Update XMOS xcore.ai port to be compatible with v11.x #1096

ACascarino · 2024-06-26T18:34:59Z

Update XMOS xcore.ai port

Description

The merge of the smp branch into main in version 11 broke an assumption that the xcore.ai port relied upon - the backwards-compatibility-preserving decision to conditionally define in tasks.c either pxCurrentTCB or pxCurrentTCBs depending solely on the value of configNUMBER_OF_CORES means that the pxCurrentTCBs symbol does not always exist. The assumption was made that this symbol would always exist, and for FreeRTOS instances with one core in use its 0th element would solely be populated. This change means that ports to SMP platforms need instead to be aware of the number of cores defined and change behaviour accordingly. This went unnoticed at XMOS at the time of release of v11, but we've now noticed the issue and I've (hopefully) created a fix.

This PR attempts to make the XMOS xcore.ai port agnostic to whether it is running on a single-core or SMP instance of FreeRTOS by simply introducing an additional layer of indirection to pxCurrentTCB(s) accesses. When the scheduler is started (which is the first time in the application that the port layer needs to interact with the TCB pointer(s)), we populate a global symbol with the address of the TCB pointer. It is this symbol, rather than pxCurrentTCB(s), which is then used in scheduler initialisation and on context switches.

This PR adds one instruction (plus compiler-defined register spill/restoration to protect the r5 clobber) to scheduler initialisation, and zero instructions to RTOS interrupt processing and context switching.

This PR also includes a fix to make exception behaviour entirely predictable; the previous implementation had assumed that the symbol _TrapHandler would always exist at 0x80080. This is not the case; for example, using the --first option in our toolchain to place arbitrary data at the top of memory shifts all other symbols, breaking this assumption and causing wildly undefined behaviour on exception. _DoException is a link-time visible symbol that presents the appropriate entry-point to the platform's exception handling routines, and its use here is preferable in all contexts.

Test Steps

As this is a community-supported port, there are no automated tests to highlight the issue, nor any to prove lack of regression. However, internal testing of FreeRTOS-based products has shown no regression in their functionality due to this change.

As a query - partner-maintained ports seem to have a series of automated tests that they use to show lack of regression, but these, from a cursory look, seem entirely single-core focussed. Are there any partner-maintained SMP ports for which the test suite would exercise the additional SMP-related functionality?

Checklist:

I have tested my changes. No regression in existing tests. There are no existing tests, but implementation testing shows no regression in selected FreeRTOS-based products
I have modified and/or added unit-tests to cover the code changes in this Pull Request. No unit tests exist

Related Issue

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

AniruddhaKanhere · 2024-06-26T19:03:08Z

Hello @ACascarino,

Thank you for taking the time to report and fix the issue. To me, in the first glance, it looks good.
I'll pass it along to our experts in the team for review.

Thanks,
Aniruddha

AniruddhaKanhere · 2024-06-26T19:09:07Z

portable/ThirdParty/xClang/XCOREAI/portasm.S

-  ldc r9, 0x0080
-  or  r11, r11, r9
-  bau r11 //_TrapHandler is at 0x00080080. TODO: Is it always? Why can't I access the symbol _TrapHandler?
+  bu _DoException


I am not sure what this change is doing - can you please clarify this for me?

Hi @AniruddhaKanhere - this is a symbol that is generated by our toolchain on compilation. There are a couple of these which are automatically built into every xcore.ai executable to bring up the processor, set execution modes on each core, and provide routines for graceful exception handling (such as _DoException, which disables all events and interrupts and leaves the processor in a state in which an external debugger may connect and query the processor state at time of exception). Therefore, while it would not be expected to find this symbol defined in any application code, it will always be present in the final executable. _TrapHandler, referenced in commentary in the previous implementation, is also one such symbol - it is usually the initial symbol used as the kernel exception pointer and sets up the necessary processor state to then branch to _DoException. Due to the alignment of kexcept, this state has already been achieved by the time kexcept gets here, and so there is no need to go via the _TrapHandler symbol.

The previous implementation assumed that _TrapHandler always existed at 0x80080 - it is not visible to the assembler, and it is therefore required that the address be hard-coded if trying to jump to it. This assumption was not true in certain cases. _DoException, which is visible to the assembler, is therefore an appropriate symbol to jump to and a better choice in all circumstances.

I've also added commentary to clarify this inline! :)

ACascarino · 2024-06-27T11:51:08Z

Apologies - it turns out adding a new commit dismissed stale reviews. Sorry to ask you to review again!

codecov · 2024-06-27T18:58:55Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 92.31%. Comparing base (0c79e74) to head (a0cca63).
Report is 2 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1096   +/-   ##
=======================================
  Coverage   92.31%   92.31%           
=======================================
  Files           6        6           
  Lines        3226     3226           
  Branches      885      885           
=======================================
  Hits         2978     2978           
  Misses        132      132           
  Partials      116      116

Flag	Coverage Δ
unittests	`92.31% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

chinglee-iot · 2024-07-01T05:07:35Z

@ACascarino

As a query - partner-maintained ports seem to have a series of automated tests that they use to show lack of regression, but these, from a cursory look, seem entirely single-core focussed. Are there any partner-maintained SMP ports for which the test suite would exercise the additional SMP-related functionality?

We have on target test cases which could exercise the SMP-related functionality. We are still in the progress to publish all the test cases. You could reference reference this link for more information.

ACascarino · 2024-07-01T11:47:25Z

@ACascarino

As a query - partner-maintained ports seem to have a series of automated tests that they use to show lack of regression, but these, from a cursory look, seem entirely single-core focussed. Are there any partner-maintained SMP ports for which the test suite would exercise the additional SMP-related functionality?

We have on target test cases which could exercise the SMP-related functionality. We are still in the progress to publish all the test cases. You could reference reference this link for more information.

Ah great thank you - I managed to completely miss these!

ACascarino added 4 commits June 19, 2024 15:32

Fix kexcept function

47e47e7

Create dummy pxCurrentTCBs for xcore.ai port

50958c1

Additional commentary

da14835

Add a layer of indirection to cope with singlecore

01b64f5

ACascarino requested a review from a team as a code owner June 26, 2024 18:34

Merge branch 'main' into xmos_fix_freertos_port

fbc3a7a

n9wxu previously approved these changes Jun 26, 2024

View reviewed changes

AniruddhaKanhere previously approved these changes Jun 26, 2024

View reviewed changes

AniruddhaKanhere reviewed Jun 26, 2024

View reviewed changes

aggarg previously approved these changes Jun 27, 2024

View reviewed changes

Clarify use of _DoException

a0cca63

ACascarino dismissed stale reviews from aggarg, AniruddhaKanhere, and n9wxu via a0cca63 June 27, 2024 11:48

AniruddhaKanhere approved these changes Jun 27, 2024

View reviewed changes

n9wxu approved these changes Jun 27, 2024

View reviewed changes

n9wxu merged commit 17dfd0f into FreeRTOS:main Jun 27, 2024
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update XMOS xcore.ai port to be compatible with v11.x #1096

Update XMOS xcore.ai port to be compatible with v11.x #1096

Uh oh!

ACascarino commented Jun 26, 2024 •

edited

Loading

Uh oh!

AniruddhaKanhere commented Jun 26, 2024

Uh oh!

AniruddhaKanhere Jun 26, 2024 •

edited

Loading

Uh oh!

ACascarino Jun 27, 2024 •

edited

Loading

Uh oh!

ACascarino Jun 27, 2024

Uh oh!

ACascarino commented Jun 27, 2024

Uh oh!

codecov bot commented Jun 27, 2024

Uh oh!

Uh oh!

chinglee-iot commented Jul 1, 2024

Uh oh!

ACascarino commented Jul 1, 2024

Uh oh!

Uh oh!

Update XMOS xcore.ai port to be compatible with v11.x #1096

Update XMOS xcore.ai port to be compatible with v11.x #1096

Uh oh!

Conversation

ACascarino commented Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!