Skip to content

Commit 269f5fe

Browse files
stuijostannard
andauthored
[AARCH64] Add support for Cortex-A320 (#139055)
This patch adds initial support for the recently announced Armv9 Cortex-A320 processor. For more information, including the Technical Reference Manual, see: https://developer.arm.com/Processors/Cortex-A320 --------- Co-authored-by: Oliver Stannard <[email protected]>
1 parent b249b49 commit 269f5fe

File tree

9 files changed

+97
-3
lines changed

9 files changed

+97
-3
lines changed

clang/docs/ReleaseNotes.rst

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -728,6 +728,9 @@ X86 Support
728728

729729
Arm and AArch64 Support
730730
^^^^^^^^^^^^^^^^^^^^^^^
731+
732+
- Support has been added for the following processors (command-line identifiers in parentheses):
733+
- Arm Cortex-A320 (``cortex-a320``)
731734
- For ARM targets, cc1as now considers the FPU's features for the selected CPU or Architecture.
732735
- The ``+nosimd`` attribute is now fully supported for ARM. Previously, this had no effect when being used with
733736
ARM targets, however this will now disable NEON instructions being generated. The ``simd`` option is
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
// RUN: not %clang --target=arm-arm-none-eabi -mcpu=cortex-a320 %s 2>&1 | FileCheck %s
2+
// CHECK: error: unsupported argument {{.*}} to option '-mcpu='
3+
4+
// RUN: %clang -target aarch64 -mcpu=cortex-a320 -### -c %s 2>&1 | FileCheck -check-prefix=A320 %s
5+
// A320: "-cc1"{{.*}} "-triple" "aarch64{{.*}}" "-target-cpu" "cortex-a320"
6+
Lines changed: 60 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,60 @@
1+
// REQUIRES: aarch64-registered-target
2+
// RUN: %clang --target=aarch64 --print-enabled-extensions -mcpu=cortex-a320 | FileCheck --strict-whitespace --implicit-check-not=FEAT_ %s
3+
4+
// CHECK: Extensions enabled for the given AArch64 target
5+
// CHECK-EMPTY:
6+
// CHECK-NEXT: Architecture Feature(s) Description
7+
// CHECK-NEXT: FEAT_AMUv1 Enable Armv8.4-A Activity Monitors extension
8+
// CHECK-NEXT: FEAT_AMUv1p1 Enable Armv8.6-A Activity Monitors Virtualization support
9+
// CHECK-NEXT: FEAT_AdvSIMD Enable Advanced SIMD instructions
10+
// CHECK-NEXT: FEAT_BF16 Enable BFloat16 Extension
11+
// CHECK-NEXT: FEAT_BTI Enable Branch Target Identification
12+
// CHECK-NEXT: FEAT_CCIDX Enable Armv8.3-A Extend of the CCSIDR number of sets
13+
// CHECK-NEXT: FEAT_CRC32 Enable Armv8.0-A CRC-32 checksum instructions
14+
// CHECK-NEXT: FEAT_CSV2_2 Enable architectural speculation restriction
15+
// CHECK-NEXT: FEAT_DIT Enable Armv8.4-A Data Independent Timing instructions
16+
// CHECK-NEXT: FEAT_DPB Enable Armv8.2-A data Cache Clean to Point of Persistence
17+
// CHECK-NEXT: FEAT_DPB2 Enable Armv8.5-A Cache Clean to Point of Deep Persistence
18+
// CHECK-NEXT: FEAT_DotProd Enable dot product support
19+
// CHECK-NEXT: FEAT_ECV Enable enhanced counter virtualization extension
20+
// CHECK-NEXT: FEAT_ETE Enable Embedded Trace Extension
21+
// CHECK-NEXT: FEAT_FCMA Enable Armv8.3-A Floating-point complex number support
22+
// CHECK-NEXT: FEAT_FGT Enable fine grained virtualization traps extension
23+
// CHECK-NEXT: FEAT_FHM Enable FP16 FML instructions
24+
// CHECK-NEXT: FEAT_FP Enable Armv8.0-A Floating Point Extensions
25+
// CHECK-NEXT: FEAT_FP16 Enable half-precision floating-point data processing
26+
// CHECK-NEXT: FEAT_FRINTTS Enable FRInt[32|64][Z|X] instructions that round a floating-point number to an integer (in FP format) forcing it to fit into a 32- or 64-bit int
27+
// CHECK-NEXT: FEAT_FlagM Enable Armv8.4-A Flag Manipulation instructions
28+
// CHECK-NEXT: FEAT_FlagM2 Enable alternative NZCV format for floating point comparisons
29+
// CHECK-NEXT: FEAT_HCX Enable Armv8.7-A HCRX_EL2 system register
30+
// CHECK-NEXT: FEAT_I8MM Enable Matrix Multiply Int8 Extension
31+
// CHECK-NEXT: FEAT_JSCVT Enable Armv8.3-A JavaScript FP conversion instructions
32+
// CHECK-NEXT: FEAT_LOR Enable Armv8.1-A Limited Ordering Regions extension
33+
// CHECK-NEXT: FEAT_LRCPC Enable support for RCPC extension
34+
// CHECK-NEXT: FEAT_LRCPC2 Enable Armv8.4-A RCPC instructions with Immediate Offsets
35+
// CHECK-NEXT: FEAT_LSE Enable Armv8.1-A Large System Extension (LSE) atomic instructions
36+
// CHECK-NEXT: FEAT_LSE2 Enable Armv8.4-A Large System Extension 2 (LSE2) atomicity rules
37+
// CHECK-NEXT: FEAT_MPAM Enable Armv8.4-A Memory system Partitioning and Monitoring extension
38+
// CHECK-NEXT: FEAT_MTE, FEAT_MTE2 Enable Memory Tagging Extension
39+
// CHECK-NEXT: FEAT_NV, FEAT_NV2 Enable Armv8.4-A Nested Virtualization Enchancement
40+
// CHECK-NEXT: FEAT_PAN Enable Armv8.1-A Privileged Access-Never extension
41+
// CHECK-NEXT: FEAT_PAN2 Enable Armv8.2-A PAN s1e1R and s1e1W Variants
42+
// CHECK-NEXT: FEAT_PAuth Enable Armv8.3-A Pointer Authentication extension
43+
// CHECK-NEXT: FEAT_PMUv3 Enable Armv8.0-A PMUv3 Performance Monitors extension
44+
// CHECK-NEXT: FEAT_RAS, FEAT_RASv1p1 Enable Armv8.0-A Reliability, Availability and Serviceability Extensions
45+
// CHECK-NEXT: FEAT_RDM Enable Armv8.1-A Rounding Double Multiply Add/Subtract instructions
46+
// CHECK-NEXT: FEAT_SB Enable Armv8.5-A Speculation Barrier
47+
// CHECK-NEXT: FEAT_SEL2 Enable Armv8.4-A Secure Exception Level 2 extension
48+
// CHECK-NEXT: FEAT_SPECRES Enable Armv8.5-A execution and data prediction invalidation instructions
49+
// CHECK-NEXT: FEAT_SSBS, FEAT_SSBS2 Enable Speculative Store Bypass Safe bit
50+
// CHECK-NEXT: FEAT_SVE Enable Scalable Vector Extension (SVE) instructions
51+
// CHECK-NEXT: FEAT_SVE2 Enable Scalable Vector Extension 2 (SVE2) instructions
52+
// CHECK-NEXT: FEAT_SVE_BitPerm Enable bit permutation SVE2 instructions
53+
// CHECK-NEXT: FEAT_TLBIOS, FEAT_TLBIRANGE Enable Armv8.4-A TLB Range and Maintenance instructions
54+
// CHECK-NEXT: FEAT_TRBE Enable Trace Buffer Extension
55+
// CHECK-NEXT: FEAT_TRF Enable Armv8.4-A Trace extension
56+
// CHECK-NEXT: FEAT_UAO Enable Armv8.2-A UAO PState
57+
// CHECK-NEXT: FEAT_VHE Enable Armv8.1-A Virtual Host extension
58+
// CHECK-NEXT: FEAT_WFxT Enable Armv8.7-A WFET and WFIT instruction
59+
// CHECK-NEXT: FEAT_XS Enable Armv8.7-A limited-TLB-maintenance instruction
60+

clang/test/Misc/target-invalid-cpu-note/aarch64.c

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,6 +36,7 @@
3636
// CHECK-SAME: {{^}}, apple-s9
3737
// CHECK-SAME: {{^}}, carmel
3838
// CHECK-SAME: {{^}}, cobalt-100
39+
// CHECK-SAME: {{^}}, cortex-a320
3940
// CHECK-SAME: {{^}}, cortex-a34
4041
// CHECK-SAME: {{^}}, cortex-a35
4142
// CHECK-SAME: {{^}}, cortex-a510

llvm/lib/Target/AArch64/AArch64Processors.td

Lines changed: 20 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,12 @@
1717
def TuneA35 : SubtargetFeature<"a35", "ARMProcFamily", "CortexA35",
1818
"Cortex-A35 ARM processors">;
1919

20+
def TuneA320 : SubtargetFeature<"a320", "ARMProcFamily", "CortexA320",
21+
"Cortex-A320 ARM processors", [
22+
FeatureFuseAES,
23+
FeatureFuseAdrpAdd,
24+
FeaturePostRAScheduler]>;
25+
2026
def TuneA53 : SubtargetFeature<"a53", "ARMProcFamily", "CortexA53",
2127
"Cortex-A53 ARM processors", [
2228
FeatureFuseAES,
@@ -709,6 +715,16 @@ def TuneOryon : SubtargetFeature<"oryon-1", "ARMProcFamily",
709715
HasV8_6aOps]>;
710716

711717
def ProcessorFeatures {
718+
list<SubtargetFeature> A320 = [HasV9_2aOps, FeatureNEON, FeatureMTE,
719+
FeatureSVEBitPerm, FeatureFP16FML, FeatureFullFP16,
720+
FeatureETE, FeaturePerfMon, FeatureCCIDX,
721+
FeatureCCPP, FeatureComplxNum, FeatureCRC,
722+
FeatureDotProd, FeatureJS, FeatureLOR,
723+
FeatureLSE, FeaturePAN, FeaturePAN_RWV,
724+
FeaturePAuth, FeaturePsUAO, FeatureRAS,
725+
FeatureRDM, FeatureTRBE, FeatureVH,
726+
FeatureFlagM, FeaturePredRes, FeatureSB, FeatureSSBS,
727+
FeatureSVE, FeatureSVE2];
712728
list<SubtargetFeature> A53 = [HasV8_0aOps, FeatureCRC, FeatureSHA2, FeatureAES,
713729
FeatureFPARMv8, FeatureNEON, FeaturePerfMon];
714730
list<SubtargetFeature> A55 = [HasV8_2aOps, FeatureSHA2, FeatureAES, FeatureFPARMv8,
@@ -1110,10 +1126,12 @@ class ProcessorAlias<string n, string alias> {
11101126
def : ProcessorModel<"generic", CortexA510Model, ProcessorFeatures.Generic,
11111127
[FeatureFuseAES, FeatureFuseAdrpAdd, FeaturePostRAScheduler,
11121128
FeatureEnableSelectOptimize]>;
1113-
def : ProcessorModel<"cortex-a35", CortexA53Model, ProcessorFeatures.A53,
1114-
[TuneA35]>;
11151129
def : ProcessorModel<"cortex-a34", CortexA53Model, ProcessorFeatures.A53,
11161130
[TuneA35]>;
1131+
def : ProcessorModel<"cortex-a35", CortexA53Model, ProcessorFeatures.A53,
1132+
[TuneA35]>;
1133+
def : ProcessorModel<"cortex-a320", CortexA510Model, ProcessorFeatures.A320,
1134+
[TuneA320]>;
11171135
def : ProcessorModel<"cortex-a53", CortexA53Model, ProcessorFeatures.A53,
11181136
[TuneA53]>;
11191137
def : ProcessorModel<"cortex-a55", CortexA55Model, ProcessorFeatures.A55,

llvm/lib/Target/AArch64/AArch64Subtarget.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -175,6 +175,7 @@ void AArch64Subtarget::initializeProperties(bool HasMinSize) {
175175
PrefLoopAlignment = Align(32);
176176
MaxBytesForLoopAlignment = 16;
177177
break;
178+
case CortexA320:
178179
case CortexA510:
179180
case CortexA520:
180181
PrefFunctionAlignment = Align(16);

llvm/lib/TargetParser/Host.cpp

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -228,6 +228,7 @@ StringRef sys::detail::getHostCPUNameForARM(StringRef ProcCpuinfoContent) {
228228
.Case("0xd14", "cortex-r82ae")
229229
.Case("0xd02", "cortex-a34")
230230
.Case("0xd04", "cortex-a35")
231+
.Case("0xd8f", "cortex-a320")
231232
.Case("0xd03", "cortex-a53")
232233
.Case("0xd05", "cortex-a55")
233234
.Case("0xd46", "cortex-a510")

llvm/unittests/TargetParser/Host.cpp

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -72,6 +72,9 @@ Serial : 0000000000000000
7272
}
7373

7474
TEST(getLinuxHostCPUName, AArch64) {
75+
EXPECT_EQ(sys::detail::getHostCPUNameForARM("CPU implementer : 0x41\n"
76+
"CPU part : 0xd8f"),
77+
"cortex-a320");
7578
EXPECT_EQ(sys::detail::getHostCPUNameForARM("CPU implementer : 0x41\n"
7679
"CPU part : 0xd03"),
7780
"cortex-a53");

llvm/unittests/TargetParser/TargetParserTest.cpp

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1085,6 +1085,7 @@ INSTANTIATE_TEST_SUITE_P(
10851085
AArch64CPUTests, AArch64CPUTestFixture,
10861086
::testing::Values(AArch64CPUTestParams("cortex-a34", "armv8-a"),
10871087
AArch64CPUTestParams("cortex-a35", "armv8-a"),
1088+
AArch64CPUTestParams("cortex-a320", "armv9.2-a"),
10881089
AArch64CPUTestParams("cortex-a53", "armv8-a"),
10891090
AArch64CPUTestParams("cortex-a55", "armv8.2-a"),
10901091
AArch64CPUTestParams("cortex-a510", "armv9-a"),
@@ -1262,7 +1263,7 @@ INSTANTIATE_TEST_SUITE_P(
12621263
AArch64CPUAliasTestParams::PrintToStringParamName);
12631264

12641265
// Note: number of CPUs includes aliases.
1265-
static constexpr unsigned NumAArch64CPUArchs = 89;
1266+
static constexpr unsigned NumAArch64CPUArchs = 90;
12661267

12671268
TEST(TargetParserTest, testAArch64CPUArchList) {
12681269
SmallVector<StringRef, NumAArch64CPUArchs> List;

0 commit comments

Comments
 (0)