Skip to content

Commit 90df664

Browse files
committed
[MCA][X86] Fix throughput of (V)PMOV extension/truncation 512-bit instructions
znver4 512-bit instructions are half rate of 128/256-bit variants (still 1uop though) Confirmed with Agner/uops.info Noticed while triaging #110308 and #117579
1 parent ec4c47d commit 90df664

File tree

3 files changed

+184
-184
lines changed

3 files changed

+184
-184
lines changed

llvm/lib/Target/X86/X86ScheduleZnver4.td

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1652,7 +1652,7 @@ def : InstRW<[Zn4MOVS], (instregex
16521652

16531653
def Zn4MOVSZ: SchedWriteRes<[Zn4FPFMisc12]> {
16541654
let Latency = 4;
1655-
let ReleaseAtCycles = [4];
1655+
let ReleaseAtCycles = [2];
16561656
let NumMicroOps = 1;
16571657
}
16581658
def : InstRW<[Zn4MOVSZ], (instregex
@@ -1661,7 +1661,7 @@ def : InstRW<[Zn4MOVSZ], (instregex
16611661

16621662
def Zn4MOVSrr: SchedWriteRes<[Zn4FPFMisc12]> {
16631663
let Latency = 5;
1664-
let ReleaseAtCycles = [5];
1664+
let ReleaseAtCycles = [2];
16651665
let NumMicroOps = 1;
16661666
}
16671667
def : InstRW<[Zn4MOVSrr], (instregex

0 commit comments

Comments
 (0)