Skip to content

Commit 977528d

Browse files
committed
dgemmjit looks identical to regular MKL, suggesting (a) the JIT isn't working (compiler warnings about ignored preprocessor directives reinforce this), and (b) a way to benchmark both OpenBLAS and MKL.
1 parent 21cb787 commit 977528d

15 files changed

+42
-16
lines changed

docs/src/assets/bench_AmulB_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_AmulBt_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_AplusAt_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_AtmulB_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_AtmulBt_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_aplusBc_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_dot3_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_dot_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_exp_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_logdettriangle_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_random_access_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_selfdot_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

docs/src/assets/bench_sse_v1.svg

Lines changed: 3 additions & 1 deletion
Loading

src/LoopVectorization.jl

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ using VectorizationBase: REGISTER_SIZE, REGISTER_COUNT, extract_data, num_vector
88
PackedStridedPointer, SparseStridedPointer, RowMajorStridedPointer, StaticStridedPointer, StaticStridedStruct
99
using SIMDPirates: VECTOR_SYMBOLS, evadd, evmul, vrange, reduced_add, reduced_prod, reduce_to_add, reduce_to_prod,
1010
sizeequivalentfloat, sizeequivalentint, vadd!, vsub!, vmul!, vfdiv!, vfmadd!, vfnmadd!, vfmsub!, vfnmsub!,
11-
vfmadd231, vfmsub231, vfnmadd231, vfnmsub231,
11+
vfmadd231, vfmsub231, vfnmadd231, vfnmsub231, #prefetch,
1212
vmullog2, vmullog10, vdivlog2, vdivlog10, vmullog2add!, vmullog10add!, vdivlog2add!, vdivlog10add!, vfmaddaddone
1313
using Base.Broadcast: Broadcasted, DefaultArrayStyle
1414
using LinearAlgebra: Adjoint, Transpose

src/costs.jl

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -194,8 +194,8 @@ const COST = Dict{Instruction,InstructionCost}(
194194
Instruction(:sincospi_fast) => InstructionCost(25,22.0,70.0,26),
195195
Instruction(:identity) => InstructionCost(0,0.0,0.0,0),
196196
Instruction(:adjoint) => InstructionCost(0,0.0,0.0,0),
197-
Instruction(:transpose) => InstructionCost(0,0.0,0.0,0)
198-
# Symbol("##CONSTANT##") => InstructionCost(0,0.0)
197+
Instruction(:transpose) => InstructionCost(0,0.0,0.0,0),
198+
Instruction(:prefetch) => InstructionCost(0,0.0,0.0,0)
199199
)
200200

201201
# const KNOWNINSTRUCTIONS = keys(COST)

0 commit comments

Comments
 (0)