[object]
x86_64 Intel Skylake
Comparison of different BLAS implementations
Test machine: Intel Core i7 8250U @ 3.4 GHz (Skylake), theoretical maximum throughput of 54.4 (108.8) Gflops in double (single) precision.
BLASFEO taget: X64_INTEL_HASWELL
gemm_nn
gemm_nt
gemm_tn
gemm_tt
syrk_ln
syrk_lt
syrk_un
syrk_lt
trmm_rlnn
trmm_rutn
trsm_llnn
trsm_llnu
trsm_lltn
trsm_lltu
trsm_lunn
trsm_lunu
trsm_lutn
trsm_lutu
trsm_rlnn
trsm_rlnu
trsm_rltn
trsm_rltu
trsm_runn
trsm_runu
trsm_rutn
trsm_rutu
geqrf
gelqf
potrf_u
potrf_l
gemv_n
gemv_t
gemv_nt
symv_l
trmv_lnn
trmv_ltn
trsv_lnn
trsv_ltn