1. 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 3 years, 9 months ago
  2. f2742c4 Cortex A55r1 QS8 GEMM microkernel by Frank Barchard · 3 years, 9 months ago
  3. 0797eb1 Rename QS8 assembly GEMM kernels to ld64 by Frank Barchard · 3 years, 9 months ago
  4. a463285 4x16 QS8 GEMM use 4 less registers, avoiding push/pop. by Frank Barchard · 3 years, 9 months ago
  5. 59df88b 4x16 QS8 GEMM defer params by Frank Barchard · 3 years, 9 months ago
  6. f1fd89e 1x16 QS8 GEMM AARCH64 assembly microkernel using dot product. by Frank Barchard · 3 years, 9 months ago
  7. a5237a5 Rename 4x16 GEMM dot product microkernel file name to allow for future variations. by Frank Barchard · 3 years, 9 months ago
  8. 31bb45b 4x16 QS8 GEMM AARCH64 assembly microkernel using dot product. by Frank Barchard · 3 years, 9 months ago
  9. 66ccf64 Rename QS8 generator templates by Marat Dukhan · 3 years, 9 months ago
  10. a48848f 4x8, 6x8 and 8x16 Neon dot product GEMM microkernels by Frank Barchard · 3 years, 9 months ago
  11. 2fa1745 6x16 QS8 GEMM for Neon dot product by Frank Barchard · 3 years, 9 months ago
  12. ef4ce31 Remove trailing whitespace by Marat Dukhan · 3 years, 10 months ago
  13. d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 3 years, 10 months ago
  14. 12c5777 Optimization: 2x partial unroll to load 8 contiguous bytes. by Benoit Jacob · 3 years, 11 months ago
  15. a964473 Add xnn_qs8_gemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 3 years, 11 months ago
  16. 0af63ab Include polyfills for intrinsics in QS8 AVX512 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 11 months ago
  17. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 11 months ago
  18. f124e88 Polyfill _mm_loadu_si32 and _mm_storeu_si32 intrinsics by Marat Dukhan · 3 years, 11 months ago
  19. 5b3af47 Re-generate QS8 and QU8 microkernels from templates by Marat Dukhan · 3 years, 11 months ago
  20. b33fc0e Add xnn_q{u,s}8_gemm_minmax_ukernel_MRxNRc4__scalar by Benoit Jacob · 3 years, 11 months ago
  21. 27203da WAsm SIMD versions of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years ago
  22. 23848db Reoptimize x86 requantization by Marat Dukhan · 4 years ago
  23. 40bbafe NEON variants of QS8 GEMM & IGEMM microkernels by Marat Dukhan · 4 years ago
  24. 683fab3 XW (eXtended Weights) optimization for QS8 GEMM microkernel by Marat Dukhan · 4 years ago
  25. e7edc80 Add 3x4c8 variants of SSE2/SSSE3/SSE4.1/XOP GEMM/IGEMM microkernels by Marat Dukhan · 4 years ago
  26. 1280952 AVX2 version of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years ago
  27. 1566fee XOP versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years ago
  28. dee732b LD128 versions of QS8 GEMM SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years ago
  29. 14d3ce8 Add LD64 suffix in QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years ago
  30. 733d0be QS8 GEMM MRx4c8 SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years ago
  31. 595e170 QS8 GEMM microkernels and infrastructure by Marat Dukhan · 4 years ago