1. e5eee46 Refactor pre-SSE4 versions of QS8/QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  2. 960ae34 NEON implementations of QC8 c8 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  3. 1663c0c NEON implementations of QS8 2x8c16 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  4. 28138f1 QS8 Neon microkernels switch from x9 to x11 for params by Frank Barchard · 3 years, 3 months ago
  5. 14f325e C2 GEMM/IGEMM QS8/QC8 NEON microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  6. e8e8c54 QC8 neon assembly re-quantization change LDP to LDR by Frank Barchard · 3 years, 3 months ago
  7. 4741e41 WAsm SIMD implementation of QS8 GEMM/IGEMM with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  8. ee029b2 Replace deprecated wasm_simd128.h intrinsics with new versions by Marat Dukhan · 3 years, 3 months ago
  9. f10af6c NEON Dot Product implementations of QC8 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  10. 98af05c NEON 4x16 QC8 GEMM and IGEMM assembly microkernels for Cortex A53 by Frank Barchard · 3 years, 3 months ago
  11. 1a0b276 NEON Dot Product implementations of QS8 FP32 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  12. 3ebfb13 NEON Dot Product implementation of QS8 FP32 4x16c4 GEMM assembly microkernel by Frank Barchard · 3 years, 3 months ago
  13. 779b253 Scalar QS8 GEMM/IGEMM microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  14. e452560 Remove MRxNR from template name on neondot c4 microkernels by Frank Barchard · 3 years, 3 months ago
  15. 70f35ea QS8 template rename with quantization removed from name. by Frank Barchard · 3 years, 3 months ago
  16. e76478b NEON implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  17. aef9091 Minor optimization for AArch64 NEON QS8 microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  18. 2d3c97c NEON QS8 GEMM microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  19. 3357d9d Minor optimizations in NEON QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  20. e742d2a Re-generate QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  21. 533410e QS8 A53 GEMM bug fix for X1 - re-enable E2E by Frank Barchard · 3 years, 3 months ago
  22. 16d79ed Polyfill vcvtnq_s32_f32 for AArch32 GCC by Marat Dukhan · 3 years, 3 months ago
  23. 0ae35f2 QS8 LD128 GEMM/IGEMM dot product 4x16 microkernel by Frank Barchard · 3 years, 3 months ago
  24. 7c9f1f9 Replace // with # for lines that only contain a comment. by Frank Barchard · 3 years, 4 months ago
  25. fc188ed QC8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
  26. 18630de QS8 NEONDOT GEMM/IGEMM microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  27. 0b04374 Support QC8 GEMM microkernels by Marat Dukhan · 3 years, 4 months ago
  28. 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  29. c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
  30. c08221f Apply text format to assembly for consistency by Frank Barchard · 3 years, 4 months ago
  31. 1ecbf53 Add templates for all QS8 GEMM assembly microkernels. by Frank Barchard · 3 years, 4 months ago
  32. 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  33. d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  34. 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  35. f86ee8b Refactor requantization helper functions by Marat Dukhan · 3 years, 4 months ago
  36. e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 4 months ago
  37. b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  38. d65d20e Rename QS8 GEMM/IGEMM microkernel filenames by Marat Dukhan · 3 years, 4 months ago
  39. e091adb 4x16 QS8 GEMM/IGEMM Cortex A53 microkernels reduce to use 2 GPR for temp by Frank Barchard · 3 years, 4 months ago
  40. 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  41. d5694df Use pointer to parameter initialization function in GEMM/IGEMM/DWCONV microkernel tests by Marat Dukhan · 3 years, 4 months ago
  42. d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 4 months ago
  43. 76f43f0 Apply consistent formatting to assembly by Frank Barchard · 3 years, 4 months ago
  44. a24cc08 Small refactoring of scalar QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  45. a1a4e78 Scalar QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 5 months ago
  46. 938ea81 Code generate 1x8C8 nicrokernel for Cortex A75 with and without prfm by Frank Barchard · 3 years, 5 months ago
  47. b639210 Add prefetch of A for quantized microkernels. by Frank Barchard · 3 years, 5 months ago
  48. e111861 1x8 C8 A53 microkernel defer adap by Frank Barchard · 3 years, 5 months ago
  49. 7c4c771 C8 A53 microkernels prefetch A by Frank Barchard · 3 years, 5 months ago
  50. 2a3169d C8 A53 microkernels move 2nd load after MLA by Frank Barchard · 3 years, 5 months ago
  51. dddb38f QS8 1x8C8 IGEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
  52. 46a69c9 QS8 1x8C8 GEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
  53. 2de3bce A53 C8 microkernel load A with ldr/ldr/ins by Frank Barchard · 3 years, 5 months ago
  54. 5549735 4X8 and 4x16 mla lane microkernels for A53 by Frank Barchard · 3 years, 5 months ago
  55. d68e114 Cortex A53 tuned C8 gemm/igemm microkernels by Frank Barchard · 3 years, 5 months ago
  56. 1f51d38 Add prefetch to MLA lane microkernel by Frank Barchard · 3 years, 5 months ago
  57. 4c6640c Disable MSan in QS8 GEMM/IGEMM microkernels with KR>1 by Marat Dukhan · 3 years, 5 months ago
  58. 4a35204 PRFM variant of QS8 C8 Neon microkernel. by Frank Barchard · 3 years, 5 months ago
  59. 2e42787 2x4c2/3x4c2 microkernels for SSE2/SSSE3/SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
  60. a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 6 months ago
  61. c409471 Include XOP headers in clang-cl compatible way. Fix #1382. by Marat Dukhan · 3 years, 6 months ago
  62. 62b4ff7 Remove 12x8 QS8 GEMM and IGEMM Neon dotproduct microkernels. by Frank Barchard · 3 years, 7 months ago
  63. da78da1 QS8 C8 Neon microkernels with MUL and MLA versions. by Frank Barchard · 3 years, 7 months ago
  64. 618d85d QS8 Neon dot product intrinsics GEMM and IGEMM microkernels reduced remainder code. by Frank Barchard · 3 years, 7 months ago
  65. 6d8ca7d Quantized GEMM/IGEMM microkernels bump kc to be a multiple of channels. by Frank Barchard · 3 years, 7 months ago
  66. fd1dee7 QS8 C16 GEMM microkernel source renamed from mull to mlal by Frank Barchard · 3 years, 7 months ago
  67. 01c341b C8 MLA Neon GEMM/IGEMM microkernels count k down from kc. by Frank Barchard · 3 years, 7 months ago
  68. 71c4d1a QS8 Neon GEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 7 months ago
  69. 6d138db Remove scalar C4 QS8 and QU8 gemm microkernels. by Frank Barchard · 3 years, 7 months ago
  70. a0fe11d QS8 C8 Neon remove remainder handling code and rewind the A pointers by kc by Frank Barchard · 3 years, 7 months ago
  71. 32389c6 QS8 e2e benchmark for C2 neon microkernels by Frank Barchard · 3 years, 7 months ago
  72. aaafdc7 QS8 scalar gemm remove bias variables. by Frank Barchard · 3 years, 7 months ago
  73. fe14b85 Add space after casting by Frank Barchard · 3 years, 7 months ago
  74. c8532ae Unroll KC loop to do MULL and then MLAL to 16 bit before lengthening to 32 bit. by Frank Barchard · 3 years, 7 months ago
  75. 7e1f371 QS8 GEMM for neon reorder with MR inner loop so mull and mlal to avoid dependency on destination. by Frank Barchard · 3 years, 8 months ago
  76. 8247e21 C2 QS8 microkernel using mull then mlal with KC loop of 16 by Frank Barchard · 3 years, 8 months ago
  77. 5899012 QS8 Neon GEMM C8 microkernel with 8 bit multiply and vpadal to accumulate. by Frank Barchard · 3 years, 8 months ago
  78. 2302ffd QS8 Neon GEMM microkernel with 8 bit multiply and vpadal to accumulate by Frank Barchard · 3 years, 8 months ago
  79. ec0bf14 QS8 GEMM and IGEMM 3x8 3x16 and IGEMM 4x8 and 4x16 by Frank Barchard · 3 years, 8 months ago
  80. 4ecae2e QS8 Neon GEMM microkernel with 8 bit multiply by Frank Barchard · 3 years, 8 months ago
  81. cfbc849 Add 4x8 and 4x16 qs8 gemm microkernels by Frank Barchard · 3 years, 8 months ago
  82. 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 4 years ago
  83. 66ccf64 Rename QS8 generator templates by Marat Dukhan · 4 years ago
  84. a48848f 4x8, 6x8 and 8x16 Neon dot product GEMM microkernels by Frank Barchard · 4 years ago
  85. 2fa1745 6x16 QS8 GEMM for Neon dot product by Frank Barchard · 4 years ago
  86. ef4ce31 Remove trailing whitespace by Marat Dukhan · 4 years ago
  87. d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 4 years ago
  88. 12c5777 Optimization: 2x partial unroll to load 8 contiguous bytes. by Benoit Jacob · 4 years, 1 month ago
  89. a964473 Add xnn_qs8_gemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 4 years, 1 month ago
  90. 0af63ab Include polyfills for intrinsics in QS8 AVX512 GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 1 month ago
  91. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 1 month ago
  92. f124e88 Polyfill _mm_loadu_si32 and _mm_storeu_si32 intrinsics by Marat Dukhan · 4 years, 1 month ago
  93. 5b3af47 Re-generate QS8 and QU8 microkernels from templates by Marat Dukhan · 4 years, 1 month ago
  94. b33fc0e Add xnn_q{u,s}8_gemm_minmax_ukernel_MRxNRc4__scalar by Benoit Jacob · 4 years, 1 month ago
  95. 27203da WAsm SIMD versions of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  96. 23848db Reoptimize x86 requantization by Marat Dukhan · 4 years, 2 months ago
  97. 40bbafe NEON variants of QS8 GEMM & IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  98. 683fab3 XW (eXtended Weights) optimization for QS8 GEMM microkernel by Marat Dukhan · 4 years, 2 months ago
  99. e7edc80 Add 3x4c8 variants of SSE2/SSSE3/SSE4.1/XOP GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  100. 1280952 AVX2 version of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago