1. 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 10 months ago
  2. 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 11 months ago
  3. 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 11 months ago
  4. 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 11 months ago
  5. 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 11 months ago
  6. e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 11 months ago
  7. eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
  8. c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 11 months ago
  9. 287952a QS8 C4 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
  10. 494cd2b S4 variant of C2 Neon GEMM/IGEMM microkernel by Frank Barchard · 3 years ago
  11. 952cb51 S4 variant of C2 Neon GEMM/IGEMM mull microkernel by Frank Barchard · 3 years ago
  12. ccbaedf C2 Neon microkernel remove duplicate DUP instructions from NR loop. by Frank Barchard · 3 years ago
  13. 1d41247 Neon C2 microkernels switch to rndnu from gemmlowp by Frank Barchard · 3 years ago
  14. 0bf8afa Leverage f32x4.pmin and f32x4.pmax WAsm SIMD instructions by Marat Dukhan · 3 years ago
  15. d460d0b Neon IGEMM do remainder with reversed MR for shifts by Frank Barchard · 3 years, 1 month ago
  16. 1ce78ab Leverage Load-Zero WAsm SIMD instructions in Chrome M88 microkernels by Marat Dukhan · 3 years, 1 month ago
  17. 90cd7df Fix rewind params for qs8 4x16c4 by Frank Barchard · 3 years, 1 month ago
  18. b7a7c30 NEON GEMM/IGEMM microkernels change store/dup to 2 of each by Frank Barchard · 3 years, 1 month ago
  19. 29833fd Change stores to non-lane STR by Frank Barchard · 3 years, 1 month ago
  20. e7e001f Fix bug in QC8/QS8/QU8 IGEMM DOT16x2 LD128 WAsm SIMD microkernels by Marat Dukhan · 3 years, 1 month ago
  21. 8589ecd QS8 IGEMM use x11 for params, x10 for a3 and x0 for cn_stride by Frank Barchard · 3 years, 1 month ago
  22. 4810905 Leverage v128.const WAsm SIMD instruction by Marat Dukhan · 3 years, 1 month ago
  23. 8dc106e QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernels using i32x4.dot_i16x8_s instruction by Marat Dukhan · 3 years, 1 month ago
  24. 6b30b73 Remainder branch move before label. by Frank Barchard · 3 years, 1 month ago
  25. 56f157c Relabel branches for quantized assembly ARM microkernels by Frank Barchard · 3 years, 1 month ago
  26. 7a8dd87 Work around generating v128.storeXX_lane for quantized WAsm SIMD microkernels by Marat Dukhan · 3 years, 2 months ago
  27. 0c2a31e Improve unpacking in SSE4+ QC8/QS8/QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
  28. 07706f6 Replace generic shuffle with narrow instructions in WAsm SIMD QS8/QU8/QC8 microkernels by Marat Dukhan · 3 years, 2 months ago
  29. dfc2db0 Add prefix to QC8/QS8/QU8 WAsm SIMD GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
  30. 3e9dc22 Remove WAsm SIMD GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
  31. 4486f87 Prune NEON-DOT QS8 GEMM/IGEMM microkernels with FP32 & GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
  32. 400e7cb Prune WAsm SIMD QS8 GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
  33. e16bf7d Prune AVX2/AVX512 QS8 GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
  34. d4db6af Replace wasm_i32x4_lt(vzero, vXX) with wasm_i32x4_shr(vXX, 31) by Marat Dukhan · 3 years, 2 months ago
  35. 2c6d196 Q8 4x16 and 1x16 Neon GEMM/IGEMM quantize using V0-V3 by Frank Barchard · 3 years, 2 months ago
  36. fbe0c6f Q8 4x16 Neon IGEMM quantize using V0-V3 by Frank Barchard · 3 years, 2 months ago
  37. 59ed1da QU8 4x16 Neon assembly microkernel by Frank Barchard · 3 years, 2 months ago
  38. 6967eb0 Add a rewind variable for params. - no impact on code, just simplified source by Frank Barchard · 3 years, 2 months ago
  39. 793c8da QS8 igemm comment for zero use int8_t* instead of float* by Frank Barchard · 3 years, 2 months ago
  40. 510b8e0 Code generator for RNDNU quantization mode on neon-mull-addw-dup microkernel by Frank Barchard · 3 years, 2 months ago
  41. 26e8378 Reduce register pressure in GEMMLOWP quantization on NEON by Frank Barchard · 3 years, 2 months ago
  42. 1a2dbe1 RNDNU scalar GEMM/IGEMM microkernel by Frank Barchard · 3 years, 2 months ago
  43. efa123d Update Neon code with generators for added comment by Frank Barchard · 3 years, 2 months ago
  44. 22fbe77 RNDNU quantized 1x16 and 4x16 Neon lane GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 3 months ago
  45. 13db60f RNDNU quantized Neon assembly GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 3 months ago
  46. 60729d0 4x16c4 RNDNU quantized Neon assembly GEMM/IGEMM microkernel. by Frank Barchard · 3 years, 3 months ago
  47. e903dff QS8 GEMM/IGEMM microkernels with RNDNU requantization by Marat Dukhan · 3 years, 3 months ago
  48. 927d474 Scalar implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  49. 43bee05 WAsm SIMD implementation of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  50. 69c8a29 NEON-MLAL implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  51. 3cf2e22 QU8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 3 months ago
  52. 902ef7f QU8 GEMM/IGEMM AVX2 microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  53. 3d5aac6 Remove remnant SSE GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 3 months ago
  54. e60e997 Remove most GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 3 months ago
  55. cdbe9a3 Code-generate QU8 GEMM and IGEMM microkernels for SSE2/SSSE3/SSE4.1 by Marat Dukhan · 3 years, 3 months ago
  56. e5eee46 Refactor pre-SSE4 versions of QS8/QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  57. 960ae34 NEON implementations of QC8 c8 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  58. 1663c0c NEON implementations of QS8 2x8c16 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  59. 14f325e C2 GEMM/IGEMM QS8/QC8 NEON microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  60. a8864fb QS8 IGEMM use x11 for params by Frank Barchard · 3 years, 3 months ago
  61. e8e8c54 QC8 neon assembly re-quantization change LDP to LDR by Frank Barchard · 3 years, 3 months ago
  62. 4741e41 WAsm SIMD implementation of QS8 GEMM/IGEMM with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  63. ee029b2 Replace deprecated wasm_simd128.h intrinsics with new versions by Marat Dukhan · 3 years, 3 months ago
  64. f10af6c NEON Dot Product implementations of QC8 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  65. 98af05c NEON 4x16 QC8 GEMM and IGEMM assembly microkernels for Cortex A53 by Frank Barchard · 3 years, 3 months ago
  66. 1a0b276 NEON Dot Product implementations of QS8 FP32 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
  67. 779b253 Scalar QS8 GEMM/IGEMM microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  68. e452560 Remove MRxNR from template name on neondot c4 microkernels by Frank Barchard · 3 years, 3 months ago
  69. 70f35ea QS8 template rename with quantization removed from name. by Frank Barchard · 3 years, 3 months ago
  70. e76478b NEON implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  71. aef9091 Minor optimization for AArch64 NEON QS8 microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  72. cf05585 NEON QS8 IGEMM microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  73. 3357d9d Minor optimizations in NEON QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  74. e742d2a Re-generate QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  75. 533410e QS8 A53 GEMM bug fix for X1 - re-enable E2E by Frank Barchard · 3 years, 4 months ago
  76. 16d79ed Polyfill vcvtnq_s32_f32 for AArch32 GCC by Marat Dukhan · 3 years, 4 months ago
  77. 0ae35f2 QS8 LD128 GEMM/IGEMM dot product 4x16 microkernel by Frank Barchard · 3 years, 4 months ago
  78. 7c9f1f9 Replace // with # for lines that only contain a comment. by Frank Barchard · 3 years, 4 months ago
  79. 18630de QS8 NEONDOT GEMM/IGEMM microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  80. 801d2c2 Fix QS8 IGEMM with FP32 requantization for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
  81. e695791 4x16C4 QS8 IGEMM Cortex A55 microkernel reuse X10 to save push by Frank Barchard · 3 years, 4 months ago
  82. 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  83. c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
  84. c08221f Apply text format to assembly for consistency by Frank Barchard · 3 years, 4 months ago
  85. 1c538cd Add templates for all QS8 IGEMM assembly microkernels. by Frank Barchard · 3 years, 4 months ago
  86. 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  87. d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  88. 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  89. f86ee8b Refactor requantization helper functions by Marat Dukhan · 3 years, 4 months ago
  90. e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 4 months ago
  91. b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  92. d65d20e Rename QS8 GEMM/IGEMM microkernel filenames by Marat Dukhan · 3 years, 4 months ago
  93. e091adb 4x16 QS8 GEMM/IGEMM Cortex A53 microkernels reduce to use 2 GPR for temp by Frank Barchard · 3 years, 4 months ago
  94. 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  95. 4bb82cc 4x16 QS8 IGEMM microkernels use x8 for temp by Frank Barchard · 3 years, 5 months ago
  96. 4be4bd7 4x16 QS8 IGEMM microkernels use x14 for A1 by Frank Barchard · 3 years, 5 months ago
  97. fb672aa 4x16 QS8 IGEMM microkernel for Cortex A53 avoid a push by Frank Barchard · 3 years, 5 months ago
  98. d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
  99. 76f43f0 Apply consistent formatting to assembly by Frank Barchard · 3 years, 5 months ago
  100. a24cc08 Small refactoring of scalar QS8 microkernels by Marat Dukhan · 3 years, 5 months ago