1. 77d2885 QS8 AArch32 GEMM benchmark build fix by Frank Barchard · 2 years, 8 months ago
  2. 6cb0fd0 Add AArch32 GEMM benchmarks for Cortex A53 and Cortex A7 by Frank Barchard · 2 years, 8 months ago
  3. ca51090 QS8 GEMM benchmark for JIT add ISA check by Frank Barchard · 2 years, 8 months ago
  4. 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 8 months ago
  5. 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 8 months ago
  6. 348c377 QU8 GEMM/IGEMM WAsm SIMD microkernels with SR=4 by Marat Dukhan · 2 years, 8 months ago
  7. fbd67a7 Pad K to a multiple of SR in GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 8 months ago
  8. 0f294ad QS8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
  9. 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 8 months ago
  10. 1bef0f2 Add JIT microkernels to QS8 GEMM benchmarks by Zhi An Ng · 2 years, 9 months ago
  11. 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 9 months ago
  12. 440e8ed Add FMAGIC/IMAGIC/LRINTF requantization variants in microkernel benchmarks by Marat Dukhan · 2 years, 9 months ago
  13. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
  14. 4c61779 Minimally support WebAssembly Relaxed SIMD builds by Marat Dukhan · 2 years, 9 months ago
  15. 9f3f420 QS8 4x8 LD64 dot product GEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
  16. da7b2e2 QS8 4x8 lane GEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
  17. 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 9 months ago
  18. 8999190 Remove GEMMLOWP requantization from QS8 GEMM/IGEMM templates by Marat Dukhan · 2 years, 10 months ago
  19. f82ea82 Add PRFM benchmarks for qs8 lane by Frank Barchard · 2 years, 10 months ago
  20. 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 10 months ago
  21. 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
  22. 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 10 months ago
  23. 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 10 months ago
  24. 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 10 months ago
  25. e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 11 months ago
  26. c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 11 months ago
  27. 5132010 QS8 C4 Neon GEMM and E2E benchmarks by Frank Barchard · 2 years, 11 months ago
  28. 494cd2b S4 variant of C2 Neon GEMM/IGEMM microkernel by Frank Barchard · 2 years, 11 months ago
  29. 952cb51 S4 variant of C2 Neon GEMM/IGEMM mull microkernel by Frank Barchard · 2 years, 11 months ago
  30. 1d41247 Neon C2 microkernels switch to rndnu from gemmlowp by Frank Barchard · 2 years, 11 months ago
  31. 42a17dd Switch scalar gemmlowp to rndnu for benchmarks by Frank Barchard · 3 years, 1 month ago
  32. 8dc106e QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernels using i32x4.dot_i16x8_s instruction by Marat Dukhan · 3 years, 1 month ago
  33. 889ed10 QS8 gemm benchmarks switch from GEMMLOWP to RNDNU for AARCH64 assembly by Frank Barchard · 3 years, 1 month ago
  34. 4c3e5a9 GEMM benchmark assembly microkernels before intrinsics. by Frank Barchard · 3 years, 1 month ago
  35. dfc2db0 Add prefix to QC8/QS8/QU8 WAsm SIMD GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
  36. 4486f87 Prune NEON-DOT QS8 GEMM/IGEMM microkernels with FP32 & GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
  37. 8674629 Use QS8 GEMM WAsm SIMD microkernels with FP32 requantization in the benchmark by Marat Dukhan · 3 years, 2 months ago
  38. 0ff7989 Use FP32 requantization for extended-weights QS8 GEMM microkernels on x86 by Marat Dukhan · 3 years, 2 months ago
  39. 529d2c1 Remove x86 QS8 GEMM microkernels with GEMMLOWP requantization from benchmarks by Marat Dukhan · 3 years, 2 months ago
  40. ec47958 Prune redundant NEON GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
  41. 91351ef Allocate additional XNN_EXTRA_BYTES for input in QS8/QU8 GEMM benchmarks by Marat Dukhan · 3 years, 2 months ago
  42. e13e639 Align packed weights on 64 bytes in microkernel benchmarks by Marat Dukhan · 3 years, 2 months ago
  43. b657605 Fix random number generation in QS8 GEMM benchmark by Marat Dukhan · 3 years, 2 months ago
  44. e60e997 Remove most GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 3 months ago
  45. 0ae35f2 QS8 LD128 GEMM/IGEMM dot product 4x16 microkernel by Frank Barchard · 3 years, 3 months ago
  46. 0b04374 Support QC8 GEMM microkernels by Marat Dukhan · 3 years, 4 months ago
  47. 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  48. c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
  49. 8102593 Update GEMM benchmarks by Marat Dukhan · 3 years, 4 months ago
  50. 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  51. e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 4 months ago
  52. b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  53. 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  54. 725f47e Split QS8/QU8 GEMM parameter initialization by datatype by Marat Dukhan · 3 years, 4 months ago
  55. d5694df Use pointer to parameter initialization function in GEMM/IGEMM/DWCONV microkernel tests by Marat Dukhan · 3 years, 4 months ago
  56. d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 4 months ago
  57. f56f4c4 Refactor interface of microkernel parameter initialization by Marat Dukhan · 3 years, 4 months ago
  58. a1a4e78 Scalar QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 5 months ago
  59. 938ea81 Code generate 1x8C8 nicrokernel for Cortex A75 with and without prfm by Frank Barchard · 3 years, 5 months ago
  60. 46a69c9 QS8 1x8C8 GEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
  61. 5549735 4X8 and 4x16 mla lane microkernels for A53 by Frank Barchard · 3 years, 5 months ago
  62. d68e114 Cortex A53 tuned C8 gemm/igemm microkernels by Frank Barchard · 3 years, 5 months ago
  63. 1f51d38 Add prefetch to MLA lane microkernel by Frank Barchard · 3 years, 5 months ago
  64. 4a35204 PRFM variant of QS8 C8 Neon microkernel. by Frank Barchard · 3 years, 5 months ago
  65. 2e42787 2x4c2/3x4c2 microkernels for SSE2/SSSE3/SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 6 months ago
  66. a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 6 months ago
  67. b876263 QS8 1X8C8 GEMM microkernel by Frank Barchard · 3 years, 6 months ago
  68. 2f06150 xnn_qs8_gemm_minmax_ukernel_2x8c8__aarch64_neon_mlal_padal GEMM microkernel by Frank Barchard · 3 years, 7 months ago
  69. 5655cb7 QS8 GEMM 2x8c16 MLAL PADAL assembly microkernel for AArch64 by Frank Barchard · 3 years, 7 months ago
  70. 62b4ff7 Remove 12x8 QS8 GEMM and IGEMM Neon dotproduct microkernels. by Frank Barchard · 3 years, 7 months ago
  71. da78da1 QS8 C8 Neon microkernels with MUL and MLA versions. by Frank Barchard · 3 years, 7 months ago
  72. 4a4be4e QS8 1x16c4 ld32 GEMM microkernel using NEON dot product by Frank Barchard · 3 years, 7 months ago
  73. a5e242c QS8 LD32 GEMM microkernel for big cores with dotproduct by Frank Barchard · 3 years, 7 months ago
  74. 71c4d1a QS8 Neon GEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 7 months ago
  75. 8247e21 C2 QS8 microkernel using mull then mlal with KC loop of 16 by Frank Barchard · 3 years, 8 months ago
  76. 5899012 QS8 Neon GEMM C8 microkernel with 8 bit multiply and vpadal to accumulate. by Frank Barchard · 3 years, 8 months ago
  77. a93765f Add MR=1 versions of QS8 gemm benchmarks by Frank Barchard · 3 years, 8 months ago
  78. 2302ffd QS8 Neon GEMM microkernel with 8 bit multiply and vpadal to accumulate by Frank Barchard · 3 years, 8 months ago
  79. ec0bf14 QS8 GEMM and IGEMM 3x8 3x16 and IGEMM 4x8 and 4x16 by Frank Barchard · 3 years, 8 months ago
  80. 4ecae2e QS8 Neon GEMM microkernel with 8 bit multiply by Frank Barchard · 3 years, 8 months ago
  81. cfbc849 Add 4x8 and 4x16 qs8 gemm microkernels by Frank Barchard · 3 years, 8 months ago
  82. d713e8a Refactor microbenchmarks by Marat Dukhan · 3 years, 10 months ago
  83. 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 4 years ago
  84. f2742c4 Cortex A55r1 QS8 GEMM microkernel by Frank Barchard · 4 years ago
  85. 31328cb Add RUY benchmark to qs8_gemm_bench by Frank Barchard · 4 years ago
  86. 0797eb1 Rename QS8 assembly GEMM kernels to ld64 by Frank Barchard · 4 years ago
  87. f1fd89e 1x16 QS8 GEMM AARCH64 assembly microkernel using dot product. by Frank Barchard · 4 years ago
  88. 31bb45b 4x16 QS8 GEMM AARCH64 assembly microkernel using dot product. by Frank Barchard · 4 years ago
  89. a48848f 4x8, 6x8 and 8x16 Neon dot product GEMM microkernels by Frank Barchard · 4 years ago
  90. 2fa1745 6x16 QS8 GEMM for Neon dot product by Frank Barchard · 4 years ago
  91. a964473 Add xnn_qs8_gemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 4 years, 1 month ago
  92. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 1 month ago
  93. ab67142 Benchmark ARM NEON versions of QS8 GEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  94. 27203da WAsm SIMD versions of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  95. ecd8311 Rename s8rng/s32rng -> i8rng/i32rng by Marat Dukhan · 4 years, 2 months ago
  96. 683fab3 XW (eXtended Weights) optimization for QS8 GEMM microkernel by Marat Dukhan · 4 years, 2 months ago
  97. e7edc80 Add 3x4c8 variants of SSE2/SSSE3/SSE4.1/XOP GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  98. 1280952 AVX2 version of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  99. 1566fee XOP versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  100. 44f0ca7 Bind RNG by reference in microbenchmarks by Marat Dukhan · 4 years, 2 months ago