1. d67539d Auto-generate X8 LUT microkernels and tests by Marat Dukhan · 3 years, 2 months ago
  2. cdf59a5 Add QU8 NR=32 microkernels by Frank Barchard · 3 years, 2 months ago
  3. df8e604 4x8 QU8 Neon Dotproduct microkernel rename from ld64 to ld128 by Frank Barchard · 3 years, 2 months ago
  4. a49e41f QU8 4x16C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 3 years, 2 months ago
  5. 0a3093c QU8 vadd neon use x32 instead of x8 by Frank Barchard · 3 years, 2 months ago
  6. 7da8b02 Q8 dwconv switch from 8x25 to 16x25 by Frank Barchard · 3 years, 2 months ago
  7. e252f92 End-to-end benchmarks on QC8 MobileNet v1/v2 models by Marat Dukhan · 3 years, 2 months ago
  8. 0d06573 dwconv Q8 switch from 8x9 to 16x9 tile. by Frank Barchard · 3 years, 2 months ago
  9. 8b69802 Enable QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 3 years, 3 months ago
  10. ca4c68e QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 3 years, 3 months ago
  11. 0c76422 QU8 NEON Assembly remove channel wise by Frank Barchard · 3 years, 3 months ago
  12. 4066898 QU8 4x16 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 3 years, 3 months ago
  13. 0049e89 QU8 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 3 years, 3 months ago
  14. de9c64a Enable 4x16 QU8 dot production microkernels by Frank Barchard · 3 years, 3 months ago
  15. 65692c7 Fix build for Clang on Windows by peter · 3 years, 3 months ago
  16. e79acb7 S8 VCLAMP microkernels by Marat Dukhan · 3 years, 3 months ago
  17. 2314753 S8 MAXPOOL microkernels for all architectures by Marat Dukhan · 3 years, 3 months ago
  18. 9098aba E2E for QU8 GEMM microkernels by Frank Barchard · 3 years, 3 months ago
  19. e033126 Generate more tile sizes for QU8 gemm/igemm by Frank Barchard · 3 years, 3 months ago
  20. 2025515 Enable dot production microkernels for QU8 on ARM by Frank Barchard · 3 years, 3 months ago
  21. 88e839c QU8 C4 NEON Dot Product GEMM/IGEMM microkernels by Frank Barchard · 3 years, 3 months ago
  22. 0461f2d Generalize PAD microkernels to all 8-/16-/32-bit data types by Marat Dukhan · 3 years, 3 months ago
  23. 933051b Generalize FILL microkernels to all 8-/16-/32-bit data types by Marat Dukhan · 3 years, 3 months ago
  24. 7c74aff Add F32 VLRELU benchmarks by Marat Dukhan · 3 years, 3 months ago
  25. 4486f87 Prune NEON-DOT QS8 GEMM/IGEMM microkernels with FP32 & GEMMLOWP requantization by Marat Dukhan · 3 years, 3 months ago
  26. e16bf7d Prune AVX2/AVX512 QS8 GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 3 months ago
  27. 66a3ca1 Initialize QS8 microkernel pointers on pre-NEON ARM architecture by Marat Dukhan · 3 years, 3 months ago
  28. 0ff7989 Use FP32 requantization for extended-weights QS8 GEMM microkernels on x86 by Marat Dukhan · 3 years, 3 months ago
  29. ec47958 Prune redundant NEON GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 3 months ago
  30. f879d9e Add qs8-requantization-test to CMake build by Marat Dukhan · 3 years, 3 months ago
  31. 599d3db Fix CMake build by Marat Dukhan · 3 years, 3 months ago
  32. 0853b8a QS8/QU8 Multiply ND operators by Marat Dukhan · 3 years, 3 months ago
  33. 8b024c9 QS8/QU8 VMULC microkernel benchmark by Marat Dukhan · 3 years, 3 months ago
  34. fb3a94f QU8 4x16 Neon assembly microkernel for Cortex A75 by Frank Barchard · 3 years, 3 months ago
  35. 795e5ab QS8/QU8 VMUL microkernel benchmarks by Marat Dukhan · 3 years, 3 months ago
  36. 4a7b70f QS8/QU8 VMUL[C] microkernels in NEON implementation by Marat Dukhan · 3 years, 3 months ago
  37. 7999341 QS8/QU8 VMUL[C] microkernels in scalar implementation by Marat Dukhan · 3 years, 3 months ago
  38. 59ed1da QU8 4x16 Neon assembly microkernel by Frank Barchard · 3 years, 3 months ago
  39. a212eac QS8/QU8 VMUL[C] microkernels in SSE2/SSE4.1/AVX implementation by Marat Dukhan · 3 years, 3 months ago
  40. eb3cff3 LD128 versions of QS8/QU8 VADD[C] NEON microkernels by Marat Dukhan · 3 years, 3 months ago
  41. 01debd9 Optimize QS8 VADD[C] microkernel selection on ARM/ARM64 by Marat Dukhan · 3 years, 4 months ago
  42. 1ef9de8 QU8 VADD/VADDC microkernel benchmarks by Marat Dukhan · 3 years, 4 months ago
  43. 83a8d2f QS8 VADD/VADDC microkernel benchmarks by Marat Dukhan · 3 years, 4 months ago
  44. 60bb7ec Accumulate in 16 bits once in AVX2 MUL16 VPUNPCK QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 4 months ago
  45. 881ab02 AVX2 MUL16 QS8/QC8 DWCONV microkernels using VPUNPCK instructions to extend the product by Marat Dukhan · 3 years, 4 months ago
  46. 2848059 Optimize QC8 DWCONV microkernel selection on AVX and XOP by Marat Dukhan · 3 years, 4 months ago
  47. 195b72f Split microkernel lists in CMakeLists into production and non-production by Marat Dukhan · 3 years, 4 months ago
  48. db3b0a7 Refactor microkernel lists in BUILD and CMakeLists.txt by Marat Dukhan · 3 years, 4 months ago
  49. 6084fb8 E2E benchmark for QU8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  50. 73a899a QU8 DWCONV NEON microkernels with RNDNU requantization by Marat Dukhan · 3 years, 4 months ago
  51. 173661d QU8 GEMM/IGEMM NEON microkernels with RNDNU requantization by Marat Dukhan · 3 years, 4 months ago
  52. 0744fa0 QS8 DWCONV microkernel benchmark by Marat Dukhan · 3 years, 4 months ago
  53. bbfc6d3 E2E benchmark for QS8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  54. 510b8e0 Code generator for RNDNU quantization mode on neon-mull-addw-dup microkernel by Frank Barchard · 3 years, 4 months ago
  55. 0966856 Accumulate in 16 bits once in SSE2/SSE4/AVX/XOP MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 4 months ago
  56. 5f2939f QS8/QC8 DWCONV NEON MUL8/MLA8 microkernels using 128-bit loads by Marat Dukhan · 3 years, 4 months ago
  57. 476eb84 Fix CMake build by Marat Dukhan · 3 years, 4 months ago
  58. caccd8e Accumulate in 16 bits once in NEON QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 4 months ago
  59. 1a2dbe1 RNDNU scalar GEMM/IGEMM microkernel by Frank Barchard · 3 years, 4 months ago
  60. e76049a AVX512 implementation of QS8/QU8 VADD[C] microkernels by Marat Dukhan · 3 years, 4 months ago
  61. 28c82b2 Fix CMake build by Marat Dukhan · 3 years, 4 months ago
  62. 3eac69c Optimized QU8 VADD[C] microkernels for SSE4/AVX/XOP/AVX2 by Marat Dukhan · 3 years, 4 months ago
  63. 036b2b1 Add QU8 MobileNet v2 model to end-to-end benchmark by Marat Dukhan · 3 years, 4 months ago
  64. 76e78c8 Generalize QS8 VADD[C] templates to cover QU8 VADD[C] microkernels by Marat Dukhan · 3 years, 4 months ago
  65. 22fbe77 RNDNU quantized 1x16 and 4x16 Neon lane GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 4 months ago
  66. 13db60f RNDNU quantized Neon assembly GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 4 months ago
  67. 60729d0 4x16c4 RNDNU quantized Neon assembly GEMM/IGEMM microkernel. by Frank Barchard · 3 years, 4 months ago
  68. 4ba70b7 QS8/QC8 NEON microkernels using 8x8->16-bit multiplication by Marat Dukhan · 3 years, 4 months ago
  69. 20c36d4 Fix CMake build by Marat Dukhan · 3 years, 4 months ago
  70. e903dff QS8 GEMM/IGEMM microkernels with RNDNU requantization by Marat Dukhan · 3 years, 4 months ago
  71. be18f5c QS8 DWCONV microkernels with RNDNU requantization by Marat Dukhan · 3 years, 4 months ago
  72. d3d818c Fix requantization stubs for Ruy requantization schema by Marat Dukhan · 3 years, 4 months ago
  73. 7b1aeb9 Evaluation stubs for Ruy requantization schema by Marat Dukhan · 3 years, 4 months ago
  74. 89cd59b Remove legacy QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  75. 605696a NEON implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  76. 1f71428 Scalar implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  77. 927d474 Scalar implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
  78. 69c8a29 NEON-MLAL implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
  79. ac67ae8 Fix CMake build for ARM64 by Marat Dukhan · 3 years, 4 months ago
  80. cfd606b QU8 DWCONV microkernels for AVX512 by Marat Dukhan · 3 years, 4 months ago
  81. 09c312b QU8 DWCONV microkernels for AVX2 by Marat Dukhan · 3 years, 4 months ago
  82. f0f2881 QS8 DWCONV microkernels for SSE2/SSE4.1/AVX by Marat Dukhan · 3 years, 4 months ago
  83. 3c35f7a QU8 DWCONV microkernels for SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
  84. 3cf2e22 QU8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 4 months ago
  85. 902ef7f QU8 GEMM/IGEMM AVX2 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  86. ef47f8d QU8 GEMM/IGEMM microkernels for SSE/AVX/XOP with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  87. e60e997 Remove most GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 4 months ago
  88. cdbe9a3 Code-generate QU8 GEMM and IGEMM microkernels for SSE2/SSSE3/SSE4.1 by Marat Dukhan · 3 years, 4 months ago
  89. c2e8f66 Unify naming of QU8 GEMM/IGEMM/DWCONV microkernels with QS8/QC8 by Marat Dukhan · 3 years, 4 months ago
  90. 960ae34 NEON implementations of QC8 c8 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 4 months ago
  91. 1663c0c NEON implementations of QS8 2x8c16 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 4 months ago
  92. 14f325e C2 GEMM/IGEMM QS8/QC8 NEON microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  93. 5754706 Scalar implementation of QC8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  94. f10af6c NEON Dot Product implementations of QC8 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 4 months ago
  95. 98af05c NEON 4x16 QC8 GEMM and IGEMM assembly microkernels for Cortex A53 by Frank Barchard · 3 years, 4 months ago
  96. 85d772b QS8 DWCONV microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  97. d602154 Scalar implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 5 months ago
  98. 258b639 Fix CMake build by Marat Dukhan · 3 years, 5 months ago
  99. fcfdd2d Refactor initialization of microkernel parameters by Marat Dukhan · 3 years, 5 months ago
  100. 1a0b276 NEON Dot Product implementations of QS8 FP32 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 5 months ago