1. eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 7 months ago
  2. 4133313 Remove duplicate e2e benchmark. by Frank Barchard · 2 years, 7 months ago
  3. 07228a3 Remove E2E MR=1 benchmarks by Frank Barchard · 2 years, 7 months ago
  4. c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 7 months ago
  5. 1fe8995 Scalar F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 7 months ago
  6. 4edfdbf NEON F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 7 months ago
  7. 22e31c8 WAsm SIMD F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 7 months ago
  8. eb84423 SSE2, SSE4.1, and AVX F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 7 months ago
  9. 5132010 QS8 C4 Neon GEMM and E2E benchmarks by Frank Barchard · 2 years, 8 months ago
  10. 494cd2b S4 variant of C2 Neon GEMM/IGEMM microkernel by Frank Barchard · 2 years, 8 months ago
  11. 952cb51 S4 variant of C2 Neon GEMM/IGEMM mull microkernel by Frank Barchard · 2 years, 8 months ago
  12. fa4daf0 Add ISA check to QU8 GEMM benchmark by Frank Barchard · 2 years, 8 months ago
  13. 1d41247 Neon C2 microkernels switch to rndnu from gemmlowp by Frank Barchard · 2 years, 8 months ago
  14. d77f77d F32->F16 VCVT microkernels for NEON-FP16, F16C, and AVX512 by Marat Dukhan · 2 years, 8 months ago
  15. c9f9d67 Add Channel Tile of 16 for float and 32 for half float. by Frank Barchard · 2 years, 8 months ago
  16. e2c0001 Scalar FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
  17. 434352f Benchmarks for FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
  18. 354cbc6 QU8 MUL8 variant of DWCONV by Frank Barchard · 2 years, 9 months ago
  19. a4ad988 X8 LUT microkernels for WAsm SIMD by Marat Dukhan · 2 years, 9 months ago
  20. 2aa2e2a q8 dwconv add channel tiles of 24 and 32 for mul16 rndnu microkernels by Frank Barchard · 2 years, 9 months ago
  21. 2366290 Add qu8_gemm_4x16__aarch64_neon_mlal_lane_cortex_a75 benchmark to E2E by Frank Barchard · 2 years, 9 months ago
  22. 2b3c410 AVX512BW implementations of X8 LUT microkernels by Marat Dukhan · 2 years, 9 months ago
  23. 7c478e3 SSSE3, AVX, and AVX2 X8 LUT microkernels by Marat Dukhan · 2 years, 9 months ago
  24. f718232 X8 LUT NEON microkernels by Marat Dukhan · 2 years, 9 months ago
  25. 5407437 Benchmark for X8 LUT microkernels by Marat Dukhan · 2 years, 10 months ago
  26. 2df7542 Add qu8_4x8__neon_mlal_lane benchmark by Frank Barchard · 2 years, 10 months ago
  27. cdf59a5 Add QU8 NR=32 microkernels by Frank Barchard · 2 years, 10 months ago
  28. 6428725 Rename ADD quantization parameters to ADDSUB by Marat Dukhan · 2 years, 10 months ago
  29. df8e604 4x8 QU8 Neon Dotproduct microkernel rename from ld64 to ld128 by Frank Barchard · 2 years, 10 months ago
  30. 33b4f75 VRND microkernels using native WAsm SIMD instructions by Marat Dukhan · 2 years, 10 months ago
  31. 42a17dd Switch scalar gemmlowp to rndnu for benchmarks by Frank Barchard · 2 years, 10 months ago
  32. efc3ccf Add 4x16c4 cortex_a55 microkernels to GEMM and E2E benchmarks by Frank Barchard · 2 years, 10 months ago
  33. 8dc106e QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernels using i32x4.dot_i16x8_s instruction by Marat Dukhan · 2 years, 10 months ago
  34. feee77f Leverage f32x4.nearest, f32x4.floor, f32x4.ceil, f32x4.trunc WAsm SIMD instructions by Marat Dukhan · 2 years, 10 months ago
  35. 5d27a7b Leverage f32x4.nearest, f32x4.floor, f32x4.ceil, f32x4.trunc WAsm SIMD instructions by Marat Dukhan · 2 years, 10 months ago
  36. 0a3093c QU8 vadd neon use x32 instead of x8 by Frank Barchard · 2 years, 10 months ago
  37. 7da8b02 Q8 dwconv switch from 8x25 to 16x25 by Frank Barchard · 2 years, 10 months ago
  38. e252f92 End-to-end benchmarks on QC8 MobileNet v1/v2 models by Marat Dukhan · 2 years, 10 months ago
  39. ca4c68e QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 2 years, 10 months ago
  40. 4066898 QU8 4x16 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 10 months ago
  41. 0049e89 QU8 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 10 months ago
  42. a29b57e QU8 e2e benchmark remove rndnu from benchmark names. by Frank Barchard · 2 years, 10 months ago
  43. 889ed10 QS8 gemm benchmarks switch from GEMMLOWP to RNDNU for AARCH64 assembly by Frank Barchard · 2 years, 10 months ago
  44. 9cedb59 Accumulate in 16 bits once in WAsm SIMD MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 2 years, 10 months ago
  45. 4c3e5a9 GEMM benchmark assembly microkernels before intrinsics. by Frank Barchard · 2 years, 10 months ago
  46. 9098aba E2E for QU8 GEMM microkernels by Frank Barchard · 2 years, 10 months ago
  47. e033126 Generate more tile sizes for QU8 gemm/igemm by Frank Barchard · 2 years, 10 months ago
  48. 88e839c QU8 C4 NEON Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 10 months ago
  49. dfc2db0 Add prefix to QC8/QS8/QU8 WAsm SIMD GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 11 months ago
  50. 7c74aff Add F32 VLRELU benchmarks by Marat Dukhan · 2 years, 11 months ago
  51. 4486f87 Prune NEON-DOT QS8 GEMM/IGEMM microkernels with FP32 & GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
  52. dc020ff Add 4x16c4 rndnu e2e benchmark for qs8. by Frank Barchard · 2 years, 11 months ago
  53. 8634f7e Refactor F32 VHSWISH benchmarks by Marat Dukhan · 2 years, 11 months ago
  54. 12e426c Refactor F32 VELU benchmarks by Marat Dukhan · 2 years, 11 months ago
  55. 9f8ea9b Refactor F32 VSIGMOID benchmarks by Marat Dukhan · 2 years, 11 months ago
  56. 5aeb32b Refactor F32 VSQRT benchmarks by Marat Dukhan · 2 years, 11 months ago
  57. 3b6c36e Refactor F32 VRELU benchmarks by Marat Dukhan · 2 years, 11 months ago
  58. 8674629 Use QS8 GEMM WAsm SIMD microkernels with FP32 requantization in the benchmark by Marat Dukhan · 2 years, 11 months ago
  59. 0ff7989 Use FP32 requantization for extended-weights QS8 GEMM microkernels on x86 by Marat Dukhan · 2 years, 11 months ago
  60. 529d2c1 Remove x86 QS8 GEMM microkernels with GEMMLOWP requantization from benchmarks by Marat Dukhan · 2 years, 11 months ago
  61. ec47958 Prune redundant NEON GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
  62. 066a0cb Evaluate convertsion-based WAsm SIMD implementations in the rounding benchmark by Marat Dukhan · 2 years, 11 months ago
  63. 91351ef Allocate additional XNN_EXTRA_BYTES for input in QS8/QU8 GEMM benchmarks by Marat Dukhan · 2 years, 11 months ago
  64. e961ecf QU8 benchmark remove quantization from Neon names for consistency by Frank Barchard · 2 years, 11 months ago
  65. 8b024c9 QS8/QU8 VMULC microkernel benchmark by Marat Dukhan · 2 years, 11 months ago
  66. fb3a94f QU8 4x16 Neon assembly microkernel for Cortex A75 by Frank Barchard · 2 years, 11 months ago
  67. 795e5ab QS8/QU8 VMUL microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  68. e2163bc Benchmark QU8 4x16 Neon assembly GEMM microkernel - Rename QS8 benchmarks to QU8 by Frank Barchard · 2 years, 11 months ago
  69. eb3cff3 LD128 versions of QS8/QU8 VADD[C] NEON microkernels by Marat Dukhan · 2 years, 11 months ago
  70. 1ef9de8 QU8 VADD/VADDC microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  71. 83a8d2f QS8 VADD/VADDC microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  72. bbe8824 Enable AVX2 MUL16 ADD16 microkernels in QS8 DWCONV benchmarks by Marat Dukhan · 2 years, 11 months ago
  73. 60bb7ec Accumulate in 16 bits once in AVX2 MUL16 VPUNPCK QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 2 years, 11 months ago
  74. 881ab02 AVX2 MUL16 QS8/QC8 DWCONV microkernels using VPUNPCK instructions to extend the product by Marat Dukhan · 2 years, 11 months ago
  75. cc96770 Evaluate MUL32 XOP QS8 DWCONV microkernels in E2E benchmark by Marat Dukhan · 2 years, 11 months ago
  76. 6084fb8 E2E benchmark for QU8 DWCONV microkernels by Marat Dukhan · 2 years, 11 months ago
  77. afd2ed9 Use NEON microkernels with RNDNU requantization in QU8 GEMM benchmark by Marat Dukhan · 2 years, 11 months ago
  78. d8e2d71 Update QU8 GEMM microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  79. 0744fa0 QS8 DWCONV microkernel benchmark by Marat Dukhan · 2 years, 11 months ago
  80. e13e639 Align packed weights on 64 bytes in microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  81. b657605 Fix random number generation in QS8 GEMM benchmark by Marat Dukhan · 2 years, 11 months ago
  82. bbfc6d3 E2E benchmark for QS8 DWCONV microkernels by Marat Dukhan · 2 years, 11 months ago
  83. 42b441b Specify parameter initialization function in F32 DWCONV benchmark by Marat Dukhan · 2 years, 11 months ago
  84. 88780d3 Specify parameter initialization function in F32 DWCONV E2E benchmark by Marat Dukhan · 2 years, 11 months ago
  85. 036b2b1 Add QU8 MobileNet v2 model to end-to-end benchmark by Marat Dukhan · 3 years ago
  86. d3d818c Fix requantization stubs for Ruy requantization schema by Marat Dukhan · 3 years ago
  87. 7b1aeb9 Evaluation stubs for Ruy requantization schema by Marat Dukhan · 3 years ago
  88. 89cd59b Remove legacy QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years ago
  89. bd3c9aa Add cpufreq to requantization benchmarks by Frank Barchard · 3 years ago
  90. ef47f8d QU8 GEMM/IGEMM microkernels for SSE/AVX/XOP with FP32 requantization by Marat Dukhan · 3 years ago
  91. e60e997 Remove most GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years ago
  92. cdbe9a3 Code-generate QU8 GEMM and IGEMM microkernels for SSE2/SSSE3/SSE4.1 by Marat Dukhan · 3 years ago
  93. c698c11 Refactor xnn_qu8_conv_minmax_params by Marat Dukhan · 3 years ago
  94. c2e8f66 Unify naming of QU8 GEMM/IGEMM/DWCONV microkernels with QS8/QC8 by Marat Dukhan · 3 years ago
  95. 40c0eaa E2E benchmark 4x16 c4 neondot fp32 microkernel by Frank Barchard · 3 years ago
  96. 79cd5f9 FP32 LD128 IGEMM for Cortex X1 by Frank Barchard · 3 years ago
  97. 533410e QS8 A53 GEMM bug fix for X1 - re-enable E2E by Frank Barchard · 3 years ago
  98. 0ae35f2 QS8 LD128 GEMM/IGEMM dot product 4x16 microkernel by Frank Barchard · 3 years ago
  99. 2fca144 Disable E2E benchmark for qs8_gemm_4x16_gemmlowp__aarch64_neon_mlal_lane_cortex_a53 by Frank Barchard · 3 years ago
  100. 143a110 Rename GEMM/IGEMM microkernels from Cortex-A57/A75 to prfm_cortex_a75 by Frank Barchard · 3 years ago