1. c5aa242 F32->QS8 and F32->QU8 microkernels for SSE by Marat Dukhan · 2 years, 7 months ago
  2. 5f7cf55 Avoid using gcc-specific intrinsics in NEON microkernels by Marat Dukhan · 2 years, 7 months ago
  3. 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 7 months ago
  4. 24abe6b Initialize S8/U8 IBILINEAR microkernel pointers by Marat Dukhan · 2 years, 7 months ago
  5. 6a69c8e Scalar versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 7 months ago
  6. 7519eb1 SSE2 & SSE4.1 versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 7 months ago
  7. cdb42a5 NEON versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 7 months ago
  8. 9cdc10d QU8 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 7 months ago
  9. 0bc5801 QC8 AArch32 use NeonV8 when available. by Frank Barchard · 2 years, 7 months ago
  10. 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 7 months ago
  11. 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 7 months ago
  12. 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 7 months ago
  13. 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 8 months ago
  14. e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 8 months ago
  15. eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 8 months ago
  16. a0c6168 F32->F16 Convert operator by Marat Dukhan · 2 years, 8 months ago
  17. e7043ff Enable C2S4 for QC8 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 8 months ago
  18. c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 8 months ago
  19. 1fe8995 Scalar F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
  20. 78f039d Scalar F16->F32 evaluation stubs of bitcast-based and fabsf-based variants by Marat Dukhan · 2 years, 8 months ago
  21. 4edfdbf NEON F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
  22. b4cde5a Fix CMake build on ARM by Marat Dukhan · 2 years, 8 months ago
  23. eb84423 SSE2, SSE4.1, and AVX F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
  24. 056f49d Evaluation stubs for SSE2 & SSE4.1 F32->F16 conversion by Marat Dukhan · 2 years, 8 months ago
  25. a6eb1e5 Evaluation stubs for NEON F32->F16 conversion by Marat Dukhan · 2 years, 8 months ago
  26. 46cc1e1 Evaluation stubs for scalar F32->F16 conversion by Marat Dukhan · 2 years, 8 months ago
  27. 287952a QS8 C4 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 8 months ago
  28. 66ae257 Switch from C2 to S4C2 for qs8 microkernels on 32 bit ARM by Frank Barchard · 2 years, 8 months ago
  29. 47a74db Add specific microkernel for 1D convolutions with 1x3 kernel size for Android backend by Artsiom Ablavatski · 2 years, 8 months ago
  30. 494cd2b S4 variant of C2 Neon GEMM/IGEMM microkernel by Frank Barchard · 2 years, 8 months ago
  31. 952cb51 S4 variant of C2 Neon GEMM/IGEMM mull microkernel by Frank Barchard · 2 years, 8 months ago
  32. 1d41247 Neon C2 microkernels switch to rndnu from gemmlowp by Frank Barchard · 2 years, 8 months ago
  33. 582e184 Evaluation stubs and tests for FP16->FP32 conversion by Marat Dukhan · 2 years, 8 months ago
  34. ddb3d16 F16 Fully Connected operator by Marat Dukhan · 2 years, 8 months ago
  35. d77f77d F32->F16 VCVT microkernels for NEON-FP16, F16C, and AVX512 by Marat Dukhan · 2 years, 8 months ago
  36. af2ba00 F16->F32 Convert operator by Marat Dukhan · 2 years, 8 months ago
  37. c9f9d67 Add Channel Tile of 16 for float and 32 for half float. by Frank Barchard · 2 years, 9 months ago
  38. dbe781b Enable 8x4, 8x9, 8x25 f32 dwconv by Frank Barchard · 2 years, 9 months ago
  39. e2c0001 Scalar FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  40. 434352f Benchmarks for FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  41. 322ed6f NEON FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  42. 1227adb SSE2/SSE4.1/AVX FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  43. 60f903b NEON FP16->FP32 conversion evaluation stubs by Marat Dukhan · 2 years, 9 months ago
  44. 3ed866b Test evaluation stubs for F16->F32 conversion by Marat Dukhan · 2 years, 9 months ago
  45. 8ff372c NEON-FP16 implementation of F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  46. 354cbc6 QU8 MUL8 variant of DWCONV by Frank Barchard · 2 years, 9 months ago
  47. 79c76ab F16->F32 conversion microkernels in AVX512-SKX implementation by Marat Dukhan · 2 years, 9 months ago
  48. f1a6ed3 F16->F32 conversion microkernels in F16C implementation by Marat Dukhan · 2 years, 9 months ago
  49. 2aa2e2a q8 dwconv add channel tiles of 24 and 32 for mul16 rndnu microkernels by Frank Barchard · 2 years, 10 months ago
  50. e4118ef Polyfill vld1q_u8_x4 for older AArch64 gcc versions by Marat Dukhan · 2 years, 10 months ago
  51. 98e054b Enable vectorized X8 LUT microkernels by Marat Dukhan · 2 years, 10 months ago
  52. 2b3c410 AVX512BW implementations of X8 LUT microkernels by Marat Dukhan · 2 years, 10 months ago
  53. 7c478e3 SSSE3, AVX, and AVX2 X8 LUT microkernels by Marat Dukhan · 2 years, 10 months ago
  54. 5de7bc0 QS8/QU8 Tanh operator using LUT microkernels by Marat Dukhan · 2 years, 10 months ago
  55. f718232 X8 LUT NEON microkernels by Marat Dukhan · 2 years, 10 months ago
  56. 548542c Fix CMake build by Marat Dukhan · 2 years, 10 months ago
  57. f6c991e Implement generic LUT-based elementwise operator by Marat Dukhan · 2 years, 10 months ago
  58. 5407437 Benchmark for X8 LUT microkernels by Marat Dukhan · 2 years, 10 months ago
  59. d67539d Auto-generate X8 LUT microkernels and tests by Marat Dukhan · 2 years, 10 months ago
  60. cdf59a5 Add QU8 NR=32 microkernels by Frank Barchard · 2 years, 10 months ago
  61. df8e604 4x8 QU8 Neon Dotproduct microkernel rename from ld64 to ld128 by Frank Barchard · 2 years, 10 months ago
  62. a49e41f QU8 4x16C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 2 years, 10 months ago
  63. 0a3093c QU8 vadd neon use x32 instead of x8 by Frank Barchard · 2 years, 10 months ago
  64. 7da8b02 Q8 dwconv switch from 8x25 to 16x25 by Frank Barchard · 2 years, 10 months ago
  65. e252f92 End-to-end benchmarks on QC8 MobileNet v1/v2 models by Marat Dukhan · 2 years, 10 months ago
  66. 0d06573 dwconv Q8 switch from 8x9 to 16x9 tile. by Frank Barchard · 2 years, 10 months ago
  67. 8b69802 Enable QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 2 years, 10 months ago
  68. ca4c68e QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 2 years, 10 months ago
  69. 0c76422 QU8 NEON Assembly remove channel wise by Frank Barchard · 2 years, 10 months ago
  70. 4066898 QU8 4x16 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 10 months ago
  71. 0049e89 QU8 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 10 months ago
  72. de9c64a Enable 4x16 QU8 dot production microkernels by Frank Barchard · 2 years, 11 months ago
  73. 65692c7 Fix build for Clang on Windows by peter · 2 years, 11 months ago
  74. e79acb7 S8 VCLAMP microkernels by Marat Dukhan · 2 years, 11 months ago
  75. 2314753 S8 MAXPOOL microkernels for all architectures by Marat Dukhan · 2 years, 11 months ago
  76. 9098aba E2E for QU8 GEMM microkernels by Frank Barchard · 2 years, 11 months ago
  77. e033126 Generate more tile sizes for QU8 gemm/igemm by Frank Barchard · 2 years, 11 months ago
  78. 2025515 Enable dot production microkernels for QU8 on ARM by Frank Barchard · 2 years, 11 months ago
  79. 88e839c QU8 C4 NEON Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
  80. 0461f2d Generalize PAD microkernels to all 8-/16-/32-bit data types by Marat Dukhan · 2 years, 11 months ago
  81. 933051b Generalize FILL microkernels to all 8-/16-/32-bit data types by Marat Dukhan · 2 years, 11 months ago
  82. 7c74aff Add F32 VLRELU benchmarks by Marat Dukhan · 2 years, 11 months ago
  83. 4486f87 Prune NEON-DOT QS8 GEMM/IGEMM microkernels with FP32 & GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
  84. e16bf7d Prune AVX2/AVX512 QS8 GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
  85. 66a3ca1 Initialize QS8 microkernel pointers on pre-NEON ARM architecture by Marat Dukhan · 2 years, 11 months ago
  86. 0ff7989 Use FP32 requantization for extended-weights QS8 GEMM microkernels on x86 by Marat Dukhan · 2 years, 11 months ago
  87. ec47958 Prune redundant NEON GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
  88. f879d9e Add qs8-requantization-test to CMake build by Marat Dukhan · 2 years, 11 months ago
  89. 599d3db Fix CMake build by Marat Dukhan · 2 years, 11 months ago
  90. 0853b8a QS8/QU8 Multiply ND operators by Marat Dukhan · 2 years, 11 months ago
  91. 8b024c9 QS8/QU8 VMULC microkernel benchmark by Marat Dukhan · 2 years, 11 months ago
  92. fb3a94f QU8 4x16 Neon assembly microkernel for Cortex A75 by Frank Barchard · 2 years, 11 months ago
  93. 795e5ab QS8/QU8 VMUL microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  94. 4a7b70f QS8/QU8 VMUL[C] microkernels in NEON implementation by Marat Dukhan · 2 years, 11 months ago
  95. 7999341 QS8/QU8 VMUL[C] microkernels in scalar implementation by Marat Dukhan · 2 years, 11 months ago
  96. 59ed1da QU8 4x16 Neon assembly microkernel by Frank Barchard · 2 years, 11 months ago
  97. a212eac QS8/QU8 VMUL[C] microkernels in SSE2/SSE4.1/AVX implementation by Marat Dukhan · 2 years, 11 months ago
  98. eb3cff3 LD128 versions of QS8/QU8 VADD[C] NEON microkernels by Marat Dukhan · 3 years ago
  99. 01debd9 Optimize QS8 VADD[C] microkernel selection on ARM/ARM64 by Marat Dukhan · 3 years ago
  100. 1ef9de8 QU8 VADD/VADDC microkernel benchmarks by Marat Dukhan · 3 years ago