1. f8475d6 SSE2 version of LUT64 Sigmoid evaluation stub by Marat Dukhan · 4 years ago
  2. 66f3ccd End-to-end FP16 MobileNet v3 benchmarks by Marat Dukhan · 4 years ago
  3. d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 4 years ago
  4. e6dc0b6 AVX2 versions of QS8 VADD[C] microkernels by Marat Dukhan · 4 years ago
  5. bb9225e SSE4.1 and XOP versions of MUL32 VADD[C] microkernels by Marat Dukhan · 4 years, 1 month ago
  6. 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
  7. 70a9618 End-to-end QS8 MobileNet v2 benchmark by Marat Dukhan · 4 years, 1 month ago
  8. ba7b279 NEON variants of QS8 VADD[C] microkernels by Marat Dukhan · 4 years, 1 month ago
  9. 9c7308f vbinary microkernels unrolled to x8 for scalar and web assembly and x16 web assembly simd by Frank Barchard · 4 years, 1 month ago
  10. 37297a6 F32-RELU unrolled more for improved performance on Web Assembly by Frank Barchard · 4 years, 1 month ago
  11. 7359463 Fix incompatibilities with ARM GCC by Marat Dukhan · 4 years, 1 month ago
  12. a05487f Add xnn_qs8_igemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 4 years, 1 month ago
  13. a964473 Add xnn_qs8_gemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 4 years, 1 month ago
  14. 0270d9f QS8 VADDC microkernels in SSE2 and SSE4.1 implementations by Marat Dukhan · 4 years, 1 month ago
  15. 8432486 Fix CMake build by Marat Dukhan · 4 years, 1 month ago
  16. 281262d NEON variant of QS8 GAVGPOOL microkernel by Marat Dukhan · 4 years, 1 month ago
  17. 023bcf9 NEON variant of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
  18. d9f3ad4 QS8 ADD microkernels in SSE2 and SSE4.1 implementations by Marat Dukhan · 4 years, 1 month ago
  19. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 1 month ago
  20. 674778d Add binary op microkernels with RELU activation by Frank Barchard · 4 years, 2 months ago
  21. c15aa4e Remove XOP variants of QS8 DWCONV by Marat Dukhan · 4 years, 2 months ago
  22. 453f4b8 Fix CMake build by Marat Dukhan · 4 years, 2 months ago
  23. c9c320e CMake build fix by Frank Barchard · 4 years, 2 months ago
  24. b33fc0e Add xnn_q{u,s}8_gemm_minmax_ukernel_MRxNRc4__scalar by Benoit Jacob · 4 years, 2 months ago
  25. 4013552 AVX2 versions of QS8 DWCONV microkernels using 16-bit multiplication by Marat Dukhan · 4 years, 2 months ago
  26. 4ed53f4 Unipass QS8 GAVGPOOL microkernels in SSE2/SSSE3/SSE4.1 implementations by Marat Dukhan · 4 years, 2 months ago
  27. d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  28. 0743cdf Hardcoded end-to-end benchmark on MobileNet v1 in QS8 format by Marat Dukhan · 4 years, 2 months ago
  29. f62bbdc SSE2/SSSE3/SSE4.1/XOP implementation of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  30. 40bbafe NEON variants of QS8 GEMM & IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  31. 56bdd4a QS8 requantization benchmark by Marat Dukhan · 4 years, 2 months ago
  32. 683fab3 XW (eXtended Weights) optimization for QS8 GEMM microkernel by Marat Dukhan · 4 years, 2 months ago
  33. e7edc80 Add 3x4c8 variants of SSE2/SSSE3/SSE4.1/XOP GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  34. 1280952 AVX2 version of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  35. 1566fee XOP versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  36. b02e522 Fix CMake build by Marat Dukhan · 4 years, 2 months ago
  37. dee732b LD128 versions of QS8 GEMM SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years, 2 months ago
  38. 733d0be QS8 GEMM MRx4c8 SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years, 2 months ago
  39. f948068 QS8 IGEMM microkernels and infrastructure by Marat Dukhan · 4 years, 2 months ago
  40. 595e170 QS8 GEMM microkernels and infrastructure by Marat Dukhan · 4 years, 2 months ago
  41. 2e23d2b Signed requantization evaluation stubs and unit tests by Marat Dukhan · 4 years, 2 months ago
  42. c5045bf Remove PSIMD variant of GAVGPOOL CW microkernel by Marat Dukhan · 4 years, 2 months ago
  43. 5b69f8b Rename requantization functions and filenames by Marat Dukhan · 4 years, 2 months ago
  44. fc73f86 Remove PSIMD versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  45. 92162da Remove PSIMD versions of PPMM and PACKX microkernels by Marat Dukhan · 4 years, 2 months ago
  46. ef25c6d NEON versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  47. a531698 PRELU microkernels on Neon and WASMSIMD with 1, 2 or 4 rows, and 4, 8 or 16 channels. by Frank Barchard · 4 years, 2 months ago
  48. aff24e2 Support Resize Bilinear 2D in Subgraph API by Marat Dukhan · 4 years, 2 months ago
  49. 859b9a8 Remove PSIMD variant of UNPOOL microkernel by Marat Dukhan · 4 years, 2 months ago
  50. 718294e Remove PSIMD variants of X32 ZIP microkernels by Marat Dukhan · 4 years, 2 months ago
  51. fb877e0 Remove PSIMD variants of IBILINEAR microkernels by Marat Dukhan · 4 years, 2 months ago
  52. dc2e437 Fix CMake build by Marat Dukhan · 4 years, 2 months ago
  53. 88f1fe1 Remove PSIMD version of PAD microkernel by Marat Dukhan · 4 years, 2 months ago
  54. 8e0abea Remove PSIMD version of FILL microkernel by Marat Dukhan · 4 years, 2 months ago
  55. a46efa5 Remove PSIMD variants of MAXPOOL/AVGPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  56. c201006 FP16 ReLU microkernel benchmark by Frank Barchard · 4 years, 2 months ago
  57. 3c4a952 relu microkernel benchmark by Frank Barchard · 4 years, 2 months ago
  58. 749cf6b Remove PSIMD variant of RMAX microkernel by Marat Dukhan · 4 years, 2 months ago
  59. 249cd6a Remove PSIMD variants of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 4 years, 2 months ago
  60. b67aec0 Remove PSIMD variants of PReLU microkernels by Marat Dukhan · 4 years, 2 months ago
  61. 2f1b1ce Remove PSIMD versions of evaluation stubs by Marat Dukhan · 4 years, 2 months ago
  62. 1ede9d8 Remove PSIMD variants of VMULCADDC microkernels by Marat Dukhan · 4 years, 2 months ago
  63. 6eca86c Remove PSIMD variants of unary elementwise microkernels by Marat Dukhan · 4 years, 2 months ago
  64. 6f40296 Remove PSIMD versions of vector binary elementwise microkernels by Marat Dukhan · 4 years, 2 months ago
  65. bbf7f3f Remove PSIMD variants of DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  66. 115d3e2 Remove PSIMD variants of GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  67. 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years, 2 months ago
  68. efaac27 Prefix generated HSWISH microkernels with hswish- by Marat Dukhan · 4 years, 2 months ago
  69. 19dd91d Prefix generated VLRELU microkernels with vlrelu- by Marat Dukhan · 4 years, 2 months ago
  70. 320cb46 Update CLAMP microkernels by Frank Barchard · 4 years, 2 months ago
  71. fb158e2 RELU microkernel to clamp values to 0 for a specialized clamp operator by Frank Barchard · 4 years, 2 months ago
  72. 08b7a97 Rename Q8 microkernels and operators to QU8 by Marat Dukhan · 4 years, 2 months ago
  73. 55dde5b NEON F32 HSWISH microkernel unrolled by 16 by Marat Dukhan · 4 years, 2 months ago
  74. 9df9dc6 Reoptimize HSWISH microkernels by Marat Dukhan · 4 years, 2 months ago
  75. d27202d Support Reshape Node in Subgraph API by Marat Dukhan · 4 years, 3 months ago
  76. 51a01c6 Support Square Root node in Subgraph API by Marat Dukhan · 4 years, 3 months ago
  77. 8e229db Add scalar binary ops with linear activation by Frank Barchard · 4 years, 3 months ago
  78. ab58238 Make packing functions non-inline by Marat Dukhan · 4 years, 3 months ago
  79. 1d6d403 Exclude WAsm SIMD microkernels from CMake build by Marat Dukhan · 4 years, 3 months ago
  80. d5b9f1c Add WASMSIMD binary ops with linear activation by Frank Barchard · 4 years, 3 months ago
  81. 6804bbd Square Root operator by Marat Dukhan · 4 years, 3 months ago
  82. 676d582 Fix CMake build by Marat Dukhan · 4 years, 3 months ago
  83. f4db2f3 Vector SQRT microkernels by Marat Dukhan · 4 years, 3 months ago
  84. 8400076 Square root evaluation stubs by Marat Dukhan · 4 years, 3 months ago
  85. f2ebd89 Remove VRSQRDIFFC microkernels by Marat Dukhan · 4 years, 3 months ago
  86. 49b4dcc FP16 Convolution NHWC operator by Frank Barchard · 4 years, 3 months ago
  87. a11ca34 Microbenchmarks for HSWISH microkernels by Marat Dukhan · 4 years, 3 months ago
  88. ad35260 HardSwish operator benchmark by Marat Dukhan · 4 years, 3 months ago
  89. 0d3f467 SSE2 and SSE4.1 versions of Leaky ReLU microkernels by Marat Dukhan · 4 years, 3 months ago
  90. 8772714 Benchmarks for rounding operators by Marat Dukhan · 4 years, 3 months ago
  91. 39b5e94 SSE versions of PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
  92. 5fd403b Fix incompatibility with CMake 3.5 by Marat Dukhan · 4 years, 3 months ago
  93. 7ae00ad Remove FP16 dependency from Global Average Pooling NWC test by Frank Barchard · 4 years, 3 months ago
  94. ef61d02 Guard FP16 Global Average Pooling by initialization flags by Marat Dukhan · 4 years, 3 months ago
  95. af4dad4 Add ISA check for FP16 to GLOBAL_AVERAGE_POOLING_NWC_F16 tests by Frank Barchard · 4 years, 3 months ago
  96. 569561d Generate PLD variation of AARCH32 LD64 by Frank Barchard · 4 years, 3 months ago
  97. d986004 Remove cpuinfo as dependency for operator benchmarks by Frank Barchard · 4 years, 3 months ago
  98. ea2088b Fix file name in CMakeLists.txt. by Im Sunghoon · 4 years, 3 months ago
  99. 7465a89 Add PSIMD DWCONV CHW 5X5S2P2 kernel. by Erich Elsen · 4 years, 3 months ago
  100. 0c84973 FP16 Global Average Pooling operator benchmark by Frank Barchard · 4 years, 3 months ago