1. 62b4ff7 Remove 12x8 QS8 GEMM and IGEMM Neon dotproduct microkernels. by Frank Barchard · 3 years, 3 months ago
  2. da78da1 QS8 C8 Neon microkernels with MUL and MLA versions. by Frank Barchard · 3 years, 3 months ago
  3. 02121ca QS8 Neon IGEMM microkernels with 8 bit MUL using DUP by Frank Barchard · 3 years, 3 months ago
  4. 36f95cf QS8 Neon IGEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 3 months ago
  5. 71c4d1a QS8 Neon GEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 3 months ago
  6. 6d138db Remove scalar C4 QS8 and QU8 gemm microkernels. by Frank Barchard · 3 years, 3 months ago
  7. 6fa8078 QS8 C2 Neon igemm by Frank Barchard · 3 years, 3 months ago
  8. d79391d QS8 C8 Neon igemm by Frank Barchard · 3 years, 3 months ago
  9. 8247e21 C2 QS8 microkernel using mull then mlal with KC loop of 16 by Frank Barchard · 3 years, 4 months ago
  10. 5899012 QS8 Neon GEMM C8 microkernel with 8 bit multiply and vpadal to accumulate. by Frank Barchard · 3 years, 4 months ago
  11. 2202c81 Implement bilinear upsampling (CHW layout) for ARM architecture by Artsiom Ablavatski · 3 years, 4 months ago
  12. 2302ffd QS8 Neon GEMM microkernel with 8 bit multiply and vpadal to accumulate by Frank Barchard · 3 years, 4 months ago
  13. ec0bf14 QS8 GEMM and IGEMM 3x8 3x16 and IGEMM 4x8 and 4x16 by Frank Barchard · 3 years, 4 months ago
  14. 4ecae2e QS8 Neon GEMM microkernel with 8 bit multiply by Frank Barchard · 3 years, 4 months ago
  15. cfbc849 Add 4x8 and 4x16 qs8 gemm microkernels by Frank Barchard · 3 years, 4 months ago
  16. c5704bf WebAssembly DWConv2D 3x3 stride 2 loadsplat by Frank Barchard · 3 years, 5 months ago
  17. c6889b3 WebAssembly DWConv2D 5x5 stride 2 loadsplat by Frank Barchard · 3 years, 5 months ago
  18. 02bb429 WebAssembly DWConv2D 3x3p1 adapted from NEON by Frank Barchard · 3 years, 5 months ago
  19. b20dcd6 WASMSIMD dwconv2d 5x5p2 use loadsplat by Frank Barchard · 3 years, 5 months ago
  20. 802fcae Additional SSE/SSE2 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 5 months ago
  21. 412e2f4 Rename WASMSIMD dwconv2d functions to splat or loadsplat by Frank Barchard · 3 years, 5 months ago
  22. 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 6 months ago
  23. 3a30521 Refactor accuracy evaluation benchmarks by Marat Dukhan · 3 years, 6 months ago
  24. 5b86c43 NEON versions of non-blocked F32 SpMM microkernels by Marat Dukhan · 3 years, 6 months ago
  25. b88d011 WebAssembly SIMD DWConv2D 3x3 stride-2 adapted from NEON by Frank Barchard · 3 years, 6 months ago
  26. 729f07b WebAssembly SIMD DWConv2D 5x5 stride 2 adapted from NEON by Frank Barchard · 3 years, 6 months ago
  27. 6b1629a Remove code generator for old 5x5p2 by Frank Barchard · 3 years, 6 months ago
  28. ed6baaf Vector ELU microkernels by Marat Dukhan · 3 years, 6 months ago
  29. 20a0741 Web Assemble DWConv2D f32_dwconv2d_chw_ukernel_5x5p2__wasmsimd adapted from Neon by Frank Barchard · 3 years, 6 months ago
  30. 3b80045 WAsm SIMD version of DWCONV2D CHW 3x3p1 by Frank Barchard · 3 years, 6 months ago
  31. db5c32d WasmSIMD dwconv2d generate x86 optimized version. by Frank Barchard · 3 years, 6 months ago
  32. 8ef44cd Pipelined Web Assembly Sparse Matrix Multiply by Frank Barchard · 3 years, 7 months ago
  33. beca652 Rename unroll to x for SpMM microkernels with unrolled loop by Frank Barchard · 3 years, 7 months ago
  34. ccca214 SSE variant of 5x5s2 DWCONV CHW micro-kernels by Marat Dukhan · 3 years, 7 months ago
  35. d050389 SSE variants of 5x5 DWCONV CHW micro-kernels by Marat Dukhan · 3 years, 7 months ago
  36. 30d4b25 Auto-generate 5x5s2 DWCONV CHW micro-kernels by Marat Dukhan · 3 years, 7 months ago
  37. 29c0c33 Auto-generate 5x5s2p2 DWCONV CHW micro-kernels by Marat Dukhan · 3 years, 7 months ago
  38. 846c0c6 Add 32x1 32x2 32x4 SPMM microkernels and remove 4x1 4x2 4x4 for WASMSIMD, Neon and SSE by Frank Barchard · 3 years, 7 months ago
  39. 149f0ea Auto-generate NEON 5x5p2 DWCONV micro-kernels by Marat Dukhan · 3 years, 7 months ago
  40. c4efb00 Auto-generate scalar 5x5p2 DWCONV CHW micro-kernels by Marat Dukhan · 3 years, 7 months ago
  41. cf5b3c3 Auto-generate scalar versions of DWCONV2D CHW 3x3s2p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  42. 82f0c32 Auto-generate NEON/NEONFMA versions of DWCONV2D CHW 3x3s2p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  43. 0ff9718 Auto-generate SSE versions of DWCONV2D CHW 3x3s2p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  44. 91249d2 Auto-generate scalar versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  45. c581e48 NEON versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  46. 1268a24 Auto-generate AArch64 NEONFMA versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  47. 98f2eeb SSSE3 versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  48. 470078a Auto-generate SSE versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 3 years, 7 months ago
  49. 965272b Add WebAssembly SIMD IBILINEAR microkernels for CHW layout by XNNPACK Team · 3 years, 7 months ago
  50. bf715f9 Rename DWCONV CHW microkernels to DWCONV2D CHW by Marat Dukhan · 3 years, 7 months ago
  51. cb2b667 Roll back the decision to split the packed weights for the CHW IBILINEAR microkernel interface by XNNPACK Team · 3 years, 7 months ago
  52. dc6c77f Generate DWCONV CHW microkernel tests from a YAML specification by Marat Dukhan · 3 years, 7 months ago
  53. 0dde1af Split packed weights into horizontal and vertical in IBILINEAR CHW microkernel interface by XNNPACK Team · 3 years, 7 months ago
  54. 6be46b2 Add input increment parameter in IBILINEAR CHW microkernels by XNNPACK Team · 3 years, 7 months ago
  55. c451e8a WAsm SpMM microkernels unrolled by 2 and 4. by Frank Barchard · 3 years, 7 months ago
  56. 9e05340 Replace PSIMD SpMM microkernels with WAsm SIMD. by Frank Barchard · 3 years, 7 months ago
  57. dc2b29c AVX float32 sigmoid ukernels. by T.J. Alumbaugh · 3 years, 7 months ago
  58. 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 3 years, 7 months ago
  59. 66ccf64 Rename QS8 generator templates by Marat Dukhan · 3 years, 8 months ago
  60. a48848f 4x8, 6x8 and 8x16 Neon dot product GEMM microkernels by Frank Barchard · 3 years, 8 months ago
  61. d9ca7e6 AVX512F versions of Sigmoid microkernel by Marat Dukhan · 3 years, 8 months ago
  62. 2fa1745 6x16 QS8 GEMM for Neon dot product by Frank Barchard · 3 years, 8 months ago
  63. d243c1a LUT-based SSE Sigmoid microkernels by Marat Dukhan · 3 years, 8 months ago
  64. ef4ce31 Remove trailing whitespace by Marat Dukhan · 3 years, 9 months ago
  65. d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 3 years, 9 months ago
  66. e6dc0b6 AVX2 versions of QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 9 months ago
  67. bb9225e SSE4.1 and XOP versions of MUL32 VADD[C] microkernels by Marat Dukhan · 3 years, 9 months ago
  68. 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 9 months ago
  69. 5df27f8 WAsm SIMD versions of QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 9 months ago
  70. ba7b279 NEON variants of QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 9 months ago
  71. 9c7308f vbinary microkernels unrolled to x8 for scalar and web assembly and x16 web assembly simd by Frank Barchard · 3 years, 9 months ago
  72. 37297a6 F32-RELU unrolled more for improved performance on Web Assembly by Frank Barchard · 3 years, 9 months ago
  73. a05487f Add xnn_qs8_igemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 3 years, 9 months ago
  74. a964473 Add xnn_qs8_gemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 3 years, 9 months ago
  75. 0270d9f QS8 VADDC microkernels in SSE2 and SSE4.1 implementations by Marat Dukhan · 3 years, 10 months ago
  76. 281262d NEON variant of QS8 GAVGPOOL microkernel by Marat Dukhan · 3 years, 10 months ago
  77. 023bcf9 NEON variant of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 10 months ago
  78. d9f3ad4 QS8 ADD microkernels in SSE2 and SSE4.1 implementations by Marat Dukhan · 3 years, 10 months ago
  79. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  80. 674778d Add binary op microkernels with RELU activation by Frank Barchard · 3 years, 10 months ago
  81. c15aa4e Remove XOP variants of QS8 DWCONV by Marat Dukhan · 3 years, 10 months ago
  82. b33fc0e Add xnn_q{u,s}8_gemm_minmax_ukernel_MRxNRc4__scalar by Benoit Jacob · 3 years, 10 months ago
  83. 4013552 AVX2 versions of QS8 DWCONV microkernels using 16-bit multiplication by Marat Dukhan · 3 years, 10 months ago
  84. b5e3d17 Multipass QS8 GAVGPOOL microkernel in WAsm SIMD implementation by Marat Dukhan · 3 years, 10 months ago
  85. ef45180 Unipass QS8 GAVGPOOL microkernel in WAsm SIMD implementation by Marat Dukhan · 3 years, 10 months ago
  86. 159688f Multipass QS8 GAVGPOOL microkernels in SSE2/SSSE3/SSE4.1 implementations by Marat Dukhan · 3 years, 10 months ago
  87. 4ed53f4 Unipass QS8 GAVGPOOL microkernels in SSE2/SSSE3/SSE4.1 implementations by Marat Dukhan · 3 years, 10 months ago
  88. cc8f34c WAsm SIMD variants of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 10 months ago
  89. 27203da WAsm SIMD versions of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  90. d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 10 months ago
  91. f62bbdc SSE2/SSSE3/SSE4.1/XOP implementation of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 10 months ago
  92. 40bbafe NEON variants of QS8 GEMM & IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  93. 683fab3 XW (eXtended Weights) optimization for QS8 GEMM microkernel by Marat Dukhan · 3 years, 10 months ago
  94. e7edc80 Add 3x4c8 variants of SSE2/SSSE3/SSE4.1/XOP GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  95. 1280952 AVX2 version of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  96. 1566fee XOP versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  97. 07bd252 QS8 IGEMM MRx4c8 SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 3 years, 10 months ago
  98. dee732b LD128 versions of QS8 GEMM SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 3 years, 10 months ago
  99. 14d3ce8 Add LD64 suffix in QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  100. 733d0be QS8 GEMM MRx4c8 SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 3 years, 10 months ago