1. bbf5182 Enable QS8 2x8c8-aarch64-neon-mlal-padal GEMM / IGEMM microkernels by Frank Barchard · 3 years, 7 months ago
  2. 4610854 Disable QS8 1x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 7 months ago
  3. 3522c0a Enable QS8 4x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 7 months ago
  4. b75840f Enable QS8 IGEMM for Cortex A55 by Frank Barchard · 3 years, 7 months ago
  5. fb0ab0b QS8 enable 4x8c4__neondot for ARM32 by Frank Barchard · 3 years, 7 months ago
  6. a414daa Enable Quantized C2 microkernel for Neon by Frank Barchard · 3 years, 7 months ago
  7. 4baa2ac Process 32 pixels at a time in ARM64 SpMM microkernels by Marat Dukhan · 3 years, 8 months ago
  8. 2d6bcbb Reorder a few gemm1 initializations to match end to end order of gemm, igemm, gemm1, igemm1 by Jared Duke · 3 years, 8 months ago
  9. 9b7562b Reorder a few gemm1 initializations to match end to end order of gemm, igemm, gemm1, igemm1 by Frank Barchard · 3 years, 8 months ago
  10. 2202c81 Implement bilinear upsampling (CHW layout) for ARM architecture by Artsiom Ablavatski · 3 years, 8 months ago
  11. b94e34b QS8 GEMM select 2x16 for Neon MLAL. by Frank Barchard · 3 years, 9 months ago
  12. dfe47b9 Use iOS microkernels for Apple Silicon Macs by Marat Dukhan · 3 years, 10 months ago
  13. 412e2f4 Rename WASMSIMD dwconv2d functions to splat or loadsplat by Frank Barchard · 3 years, 10 months ago
  14. cfbed0a Disable sparse graph rewriting on x86 with AVX+ by Marat Dukhan · 3 years, 10 months ago
  15. 6f7d4a2 Remove unused input_width_tile from dwconv2d_chw_parameters by Marat Dukhan · 3 years, 10 months ago
  16. 4ddfab4 Optimize CHW microkernel selection for pre-NEON AArch32 by Marat Dukhan · 3 years, 10 months ago
  17. c763488 CONV2D HWC2CHW microkernel for ARM NEON by Marat Dukhan · 3 years, 10 months ago
  18. 3e91338 Initialize pointers to NEON CHW microkernels by Marat Dukhan · 3 years, 10 months ago
  19. 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 10 months ago
  20. ff0624e Add WebAssembly dwconv2d_chw_3x3s2p1 benchmark by Frank Barchard · 3 years, 10 months ago
  21. b6bd4bc Implement ELU operator by Marat Dukhan · 3 years, 10 months ago
  22. 048931b Extract memcpy wrapper used by Copy operator into a microkernel by Marat Dukhan · 3 years, 10 months ago
  23. 2213606 xnn_f32_conv_hwc2chw_ukernel_3x3s2p1c3x4__wasmsimd_2x2 based on SSE version by Frank Barchard · 3 years, 10 months ago
  24. 97883b8 Enable dwconv2d_chw_3x3p1__wasmsimd_x86_2x4 microkernel by Frank Barchard · 3 years, 10 months ago
  25. 0b18cb3 Enable dwconv2d_chw_3x3p1__ssse3_2x4_acc2 microkernel by Frank Barchard · 3 years, 10 months ago
  26. ad71b9a Refactor naming of DEPTHTOSPACE microkernels by Marat Dukhan · 3 years, 10 months ago
  27. db5c32d WasmSIMD dwconv2d generate x86 optimized version. by Frank Barchard · 3 years, 10 months ago
  28. 498cb50 Initialize select SpMM microkernel for x86 or ARM based on cpu detect, by Frank Barchard · 3 years, 10 months ago
  29. 1a95305 Replace DWConv2D PSIMD with WAsm SIMD. by Frank Barchard · 3 years, 10 months ago
  30. bbe8506 Introduce DEPTH_TO_SPACE operator and enable it for graph rewriting by Artsiom Ablavatski · 3 years, 11 months ago
  31. ccca214 SSE variant of 5x5s2 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
  32. 4fd38b2 Enable 32x1 SpMM microkernels for WAsm and SSE by Frank Barchard · 4 years ago
  33. d050389 SSE variants of 5x5 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
  34. 29c0c33 Auto-generate 5x5s2p2 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
  35. 9791810 Add operator implementation and tests for IBILINEAR CHW microkernel by Artsiom Ablavatski · 4 years ago
  36. b392f8e VDIV unrolled for WebAssembly by Frank Barchard · 4 years ago
  37. 149f0ea Auto-generate NEON 5x5p2 DWCONV micro-kernels by Marat Dukhan · 4 years ago
  38. c4efb00 Auto-generate scalar 5x5p2 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
  39. cf5b3c3 Auto-generate scalar versions of DWCONV2D CHW 3x3s2p1 micro-kernels by Marat Dukhan · 4 years ago
  40. 82f0c32 Auto-generate NEON/NEONFMA versions of DWCONV2D CHW 3x3s2p1 micro-kernels by Marat Dukhan · 4 years ago
  41. 91249d2 Auto-generate scalar versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 4 years ago
  42. 470078a Auto-generate SSE versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 4 years ago
  43. bf715f9 Rename DWCONV CHW microkernels to DWCONV2D CHW by Marat Dukhan · 4 years ago
  44. 6f469a5 Minor refactoring in DWCONV CHW microkernels by Marat Dukhan · 4 years ago
  45. 1c6cad9 Suffix DWCONV CHW microkernels with block size by Marat Dukhan · 4 years ago
  46. 9e05340 Replace PSIMD SpMM microkernels with WAsm SIMD. by Frank Barchard · 4 years ago
  47. dc2b29c AVX float32 sigmoid ukernels. by T.J. Alumbaugh · 4 years ago
  48. 31677ad Enable Cortex-A55 QS8 GEMM microkernel on HMP systems by Marat Dukhan · 4 years ago
  49. 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 4 years ago
  50. 1e8590e Enable QS8 A55 GEMM microkernel by Frank Barchard · 4 years ago
  51. 0797eb1 Rename QS8 assembly GEMM kernels to ld64 by Frank Barchard · 4 years ago
  52. 46aadda Enable 1x16 QS8 assembly GEMM for Neon dotproduct by Frank Barchard · 4 years ago
  53. bc0c729 Enable GEMM 4x16 QS8 using dot product microkernels. by Frank Barchard · 4 years ago
  54. d9ca7e6 AVX512F versions of Sigmoid microkernel by Marat Dukhan · 4 years ago
  55. 6dd7136 Use LUT-based Sigmoid microkernels on SSE2/SSE4 systems by Marat Dukhan · 4 years ago
  56. a96948e FP16 HardSwish operator by Frank Barchard · 4 years, 1 month ago
  57. d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 4 years, 1 month ago
  58. 0ea6a77 FP16 binary multiply operator by Frank Barchard · 4 years, 1 month ago
  59. bb9225e SSE4.1 and XOP versions of MUL32 VADD[C] microkernels by Marat Dukhan · 4 years, 1 month ago
  60. 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
  61. ff20948 QS8 version of ND ADD operator by Marat Dukhan · 4 years, 1 month ago
  62. 9c7308f vbinary microkernels unrolled to x8 for scalar and web assembly and x16 web assembly simd by Frank Barchard · 4 years, 1 month ago
  63. 37297a6 F32-RELU unrolled more for improved performance on Web Assembly by Frank Barchard · 4 years, 1 month ago
  64. f28cddf Initialize QS8 microkernels in ARM/ARM64 builds by Marat Dukhan · 4 years, 2 months ago
  65. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  66. 75215d8 Enable XOP versions of GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  67. 9e0b539 QS8 variant of NWC Global Average Pooling operator by Marat Dukhan · 4 years, 2 months ago
  68. 07e5040 Initialize QS8 microkernels for WAsm SIMD by Marat Dukhan · 4 years, 2 months ago
  69. d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  70. 16f1e1a QS8 version of NHWC Convolution operator by Marat Dukhan · 4 years, 2 months ago
  71. c5045bf Remove PSIMD variant of GAVGPOOL CW microkernel by Marat Dukhan · 4 years, 2 months ago
  72. 9531e9f Suffix VMULCADDC microkernels with activation name by Marat Dukhan · 4 years, 2 months ago
  73. a199d49 Remove support for direct Asm.js builds by Marat Dukhan · 4 years, 2 months ago
  74. ef25c6d NEON versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  75. cfa217d Remove ReLU microkernel initialization on native ARM and Intel. by Frank Barchard · 4 years, 2 months ago
  76. 62c5e23 Clamp operator with ReLU activation. by Frank Barchard · 4 years, 2 months ago
  77. 40f0552 WAsm SIMD versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  78. e3b7876 WAsm SIMD versions of X32 ZIP microkernels by Marat Dukhan · 4 years, 2 months ago
  79. 9d4bfa2 WAsm SIMD version of X32 UNPOOL microkernel by Marat Dukhan · 4 years, 2 months ago
  80. c601680 WAsm SIMD versions of GAVGPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  81. 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years, 2 months ago
  82. 1483c53 WAsm SIMD version of F32 PAVGPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  83. 3b7432d WAsm SIMD versions of F32 AVGPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  84. f4935a2 Enable WAsm SIMD microkernels for Leaky ReLU by Marat Dukhan · 4 years, 2 months ago
  85. 9306ae0 WAsm SIMD version of X32 PAD microkernel by Marat Dukhan · 4 years, 2 months ago
  86. 52238f0 WAsm SIMD versions of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 4 years, 2 months ago
  87. 8ee3701 WAsm SIMD version of X32 FILL microkernel by Marat Dukhan · 4 years, 2 months ago
  88. b3635ed Port SIGMOID microkernels to WAsm SIMD by Marat Dukhan · 4 years, 2 months ago
  89. b82b2cd WAsm SIMD conversion-based variants of VRND microkernels by Marat Dukhan · 4 years, 2 months ago
  90. 7829928 Reoptimize WAsm SIMD PReLU microkernels by Marat Dukhan · 4 years, 2 months ago
  91. d816f62 WAsm SIMD versions of VMULCADDC microkernels by Marat Dukhan · 4 years, 2 months ago
  92. 08b7a97 Rename Q8 microkernels and operators to QU8 by Marat Dukhan · 4 years, 2 months ago
  93. 688f6d8 Unify x86 and ARM flavors of WAsm SIMD GEMM/IGEMM/DWCONV with RELU by Marat Dukhan · 4 years, 2 months ago
  94. 55dde5b NEON F32 HSWISH microkernel unrolled by 16 by Marat Dukhan · 4 years, 3 months ago
  95. 9df9dc6 Reoptimize HSWISH microkernels by Marat Dukhan · 4 years, 3 months ago
  96. 00d1d6e WAsm SIMD variants of F32 IBILINEAR microkernels by Marat Dukhan · 4 years, 3 months ago
  97. e39e646 WAsm SIMD versions of [I]GEMM microkernels with NR=2 by Marat Dukhan · 4 years, 3 months ago
  98. f6e2480 WAsm SIMD variants of F32 MAXPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
  99. 3fa52c8 WAsm SIMD versions of F32 CLAMP microkernel by Marat Dukhan · 4 years, 3 months ago
  100. 8c41796 WAsm SIMD versions of F32 RMAX microkernel by Marat Dukhan · 4 years, 3 months ago