1. 6f469a5 Minor refactoring in DWCONV CHW microkernels by Marat Dukhan · 4 years ago
  2. 1c6cad9 Suffix DWCONV CHW microkernels with block size by Marat Dukhan · 4 years ago
  3. 9e05340 Replace PSIMD SpMM microkernels with WAsm SIMD. by Frank Barchard · 4 years ago
  4. dc2b29c AVX float32 sigmoid ukernels. by T.J. Alumbaugh · 4 years ago
  5. 31677ad Enable Cortex-A55 QS8 GEMM microkernel on HMP systems by Marat Dukhan · 4 years ago
  6. 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 4 years ago
  7. 1e8590e Enable QS8 A55 GEMM microkernel by Frank Barchard · 4 years ago
  8. 0797eb1 Rename QS8 assembly GEMM kernels to ld64 by Frank Barchard · 4 years ago
  9. 46aadda Enable 1x16 QS8 assembly GEMM for Neon dotproduct by Frank Barchard · 4 years ago
  10. bc0c729 Enable GEMM 4x16 QS8 using dot product microkernels. by Frank Barchard · 4 years ago
  11. d9ca7e6 AVX512F versions of Sigmoid microkernel by Marat Dukhan · 4 years ago
  12. 6dd7136 Use LUT-based Sigmoid microkernels on SSE2/SSE4 systems by Marat Dukhan · 4 years ago
  13. a96948e FP16 HardSwish operator by Frank Barchard · 4 years, 1 month ago
  14. d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 4 years, 1 month ago
  15. 0ea6a77 FP16 binary multiply operator by Frank Barchard · 4 years, 1 month ago
  16. bb9225e SSE4.1 and XOP versions of MUL32 VADD[C] microkernels by Marat Dukhan · 4 years, 1 month ago
  17. 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
  18. ff20948 QS8 version of ND ADD operator by Marat Dukhan · 4 years, 1 month ago
  19. 9c7308f vbinary microkernels unrolled to x8 for scalar and web assembly and x16 web assembly simd by Frank Barchard · 4 years, 1 month ago
  20. 37297a6 F32-RELU unrolled more for improved performance on Web Assembly by Frank Barchard · 4 years, 1 month ago
  21. f28cddf Initialize QS8 microkernels in ARM/ARM64 builds by Marat Dukhan · 4 years, 2 months ago
  22. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  23. 75215d8 Enable XOP versions of GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  24. 9e0b539 QS8 variant of NWC Global Average Pooling operator by Marat Dukhan · 4 years, 2 months ago
  25. 07e5040 Initialize QS8 microkernels for WAsm SIMD by Marat Dukhan · 4 years, 2 months ago
  26. d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  27. 16f1e1a QS8 version of NHWC Convolution operator by Marat Dukhan · 4 years, 2 months ago
  28. c5045bf Remove PSIMD variant of GAVGPOOL CW microkernel by Marat Dukhan · 4 years, 2 months ago
  29. 9531e9f Suffix VMULCADDC microkernels with activation name by Marat Dukhan · 4 years, 2 months ago
  30. a199d49 Remove support for direct Asm.js builds by Marat Dukhan · 4 years, 2 months ago
  31. ef25c6d NEON versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
  32. cfa217d Remove ReLU microkernel initialization on native ARM and Intel. by Frank Barchard · 4 years, 2 months ago
  33. 62c5e23 Clamp operator with ReLU activation. by Frank Barchard · 4 years, 2 months ago
  34. 40f0552 WAsm SIMD versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
  35. e3b7876 WAsm SIMD versions of X32 ZIP microkernels by Marat Dukhan · 4 years, 3 months ago
  36. 9d4bfa2 WAsm SIMD version of X32 UNPOOL microkernel by Marat Dukhan · 4 years, 3 months ago
  37. c601680 WAsm SIMD versions of GAVGPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
  38. 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years, 3 months ago
  39. 1483c53 WAsm SIMD version of F32 PAVGPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
  40. 3b7432d WAsm SIMD versions of F32 AVGPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
  41. f4935a2 Enable WAsm SIMD microkernels for Leaky ReLU by Marat Dukhan · 4 years, 3 months ago
  42. 9306ae0 WAsm SIMD version of X32 PAD microkernel by Marat Dukhan · 4 years, 3 months ago
  43. 52238f0 WAsm SIMD versions of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 4 years, 3 months ago
  44. 8ee3701 WAsm SIMD version of X32 FILL microkernel by Marat Dukhan · 4 years, 3 months ago
  45. b3635ed Port SIGMOID microkernels to WAsm SIMD by Marat Dukhan · 4 years, 3 months ago
  46. b82b2cd WAsm SIMD conversion-based variants of VRND microkernels by Marat Dukhan · 4 years, 3 months ago
  47. 7829928 Reoptimize WAsm SIMD PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
  48. d816f62 WAsm SIMD versions of VMULCADDC microkernels by Marat Dukhan · 4 years, 3 months ago
  49. 08b7a97 Rename Q8 microkernels and operators to QU8 by Marat Dukhan · 4 years, 3 months ago
  50. 688f6d8 Unify x86 and ARM flavors of WAsm SIMD GEMM/IGEMM/DWCONV with RELU by Marat Dukhan · 4 years, 3 months ago
  51. 55dde5b NEON F32 HSWISH microkernel unrolled by 16 by Marat Dukhan · 4 years, 3 months ago
  52. 9df9dc6 Reoptimize HSWISH microkernels by Marat Dukhan · 4 years, 3 months ago
  53. 00d1d6e WAsm SIMD variants of F32 IBILINEAR microkernels by Marat Dukhan · 4 years, 3 months ago
  54. e39e646 WAsm SIMD versions of [I]GEMM microkernels with NR=2 by Marat Dukhan · 4 years, 3 months ago
  55. f6e2480 WAsm SIMD variants of F32 MAXPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
  56. 3fa52c8 WAsm SIMD versions of F32 CLAMP microkernel by Marat Dukhan · 4 years, 3 months ago
  57. 8c41796 WAsm SIMD versions of F32 RMAX microkernel by Marat Dukhan · 4 years, 3 months ago
  58. c67dd7f Initialize linear vs minmax binary operator microkernels for web assembly. by Frank Barchard · 4 years, 3 months ago
  59. 6804bbd Square Root operator by Marat Dukhan · 4 years, 3 months ago
  60. f4df5fe Cortex-A7 use prefetch version of GEMM microkernel. by Frank Barchard · 4 years, 3 months ago
  61. 37c8351 Port unary elementwise microkernels to WAsm SIMD by Marat Dukhan · 4 years, 3 months ago
  62. 72b399a Port RND microkernels to WAsm SIMD intrinsics by Marat Dukhan · 4 years, 3 months ago
  63. f2ebd89 Remove VRSQRDIFFC microkernels by Marat Dukhan · 4 years, 3 months ago
  64. cdc5655 Enable WebAssembly SIMD kernels for binary elementwise operators by Marat Dukhan · 4 years, 3 months ago
  65. 6b73c4f FP16 use 6x16 aarch64 microkernel by Frank Barchard · 4 years, 3 months ago
  66. 49b4dcc FP16 Convolution NHWC operator by Frank Barchard · 4 years, 3 months ago
  67. c303fe6 Optimize selection of HSWISH microkernels in WAsm SIMD by Marat Dukhan · 4 years, 3 months ago
  68. 0d3f467 SSE2 and SSE4.1 versions of Leaky ReLU microkernels by Marat Dukhan · 4 years, 3 months ago
  69. 7c1f808 WAsm implementation of PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
  70. 195f8eb WAsm SIMD implementation of PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
  71. 39b5e94 SSE versions of PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
  72. 01898c0 FP16 binary add operator by Frank Barchard · 4 years, 3 months ago
  73. 8d5d259 Check NEON FP16 arithmetics support by Marat Dukhan · 4 years, 3 months ago
  74. 854fb6b Replace xnn_params.initialized with fine-grained xnn_params.init_flags by Marat Dukhan · 4 years, 3 months ago
  75. b8e7b07 DWCONV microkernels with alternative activations in WAsm SIMD by Marat Dukhan · 4 years, 4 months ago
  76. 802808c GEMM/IGEMM microkernels with alternative activations in WAsm SIMD by Marat Dukhan · 4 years, 4 months ago
  77. ac014d7 DWCONV microkernels in WAsm SIMD intrinsics by Marat Dukhan · 4 years, 4 months ago
  78. 1bbf96b GEMM/IGEMM implementations in WAsm SIMD intrinsics by Marat Dukhan · 4 years, 4 months ago
  79. 7465a89 Add PSIMD DWCONV CHW 5X5S2P2 kernel. by Erich Elsen · 4 years, 4 months ago
  80. 2892889 Add PSIMD DWCONV 5x5s2 kernel. by Erich Elsen · 4 years, 4 months ago
  81. 7e2cbb0 FP16 Global Average Pooling operator by Frank Barchard · 4 years, 4 months ago
  82. 016e586 iOS use Cortex-A75 microkernel which avoids x18 register by Frank Barchard · 4 years, 4 months ago
  83. 2881333 FP32 Leaky ReLU operator by Marat Dukhan · 4 years, 4 months ago
  84. 0a1970e PSIMD F32-CONV-HWC2CHW kernel by Erich Elsen · 4 years, 4 months ago
  85. 6e80fdc Add 16x1 SSE f32-SpMM kernels, which is faster than the existing 8x1 kernel. by Erich Elsen · 4 years, 4 months ago
  86. 64e5251 Rounding operators by Marat Dukhan · 4 years, 4 months ago
  87. 5b2e07a Add new x86 sse chw2hwc conv kernel to init.c by Erich Elsen · 4 years, 4 months ago
  88. 5020b96 Abs, Negate, and Square NC operators by Marat Dukhan · 4 years, 4 months ago
  89. f739926 Squared Difference operator by Marat Dukhan · 4 years, 4 months ago
  90. 467f636 Fused [I]GEMM+RELU micro-kernels by Marat Dukhan · 4 years, 4 months ago
  91. 63523d4 Refactor X32 PAD micro-kernels by Marat Dukhan · 4 years, 4 months ago
  92. 4662b19 N-dimensional Pad operator by Marat Dukhan · 4 years, 4 months ago
  93. 1f29b80 Refactor CHW micro-kernels by Marat Dukhan · 4 years, 5 months ago
  94. bcdb1c1 Remove xnn_q8_dwconv_minmax_ukernel_up8x9__aarch32_neon by Frank Barchard · 4 years, 5 months ago
  95. 3b745a4 Initialize micro-kernels for pre-NEON ARM in non-mobile builds by Marat Dukhan · 4 years, 5 months ago
  96. 0184901 Simplify x86 detection in WAsm builds by Marat Dukhan · 4 years, 5 months ago
  97. f5425ea Additional NEON/NEONFMA DWCONV microkernels by Marat Dukhan · 4 years, 5 months ago
  98. 57dccd8 NEON and SSE2 implementations of X32 UNPOOL micro-kernel by Marat Dukhan · 4 years, 6 months ago
  99. 57133c0 Port xnn_initialize to Windows by Marat Dukhan · 4 years, 6 months ago
  100. 9993660 Add MINMAX suffix to remaining micro-kernels by Marat Dukhan · 4 years, 6 months ago