- 6f469a5 Minor refactoring in DWCONV CHW microkernels by Marat Dukhan · 4 years ago
- 1c6cad9 Suffix DWCONV CHW microkernels with block size by Marat Dukhan · 4 years ago
- 9e05340 Replace PSIMD SpMM microkernels with WAsm SIMD. by Frank Barchard · 4 years ago
- dc2b29c AVX float32 sigmoid ukernels. by T.J. Alumbaugh · 4 years ago
- 31677ad Enable Cortex-A55 QS8 GEMM microkernel on HMP systems by Marat Dukhan · 4 years ago
- 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 4 years ago
- 1e8590e Enable QS8 A55 GEMM microkernel by Frank Barchard · 4 years ago
- 0797eb1 Rename QS8 assembly GEMM kernels to ld64 by Frank Barchard · 4 years ago
- 46aadda Enable 1x16 QS8 assembly GEMM for Neon dotproduct by Frank Barchard · 4 years ago
- bc0c729 Enable GEMM 4x16 QS8 using dot product microkernels. by Frank Barchard · 4 years ago
- d9ca7e6 AVX512F versions of Sigmoid microkernel by Marat Dukhan · 4 years ago
- 6dd7136 Use LUT-based Sigmoid microkernels on SSE2/SSE4 systems by Marat Dukhan · 4 years ago
- a96948e FP16 HardSwish operator by Frank Barchard · 4 years, 1 month ago
- d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 4 years, 1 month ago
- 0ea6a77 FP16 binary multiply operator by Frank Barchard · 4 years, 1 month ago
- bb9225e SSE4.1 and XOP versions of MUL32 VADD[C] microkernels by Marat Dukhan · 4 years, 1 month ago
- 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
- ff20948 QS8 version of ND ADD operator by Marat Dukhan · 4 years, 1 month ago
- 9c7308f vbinary microkernels unrolled to x8 for scalar and web assembly and x16 web assembly simd by Frank Barchard · 4 years, 1 month ago
- 37297a6 F32-RELU unrolled more for improved performance on Web Assembly by Frank Barchard · 4 years, 1 month ago
- f28cddf Initialize QS8 microkernels in ARM/ARM64 builds by Marat Dukhan · 4 years, 2 months ago
- bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 75215d8 Enable XOP versions of GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 9e0b539 QS8 variant of NWC Global Average Pooling operator by Marat Dukhan · 4 years, 2 months ago
- 07e5040 Initialize QS8 microkernels for WAsm SIMD by Marat Dukhan · 4 years, 2 months ago
- d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
- 16f1e1a QS8 version of NHWC Convolution operator by Marat Dukhan · 4 years, 2 months ago
- c5045bf Remove PSIMD variant of GAVGPOOL CW microkernel by Marat Dukhan · 4 years, 2 months ago
- 9531e9f Suffix VMULCADDC microkernels with activation name by Marat Dukhan · 4 years, 2 months ago
- a199d49 Remove support for direct Asm.js builds by Marat Dukhan · 4 years, 2 months ago
- ef25c6d NEON versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
- cfa217d Remove ReLU microkernel initialization on native ARM and Intel. by Frank Barchard · 4 years, 2 months ago
- 62c5e23 Clamp operator with ReLU activation. by Frank Barchard · 4 years, 2 months ago
- 40f0552 WAsm SIMD versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
- e3b7876 WAsm SIMD versions of X32 ZIP microkernels by Marat Dukhan · 4 years, 3 months ago
- 9d4bfa2 WAsm SIMD version of X32 UNPOOL microkernel by Marat Dukhan · 4 years, 3 months ago
- c601680 WAsm SIMD versions of GAVGPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
- 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years, 3 months ago
- 1483c53 WAsm SIMD version of F32 PAVGPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
- 3b7432d WAsm SIMD versions of F32 AVGPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
- f4935a2 Enable WAsm SIMD microkernels for Leaky ReLU by Marat Dukhan · 4 years, 3 months ago
- 9306ae0 WAsm SIMD version of X32 PAD microkernel by Marat Dukhan · 4 years, 3 months ago
- 52238f0 WAsm SIMD versions of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 4 years, 3 months ago
- 8ee3701 WAsm SIMD version of X32 FILL microkernel by Marat Dukhan · 4 years, 3 months ago
- b3635ed Port SIGMOID microkernels to WAsm SIMD by Marat Dukhan · 4 years, 3 months ago
- b82b2cd WAsm SIMD conversion-based variants of VRND microkernels by Marat Dukhan · 4 years, 3 months ago
- 7829928 Reoptimize WAsm SIMD PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
- d816f62 WAsm SIMD versions of VMULCADDC microkernels by Marat Dukhan · 4 years, 3 months ago
- 08b7a97 Rename Q8 microkernels and operators to QU8 by Marat Dukhan · 4 years, 3 months ago
- 688f6d8 Unify x86 and ARM flavors of WAsm SIMD GEMM/IGEMM/DWCONV with RELU by Marat Dukhan · 4 years, 3 months ago
- 55dde5b NEON F32 HSWISH microkernel unrolled by 16 by Marat Dukhan · 4 years, 3 months ago
- 9df9dc6 Reoptimize HSWISH microkernels by Marat Dukhan · 4 years, 3 months ago
- 00d1d6e WAsm SIMD variants of F32 IBILINEAR microkernels by Marat Dukhan · 4 years, 3 months ago
- e39e646 WAsm SIMD versions of [I]GEMM microkernels with NR=2 by Marat Dukhan · 4 years, 3 months ago
- f6e2480 WAsm SIMD variants of F32 MAXPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
- 3fa52c8 WAsm SIMD versions of F32 CLAMP microkernel by Marat Dukhan · 4 years, 3 months ago
- 8c41796 WAsm SIMD versions of F32 RMAX microkernel by Marat Dukhan · 4 years, 3 months ago
- c67dd7f Initialize linear vs minmax binary operator microkernels for web assembly. by Frank Barchard · 4 years, 3 months ago
- 6804bbd Square Root operator by Marat Dukhan · 4 years, 3 months ago
- f4df5fe Cortex-A7 use prefetch version of GEMM microkernel. by Frank Barchard · 4 years, 3 months ago
- 37c8351 Port unary elementwise microkernels to WAsm SIMD by Marat Dukhan · 4 years, 3 months ago
- 72b399a Port RND microkernels to WAsm SIMD intrinsics by Marat Dukhan · 4 years, 3 months ago
- f2ebd89 Remove VRSQRDIFFC microkernels by Marat Dukhan · 4 years, 3 months ago
- cdc5655 Enable WebAssembly SIMD kernels for binary elementwise operators by Marat Dukhan · 4 years, 3 months ago
- 6b73c4f FP16 use 6x16 aarch64 microkernel by Frank Barchard · 4 years, 3 months ago
- 49b4dcc FP16 Convolution NHWC operator by Frank Barchard · 4 years, 3 months ago
- c303fe6 Optimize selection of HSWISH microkernels in WAsm SIMD by Marat Dukhan · 4 years, 3 months ago
- 0d3f467 SSE2 and SSE4.1 versions of Leaky ReLU microkernels by Marat Dukhan · 4 years, 3 months ago
- 7c1f808 WAsm implementation of PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
- 195f8eb WAsm SIMD implementation of PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
- 39b5e94 SSE versions of PReLU microkernels by Marat Dukhan · 4 years, 3 months ago
- 01898c0 FP16 binary add operator by Frank Barchard · 4 years, 3 months ago
- 8d5d259 Check NEON FP16 arithmetics support by Marat Dukhan · 4 years, 3 months ago
- 854fb6b Replace xnn_params.initialized with fine-grained xnn_params.init_flags by Marat Dukhan · 4 years, 3 months ago
- b8e7b07 DWCONV microkernels with alternative activations in WAsm SIMD by Marat Dukhan · 4 years, 4 months ago
- 802808c GEMM/IGEMM microkernels with alternative activations in WAsm SIMD by Marat Dukhan · 4 years, 4 months ago
- ac014d7 DWCONV microkernels in WAsm SIMD intrinsics by Marat Dukhan · 4 years, 4 months ago
- 1bbf96b GEMM/IGEMM implementations in WAsm SIMD intrinsics by Marat Dukhan · 4 years, 4 months ago
- 7465a89 Add PSIMD DWCONV CHW 5X5S2P2 kernel. by Erich Elsen · 4 years, 4 months ago
- 2892889 Add PSIMD DWCONV 5x5s2 kernel. by Erich Elsen · 4 years, 4 months ago
- 7e2cbb0 FP16 Global Average Pooling operator by Frank Barchard · 4 years, 4 months ago
- 016e586 iOS use Cortex-A75 microkernel which avoids x18 register by Frank Barchard · 4 years, 4 months ago
- 2881333 FP32 Leaky ReLU operator by Marat Dukhan · 4 years, 4 months ago
- 0a1970e PSIMD F32-CONV-HWC2CHW kernel by Erich Elsen · 4 years, 4 months ago
- 6e80fdc Add 16x1 SSE f32-SpMM kernels, which is faster than the existing 8x1 kernel. by Erich Elsen · 4 years, 4 months ago
- 64e5251 Rounding operators by Marat Dukhan · 4 years, 4 months ago
- 5b2e07a Add new x86 sse chw2hwc conv kernel to init.c by Erich Elsen · 4 years, 4 months ago
- 5020b96 Abs, Negate, and Square NC operators by Marat Dukhan · 4 years, 4 months ago
- f739926 Squared Difference operator by Marat Dukhan · 4 years, 4 months ago
- 467f636 Fused [I]GEMM+RELU micro-kernels by Marat Dukhan · 4 years, 4 months ago
- 63523d4 Refactor X32 PAD micro-kernels by Marat Dukhan · 4 years, 4 months ago
- 4662b19 N-dimensional Pad operator by Marat Dukhan · 4 years, 4 months ago
- 1f29b80 Refactor CHW micro-kernels by Marat Dukhan · 4 years, 5 months ago
- bcdb1c1 Remove xnn_q8_dwconv_minmax_ukernel_up8x9__aarch32_neon by Frank Barchard · 4 years, 5 months ago
- 3b745a4 Initialize micro-kernels for pre-NEON ARM in non-mobile builds by Marat Dukhan · 4 years, 5 months ago
- 0184901 Simplify x86 detection in WAsm builds by Marat Dukhan · 4 years, 5 months ago
- f5425ea Additional NEON/NEONFMA DWCONV microkernels by Marat Dukhan · 4 years, 5 months ago
- 57dccd8 NEON and SSE2 implementations of X32 UNPOOL micro-kernel by Marat Dukhan · 4 years, 6 months ago
- 57133c0 Port xnn_initialize to Windows by Marat Dukhan · 4 years, 6 months ago
- 9993660 Add MINMAX suffix to remaining micro-kernels by Marat Dukhan · 4 years, 6 months ago