- bbf5182 Enable QS8 2x8c8-aarch64-neon-mlal-padal GEMM / IGEMM microkernels by Frank Barchard · 3 years, 7 months ago
- 4610854 Disable QS8 1x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 7 months ago
- 3522c0a Enable QS8 4x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 7 months ago
- b75840f Enable QS8 IGEMM for Cortex A55 by Frank Barchard · 3 years, 7 months ago
- fb0ab0b QS8 enable 4x8c4__neondot for ARM32 by Frank Barchard · 3 years, 7 months ago
- a414daa Enable Quantized C2 microkernel for Neon by Frank Barchard · 3 years, 7 months ago
- 4baa2ac Process 32 pixels at a time in ARM64 SpMM microkernels by Marat Dukhan · 3 years, 8 months ago
- 2d6bcbb Reorder a few gemm1 initializations to match end to end order of gemm, igemm, gemm1, igemm1 by Jared Duke · 3 years, 8 months ago
- 9b7562b Reorder a few gemm1 initializations to match end to end order of gemm, igemm, gemm1, igemm1 by Frank Barchard · 3 years, 8 months ago
- 2202c81 Implement bilinear upsampling (CHW layout) for ARM architecture by Artsiom Ablavatski · 3 years, 8 months ago
- b94e34b QS8 GEMM select 2x16 for Neon MLAL. by Frank Barchard · 3 years, 9 months ago
- dfe47b9 Use iOS microkernels for Apple Silicon Macs by Marat Dukhan · 3 years, 10 months ago
- 412e2f4 Rename WASMSIMD dwconv2d functions to splat or loadsplat by Frank Barchard · 3 years, 10 months ago
- cfbed0a Disable sparse graph rewriting on x86 with AVX+ by Marat Dukhan · 3 years, 10 months ago
- 6f7d4a2 Remove unused input_width_tile from dwconv2d_chw_parameters by Marat Dukhan · 3 years, 10 months ago
- 4ddfab4 Optimize CHW microkernel selection for pre-NEON AArch32 by Marat Dukhan · 3 years, 10 months ago
- c763488 CONV2D HWC2CHW microkernel for ARM NEON by Marat Dukhan · 3 years, 10 months ago
- 3e91338 Initialize pointers to NEON CHW microkernels by Marat Dukhan · 3 years, 10 months ago
- 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 10 months ago
- ff0624e Add WebAssembly dwconv2d_chw_3x3s2p1 benchmark by Frank Barchard · 3 years, 10 months ago
- b6bd4bc Implement ELU operator by Marat Dukhan · 3 years, 10 months ago
- 048931b Extract memcpy wrapper used by Copy operator into a microkernel by Marat Dukhan · 3 years, 10 months ago
- 2213606 xnn_f32_conv_hwc2chw_ukernel_3x3s2p1c3x4__wasmsimd_2x2 based on SSE version by Frank Barchard · 3 years, 10 months ago
- 97883b8 Enable dwconv2d_chw_3x3p1__wasmsimd_x86_2x4 microkernel by Frank Barchard · 3 years, 10 months ago
- 0b18cb3 Enable dwconv2d_chw_3x3p1__ssse3_2x4_acc2 microkernel by Frank Barchard · 3 years, 10 months ago
- ad71b9a Refactor naming of DEPTHTOSPACE microkernels by Marat Dukhan · 3 years, 10 months ago
- db5c32d WasmSIMD dwconv2d generate x86 optimized version. by Frank Barchard · 3 years, 10 months ago
- 498cb50 Initialize select SpMM microkernel for x86 or ARM based on cpu detect, by Frank Barchard · 3 years, 10 months ago
- 1a95305 Replace DWConv2D PSIMD with WAsm SIMD. by Frank Barchard · 3 years, 10 months ago
- bbe8506 Introduce DEPTH_TO_SPACE operator and enable it for graph rewriting by Artsiom Ablavatski · 3 years, 11 months ago
- ccca214 SSE variant of 5x5s2 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
- 4fd38b2 Enable 32x1 SpMM microkernels for WAsm and SSE by Frank Barchard · 4 years ago
- d050389 SSE variants of 5x5 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
- 29c0c33 Auto-generate 5x5s2p2 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
- 9791810 Add operator implementation and tests for IBILINEAR CHW microkernel by Artsiom Ablavatski · 4 years ago
- b392f8e VDIV unrolled for WebAssembly by Frank Barchard · 4 years ago
- 149f0ea Auto-generate NEON 5x5p2 DWCONV micro-kernels by Marat Dukhan · 4 years ago
- c4efb00 Auto-generate scalar 5x5p2 DWCONV CHW micro-kernels by Marat Dukhan · 4 years ago
- cf5b3c3 Auto-generate scalar versions of DWCONV2D CHW 3x3s2p1 micro-kernels by Marat Dukhan · 4 years ago
- 82f0c32 Auto-generate NEON/NEONFMA versions of DWCONV2D CHW 3x3s2p1 micro-kernels by Marat Dukhan · 4 years ago
- 91249d2 Auto-generate scalar versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 4 years ago
- 470078a Auto-generate SSE versions of DWCONV2D CHW 3x3p1 micro-kernels by Marat Dukhan · 4 years ago
- bf715f9 Rename DWCONV CHW microkernels to DWCONV2D CHW by Marat Dukhan · 4 years ago
- 6f469a5 Minor refactoring in DWCONV CHW microkernels by Marat Dukhan · 4 years ago
- 1c6cad9 Suffix DWCONV CHW microkernels with block size by Marat Dukhan · 4 years ago
- 9e05340 Replace PSIMD SpMM microkernels with WAsm SIMD. by Frank Barchard · 4 years ago
- dc2b29c AVX float32 sigmoid ukernels. by T.J. Alumbaugh · 4 years ago
- 31677ad Enable Cortex-A55 QS8 GEMM microkernel on HMP systems by Marat Dukhan · 4 years ago
- 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 4 years ago
- 1e8590e Enable QS8 A55 GEMM microkernel by Frank Barchard · 4 years ago
- 0797eb1 Rename QS8 assembly GEMM kernels to ld64 by Frank Barchard · 4 years ago
- 46aadda Enable 1x16 QS8 assembly GEMM for Neon dotproduct by Frank Barchard · 4 years ago
- bc0c729 Enable GEMM 4x16 QS8 using dot product microkernels. by Frank Barchard · 4 years ago
- d9ca7e6 AVX512F versions of Sigmoid microkernel by Marat Dukhan · 4 years ago
- 6dd7136 Use LUT-based Sigmoid microkernels on SSE2/SSE4 systems by Marat Dukhan · 4 years ago
- a96948e FP16 HardSwish operator by Frank Barchard · 4 years, 1 month ago
- d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 4 years, 1 month ago
- 0ea6a77 FP16 binary multiply operator by Frank Barchard · 4 years, 1 month ago
- bb9225e SSE4.1 and XOP versions of MUL32 VADD[C] microkernels by Marat Dukhan · 4 years, 1 month ago
- 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
- ff20948 QS8 version of ND ADD operator by Marat Dukhan · 4 years, 1 month ago
- 9c7308f vbinary microkernels unrolled to x8 for scalar and web assembly and x16 web assembly simd by Frank Barchard · 4 years, 1 month ago
- 37297a6 F32-RELU unrolled more for improved performance on Web Assembly by Frank Barchard · 4 years, 1 month ago
- f28cddf Initialize QS8 microkernels in ARM/ARM64 builds by Marat Dukhan · 4 years, 2 months ago
- bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 75215d8 Enable XOP versions of GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 9e0b539 QS8 variant of NWC Global Average Pooling operator by Marat Dukhan · 4 years, 2 months ago
- 07e5040 Initialize QS8 microkernels for WAsm SIMD by Marat Dukhan · 4 years, 2 months ago
- d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
- 16f1e1a QS8 version of NHWC Convolution operator by Marat Dukhan · 4 years, 2 months ago
- c5045bf Remove PSIMD variant of GAVGPOOL CW microkernel by Marat Dukhan · 4 years, 2 months ago
- 9531e9f Suffix VMULCADDC microkernels with activation name by Marat Dukhan · 4 years, 2 months ago
- a199d49 Remove support for direct Asm.js builds by Marat Dukhan · 4 years, 2 months ago
- ef25c6d NEON versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
- cfa217d Remove ReLU microkernel initialization on native ARM and Intel. by Frank Barchard · 4 years, 2 months ago
- 62c5e23 Clamp operator with ReLU activation. by Frank Barchard · 4 years, 2 months ago
- 40f0552 WAsm SIMD versions of ARGMAXPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
- e3b7876 WAsm SIMD versions of X32 ZIP microkernels by Marat Dukhan · 4 years, 2 months ago
- 9d4bfa2 WAsm SIMD version of X32 UNPOOL microkernel by Marat Dukhan · 4 years, 2 months ago
- c601680 WAsm SIMD versions of GAVGPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
- 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years, 2 months ago
- 1483c53 WAsm SIMD version of F32 PAVGPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
- 3b7432d WAsm SIMD versions of F32 AVGPOOL microkernels by Marat Dukhan · 4 years, 2 months ago
- f4935a2 Enable WAsm SIMD microkernels for Leaky ReLU by Marat Dukhan · 4 years, 2 months ago
- 9306ae0 WAsm SIMD version of X32 PAD microkernel by Marat Dukhan · 4 years, 2 months ago
- 52238f0 WAsm SIMD versions of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 4 years, 2 months ago
- 8ee3701 WAsm SIMD version of X32 FILL microkernel by Marat Dukhan · 4 years, 2 months ago
- b3635ed Port SIGMOID microkernels to WAsm SIMD by Marat Dukhan · 4 years, 2 months ago
- b82b2cd WAsm SIMD conversion-based variants of VRND microkernels by Marat Dukhan · 4 years, 2 months ago
- 7829928 Reoptimize WAsm SIMD PReLU microkernels by Marat Dukhan · 4 years, 2 months ago
- d816f62 WAsm SIMD versions of VMULCADDC microkernels by Marat Dukhan · 4 years, 2 months ago
- 08b7a97 Rename Q8 microkernels and operators to QU8 by Marat Dukhan · 4 years, 2 months ago
- 688f6d8 Unify x86 and ARM flavors of WAsm SIMD GEMM/IGEMM/DWCONV with RELU by Marat Dukhan · 4 years, 2 months ago
- 55dde5b NEON F32 HSWISH microkernel unrolled by 16 by Marat Dukhan · 4 years, 3 months ago
- 9df9dc6 Reoptimize HSWISH microkernels by Marat Dukhan · 4 years, 3 months ago
- 00d1d6e WAsm SIMD variants of F32 IBILINEAR microkernels by Marat Dukhan · 4 years, 3 months ago
- e39e646 WAsm SIMD versions of [I]GEMM microkernels with NR=2 by Marat Dukhan · 4 years, 3 months ago
- f6e2480 WAsm SIMD variants of F32 MAXPOOL microkernels by Marat Dukhan · 4 years, 3 months ago
- 3fa52c8 WAsm SIMD versions of F32 CLAMP microkernel by Marat Dukhan · 4 years, 3 months ago
- 8c41796 WAsm SIMD versions of F32 RMAX microkernel by Marat Dukhan · 4 years, 3 months ago