- 801d2c2 Fix QS8 IGEMM with FP32 requantization for SSE/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
- 0b04374 Support QC8 GEMM microkernels by Marat Dukhan · 3 years, 5 months ago
- 8b0e381 Remove bias_n accessor in GemmMicrokernelTester by Marat Dukhan · 3 years, 5 months ago
- e695791 4x16C4 QS8 IGEMM Cortex A55 microkernel reuse X10 to save push by Frank Barchard · 3 years, 5 months ago
- 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 5 months ago
- caf4831 FP32 requantization in QS8 DWCONV microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
- c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
- c6e6ee0 Refactor RNDNA and RNDNU reference requantization by Marat Dukhan · 3 years, 5 months ago
- 062bee3 Evaluation stubs for RNDNU requantization by Marat Dukhan · 3 years, 5 months ago
- 0671624 Rename PRECISE requantization schema to RNDNA by Marat Dukhan · 3 years, 5 months ago
- 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
- d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
- 77ded05 Use byte-wide MIN/MAX in AVX512 QS8 DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
- 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
- a5d1261 Explicitly specify requantization in GEMM/IGEMM/DWCONV tests by Marat Dukhan · 3 years, 5 months ago
- 5ca0d8d Consolidate requantization structures and functions in a single header by Marat Dukhan · 3 years, 5 months ago
- 9976cd8 Rename Q31 requantization to GEMMLOWP requantization by Marat Dukhan · 3 years, 5 months ago
- e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 5 months ago
- b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
- a0acc15 Use pointer to parameter initialization function in VMULCADDC microkernel tests by Marat Dukhan · 3 years, 5 months ago
- 104ae5e Use ISA-specific layouts in F32 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
- 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
- 725f47e Split QS8/QU8 GEMM parameter initialization by datatype by Marat Dukhan · 3 years, 5 months ago
- d5694df Use pointer to parameter initialization function in GEMM/IGEMM/DWCONV microkernel tests by Marat Dukhan · 3 years, 5 months ago
- d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
- f56f4c4 Refactor interface of microkernel parameter initialization by Marat Dukhan · 3 years, 5 months ago
- a6c0516 Migrate remaining CLAMP and HSWISH tests to VUNARY test gen by Marat Dukhan · 3 years, 5 months ago
- 6eaab71 Remove pointer casting in generated vector unary tests by Marat Dukhan · 3 years, 5 months ago
- 10f1fe0 Rename VBinOpMicrokernelTester -> VBinaryMicrokernelTester by Marat Dukhan · 3 years, 5 months ago
- 87ed45c Rename VUnOpMicrokernelTester -> VUnaryMicrokernelTester by Marat Dukhan · 3 years, 5 months ago
- 60d3f24 Migrate F32 VCLAMP microkernel tests to VUNARY test gen by Marat Dukhan · 3 years, 5 months ago
- 949b6e7 Migrate F32 HSWISH microkernel tests to VUNARY test gen by Marat Dukhan · 3 years, 5 months ago
- 4ed1488 QS8 DWCONV25 microkernels by Marat Dukhan · 3 years, 5 months ago
- d481c28 QS8 VADD microkernels by Marat Dukhan · 3 years, 6 months ago
- 047b620 Scalar QS8 GAVGPOOL microkernels by Marat Dukhan · 3 years, 6 months ago
- 4454288 Scalar QS8 DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
- a1a4e78 Scalar QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 6 months ago
- 938ea81 Code generate 1x8C8 nicrokernel for Cortex A75 with and without prfm by Frank Barchard · 3 years, 6 months ago
- a91559a Move declarations of VHSWISH microkernels into vunary.h by Marat Dukhan · 3 years, 6 months ago
- 6674d69 Refactor naming of unary elementwise microkernels by Marat Dukhan · 3 years, 6 months ago
- dddb38f QS8 1x8C8 IGEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
- 46a69c9 QS8 1x8C8 GEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
- 5549735 4X8 and 4x16 mla lane microkernels for A53 by Frank Barchard · 3 years, 6 months ago
- d68e114 Cortex A53 tuned C8 gemm/igemm microkernels by Frank Barchard · 3 years, 6 months ago
- 1f51d38 Add prefetch to MLA lane microkernel by Frank Barchard · 3 years, 6 months ago
- 4a35204 PRFM variant of QS8 C8 Neon microkernel. by Frank Barchard · 3 years, 6 months ago
- 6e8c0ce Disable compilation of neondot microkernels for AArch32 iOS by Marat Dukhan · 3 years, 6 months ago
- 3fd4e27 XOP versions of QS8 DWCONV MUL32 microkernels by Marat Dukhan · 3 years, 7 months ago
- 2e42787 2x4c2/3x4c2 microkernels for SSE2/SSSE3/SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 7 months ago
- 07feec8 MUL32 versions of SSE4.1 & AVX QS8 DWCONV microkernels by Marat Dukhan · 3 years, 7 months ago
- fa0ab85 AVX versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 7 months ago
- e9c4b96 AVX versions of QS8 VADD/VADDC microkernels by Marat Dukhan · 3 years, 7 months ago
- a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 7 months ago
- d23cb6e Fully Connected operator for QS8 datatype by Marat Dukhan · 3 years, 7 months ago
- b3ffd58 Implement bilinear upsampling for SSE architecture by Artsiom Ablavatski · 3 years, 7 months ago
- 6e35de5 QS8 1X8C8 IGEMM microkernel by Frank Barchard · 3 years, 7 months ago
- b876263 QS8 1X8C8 GEMM microkernel by Frank Barchard · 3 years, 7 months ago
- 967712d Limit range of test values for f16 binary minmax ops. by Frank Barchard · 3 years, 7 months ago
- cbb8e70 QS8 2x8c8-aarch64-neon-mlal-padal IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 7ca54df QS8 2x8c16-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 2f06150 xnn_qs8_gemm_minmax_ukernel_2x8c8__aarch64_neon_mlal_padal GEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 1dc9fef QS8 2x8c8-aarch64 GEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 671d1b0 QS8 4x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 5655cb7 QS8 GEMM 2x8c16 MLAL PADAL assembly microkernel for AArch64 by Frank Barchard · 3 years, 8 months ago
- 89e12f8 QS8 IGEMM for Cortex A55 by Frank Barchard · 3 years, 8 months ago
- fb8d1f1 Increase minimum value to avoid f16_vrdivc producing inf by Frank Barchard · 3 years, 8 months ago
- 62b4ff7 Remove 12x8 QS8 GEMM and IGEMM Neon dotproduct microkernels. by Frank Barchard · 3 years, 8 months ago
- da78da1 QS8 C8 Neon microkernels with MUL and MLA versions. by Frank Barchard · 3 years, 8 months ago
- 4a4be4e QS8 1x16c4 ld32 GEMM microkernel using NEON dot product by Frank Barchard · 3 years, 8 months ago
- 02121ca QS8 Neon IGEMM microkernels with 8 bit MUL using DUP by Frank Barchard · 3 years, 8 months ago
- 77e93a2 Fix mismatch in block layout in mixed-layout Depth-To-Space operator by Marat Dukhan · 3 years, 8 months ago
- a5e242c QS8 LD32 GEMM microkernel for big cores with dotproduct by Frank Barchard · 3 years, 8 months ago
- 36f95cf QS8 Neon IGEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 8 months ago
- 71c4d1a QS8 Neon GEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 8 months ago
- 6d138db Remove scalar C4 QS8 and QU8 gemm microkernels. by Frank Barchard · 3 years, 8 months ago
- 6fa8078 QS8 C2 Neon igemm by Frank Barchard · 3 years, 8 months ago
- d79391d QS8 C8 Neon igemm by Frank Barchard · 3 years, 8 months ago
- c8532ae Unroll KC loop to do MULL and then MLAL to 16 bit before lengthening to 32 bit. by Frank Barchard · 3 years, 9 months ago
- 8247e21 C2 QS8 microkernel using mull then mlal with KC loop of 16 by Frank Barchard · 3 years, 9 months ago
- 5899012 QS8 Neon GEMM C8 microkernel with 8 bit multiply and vpadal to accumulate. by Frank Barchard · 3 years, 9 months ago
- 6d490f7 Change isfinite() to std::isfinite() by Anush Elangovan · 3 years, 9 months ago
- 2202c81 Implement bilinear upsampling (CHW layout) for ARM architecture by Artsiom Ablavatski · 3 years, 9 months ago
- 2302ffd QS8 Neon GEMM microkernel with 8 bit multiply and vpadal to accumulate by Frank Barchard · 3 years, 9 months ago
- ec0bf14 QS8 GEMM and IGEMM 3x8 3x16 and IGEMM 4x8 and 4x16 by Frank Barchard · 3 years, 9 months ago
- 4ecae2e QS8 Neon GEMM microkernel with 8 bit multiply by Frank Barchard · 3 years, 9 months ago
- cfbc849 Add 4x8 and 4x16 qs8 gemm microkernels by Frank Barchard · 3 years, 9 months ago
- c5704bf WebAssembly DWConv2D 3x3 stride 2 loadsplat by Frank Barchard · 3 years, 10 months ago
- c6889b3 WebAssembly DWConv2D 5x5 stride 2 loadsplat by Frank Barchard · 3 years, 10 months ago
- 02bb429 WebAssembly DWConv2D 3x3p1 adapted from NEON by Frank Barchard · 3 years, 10 months ago
- b20dcd6 WASMSIMD dwconv2d 5x5p2 use loadsplat by Frank Barchard · 3 years, 10 months ago
- 4eddb9c Fix incompatibility with Apple Clang in Subgraph tester by Marat Dukhan · 3 years, 10 months ago
- 802fcae Additional SSE/SSE2 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 11 months ago
- 412e2f4 Rename WASMSIMD dwconv2d functions to splat or loadsplat by Frank Barchard · 3 years, 11 months ago
- 3de5dfa Remove PSIMD dependency by Marat Dukhan · 3 years, 11 months ago
- b36582b Enable sparse inference by default by Marat Dukhan · 3 years, 11 months ago
- c10585f Minor refactoring of SubgraphTester by Marat Dukhan · 3 years, 11 months ago
- 54b2d54 Disable sparse graph rewriting for clusters with <= 2/3 zeroes by Marat Dukhan · 3 years, 11 months ago
- c763488 CONV2D HWC2CHW microkernel for ARM NEON by Marat Dukhan · 3 years, 11 months ago
- 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 11 months ago
- 5d7ca1a Remove duplicate WASMSIMD dwconv2d 5x5s2 tests by Frank Barchard · 3 years, 11 months ago