- 3357d9d Minor optimizations in NEON QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- e742d2a Re-generate QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- 533410e QS8 A53 GEMM bug fix for X1 - re-enable E2E by Frank Barchard · 3 years, 4 months ago
- 16d79ed Polyfill vcvtnq_s32_f32 for AArch32 GCC by Marat Dukhan · 3 years, 4 months ago
- 0ae35f2 QS8 LD128 GEMM/IGEMM dot product 4x16 microkernel by Frank Barchard · 3 years, 4 months ago
- 7c9f1f9 Replace // with # for lines that only contain a comment. by Frank Barchard · 3 years, 4 months ago
- 18630de QS8 NEONDOT GEMM/IGEMM microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
- 801d2c2 Fix QS8 IGEMM with FP32 requantization for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
- e695791 4x16C4 QS8 IGEMM Cortex A55 microkernel reuse X10 to save push by Frank Barchard · 3 years, 4 months ago
- 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
- c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
- c08221f Apply text format to assembly for consistency by Frank Barchard · 3 years, 4 months ago
- 1c538cd Add templates for all QS8 IGEMM assembly microkernels. by Frank Barchard · 3 years, 4 months ago
- 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
- d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
- 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
- f86ee8b Refactor requantization helper functions by Marat Dukhan · 3 years, 4 months ago
- e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 4 months ago
- b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
- d65d20e Rename QS8 GEMM/IGEMM microkernel filenames by Marat Dukhan · 3 years, 4 months ago
- e091adb 4x16 QS8 GEMM/IGEMM Cortex A53 microkernels reduce to use 2 GPR for temp by Frank Barchard · 3 years, 4 months ago
- 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
- 4bb82cc 4x16 QS8 IGEMM microkernels use x8 for temp by Frank Barchard · 3 years, 5 months ago
- 4be4bd7 4x16 QS8 IGEMM microkernels use x14 for A1 by Frank Barchard · 3 years, 5 months ago
- fb672aa 4x16 QS8 IGEMM microkernel for Cortex A53 avoid a push by Frank Barchard · 3 years, 5 months ago
- d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
- 76f43f0 Apply consistent formatting to assembly by Frank Barchard · 3 years, 5 months ago
- a24cc08 Small refactoring of scalar QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
- a1a4e78 Scalar QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 5 months ago
- 938ea81 Code generate 1x8C8 nicrokernel for Cortex A75 with and without prfm by Frank Barchard · 3 years, 5 months ago
- b639210 Add prefetch of A for quantized microkernels. by Frank Barchard · 3 years, 5 months ago
- e111861 1x8 C8 A53 microkernel defer adap by Frank Barchard · 3 years, 5 months ago
- 7c4c771 C8 A53 microkernels prefetch A by Frank Barchard · 3 years, 5 months ago
- 2a3169d C8 A53 microkernels move 2nd load after MLA by Frank Barchard · 3 years, 5 months ago
- dddb38f QS8 1x8C8 IGEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
- 2de3bce A53 C8 microkernel load A with ldr/ldr/ins by Frank Barchard · 3 years, 5 months ago
- 5549735 4X8 and 4x16 mla lane microkernels for A53 by Frank Barchard · 3 years, 5 months ago
- d68e114 Cortex A53 tuned C8 gemm/igemm microkernels by Frank Barchard · 3 years, 5 months ago
- 1f51d38 Add prefetch to MLA lane microkernel by Frank Barchard · 3 years, 6 months ago
- 4c6640c Disable MSan in QS8 GEMM/IGEMM microkernels with KR>1 by Marat Dukhan · 3 years, 6 months ago
- 4a35204 PRFM variant of QS8 C8 Neon microkernel. by Frank Barchard · 3 years, 6 months ago
- 2e42787 2x4c2/3x4c2 microkernels for SSE2/SSSE3/SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 6 months ago
- a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 6 months ago
- c409471 Include XOP headers in clang-cl compatible way. Fix #1382. by Marat Dukhan · 3 years, 6 months ago
- 62b4ff7 Remove 12x8 QS8 GEMM and IGEMM Neon dotproduct microkernels. by Frank Barchard · 3 years, 7 months ago
- da78da1 QS8 C8 Neon microkernels with MUL and MLA versions. by Frank Barchard · 3 years, 7 months ago
- 618d85d QS8 Neon dot product intrinsics GEMM and IGEMM microkernels reduced remainder code. by Frank Barchard · 3 years, 7 months ago
- 6d8ca7d Quantized GEMM/IGEMM microkernels bump kc to be a multiple of channels. by Frank Barchard · 3 years, 7 months ago
- 02121ca QS8 Neon IGEMM microkernels with 8 bit MUL using DUP by Frank Barchard · 3 years, 7 months ago
- 01c341b C8 MLA Neon GEMM/IGEMM microkernels count k down from kc. by Frank Barchard · 3 years, 7 months ago
- 36f95cf QS8 Neon IGEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 7 months ago
- a0fe11d QS8 C8 Neon remove remainder handling code and rewind the A pointers by kc by Frank Barchard · 3 years, 8 months ago
- 6fa8078 QS8 C2 Neon igemm by Frank Barchard · 3 years, 8 months ago
- d79391d QS8 C8 Neon igemm by Frank Barchard · 3 years, 8 months ago
- fe14b85 Add space after casting by Frank Barchard · 3 years, 8 months ago
- ec0bf14 QS8 GEMM and IGEMM 3x8 3x16 and IGEMM 4x8 and 4x16 by Frank Barchard · 3 years, 9 months ago
- 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 4 years ago
- 66ccf64 Rename QS8 generator templates by Marat Dukhan · 4 years ago
- a48848f 4x8, 6x8 and 8x16 Neon dot product GEMM microkernels by Frank Barchard · 4 years ago
- 2fa1745 6x16 QS8 GEMM for Neon dot product by Frank Barchard · 4 years ago
- ef4ce31 Remove trailing whitespace by Marat Dukhan · 4 years, 1 month ago
- d4c8303 Enable NEON DOT QS8 [I]GEMM microkernels on ARM64 by Marat Dukhan · 4 years, 1 month ago
- 12c5777 Optimization: 2x partial unroll to load 8 contiguous bytes. by Benoit Jacob · 4 years, 2 months ago
- a05487f Add xnn_qs8_igemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 4 years, 2 months ago
- 0af63ab Include polyfills for intrinsics in QS8 AVX512 GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- f124e88 Polyfill _mm_loadu_si32 and _mm_storeu_si32 intrinsics by Marat Dukhan · 4 years, 2 months ago
- 27203da WAsm SIMD versions of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 23848db Reoptimize x86 requantization by Marat Dukhan · 4 years, 2 months ago
- 40bbafe NEON variants of QS8 GEMM & IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- e7edc80 Add 3x4c8 variants of SSE2/SSSE3/SSE4.1/XOP GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 1280952 AVX2 version of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 1566fee XOP versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 07bd252 QS8 IGEMM MRx4c8 SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years, 2 months ago
- dee732b LD128 versions of QS8 GEMM SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years, 2 months ago
- 14d3ce8 Add LD64 suffix in QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- f948068 QS8 IGEMM microkernels and infrastructure by Marat Dukhan · 4 years, 2 months ago