- ccbaedf C2 Neon microkernel remove duplicate DUP instructions from NR loop. by Frank Barchard · 3 years ago
- 0bf8afa Leverage f32x4.pmin and f32x4.pmax WAsm SIMD instructions by Marat Dukhan · 3 years, 1 month ago
- d460d0b Neon IGEMM do remainder with reversed MR for shifts by Frank Barchard · 3 years, 2 months ago
- 4c49494 Fix crash on AArch32 in scalar quantized microkernels by Marat Dukhan · 3 years, 2 months ago
- 1ce78ab Leverage Load-Zero WAsm SIMD instructions in Chrome M88 microkernels by Marat Dukhan · 3 years, 2 months ago
- 90cd7df Fix rewind params for qs8 4x16c4 by Frank Barchard · 3 years, 2 months ago
- b7a7c30 NEON GEMM/IGEMM microkernels change store/dup to 2 of each by Frank Barchard · 3 years, 2 months ago
- 29833fd Change stores to non-lane STR by Frank Barchard · 3 years, 2 months ago
- e7e001f Fix bug in QC8/QS8/QU8 IGEMM DOT16x2 LD128 WAsm SIMD microkernels by Marat Dukhan · 3 years, 2 months ago
- 8589ecd QS8 IGEMM use x11 for params, x10 for a3 and x0 for cn_stride by Frank Barchard · 3 years, 2 months ago
- 4810905 Leverage v128.const WAsm SIMD instruction by Marat Dukhan · 3 years, 2 months ago
- 8dc106e QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernels using i32x4.dot_i16x8_s instruction by Marat Dukhan · 3 years, 2 months ago
- 6b30b73 Remainder branch move before label. by Frank Barchard · 3 years, 2 months ago
- 56f157c Relabel branches for quantized assembly ARM microkernels by Frank Barchard · 3 years, 2 months ago
- 7a8dd87 Work around generating v128.storeXX_lane for quantized WAsm SIMD microkernels by Marat Dukhan · 3 years, 3 months ago
- 0c2a31e Improve unpacking in SSE4+ QC8/QS8/QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- 07706f6 Replace generic shuffle with narrow instructions in WAsm SIMD QS8/QU8/QC8 microkernels by Marat Dukhan · 3 years, 3 months ago
- dfc2db0 Add prefix to QC8/QS8/QU8 WAsm SIMD GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- 2c6d196 Q8 4x16 and 1x16 Neon GEMM/IGEMM quantize using V0-V3 by Frank Barchard · 3 years, 3 months ago
- fbe0c6f Q8 4x16 Neon IGEMM quantize using V0-V3 by Frank Barchard · 3 years, 3 months ago
- 59ed1da QU8 4x16 Neon assembly microkernel by Frank Barchard · 3 years, 3 months ago
- 6967eb0 Add a rewind variable for params. - no impact on code, just simplified source by Frank Barchard · 3 years, 3 months ago
- 793c8da QS8 igemm comment for zero use int8_t* instead of float* by Frank Barchard · 3 years, 3 months ago
- efa123d Update Neon code with generators for added comment by Frank Barchard · 3 years, 3 months ago
- 13db60f RNDNU quantized Neon assembly GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 4 months ago
- 60729d0 4x16c4 RNDNU quantized Neon assembly GEMM/IGEMM microkernel. by Frank Barchard · 3 years, 4 months ago
- 927d474 Scalar implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- 43bee05 WAsm SIMD implementation of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- 69c8a29 NEON-MLAL implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- 3cf2e22 QU8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 4 months ago
- 902ef7f QU8 GEMM/IGEMM AVX2 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
- cdbe9a3 Code-generate QU8 GEMM and IGEMM microkernels for SSE2/SSSE3/SSE4.1 by Marat Dukhan · 3 years, 4 months ago
- e5eee46 Refactor pre-SSE4 versions of QS8/QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- 960ae34 NEON implementations of QC8 c8 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 4 months ago
- 1663c0c NEON implementations of QS8 2x8c16 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 4 months ago
- 14f325e C2 GEMM/IGEMM QS8/QC8 NEON microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
- 4b291bc Re-generate QC8 IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- 47c1220 WAsm SIMD implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- e8e8c54 QC8 neon assembly re-quantization change LDP to LDR by Frank Barchard · 3 years, 4 months ago
- f10af6c NEON Dot Product implementations of QC8 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 4 months ago
- 98af05c NEON 4x16 QC8 GEMM and IGEMM assembly microkernels for Cortex A53 by Frank Barchard · 3 years, 4 months ago
- d602154 Scalar implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- e452560 Remove MRxNR from template name on neondot c4 microkernels by Frank Barchard · 3 years, 4 months ago
- e76478b NEON implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 4 months ago
- fc188ed QC8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
- c3e3f1c QC8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 5 months ago
- e06c813 Support QC8 IGEMM microkernels by Marat Dukhan · 3 years, 5 months ago