- 88d06fc Disable neondot microkernels on iOS 32 bit by Frank Barchard · 2 years, 8 months ago
- cde8bdf Q8 GEMM for Cortex A7 reduce prefetch to weights by Frank Barchard · 2 years, 8 months ago
- 3e3124e Make void* params argument of JIT generators const by Zhi An Ng · 2 years, 8 months ago
- 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 8 months ago
- 101271e QC8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 8 months ago
- 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 8 months ago
- 3ceb4f1 Reoptimize NEON QC8/QS8 GEMM/IGEMM microkernels with SR > 1 by Marat Dukhan · 2 years, 8 months ago
- 69b7f14 Reoptimize QS8/QC8 GEMM/IGEMM WAsm SIMD microkernels with swizzle by Marat Dukhan · 2 years, 8 months ago
- fbd67a7 Pad K to a multiple of SR in GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 8 months ago
- c607028 Remove wb from JIT aarch32 instructions, use mem operand and ++ instead by Zhi An Ng · 2 years, 8 months ago
- 870108c QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
- adf087d Remove 3 blank lines after last jit assembly instruction before end of function by Frank Barchard · 2 years, 8 months ago
- 752b980 Avoid importing the entire xnnpack namespace in aarch32 assembler by Zhi An Ng · 2 years, 8 months ago
- e1ff738 Update assembly register usage comments. by Frank Barchard · 2 years, 8 months ago
- ac654f1 QC8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
- 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 8 months ago
- 5e1a303 QC8 GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 8 months ago
- 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 8 months ago
- 48d74c3 Replicate QC8/QS8/QU8 CONV WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 8 months ago
- 16b734c Add more QC8 GEMM/IGEMM JIT microkernels. by Zhi An Ng · 2 years, 9 months ago
- ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 9 months ago
- f623740 QC8 NEON lane microkernels by Frank Barchard · 2 years, 9 months ago
- 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 9 months ago
- cccb012 Apply sort and formatting to ARM code by Frank Barchard · 2 years, 9 months ago
- 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 9 months ago
- 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
- dc54e12 Replace vshll_n_u32(v, 0) with vmovl_u32 in C4/C4S2 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 9 months ago
- 87fe410 QC8 quantization for all aarch32 GEMM/IGEMM microkernels by Frank Barchard · 2 years, 9 months ago
- d541fc0 Annotate remaining microkernels with Out-of-Bounds reads with XNN_OOB_READS by Marat Dukhan · 2 years, 9 months ago
- 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 9 months ago
- 0f1ed94 QS8/QC8 GEMM/IGEMM WAsm SIMD microkernels using C2S4 layout by Marat Dukhan · 2 years, 10 months ago
- 03efa0f Reoptimize FP32 requantization in NEON QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 5a31dc6 Optimize FP32 requantization in NEON QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 7988a18 Refactoring xnn_qs8_minmax_params for NEON/NEONv8 by Marat Dukhan · 2 years, 10 months ago
- 13c9f8d Support requantization scale over 1 in SSE/AVX GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 411c18d Optimize FP32 requantization in WAsm SIMD QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 5f7cf55 Avoid using gcc-specific intrinsics in NEON microkernels by Marat Dukhan · 2 years, 10 months ago
- 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
- 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 10 months ago
- 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 10 months ago
- 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 10 months ago
- e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 10 months ago
- eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 10 months ago
- c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 11 months ago
- 287952a QS8 C4 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
- ccbaedf C2 Neon microkernel remove duplicate DUP instructions from NR loop. by Frank Barchard · 2 years, 11 months ago
- 0bf8afa Leverage f32x4.pmin and f32x4.pmax WAsm SIMD instructions by Marat Dukhan · 3 years ago
- 031ff4b Template bug fix in stores for remainder of 8 in Neon QS8 microkernels by Frank Barchard · 3 years ago
- 4c49494 Fix crash on AArch32 in scalar quantized microkernels by Marat Dukhan · 3 years, 1 month ago
- 1ce78ab Leverage Load-Zero WAsm SIMD instructions in Chrome M88 microkernels by Marat Dukhan · 3 years, 1 month ago
- b7a7c30 NEON GEMM/IGEMM microkernels change store/dup to 2 of each by Frank Barchard · 3 years, 1 month ago
- 132774e QU8 microkernels change stores to non-lane STR by Frank Barchard · 3 years, 1 month ago
- 29833fd Change stores to non-lane STR by Frank Barchard · 3 years, 1 month ago
- 1c70764 4x16c4 cortex_a55 microkernel tuning by Frank Barchard · 3 years, 1 month ago
- a49e41f QU8 4x16C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 3 years, 1 month ago
- 4810905 Leverage v128.const WAsm SIMD instruction by Marat Dukhan · 3 years, 1 month ago
- 8dc106e QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernels using i32x4.dot_i16x8_s instruction by Marat Dukhan · 3 years, 1 month ago
- 1215c9a QS8 NEON GEMM microkernels use rewind instead of reload by Frank Barchard · 3 years, 1 month ago
- 6b30b73 Remainder branch move before label. by Frank Barchard · 3 years, 1 month ago
- 56f157c Relabel branches for quantized assembly ARM microkernels by Frank Barchard · 3 years, 1 month ago
- 7a8dd87 Work around generating v128.storeXX_lane for quantized WAsm SIMD microkernels by Marat Dukhan · 3 years, 1 month ago
- 0c2a31e Improve unpacking in SSE4+ QC8/QS8/QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 1 month ago
- 07706f6 Replace generic shuffle with narrow instructions in WAsm SIMD QS8/QU8/QC8 microkernels by Marat Dukhan · 3 years, 1 month ago
- dfc2db0 Add prefix to QC8/QS8/QU8 WAsm SIMD GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 1 month ago
- 2c6d196 Q8 4x16 and 1x16 Neon GEMM/IGEMM quantize using V0-V3 by Frank Barchard · 3 years, 2 months ago
- 6967eb0 Add a rewind variable for params. - no impact on code, just simplified source by Frank Barchard · 3 years, 2 months ago
- 26e8378 Reduce register pressure in GEMMLOWP quantization on NEON by Frank Barchard · 3 years, 2 months ago
- efa123d Update Neon code with generators for added comment by Frank Barchard · 3 years, 2 months ago
- 13db60f RNDNU quantized Neon assembly GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 2 months ago
- 2837e8b Remove 0 offset from loads. by Frank Barchard · 3 years, 2 months ago
- d194311 4x16c4-aarch64-neondot-ld32 use LD1R instead of lanes by Frank Barchard · 3 years, 2 months ago
- 927d474 Scalar implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
- 43bee05 WAsm SIMD implementation of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
- 69c8a29 NEON-MLAL implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
- 3cf2e22 QU8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 3 months ago
- 902ef7f QU8 GEMM/IGEMM AVX2 microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
- cdbe9a3 Code-generate QU8 GEMM and IGEMM microkernels for SSE2/SSSE3/SSE4.1 by Marat Dukhan · 3 years, 3 months ago
- e5eee46 Refactor pre-SSE4 versions of QS8/QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- 960ae34 NEON implementations of QC8 c8 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
- 1663c0c NEON implementations of QS8 2x8c16 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
- 14f325e C2 GEMM/IGEMM QS8/QC8 NEON microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
- 47c1220 WAsm SIMD implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- e8e8c54 QC8 neon assembly re-quantization change LDP to LDR by Frank Barchard · 3 years, 3 months ago
- f10af6c NEON Dot Product implementations of QC8 c4 GEMM and IGEMM assembly microkernels by Frank Barchard · 3 years, 3 months ago
- 98af05c NEON 4x16 QC8 GEMM and IGEMM assembly microkernels for Cortex A53 by Frank Barchard · 3 years, 3 months ago
- d602154 Scalar implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- e452560 Remove MRxNR from template name on neondot c4 microkernels by Frank Barchard · 3 years, 3 months ago
- e76478b NEON implementations of QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- fc188ed QC8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
- c3e3f1c QC8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 4 months ago
- 0b04374 Support QC8 GEMM microkernels by Marat Dukhan · 3 years, 4 months ago