- 48d74c3 Replicate QC8/QS8/QU8 CONV WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 9 months ago
- 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 9 months ago
- 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 9 months ago
- 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
- 6150425 Disable MSan in AVX512SKX QS8/QC8/QU8 DWCONV microkernels by Marat Dukhan · 2 years, 10 months ago
- 7be427a Disable MSan and TSan in most microkernels with Out-of-Bounds reads by Marat Dukhan · 2 years, 10 months ago
- 03efa0f Reoptimize FP32 requantization in NEON QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 5a31dc6 Optimize FP32 requantization in NEON QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 7988a18 Refactoring xnn_qs8_minmax_params for NEON/NEONv8 by Marat Dukhan · 2 years, 10 months ago
- 8978ac2 Support requantization scale greater than 1 in RNDNU NEON microkernels by Marat Dukhan · 2 years, 10 months ago
- 13c9f8d Support requantization scale over 1 in SSE/AVX GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 17a9e3f Remove GEMMLOWP requantization from QS8 DWCONV templates by Marat Dukhan · 2 years, 10 months ago
- 411c18d Optimize FP32 requantization in WAsm SIMD QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
- 5f7cf55 Avoid using gcc-specific intrinsics in NEON microkernels by Marat Dukhan · 2 years, 10 months ago
- 0bf8afa Leverage f32x4.pmin and f32x4.pmax WAsm SIMD instructions by Marat Dukhan · 3 years ago
- 2aa2e2a q8 dwconv add channel tiles of 24 and 32 for mul16 rndnu microkernels by Frank Barchard · 3 years ago
- 4c49494 Fix crash on AArch32 in scalar quantized microkernels by Marat Dukhan · 3 years, 1 month ago
- 7a8dd87 Work around generating v128.storeXX_lane for quantized WAsm SIMD microkernels by Marat Dukhan · 3 years, 1 month ago
- 9cedb59 Accumulate in 16 bits once in WAsm SIMD MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 1 month ago
- 36fe5aa Remove WAsm SIMD QS8 DWCONV microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
- 07706f6 Replace generic shuffle with narrow instructions in WAsm SIMD QS8/QU8/QC8 microkernels by Marat Dukhan · 3 years, 2 months ago
- 1e6fc21 Fix incompatible pointer type in QU8 DWCONV NEON microkernels by Marat Dukhan · 3 years, 2 months ago
- 60bb7ec Accumulate in 16 bits once in AVX2 MUL16 VPUNPCK QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
- 881ab02 AVX2 MUL16 QS8/QC8 DWCONV microkernels using VPUNPCK instructions to extend the product by Marat Dukhan · 3 years, 2 months ago
- 0966856 Accumulate in 16 bits once in SSE2/SSE4/AVX/XOP MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
- ab952f1 Remove multiplication in QS8/QC8 DWCONV MUL16 microkernels for SSE4 by Marat Dukhan · 3 years, 2 months ago
- 5f2939f QS8/QC8 DWCONV NEON MUL8/MLA8 microkernels using 128-bit loads by Marat Dukhan · 3 years, 2 months ago
- caccd8e Accumulate in 16 bits once in NEON QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
- 575dfb9 Disable MSan in quantized DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
- 4ba70b7 QS8/QC8 NEON microkernels using 8x8->16-bit multiplication by Marat Dukhan · 3 years, 2 months ago
- 5c92195 Fix incompatibilities with GCC on ARM by Marat Dukhan · 3 years, 2 months ago
- be18f5c QS8 DWCONV microkernels with RNDNU requantization by Marat Dukhan · 3 years, 2 months ago
- 605696a NEON implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
- 1f71428 Scalar implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
- f601135 WAsm SIMD implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
- cfd606b QU8 DWCONV microkernels for AVX512 by Marat Dukhan · 3 years, 3 months ago
- 09c312b QU8 DWCONV microkernels for AVX2 by Marat Dukhan · 3 years, 3 months ago
- f0f2881 QS8 DWCONV microkernels for SSE2/SSE4.1/AVX by Marat Dukhan · 3 years, 3 months ago
- 3c35f7a QU8 DWCONV microkernels for SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 3 months ago
- e5eee46 Refactor pre-SSE4 versions of QS8/QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
- b3336d9 Refactoring in QS8/QC8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
- 9ca2333 Minor refactoring in QC8 microkernels by Marat Dukhan · 3 years, 3 months ago
- 0fae3bc Include polyfill in NEON QS8/QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
- 5754706 Scalar implementation of QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
- 313eef7 WAsm SIMD implementations of QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
- 69aa623 WAsm SIMD QS8 DWCONV with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
- ee029b2 Replace deprecated wasm_simd128.h intrinsics with new versions by Marat Dukhan · 3 years, 3 months ago
- 85d772b QS8 DWCONV microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
- 59af581 NEON implementations of QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
- aef9091 Minor optimization for AArch64 NEON QS8 microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
- 6f90529 QS8 DWCONV microkernels for ARM NEON with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
- 98042f2 QC8 DWCONV microkernels for SSE/AVX/XOP/AVX512 by Marat Dukhan · 3 years, 3 months ago
- 8228689 Support QC8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
- 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
- caf4831 FP32 requantization in QS8 DWCONV microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
- 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
- d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
- 77ded05 Use byte-wide MIN/MAX in AVX512 QS8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
- 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
- f86ee8b Refactor requantization helper functions by Marat Dukhan · 3 years, 4 months ago
- e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 4 months ago
- b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
- e1ff248 Rename QS8 DWCONV microkernel filenames by Marat Dukhan · 3 years, 4 months ago
- 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
- 4ed1488 QS8 DWCONV25 microkernels by Marat Dukhan · 3 years, 5 months ago
- a24cc08 Small refactoring of scalar QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
- 4454288 Scalar QS8 DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
- 3fd4e27 XOP versions of QS8 DWCONV MUL32 microkernels by Marat Dukhan · 3 years, 6 months ago
- 60fc613 Polyfill _mm_loadu_si32 in MUL32 QS8 DWCONV SSE4.1/AVX microkernels by Marat Dukhan · 3 years, 6 months ago
- 07feec8 MUL32 versions of SSE4.1 & AVX QS8 DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
- fa0ab85 AVX versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
- 66ccf64 Rename QS8 generator templates by Marat Dukhan · 4 years ago
- 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
- 023bcf9 NEON variant of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
- c15aa4e Remove XOP variants of QS8 DWCONV by Marat Dukhan · 4 years, 2 months ago
- 4013552 AVX2 versions of QS8 DWCONV microkernels using 16-bit multiplication by Marat Dukhan · 4 years, 2 months ago
- cc8f34c WAsm SIMD variants of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
- 23848db Reoptimize x86 requantization by Marat Dukhan · 4 years, 2 months ago
- d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
- f62bbdc SSE2/SSSE3/SSE4.1/XOP implementation of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago