- 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 7 months ago
- 24abe6b Initialize S8/U8 IBILINEAR microkernel pointers by Marat Dukhan · 2 years, 7 months ago
- 6a69c8e Scalar versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 7 months ago
- 266a47b WAsm SIMD versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 7 months ago
- 7519eb1 SSE2 & SSE4.1 versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 7 months ago
- cdb42a5 NEON versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 8 months ago
- 9cdc10d QU8 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 8 months ago
- 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 8 months ago
- 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 8 months ago
- 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 8 months ago
- 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 8 months ago
- b1325b9 Introduce xnn_compute_type in Subgraph Nodes by Marat Dukhan · 2 years, 8 months ago
- e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 8 months ago
- eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 8 months ago
- a0c6168 F32->F16 Convert operator by Marat Dukhan · 2 years, 8 months ago
- c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 8 months ago
- 1fe8995 Scalar F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
- 78f039d Scalar F16->F32 evaluation stubs of bitcast-based and fabsf-based variants by Marat Dukhan · 2 years, 8 months ago
- 4edfdbf NEON F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
- 22e31c8 WAsm SIMD F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
- eb84423 SSE2, SSE4.1, and AVX F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 8 months ago
- 79c78b2 Evaluation stubs for WAsm SIMD F32->F16 conversion by Marat Dukhan · 2 years, 8 months ago
- 056f49d Evaluation stubs for SSE2 & SSE4.1 F32->F16 conversion by Marat Dukhan · 2 years, 8 months ago
- a6eb1e5 Evaluation stubs for NEON F32->F16 conversion by Marat Dukhan · 2 years, 8 months ago
- 46cc1e1 Evaluation stubs for scalar F32->F16 conversion by Marat Dukhan · 2 years, 8 months ago
- 287952a QS8 C4 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 8 months ago
- 47a74db Add specific microkernel for 1D convolutions with 1x3 kernel size for Android backend by Artsiom Ablavatski · 2 years, 8 months ago
- 494cd2b S4 variant of C2 Neon GEMM/IGEMM microkernel by Frank Barchard · 2 years, 8 months ago
- 952cb51 S4 variant of C2 Neon GEMM/IGEMM mull microkernel by Frank Barchard · 2 years, 8 months ago
- 1d41247 Neon C2 microkernels switch to rndnu from gemmlowp by Frank Barchard · 2 years, 8 months ago
- 582e184 Evaluation stubs and tests for FP16->FP32 conversion by Marat Dukhan · 2 years, 8 months ago
- ddb3d16 F16 Fully Connected operator by Marat Dukhan · 2 years, 8 months ago
- d77f77d F32->F16 VCVT microkernels for NEON-FP16, F16C, and AVX512 by Marat Dukhan · 2 years, 8 months ago
- af2ba00 F16->F32 Convert operator by Marat Dukhan · 2 years, 8 months ago
- c9f9d67 Add Channel Tile of 16 for float and 32 for half float. by Frank Barchard · 2 years, 9 months ago
- e2c0001 Scalar FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- f6507f8 WAsm SIMD FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- 322ed6f NEON FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- 1227adb SSE2/SSE4.1/AVX FP16->FP32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- 60f903b NEON FP16->FP32 conversion evaluation stubs by Marat Dukhan · 2 years, 9 months ago
- a18926a WAsm SIMD FP16->FP32 conversion evaluation stubs by Marat Dukhan · 2 years, 9 months ago
- 3ed866b Test evaluation stubs for F16->F32 conversion by Marat Dukhan · 2 years, 9 months ago
- 8ff372c NEON-FP16 implementation of F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- 0630d29 Refactor creation and setup of Operators from Nodes by Marat Dukhan · 2 years, 9 months ago
- 354cbc6 QU8 MUL8 variant of DWCONV by Frank Barchard · 2 years, 9 months ago
- 79c76ab F16->F32 conversion microkernels in AVX512-SKX implementation by Marat Dukhan · 2 years, 9 months ago
- f1a6ed3 F16->F32 conversion microkernels in F16C implementation by Marat Dukhan · 2 years, 9 months ago
- a4ad988 X8 LUT microkernels for WAsm SIMD by Marat Dukhan · 2 years, 10 months ago
- 2aa2e2a q8 dwconv add channel tiles of 24 and 32 for mul16 rndnu microkernels by Frank Barchard · 2 years, 10 months ago
- 5cc31e3 Replace _mm512_(loadu/storeu)_epi8 with _mm512_(loadu/storeu)_si512 by Marat Dukhan · 2 years, 10 months ago
- 37c3077 Avoid _mm512_(loadu/storeu)_epi32 in _mm512_(loadu/storeu)_epi8 polyfills by Marat Dukhan · 2 years, 10 months ago
- b54871d Polyfill _mm512_loadu_epi8 & _mm512_storeu_epi8 for pre GCC-11 by Marat Dukhan · 2 years, 10 months ago
- eec0052 QS8 ELU operator by Marat Dukhan · 2 years, 10 months ago
- e4118ef Polyfill vld1q_u8_x4 for older AArch64 gcc versions by Marat Dukhan · 2 years, 10 months ago
- 2b3c410 AVX512BW implementations of X8 LUT microkernels by Marat Dukhan · 2 years, 10 months ago
- 7c478e3 SSSE3, AVX, and AVX2 X8 LUT microkernels by Marat Dukhan · 2 years, 10 months ago
- 5de7bc0 QS8/QU8 Tanh operator using LUT microkernels by Marat Dukhan · 2 years, 10 months ago
- f718232 X8 LUT NEON microkernels by Marat Dukhan · 2 years, 10 months ago
- 71a9bb1 QS8 Sigmoid operator by Marat Dukhan · 2 years, 10 months ago
- d67539d Auto-generate X8 LUT microkernels and tests by Marat Dukhan · 2 years, 10 months ago
- cdf59a5 Add QU8 NR=32 microkernels by Frank Barchard · 2 years, 10 months ago
- b8cbcb5 Fuse rounding term into bias in QS8 & QU8 VADD[C] microkernels by Marat Dukhan · 2 years, 10 months ago
- 8e2fd20 QS8 and QU8 Subtract ND operators by Marat Dukhan · 2 years, 10 months ago
- 6428725 Rename ADD quantization parameters to ADDSUB by Marat Dukhan · 2 years, 10 months ago
- 8ae1a53 Remove duplicate prototypes by Frank Barchard · 2 years, 10 months ago
- 4c49494 Fix crash on AArch32 in scalar quantized microkernels by Marat Dukhan · 2 years, 10 months ago
- df8e604 4x8 QU8 Neon Dotproduct microkernel rename from ld64 to ld128 by Frank Barchard · 2 years, 10 months ago
- 33b4f75 VRND microkernels using native WAsm SIMD instructions by Marat Dukhan · 2 years, 10 months ago
- a49e41f QU8 4x16C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 2 years, 10 months ago
- 8dc106e QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernels using i32x4.dot_i16x8_s instruction by Marat Dukhan · 2 years, 10 months ago
- feee77f Leverage f32x4.nearest, f32x4.floor, f32x4.ceil, f32x4.trunc WAsm SIMD instructions by Marat Dukhan · 2 years, 10 months ago
- 5d27a7b Leverage f32x4.nearest, f32x4.floor, f32x4.ceil, f32x4.trunc WAsm SIMD instructions by Marat Dukhan · 2 years, 10 months ago
- 0a3093c QU8 vadd neon use x32 instead of x8 by Frank Barchard · 2 years, 10 months ago
- ca4c68e QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 2 years, 10 months ago
- 4066898 QU8 4x16 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
- 0049e89 QU8 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
- 9cedb59 Accumulate in 16 bits once in WAsm SIMD MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 2 years, 11 months ago
- 61c0c9e Clamp NC operator for S8 data type by Marat Dukhan · 2 years, 11 months ago
- 9491279 Refactor parameter initialization for VCLAMP microkernels by Marat Dukhan · 2 years, 11 months ago
- e79acb7 S8 VCLAMP microkernels by Marat Dukhan · 2 years, 11 months ago
- 1f5b108 Refactor U8 CLAMP microkernels by Marat Dukhan · 2 years, 11 months ago
- 2ea50a0 Refactor U8 MAXPOOL microkernels similarly to S8 MAXPOOL by Marat Dukhan · 2 years, 11 months ago
- dc5c148 S8 Max Pooling operator by Marat Dukhan · 2 years, 11 months ago
- 2314753 S8 MAXPOOL microkernels for all architectures by Marat Dukhan · 2 years, 11 months ago
- f158942 WAsm SIMD implementation of U8 MAXPOOL microkernel by Marat Dukhan · 2 years, 11 months ago
- 91ae165 Refactor initialization of MAXPOOL microkernel parameters by Marat Dukhan · 2 years, 11 months ago
- e033126 Generate more tile sizes for QU8 gemm/igemm by Frank Barchard · 2 years, 11 months ago
- 88e839c QU8 C4 NEON Dot Product GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
- 139e961 X8 version of Constand Pad ND operator by Marat Dukhan · 2 years, 11 months ago
- dfc2db0 Add prefix to QC8/QS8/QU8 WAsm SIMD GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 11 months ago
- 0461f2d Generalize PAD microkernels to all 8-/16-/32-bit data types by Marat Dukhan · 2 years, 11 months ago
- 933051b Generalize FILL microkernels to all 8-/16-/32-bit data types by Marat Dukhan · 2 years, 11 months ago
- 4486f87 Prune NEON-DOT QS8 GEMM/IGEMM microkernels with FP32 & GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
- 400e7cb Prune WAsm SIMD QS8 GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
- e16bf7d Prune AVX2/AVX512 QS8 GEMM/IGEMM microkernels with GEMMLOWP requantization by Marat Dukhan · 2 years, 11 months ago
- 8674629 Use QS8 GEMM WAsm SIMD microkernels with FP32 requantization in the benchmark by Marat Dukhan · 2 years, 11 months ago
- 0ff7989 Use FP32 requantization for extended-weights QS8 GEMM microkernels on x86 by Marat Dukhan · 2 years, 11 months ago
- 0853b8a QS8/QU8 Multiply ND operators by Marat Dukhan · 3 years ago
- fb3a94f QU8 4x16 Neon assembly microkernel for Cortex A75 by Frank Barchard · 3 years ago
- 4a7b70f QS8/QU8 VMUL[C] microkernels in NEON implementation by Marat Dukhan · 3 years ago