- b91432c AVX F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 6883abb JIT memory allocation and integration into Assembler by Zhi An Ng · 2 years, 10 months ago
- da7b2e2 QS8 4x8 lane GEMM AArch32 microkernel by Frank Barchard · 2 years, 10 months ago
- 710fb42 Benchmark for the Convert (F32->QS8) operator by Marat Dukhan · 2 years, 10 months ago
- 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 10 months ago
- f92206b QS8->F32 and QU8->F32 Convert NC operators by Marat Dukhan · 2 years, 10 months ago
- ad6f2dc Benchmarks for QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- cb052a3 Remove duplicate template line for 1x8c4 NEON dot product. by Frank Barchard · 2 years, 10 months ago
- 86bd270 Scalar QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- d873fa2 SSE2 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- f9cf55d SSE4.1 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- fee66be NEON QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 59d6515 Enable FP32 requant variant for QU8 [1,4]x8 Neon MLAL [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
- 9982ed3 Enable FP32 requant variant for QU8 NEON dotprod [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
- 2e2d179 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
- 10f9f62 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels for CA55r1 by Digant Desai · 2 years, 10 months ago
- b559fe9 Initial AArch32 structure by Zhi An Ng · 2 years, 10 months ago
- 8999190 Remove GEMMLOWP requantization from QS8 GEMM/IGEMM templates by Marat Dukhan · 2 years, 10 months ago
- 17a9e3f Remove GEMMLOWP requantization from QS8 DWCONV templates by Marat Dukhan · 2 years, 10 months ago
- 20483c7 Expose Convert operator in Subgraph API by Marat Dukhan · 2 years, 10 months ago
- 430b173 F32->QS8/QU8 VCVT scalar microkernels using FP32 min/max by Marat Dukhan · 2 years, 10 months ago
- ed2d776 F32->QS8 and F32->QU8 Convert NC operators by Marat Dukhan · 2 years, 10 months ago
- 03f1297 F32->QS8 and F32->QU8 Convert NC operators by XNNPACK Team · 2 years, 10 months ago
- 7d2d85c F32->QS8 and F32->QU8 Convert NC operators by Marat Dukhan · 2 years, 10 months ago
- 563eee1 Benchmarks for F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 00a1085 F32->QS8 and F32->QU8 VCVT scalar microkernels by Marat Dukhan · 2 years, 10 months ago
- b2d0a2a F32->QS8 and F32->QU8 VCVT NEON microkernels by Marat Dukhan · 2 years, 10 months ago
- d24301d F32->QS8/QU8 CVT evaluation stubs for NEON and NEON v8 by Marat Dukhan · 2 years, 10 months ago
- 9551075 Fix CMake build by Marat Dukhan · 2 years, 10 months ago
- 3df14d3 F32->QS8 and F32->QU8 VCVT NEON V8 microkernels by Marat Dukhan · 2 years, 10 months ago
- c5aa242 F32->QS8 and F32->QU8 microkernels for SSE by Marat Dukhan · 2 years, 10 months ago
- 5f7cf55 Avoid using gcc-specific intrinsics in NEON microkernels by Marat Dukhan · 2 years, 10 months ago
- 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 10 months ago
- 24abe6b Initialize S8/U8 IBILINEAR microkernel pointers by Marat Dukhan · 2 years, 10 months ago
- 6a69c8e Scalar versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 10 months ago
- 7519eb1 SSE2 & SSE4.1 versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 10 months ago
- cdb42a5 NEON versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 10 months ago
- 9cdc10d QU8 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
- 0bc5801 QC8 AArch32 use NeonV8 when available. by Frank Barchard · 2 years, 10 months ago
- 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
- 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 10 months ago
- 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 10 months ago
- 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 11 months ago
- e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 11 months ago
- eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
- a0c6168 F32->F16 Convert operator by Marat Dukhan · 2 years, 11 months ago
- e7043ff Enable C2S4 for QC8 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 11 months ago
- c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 11 months ago
- 1fe8995 Scalar F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 11 months ago
- 78f039d Scalar F16->F32 evaluation stubs of bitcast-based and fabsf-based variants by Marat Dukhan · 2 years, 11 months ago
- 4edfdbf NEON F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 11 months ago
- b4cde5a Fix CMake build on ARM by Marat Dukhan · 2 years, 11 months ago
- eb84423 SSE2, SSE4.1, and AVX F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 11 months ago
- 056f49d Evaluation stubs for SSE2 & SSE4.1 F32->F16 conversion by Marat Dukhan · 2 years, 11 months ago
- a6eb1e5 Evaluation stubs for NEON F32->F16 conversion by Marat Dukhan · 2 years, 11 months ago
- 46cc1e1 Evaluation stubs for scalar F32->F16 conversion by Marat Dukhan · 2 years, 11 months ago
- 287952a QS8 C4 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
- 66ae257 Switch from C2 to S4C2 for qs8 microkernels on 32 bit ARM by Frank Barchard · 2 years, 11 months ago
- 47a74db Add specific microkernel for 1D convolutions with 1x3 kernel size for Android backend by Artsiom Ablavatski · 2 years, 11 months ago
- 494cd2b S4 variant of C2 Neon GEMM/IGEMM microkernel by Frank Barchard · 2 years, 11 months ago
- 952cb51 S4 variant of C2 Neon GEMM/IGEMM mull microkernel by Frank Barchard · 2 years, 11 months ago
- 1d41247 Neon C2 microkernels switch to rndnu from gemmlowp by Frank Barchard · 3 years ago
- 582e184 Evaluation stubs and tests for FP16->FP32 conversion by Marat Dukhan · 3 years ago
- ddb3d16 F16 Fully Connected operator by Marat Dukhan · 3 years ago
- d77f77d F32->F16 VCVT microkernels for NEON-FP16, F16C, and AVX512 by Marat Dukhan · 3 years ago
- af2ba00 F16->F32 Convert operator by Marat Dukhan · 3 years ago
- c9f9d67 Add Channel Tile of 16 for float and 32 for half float. by Frank Barchard · 3 years ago
- dbe781b Enable 8x4, 8x9, 8x25 f32 dwconv by Frank Barchard · 3 years ago
- e2c0001 Scalar FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years ago
- 434352f Benchmarks for FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years ago
- 322ed6f NEON FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years ago
- 1227adb SSE2/SSE4.1/AVX FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years ago
- 60f903b NEON FP16->FP32 conversion evaluation stubs by Marat Dukhan · 3 years ago
- 3ed866b Test evaluation stubs for F16->F32 conversion by Marat Dukhan · 3 years ago
- 8ff372c NEON-FP16 implementation of F16->F32 VCVT microkernels by Marat Dukhan · 3 years ago
- 354cbc6 QU8 MUL8 variant of DWCONV by Frank Barchard · 3 years ago
- 79c76ab F16->F32 conversion microkernels in AVX512-SKX implementation by Marat Dukhan · 3 years ago
- f1a6ed3 F16->F32 conversion microkernels in F16C implementation by Marat Dukhan · 3 years ago
- 2aa2e2a q8 dwconv add channel tiles of 24 and 32 for mul16 rndnu microkernels by Frank Barchard · 3 years ago
- e4118ef Polyfill vld1q_u8_x4 for older AArch64 gcc versions by Marat Dukhan · 3 years ago
- 98e054b Enable vectorized X8 LUT microkernels by Marat Dukhan · 3 years ago
- 2b3c410 AVX512BW implementations of X8 LUT microkernels by Marat Dukhan · 3 years, 1 month ago
- 7c478e3 SSSE3, AVX, and AVX2 X8 LUT microkernels by Marat Dukhan · 3 years, 1 month ago
- 5de7bc0 QS8/QU8 Tanh operator using LUT microkernels by Marat Dukhan · 3 years, 1 month ago
- f718232 X8 LUT NEON microkernels by Marat Dukhan · 3 years, 1 month ago
- 548542c Fix CMake build by Marat Dukhan · 3 years, 1 month ago
- f6c991e Implement generic LUT-based elementwise operator by Marat Dukhan · 3 years, 1 month ago
- 5407437 Benchmark for X8 LUT microkernels by Marat Dukhan · 3 years, 1 month ago
- d67539d Auto-generate X8 LUT microkernels and tests by Marat Dukhan · 3 years, 1 month ago
- cdf59a5 Add QU8 NR=32 microkernels by Frank Barchard · 3 years, 1 month ago
- df8e604 4x8 QU8 Neon Dotproduct microkernel rename from ld64 to ld128 by Frank Barchard · 3 years, 1 month ago
- a49e41f QU8 4x16C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 3 years, 1 month ago
- 0a3093c QU8 vadd neon use x32 instead of x8 by Frank Barchard · 3 years, 1 month ago
- 7da8b02 Q8 dwconv switch from 8x25 to 16x25 by Frank Barchard · 3 years, 1 month ago
- e252f92 End-to-end benchmarks on QC8 MobileNet v1/v2 models by Marat Dukhan · 3 years, 1 month ago
- 0d06573 dwconv Q8 switch from 8x9 to 16x9 tile. by Frank Barchard · 3 years, 1 month ago
- 8b69802 Enable QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 3 years, 1 month ago
- ca4c68e QU8 C4 NEON Dot Product GEMM/IGEMM microkernels for Cortex A55r1 by Frank Barchard · 3 years, 1 month ago
- 0c76422 QU8 NEON Assembly remove channel wise by Frank Barchard · 3 years, 1 month ago
- 4066898 QU8 4x16 C4 NEON Assembly Dot Product GEMM/IGEMM microkernels by Frank Barchard · 3 years, 1 month ago