- e3cb19b Minimal RISC-V support by Marat Dukhan · 3 years, 6 months ago
- b9ebcad Fix CMake build by Marat Dukhan · 3 years, 6 months ago
- b639210 Add prefetch of A for quantized microkernels. by Frank Barchard · 3 years, 6 months ago
- a91559a Move declarations of VHSWISH microkernels into vunary.h by Marat Dukhan · 3 years, 6 months ago
- b1eec08 Refactor filenames of Clamp/Relu/Sigmoid microkernels by Marat Dukhan · 3 years, 6 months ago
- 43558a7 Fix CMake build for ARM64 by Marat Dukhan · 3 years, 6 months ago
- 6674d69 Refactor naming of unary elementwise microkernels by Marat Dukhan · 3 years, 6 months ago
- 3a6bb68 Merge pull request #1426 from microblink:mb-patches by XNNPACK Team · 3 years, 6 months ago
- e111861 1x8 C8 A53 microkernel defer adap by Frank Barchard · 3 years, 6 months ago
- 7c4c771 C8 A53 microkernels prefetch A by Frank Barchard · 3 years, 6 months ago
- 2a3169d C8 A53 microkernels move 2nd load after MLA by Frank Barchard · 3 years, 6 months ago
- ec51a4e Enable QS8 1x8C8 IGEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
- dddb38f QS8 1x8C8 IGEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
- 21acdd0 Enable QS8 1x8C8 GEMM microkernel for Cortex A53. by Frank Barchard · 3 years, 6 months ago
- 46a69c9 QS8 1x8C8 GEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
- 042cdaf GCC 11 no longer needs this polyfill by Nenad Mikša · 3 years, 6 months ago
- 93757bf add `*.swp` to .gitignore by Nenad Mikša · 3 years, 6 months ago
- 2de3bce A53 C8 microkernel load A with ldr/ldr/ins by Frank Barchard · 3 years, 7 months ago
- 184a8e1 Enable A53 C8 microkernel load A with ldr/ldr/ins by Frank Barchard · 3 years, 7 months ago
- 5549735 4X8 and 4x16 mla lane microkernels for A53 by Frank Barchard · 3 years, 7 months ago
- 90f520b Enable Cortex A53 tuned C8 gemm/igemm microkernels for Cortex A53 and Cortex A55r0 by Frank Barchard · 3 years, 7 months ago
- d68e114 Cortex A53 tuned C8 gemm/igemm microkernels by Frank Barchard · 3 years, 7 months ago
- fb5983d Enable prefetch to MLA lane microkernel on Cortex A53 by Frank Barchard · 3 years, 7 months ago
- 1f51d38 Add prefetch to MLA lane microkernel by Frank Barchard · 3 years, 7 months ago
- 8f15372 Expose QS8 Fully Connected operator in Subgraph API by Marat Dukhan · 3 years, 7 months ago
- a999225 Support 2D Convolution and 2D Depthwise Convolution without bias by Marat Dukhan · 3 years, 7 months ago
- 853bb7a Reformat Subgraph API documentation by Marat Dukhan · 3 years, 7 months ago
- d74a53f Improve docs for xnn_define_fully_connected by Marat Dukhan · 3 years, 7 months ago
- 281f13e Simplify Fully Connected Node without bias by Marat Dukhan · 3 years, 7 months ago
- 4c6640c Disable MSan in QS8 GEMM/IGEMM microkernels with KR>1 by Marat Dukhan · 3 years, 7 months ago
- d9487b8 Merge pull request #1409 from larryliu0820:patch-1 by XNNPACK Team · 3 years, 7 months ago
- 3dd80b3 Fix allocator initialize issue on Windows by Larry Liu · 3 years, 7 months ago
- 676322f Merge pull request #1396 from huningxin:fully_connected by XNNPACK Team · 3 years, 7 months ago
- 6ac1d18 Cortex A53 used MLAL lane by Frank Barchard · 3 years, 7 months ago
- c77fc4c Bug fix add missing break for qs8 select on big core. by Frank Barchard · 3 years, 7 months ago
- 2b26670 Merge pull request #1406 from larryliu0820:master by XNNPACK Team · 3 years, 7 months ago
- 39c539d Fix neon v8 for ios armv7 by Larry Liu · 3 years, 7 months ago
- ec56b7e Avoid selection of NEON-DOT microkernels on AArch32 iOS by Marat Dukhan · 3 years, 7 months ago
- 2a995e7 Enable PRFM variant of QS8 C8 Neon microkernel on Cortex A53, A72, A73 and Kryo. by Frank Barchard · 3 years, 7 months ago
- 4a35204 PRFM variant of QS8 C8 Neon microkernel. by Frank Barchard · 3 years, 7 months ago
- 6e8c0ce Disable compilation of neondot microkernels for AArch32 iOS by Marat Dukhan · 3 years, 7 months ago
- 3fd4e27 XOP versions of QS8 DWCONV MUL32 microkernels by Marat Dukhan · 3 years, 7 months ago
- 4181f94 Optimize QS8 GEMM/IGEMM microkernel selection for AVX by Marat Dukhan · 3 years, 7 months ago
- 2e42787 2x4c2/3x4c2 microkernels for SSE2/SSSE3/SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 7 months ago
- 30fa853 Support flags and optional bias for fully_connected node by Ningxin Hu · 3 years, 7 months ago
- be3d8fd QS8 E2E GEMM benchmarks for C2 AVX microkernels by Marat Dukhan · 3 years, 7 months ago
- d5cc508 Add AVX microkernels to QS8 E2E GEMM benchmark by Marat Dukhan · 3 years, 7 months ago
- 496389f Make xnn_initialize thread-safe by Marat Dukhan · 3 years, 7 months ago
- e696c3f QS8 move loads to end of loop, 1 every 2 neon instructions. by Frank Barchard · 3 years, 7 months ago
- 60fc613 Polyfill _mm_loadu_si32 in MUL32 QS8 DWCONV SSE4.1/AVX microkernels by Marat Dukhan · 3 years, 7 months ago
- 07feec8 MUL32 versions of SSE4.1 & AVX QS8 DWCONV microkernels by Marat Dukhan · 3 years, 7 months ago
- fa0ab85 AVX versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 7 months ago
- e9c4b96 AVX versions of QS8 VADD/VADDC microkernels by Marat Dukhan · 3 years, 7 months ago
- a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 7 months ago
- b8ad46a Refactor code-generation templates for XOP microkernels by Marat Dukhan · 3 years, 7 months ago
- ae5082e QS8 C8 GEMM/IGEMM use load a/b last technique for Cortex A75 performance. by Frank Barchard · 3 years, 7 months ago
- c409471 Include XOP headers in clang-cl compatible way. Fix #1382. by Marat Dukhan · 3 years, 7 months ago
- d23cb6e Fully Connected operator for QS8 datatype by Marat Dukhan · 3 years, 8 months ago
- b3ffd58 Implement bilinear upsampling for SSE architecture by Artsiom Ablavatski · 3 years, 8 months ago
- 1f5099e Support quantized inference in Subgraph API with xnn_enable_qs8=true by Marat Dukhan · 3 years, 8 months ago
- 03f4621 Rename xnnpack_enable_memopt Bazel option to xnn_enable_memopt by Marat Dukhan · 3 years, 8 months ago
- b939cdb Bazel flag --xnn_enable_qs8=true to include QS8 operators in :xnnpack_for_tflite by Marat Dukhan · 3 years, 8 months ago
- 96d95e0 Remove unused :xnnpack_f32 target by Marat Dukhan · 3 years, 8 months ago
- d09ca26 Remove Android NDK r17 support by Marat Dukhan · 3 years, 8 months ago
- f0cb70a Rename TFLite- and TF.js- optimized targets by Marat Dukhan · 3 years, 8 months ago
- 09c0591 Validate static tensors in Subgraph API by Marat Dukhan · 3 years, 8 months ago
- 3075719 Clarify documentation of xnn_define_quantized_tensor_value by Marat Dukhan · 3 years, 8 months ago
- 43ebc05 Extend Subgraph API to support quantized tensors by Marat Dukhan · 3 years, 8 months ago
- ccd3a1d Validate tensor data types in Subgraph API by Marat Dukhan · 3 years, 8 months ago
- 3bfbdaf Update emscripten config settings to conform to the official emscripten toolchain. by XNNPACK Team · 3 years, 8 months ago
- f7baa13 Merge pull request #1367 from JerryShih:upgrade-fp16 by XNNPACK Team · 3 years, 8 months ago
- 6e35de5 QS8 1X8C8 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- b876263 QS8 1X8C8 GEMM microkernel by Frank Barchard · 3 years, 8 months ago
- accc49a Update FP16 dependency. by Jerry Shih · 3 years, 8 months ago
- a0f9bdc Validate tensor types in Subgraph API by Marat Dukhan · 3 years, 8 months ago
- 967712d Limit range of test values for f16 binary minmax ops. by Frank Barchard · 3 years, 8 months ago
- 2c525e5 MOV 16b instead of 4s for GCC compatability. Fix #1360 by Frank Barchard · 3 years, 8 months ago
- b0da47a QS8 C8 neon microkernel load B at end of loop and PADAP at top of loop. by Frank Barchard · 3 years, 8 months ago
- f5f9cec Miscellaneous tweeks to QS8 IGEMM microkernels by Frank Barchard · 3 years, 8 months ago
- 8e58994 2x8c8__aarch64_neon_mlal_padal GEMM microkernel load A0 last by Frank Barchard · 3 years, 8 months ago
- bbf5182 Enable QS8 2x8c8-aarch64-neon-mlal-padal GEMM / IGEMM microkernels by Frank Barchard · 3 years, 8 months ago
- cbb8e70 QS8 2x8c8-aarch64-neon-mlal-padal IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 7ca54df QS8 2x8c16-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 0ac9b7f Fix bugs in AVX512F LUT-based EXP evaluation stubs by Marat Dukhan · 3 years, 8 months ago
- 7825897 C8 mul microkernel labels sorted and registers documented by Frank Barchard · 3 years, 8 months ago
- dbb2292 Fix bug in AVX512F RR2 P5 SCALEF EXP evaluation stubs by Marat Dukhan · 3 years, 8 months ago
- 2f06150 xnn_qs8_gemm_minmax_ukernel_2x8c8__aarch64_neon_mlal_padal GEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 1dc9fef QS8 2x8c8-aarch64 GEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 4610854 Disable QS8 1x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 3522c0a Enable QS8 4x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 671d1b0 QS8 4x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
- 24c2dec QU8 remove prototypes for microkernels that do not exist. by Frank Barchard · 3 years, 8 months ago
- baf46fc Tuned QS8 GEMM 2x8c16 MLAL PADAL assembly microkernel for AArch64 by Frank Barchard · 3 years, 8 months ago
- 5655cb7 QS8 GEMM 2x8c16 MLAL PADAL assembly microkernel for AArch64 by Frank Barchard · 3 years, 8 months ago
- 4cea232 Built-in end-to-end benchmark on sparse models by Marat Dukhan · 3 years, 8 months ago
- 52e061d Use std::array for fixed-sized arrays in hardcoded models by Marat Dukhan · 3 years, 8 months ago
- b7941cb Round KC up for assembly microkernels. by Frank Barchard · 3 years, 8 months ago
- b75840f Enable QS8 IGEMM for Cortex A55 by Frank Barchard · 3 years, 8 months ago
- 89e12f8 QS8 IGEMM for Cortex A55 by Frank Barchard · 3 years, 8 months ago
- 12a23bb Benchmark randomized QU8 MobileNet v1 model by Marat Dukhan · 3 years, 8 months ago