- e64f91a Pipelined 6x8 GEMM for Cortex A53 by Frank Barchard · 4 years, 9 months ago
- 9fab3f9 Support input offset in BILINEAR micro-kernels by Marat Dukhan · 4 years, 9 months ago
- 38709a6 Add scalar chw 5x5p2 and 5x5s2p2 kernels by Erich Elsen · 4 years, 9 months ago
- 2a64a1a Fix incompatibility with ARM gcc by Marat Dukhan · 4 years, 9 months ago
- 0f06b5c Fix gcc incompatibility in SSE PReLU microkernels by Marat Dukhan · 4 years, 9 months ago
- c465fc2 Add missed f32-bilinear-test to CMake build by Marat Dukhan · 4 years, 9 months ago
- 35dacfb BILINEAR micro-kernels by Marat Dukhan · 4 years, 9 months ago
- 5098c3e Refactor DWCONV micro-kernels by Marat Dukhan · 4 years, 9 months ago
- 49e6ee9 Refactor VMulCAddC micro-kernel by Marat Dukhan · 4 years, 9 months ago
- 69c3f2c Refactor PReLU microkernels by Marat Dukhan · 4 years, 9 months ago
- d5208d6 Remove a_sum buffer by Marat Dukhan · 4 years, 9 months ago
- 1898b91 Move adjustment_* arguments of Deconvolution into setup by Marat Dukhan · 4 years, 9 months ago
- a41533d Reduce image sizes in Deconvolution unit tests by Marat Dukhan · 4 years, 9 months ago
- bad48fe Vary number of threads in the End-to-End benchmark by Marat Dukhan · 4 years, 9 months ago
- 70ad409 Make ARM microkernels compatible with gcc by Marat Dukhan · 4 years, 9 months ago
- fb60914 Make F32 CLAMP NEON micro-kernel compatible with gcc on AArch32 by Marat Dukhan · 4 years, 9 months ago
- bd41971 A57 branch a version of A53 kernel by Frank Barchard · 4 years, 9 months ago
- 8e6e997 Fix ARM64 build with CMake by Marat Dukhan · 4 years, 9 months ago
- c9d2f3f Fix CMake build of End-to-End DWCONV & GEMM benchmarks by Marat Dukhan · 4 years, 9 months ago
- c712fa4 Add Freq to end2end benchmark. by Frank Barchard · 4 years, 9 months ago
- ef4416e End-to-end benchmarks for DWCONV microkernels by Marat Dukhan · 4 years, 9 months ago
- e72e287 Add Freq to E2E benchmark by Frank Barchard · 4 years, 9 months ago
- 0a5a53f Fix CMake build of GEMM E2E benchmark by Marat Dukhan · 4 years, 9 months ago
- e2142e7 Merge pull request #103 from Maratyszcza:master by XNNPACK Team · 4 years, 9 months ago
- 63ba2ed Fix typos in AVX2 ExtExp micro-kernels by Marat Dukhan · 4 years, 9 months ago
- 5f18d26 End-to-end benchmarks for GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 9 months ago
- 4fa0fbe Add missing <chrono> header in f32-softargmax benchmark by Marat Dukhan · 4 years, 9 months ago
- 791d01d Minor revision in README by Marat Dukhan · 4 years, 9 months ago
- a5977df Fix CMake build by Marat Dukhan · 4 years, 9 months ago
- 64a5bfe A53 6x8 IGEMM kernel prefetch by Frank Barchard · 4 years, 9 months ago
- 5a9c48d Merge pull request #82 from AshkanAliabadi:master by XNNPACK Team · 4 years, 9 months ago
- 95b2243 PReLU benchmark on characteristic arguments of ImageNet classifiers by Marat Dukhan · 4 years, 9 months ago
- bd1d5d9 6x8 A53 GEMM use prefetch. by Frank Barchard · 4 years, 9 months ago
- b864235 Refactor BUILD.bazel file by Marat Dukhan · 4 years, 9 months ago
- 8c19e3c Internal change by Marat Dukhan · 4 years, 9 months ago
- ab7424a Fix typos in Deconvolution operator tests by Marat Dukhan · 4 years, 9 months ago
- f568f08 Support Convolution, Deconvolution, and Fully Connected operators without bias by Marat Dukhan · 4 years, 9 months ago
- d70028a Update pthreadpool revision by Marat Dukhan · 4 years, 9 months ago
- 263bb09 Cortex A76 use 6x8 micro kernel by Frank Barchard · 4 years, 9 months ago
- feb4923 AVX512F exp implementation based on PERM2 by Marat Dukhan · 4 years, 9 months ago
- ba7c3bb Merge generate-f32-gemminc.sh script into generate-f32-gemm.sh by Marat Dukhan · 4 years, 9 months ago
- 918a4a6 Extract common parts of test generators into a separate file by Marat Dukhan · 4 years, 9 months ago
- 00bf68e A53 6x8 GEMM unrolled by Frank Barchard · 4 years, 9 months ago
- c452eb1 Re-generate SpMM micro-kernels by Marat Dukhan · 4 years, 9 months ago
- ae777b4 4x8 a53 eliminate pushes to stack by Frank Barchard · 4 years, 9 months ago
- e0601b5 Sort include order for params-init.h and log.h by Frank Barchard · 4 years, 9 months ago
- 4a2bbc6 Benchmark for Two-Pass Softmax algorithm by Marat Dukhan · 4 years, 9 months ago
- eeaa7bd Refactor initialization of micro-kernel parameters by Marat Dukhan · 4 years, 9 months ago
- 6f8d4d3 RADDEXTEXP and VSCALEEXTEXP micro-kernels for AVX2 and AVX512F by Marat Dukhan · 4 years, 9 months ago
- b3c6c6e 6x8 A53 remove pushes for NEON by Frank Barchard · 4 years, 9 months ago
- 46fb807 4x8 A53 GEMM, and GEMMINC unpipelined microkernels. by Frank Barchard · 4 years, 9 months ago
- cd945c6 Re-enable swizzle GEMM/IGEMM micro-kernels in WAsm SIMD on ARM by Marat Dukhan · 4 years, 9 months ago
- f753a7d Rename BUILD to BUILD.bazel by Marat Dukhan · 4 years, 9 months ago
- c4ae7de Propagate IGEMM SR argument to weights packing in Deconvolution operator by Marat Dukhan · 4 years, 9 months ago
- c6afd9b Add blocked scalar spmm kernels. by Erich Elsen · 4 years, 9 months ago
- 7892d97 Fix softargmax-bench CMake target. by Ashkan Aliabadi · 4 years, 9 months ago
- 8440fde Support TF-style SAME padding via explicit flag by Marat Dukhan · 4 years, 9 months ago
- bff791e Use 8x1 SpMM micro-kernel on WebAssembly by Marat Dukhan · 4 years, 9 months ago
- 32c74f7 Fix xnn_f32_gavgpool_spchw_ukernel__scalar_x1 test cases by Marat Dukhan · 4 years, 9 months ago
- 14fe0b2 Enable sparse MobileNet v1/v2 operators on WebAssembly by Marat Dukhan · 4 years, 9 months ago
- a7fb855 6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels. by Frank Barchard · 4 years, 9 months ago
- 563df5f Add scalar version of hwc2spchw convolution. by Erich Elsen · 4 years, 10 months ago
- 98ba441 Vectorized extexp functions by Marat Dukhan · 4 years, 10 months ago
- cb80197 Disable GEMM/IGEMM micro-kernels with swizzle by Marat Dukhan · 4 years, 10 months ago
- 4232323 Unify naming of functions in benchmark::utils:: by Marat Dukhan · 4 years, 10 months ago
- 31a98d7 Remove warnings about inefficient padding parameters in Convolution by Marat Dukhan · 4 years, 10 months ago
- 1756f9e Propagate GEMM/IGEMM SR argument to weights packing in Fully Connected operator by Marat Dukhan · 4 years, 10 months ago
- e0df831 Remove trailing whitespace by Marat Dukhan · 4 years, 10 months ago
- 07cb676 Refactor initialization of even/odd masks in parameters for SpCHW micro-kernels by Marat Dukhan · 4 years, 10 months ago
- 838c8e3 Refactor initialization of masks in parameters for SpCHW micro-kernels by Marat Dukhan · 4 years, 10 months ago
- caf8544 LD64/LD128 kernels remove all pushes (d8-d15) Remap d12-d15 to d16-d19 by Frank Barchard · 4 years, 10 months ago
- fcfdc0e Automated g4 rollback of changelist 274728310. by Frank Barchard · 4 years, 10 months ago
- 05ac8e3 VSCALE microkernel and SoftMax Three-Pass algorithm with Reloading by Marat Dukhan · 4 years, 10 months ago
- 4a4a7fa Three-Pass Softargmax benchmark (recomputing version) by Marat Dukhan · 4 years, 10 months ago
- 8e3c551 1x8 a53 kernel refactor based on a57. by Frank Barchard · 4 years, 10 months ago
- baa9ead Update assembly Copyright notice to // comment by Frank Barchard · 4 years, 10 months ago
- 9757953 Refactor and open-source Three-Pass Softmax micro-kernels by Marat Dukhan · 4 years, 10 months ago
- 459c9fc 6x8 and a53 kernel comments. by Frank Barchard · 4 years, 10 months ago
- 515c977 Refactor and open-source vectorized expminus function by Marat Dukhan · 4 years, 10 months ago
- f6839e1 Refactor vectorized exp functions by Marat Dukhan · 4 years, 10 months ago
- 2af471b Switch default intrinsics kernel to 6x8 by Frank Barchard · 4 years, 10 months ago
- 9cdade3 Add prefetch instructions to 16x1, 16x2, 16x4 kernels. by Erich Elsen · 4 years, 10 months ago
- 34dc2c0 Add gavgpool_spchw_scalar__x1 kernel. by Erich Elsen · 4 years, 10 months ago
- ac4de80 Add chw 3x3s2p1_scalar kernels. by Erich Elsen · 4 years, 10 months ago
- 0cc2c53 add 3x3p1_scalar kernel by Erich Elsen · 4 years, 10 months ago
- a5ca10e Neon intrinsics clamping - Replace 2 LD1R with 1 LD2R by Frank Barchard · 4 years, 10 months ago
- 6adff4e Vectorized implementations of expf function for AVX2 and AVX512F by Marat Dukhan · 4 years, 10 months ago
- bd9e495 Remove 4x12 intrinsics kernels. by Frank Barchard · 4 years, 10 months ago
- 7e95597 Pass XNN_ENABLE_ASSEMBLY for all tests and kernel benchmarks by Frank Barchard · 4 years, 10 months ago
- 8fe54e4 Extra :xnnpack_operators_nhwc_f32 target with only F32 operators in NHWC layout by Marat Dukhan · 4 years, 10 months ago
- 810171d Enable assembly by default. by Frank Barchard · 4 years, 10 months ago
- 21be34f 1x8 A53 GEMM, GEMMINC and IGEMM microkernels. by Frank Barchard · 4 years, 10 months ago
- db45b6a 1x8 neonfma IGEMM microkernel and 1x8 benchmarks. by Frank Barchard · 4 years, 10 months ago
- 174706e Fix misleading comments for debug_build/optimized_build Bazel configs by Marat Dukhan · 4 years, 10 months ago
- dbafc58 extend build flag --define=xnn_enable_assembly=true to GEMM and IGEMM benchmarks. by Frank Barchard · 4 years, 10 months ago
- 4e0249a Add performance results on MobileNets & Pixel phones by Marat Dukhan · 4 years, 10 months ago
- 466b523 Use GEMM/IGEMM micro-kernels with Swizzle on WAsm SIMD by Marat Dukhan · 4 years, 10 months ago
- f633c2c Fix Bazel symblinks in .gitignore by Marat Dukhan · 4 years, 10 months ago
- 523448a Add .gitignore file by Marat Dukhan · 4 years, 10 months ago
- 2dbdc2f CMake build configurations by Marat Dukhan · 4 years, 10 months ago