- eb7256b Port F32 GEMM A75 1x8 microkernel to JIT and specialize for min/max, add tests and benchmarks by Zhi An Ng · 2 years, 8 months ago
- c2e2da8 Fix conversion script for aarch64 assembly kernels and convert a single F32 GEMM as a test by Zhi An Ng · 2 years, 8 months ago
- 0ec25cf Duplicate test methods in gemm-microkernel-test for JIT codegen, update IGEMM generator signature and test generation script. by Zhi An Ng · 2 years, 8 months ago
- e7225eb Specialize F32 GEMM (a53) on kc by Zhi An Ng · 2 years, 8 months ago
- 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 9 months ago
- af9ff85 Fix GEMM test templates to use variable n instead of fixed NR and regenerate tests by Zhi An Ng · 2 years, 9 months ago
- 4c1fd6f Allow generate-gemm-test.py to accept multiple output files, and shard the generated tests across all specified output files. by Zhi An Ng · 2 years, 9 months ago
- 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 9 months ago
- 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 9 months ago
- c83ef3b Refactor F32 MINMAX parameters for WAsm SIMD by Marat Dukhan · 2 years, 9 months ago
- b43b47a Add a script to convert existing assembly microkernels to JIT codegen. by Zhi An Ng · 2 years, 9 months ago
- c80ffb0 Fix generation of gemm tests for ADJBLOCK and rerun scripts. by Zhi An Ng · 2 years, 9 months ago
- 4c61779 Minimally support WebAssembly Relaxed SIMD builds by Marat Dukhan · 2 years, 9 months ago
- 143a110 Rename GEMM/IGEMM microkernels from Cortex-A57/A75 to prfm_cortex_a75 by Frank Barchard · 3 years, 3 months ago
- 104ae5e Use ISA-specific layouts in F32 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
- d5694df Use pointer to parameter initialization function in GEMM/IGEMM/DWCONV microkernel tests by Marat Dukhan · 3 years, 4 months ago
- 802fcae Additional SSE/SSE2 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
- 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 10 months ago
- 115d3e2 Remove PSIMD variants of GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
- 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years, 2 months ago
- e39e646 WAsm SIMD versions of [I]GEMM microkernels with NR=2 by Marat Dukhan · 4 years, 3 months ago
- 47387d6 Fix parameter initializion and WAsm microkernel tests by Marat Dukhan · 4 years, 3 months ago
- fb5b20a Enable WAsm MVP tests in WAsm SIMD builds by Marat Dukhan · 4 years, 3 months ago
- 569561d Generate PLD variation of AARCH32 LD64 by Frank Barchard · 4 years, 3 months ago
- 1bbf96b GEMM/IGEMM implementations in WAsm SIMD intrinsics by Marat Dukhan · 4 years, 3 months ago
- 016e586 iOS use Cortex-A75 microkernel which avoids x18 register by Frank Barchard · 4 years, 4 months ago
- 32f9381 4x4 LD64 GEMM microkernel in AArch32+VFP assembly by Marat Dukhan · 4 years, 4 months ago
- 3b98f6b 4x4 LD64 GEMM+MINMAX microkernel in AArch32+VFP assembly by Marat Dukhan · 4 years, 4 months ago
- f606806 Add missing "assembly: true" specifications by Marat Dukhan · 4 years, 4 months ago
- 29c6b26 Exlude PSIMD micro-kernels from the MSVC/ICC build by Marat Dukhan · 4 years, 6 months ago
- 3cb54f9 1x8 LD64 F32 GEMM by Frank Barchard · 4 years, 6 months ago
- de06f49 Add MINMAX suffix to GEMM/IGEMM/DWCONV/PPMM micro-kernel names by Marat Dukhan · 4 years, 6 months ago
- 1c58711 Add MINMAX suffix to filenames of GEMM/IGEMM/PPMM/DWCONV micro-kernels by Marat Dukhan · 4 years, 6 months ago[Renamed (99%) from test/f32-gemm.cc]
- 0d1052c iOS 6x8 microkernel based on Cortex-A75 but with X18 avoided. by Frank Barchard · 4 years, 6 months ago
- 8fb9055 4x8 GEMM and IGEMM microkernels for Cortex A55. 7.8% faster for e2e mobile net v2. by Frank Barchard · 4 years, 6 months ago
- b7dd29e 4x8 GEMM and IGEMM microkernels for AARCH32 Cortex A55. 11.5% faster end to end: by Frank Barchard · 4 years, 7 months ago
- 91e1999 6x8 GEMM and IGEMM microkernels for Cortex A55. 9% faster end to end: by Frank Barchard · 4 years, 7 months ago
- b00004d 4x2c4 GEMM micro-kernels for PSIMD and SSE by Marat Dukhan · 4 years, 8 months ago
- ca27b40 Add pipelined to gemm tests for aarch32. by Frank Barchard · 4 years, 8 months ago
- 387c2d1 Generate A57 micro-kernels from A75 source. by Frank Barchard · 4 years, 10 months ago
- 9f7d555 Prefetch version of the aarch32 a75 GEMM kernel by Frank Barchard · 4 years, 10 months ago
- 1391604 Initial Cortex A53 kernel for aarch32 by Frank Barchard · 4 years, 10 months ago
- 2712132 FMA3 microkernels with 4-wide shuffle by Marat Dukhan · 4 years, 10 months ago
- eccfd71 NR=16 GEMM and IGEMM micro-kernels in AVX and FMA3 implementations by Marat Dukhan · 4 years, 10 months ago
- 3e237f2 AARCH32 4x8 for Cortex A75 by Frank Barchard · 4 years, 10 months ago
- 436ebe6 Separate WAsm micro-kernels and scalar micro-kernels by Marat Dukhan · 4 years, 10 months ago
- 8b0f026 AARCH32 4x8 NEON GEMM Assembly version of 4x8 for 32 bit ARM. Based on LD64. by Frank Barchard · 4 years, 10 months ago
- 0f349c4 AVX512F implementation of GEMM & IGEMM micro-kernels by Marat Dukhan · 4 years, 10 months ago
- 69172d9 6x8 ld128 GEMM microkernels by Frank Barchard · 4 years, 10 months ago
- 5243bb0 DUP Neon GEMM kernels for Exynos by Frank Barchard · 4 years, 10 months ago
- 91317c5 Rename neon intrinsics to lane. by Frank Barchard · 4 years, 10 months ago
- fda12b8 AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM by Marat Dukhan · 4 years, 10 months ago
- df06d80 Neon shuffle GEMM and IGEMM kernels. by Frank Barchard · 4 years, 10 months ago
- 0ecc2ab 4x8 GEMM for Cortex A53 by Frank Barchard · 4 years, 11 months ago
- e64f91a Pipelined 6x8 GEMM for Cortex A53 by Frank Barchard · 4 years, 11 months ago
- bd41971 A57 branch a version of A53 kernel by Frank Barchard · 5 years ago
- 00bf68e A53 6x8 GEMM unrolled by Frank Barchard · 5 years ago
- 46fb807 4x8 A53 GEMM, and GEMMINC unpipelined microkernels. by Frank Barchard · 5 years ago
- a7fb855 6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels. by Frank Barchard · 5 years ago
- bd9e495 Remove 4x12 intrinsics kernels. by Frank Barchard · 5 years ago
- 7e95597 Pass XNN_ENABLE_ASSEMBLY for all tests and kernel benchmarks by Frank Barchard · 5 years ago
- 21be34f 1x8 A53 GEMM, GEMMINC and IGEMM microkernels. by Frank Barchard · 5 years ago
- 1dadbf7 Limit direct dependencies on cpuinfo by Marat Dukhan · 5 years ago
- b455b12 Initial open-source release by XNNPACK Team · 5 years ago