1. f30a859 Port aarch64 F32 IGEMM 1x8 A75 microkernel to JIT, add tests, benchmarks, enable in init.c if JIT is enabled by Zhi An Ng · 2 years, 5 months ago
  2. 1425eb5 Copy IGEMM benchmark code into JIT's IGEMM benchmark code, and add JIT aarch64 generators to benchmarks by Zhi An Ng · 2 years, 5 months ago
  3. 2188833 Fix F32 IGEMM benchmark loop to not require capping NC to NR by Zhi An Ng · 2 years, 5 months ago
  4. fbd67a7 Pad K to a multiple of SR in GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 5 months ago
  5. 0ec25cf Duplicate test methods in gemm-microkernel-test for JIT codegen, update IGEMM generator signature and test generation script. by Zhi An Ng · 2 years, 5 months ago
  6. 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 5 months ago
  7. 665cb23 Add JIT microkernels to F32 IGEMM benchmarks by Zhi An Ng · 2 years, 5 months ago
  8. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 6 months ago
  9. 4c61779 Minimally support WebAssembly Relaxed SIMD builds by Marat Dukhan · 2 years, 6 months ago
  10. 4c3e5a9 GEMM benchmark assembly microkernels before intrinsics. by Frank Barchard · 2 years, 10 months ago
  11. e13e639 Align packed weights on 64 bytes in microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  12. 79cd5f9 FP32 LD128 IGEMM for Cortex X1 by Frank Barchard · 3 years ago
  13. 143a110 Rename GEMM/IGEMM microkernels from Cortex-A57/A75 to prfm_cortex_a75 by Frank Barchard · 3 years ago
  14. e349124 fp32 IGEMM 4x8 and 6x8 ld64 microkernels by Frank Barchard · 3 years ago
  15. e06c813 Support QC8 IGEMM microkernels by Marat Dukhan · 3 years, 1 month ago
  16. 104ae5e Use ISA-specific layouts in F32 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 1 month ago
  17. f56f4c4 Refactor interface of microkernel parameter initialization by Marat Dukhan · 3 years, 1 month ago
  18. 802fcae Additional SSE/SSE2 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 6 months ago
  19. 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 7 months ago
  20. 4e89587 Guard microbenchmarks against running microkernels on incompatible CPUs by Marat Dukhan · 3 years, 7 months ago
  21. d713e8a Refactor microbenchmarks by Marat Dukhan · 3 years, 7 months ago
  22. 44f0ca7 Bind RNG by reference in microbenchmarks by Marat Dukhan · 3 years, 11 months ago
  23. 115d3e2 Remove PSIMD variants of GEMM and IGEMM microkernels by Marat Dukhan · 4 years ago
  24. 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years ago
  25. b42f866 Unify interface of weights packing functions by Marat Dukhan · 4 years ago
  26. 569561d Generate PLD variation of AARCH32 LD64 by Frank Barchard · 4 years ago
  27. 1bbf96b GEMM/IGEMM implementations in WAsm SIMD intrinsics by Marat Dukhan · 4 years ago
  28. 016e586 iOS use Cortex-A75 microkernel which avoids x18 register by Frank Barchard · 4 years ago
  29. e70dbeb Rename minmax_params to params for variables. by Frank Barchard · 4 years, 2 months ago
  30. 29c6b26 Exlude PSIMD micro-kernels from the MSVC/ICC build by Marat Dukhan · 4 years, 2 months ago
  31. de06f49 Add MINMAX suffix to GEMM/IGEMM/DWCONV/PPMM micro-kernel names by Marat Dukhan · 4 years, 2 months ago
  32. eb09a6b Rename F32/U8 output params to minmax params by Marat Dukhan · 4 years, 2 months ago
  33. 0d1052c iOS 6x8 microkernel based on Cortex-A75 but with X18 avoided. by Frank Barchard · 4 years, 3 months ago
  34. 8fb9055 4x8 GEMM and IGEMM microkernels for Cortex A55. 7.8% faster for e2e mobile net v2. by Frank Barchard · 4 years, 3 months ago
  35. b7dd29e 4x8 GEMM and IGEMM microkernels for AARCH32 Cortex A55. 11.5% faster end to end: by Frank Barchard · 4 years, 3 months ago
  36. 91e1999 6x8 GEMM and IGEMM microkernels for Cortex A55. 9% faster end to end: by Frank Barchard · 4 years, 3 months ago
  37. c87a8fd Cortex A53 IGEMM 32 bit ARM by Frank Barchard · 4 years, 4 months ago
  38. 90ce789 Cortex A75 IGEMM 32 bit ARM. by Frank Barchard · 4 years, 4 months ago
  39. dc38f07 LD64 IGEMM 32 bit ARM by Frank Barchard · 4 years, 4 months ago
  40. a7b22c1 Fix wrong MR specifications in IGEMM benchmark by Marat Dukhan · 4 years, 5 months ago
  41. 387c2d1 Generate A57 micro-kernels from A75 source. by Frank Barchard · 4 years, 6 months ago
  42. 0f349c4 AVX512F implementation of GEMM & IGEMM micro-kernels by Marat Dukhan · 4 years, 7 months ago
  43. 69172d9 6x8 ld128 GEMM microkernels by Frank Barchard · 4 years, 7 months ago
  44. c8466f5 Add checks for target ISA in microbenchmarks by Marat Dukhan · 4 years, 7 months ago
  45. 5243bb0 DUP Neon GEMM kernels for Exynos by Frank Barchard · 4 years, 7 months ago
  46. 91317c5 Rename neon intrinsics to lane. by Frank Barchard · 4 years, 7 months ago
  47. fda12b8 AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM by Marat Dukhan · 4 years, 7 months ago
  48. df06d80 Neon shuffle GEMM and IGEMM kernels. by Frank Barchard · 4 years, 7 months ago
  49. eeaa7bd Refactor initialization of micro-kernel parameters by Marat Dukhan · 4 years, 8 months ago
  50. 46fb807 4x8 A53 GEMM, and GEMMINC unpipelined microkernels. by Frank Barchard · 4 years, 8 months ago
  51. a7fb855 6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels. by Frank Barchard · 4 years, 8 months ago
  52. 4232323 Unify naming of functions in benchmark::utils:: by Marat Dukhan · 4 years, 8 months ago
  53. bd9e495 Remove 4x12 intrinsics kernels. by Frank Barchard · 4 years, 8 months ago
  54. 21be34f 1x8 A53 GEMM, GEMMINC and IGEMM microkernels. by Frank Barchard · 4 years, 9 months ago
  55. db45b6a 1x8 neonfma IGEMM microkernel and 1x8 benchmarks. by Frank Barchard · 4 years, 9 months ago
  56. dbafc58 extend build flag --define=xnn_enable_assembly=true to GEMM and IGEMM benchmarks. by Frank Barchard · 4 years, 9 months ago
  57. d62f3cc Avoid using cpuinfo_get_max_cache_size() function by Marat Dukhan · 4 years, 9 months ago
  58. 1dadbf7 Limit direct dependencies on cpuinfo by Marat Dukhan · 4 years, 9 months ago
  59. b455b12 Initial open-source release by XNNPACK Team · 4 years, 9 months ago