1. eb7256b Port F32 GEMM A75 1x8 microkernel to JIT and specialize for min/max, add tests and benchmarks by Zhi An Ng · 2 years, 8 months ago
  2. 9fd2f3e Fix passing of kc JIT generator in F32 GEMM benchmarks by Zhi An Ng · 2 years, 8 months ago
  3. f9fc9ec Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 8 months ago
  4. fbd67a7 Pad K to a multiple of SR in GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 8 months ago
  5. 8b758bf Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by XNNPACK Team · 2 years, 8 months ago
  6. df51e11 Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 8 months ago
  7. d236074 Add F32 GEMM 6x8 aarch64 neonfma cortex a75 JIT microkernel to benchmark by Zhi An Ng · 2 years, 8 months ago
  8. 70ea0a2 Specialize F32 GEMM A53 JIT microkernel for min/max params by Zhi An Ng · 2 years, 8 months ago
  9. 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 8 months ago
  10. 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 9 months ago
  11. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 9 months ago
  12. 4c61779 Minimally support WebAssembly Relaxed SIMD builds by Marat Dukhan · 2 years, 9 months ago
  13. e13e639 Align packed weights on 64 bytes in microkernel benchmarks by Marat Dukhan · 3 years, 2 months ago
  14. 143a110 Rename GEMM/IGEMM microkernels from Cortex-A57/A75 to prfm_cortex_a75 by Frank Barchard · 3 years, 3 months ago
  15. 0b04374 Support QC8 GEMM microkernels by Marat Dukhan · 3 years, 4 months ago
  16. 8102593 Update GEMM benchmarks by Marat Dukhan · 3 years, 4 months ago
  17. 104ae5e Use ISA-specific layouts in F32 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  18. f56f4c4 Refactor interface of microkernel parameter initialization by Marat Dukhan · 3 years, 4 months ago
  19. 802fcae Additional SSE/SSE2 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 10 months ago
  20. 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 10 months ago
  21. 4e89587 Guard microbenchmarks against running microkernels on incompatible CPUs by Marat Dukhan · 3 years, 10 months ago
  22. d713e8a Refactor microbenchmarks by Marat Dukhan · 3 years, 10 months ago
  23. 44f0ca7 Bind RNG by reference in microbenchmarks by Marat Dukhan · 4 years, 2 months ago
  24. 92162da Remove PSIMD versions of PPMM and PACKX microkernels by Marat Dukhan · 4 years, 2 months ago
  25. 0d0d882 WAsm SIMD version of PPMM microkernel by Marat Dukhan · 4 years, 2 months ago
  26. 115d3e2 Remove PSIMD variants of GEMM and IGEMM microkernels by Marat Dukhan · 4 years, 2 months ago
  27. 490febe Cortex A7 microkernel based on LD64 with PLD added. 3.2% faster in end to end mobilenet v2 by Frank Barchard · 4 years, 2 months ago
  28. b42f866 Unify interface of weights packing functions by Marat Dukhan · 4 years, 3 months ago
  29. 569561d Generate PLD variation of AARCH32 LD64 by Frank Barchard · 4 years, 3 months ago
  30. 1bbf96b GEMM/IGEMM implementations in WAsm SIMD intrinsics by Marat Dukhan · 4 years, 3 months ago
  31. 016e586 iOS use Cortex-A75 microkernel which avoids x18 register by Frank Barchard · 4 years, 4 months ago
  32. 3b98f6b 4x4 LD64 GEMM+MINMAX microkernel in AArch32+VFP assembly by Marat Dukhan · 4 years, 4 months ago
  33. e70dbeb Rename minmax_params to params for variables. by Frank Barchard · 4 years, 5 months ago
  34. b026e22 Adapt to one last ruy API change. by Benoit Jacob · 4 years, 5 months ago
  35. 349701a Adapt XNNPACK to the new ruy interface. by Benoit Jacob · 4 years, 5 months ago
  36. 29c6b26 Exlude PSIMD micro-kernels from the MSVC/ICC build by Marat Dukhan · 4 years, 5 months ago
  37. 3cb54f9 1x8 LD64 F32 GEMM by Frank Barchard · 4 years, 6 months ago
  38. de06f49 Add MINMAX suffix to GEMM/IGEMM/DWCONV/PPMM micro-kernel names by Marat Dukhan · 4 years, 6 months ago
  39. eb09a6b Rename F32/U8 output params to minmax params by Marat Dukhan · 4 years, 6 months ago
  40. b038fdc Adapt XNNPACK to the move of ruy to its own GitHub repository. by Benoit Jacob · 4 years, 6 months ago
  41. 0d1052c iOS 6x8 microkernel based on Cortex-A75 but with X18 avoided. by Frank Barchard · 4 years, 6 months ago
  42. 8fb9055 4x8 GEMM and IGEMM microkernels for Cortex A55. 7.8% faster for e2e mobile net v2. by Frank Barchard · 4 years, 6 months ago
  43. b7dd29e 4x8 GEMM and IGEMM microkernels for AARCH32 Cortex A55. 11.5% faster end to end: by Frank Barchard · 4 years, 7 months ago
  44. 91e1999 6x8 GEMM and IGEMM microkernels for Cortex A55. 9% faster end to end: by Frank Barchard · 4 years, 7 months ago
  45. 387c2d1 Generate A57 micro-kernels from A75 source. by Frank Barchard · 4 years, 9 months ago
  46. 9f7d555 Prefetch version of the aarch32 a75 GEMM kernel by Frank Barchard · 4 years, 10 months ago
  47. 1391604 Initial Cortex A53 kernel for aarch32 by Frank Barchard · 4 years, 10 months ago
  48. 2712132 FMA3 microkernels with 4-wide shuffle by Marat Dukhan · 4 years, 10 months ago
  49. 3e237f2 AARCH32 4x8 for Cortex A75 by Frank Barchard · 4 years, 10 months ago
  50. 8b0f026 AARCH32 4x8 NEON GEMM Assembly version of 4x8 for 32 bit ARM. Based on LD64. by Frank Barchard · 4 years, 10 months ago
  51. 0f349c4 AVX512F implementation of GEMM & IGEMM micro-kernels by Marat Dukhan · 4 years, 10 months ago
  52. 69172d9 6x8 ld128 GEMM microkernels by Frank Barchard · 4 years, 10 months ago
  53. c8466f5 Add checks for target ISA in microbenchmarks by Marat Dukhan · 4 years, 10 months ago
  54. 5243bb0 DUP Neon GEMM kernels for Exynos by Frank Barchard · 4 years, 10 months ago
  55. 91317c5 Rename neon intrinsics to lane. by Frank Barchard · 4 years, 10 months ago
  56. fda12b8 AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM by Marat Dukhan · 4 years, 10 months ago
  57. df06d80 Neon shuffle GEMM and IGEMM kernels. by Frank Barchard · 4 years, 10 months ago
  58. 95bebc9 Benchmarks rename sgemm and sppmm to f32_gemm and f32_ppmm by Frank Barchard · 4 years, 10 months ago
  59. eeaa7bd Refactor initialization of micro-kernel parameters by Marat Dukhan · 5 years ago
  60. 46fb807 4x8 A53 GEMM, and GEMMINC unpipelined microkernels. by Frank Barchard · 5 years ago
  61. a7fb855 6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels. by Frank Barchard · 5 years ago
  62. 4232323 Unify naming of functions in benchmark::utils:: by Marat Dukhan · 5 years ago
  63. bd9e495 Remove 4x12 intrinsics kernels. by Frank Barchard · 5 years ago
  64. 21be34f 1x8 A53 GEMM, GEMMINC and IGEMM microkernels. by Frank Barchard · 5 years ago
  65. dbafc58 extend build flag --define=xnn_enable_assembly=true to GEMM and IGEMM benchmarks. by Frank Barchard · 5 years ago
  66. 33f0c7a Guard Ruy and GemmLowp benchmarks by Marat Dukhan · 5 years ago
  67. d62f3cc Avoid using cpuinfo_get_max_cache_size() function by Marat Dukhan · 5 years ago
  68. 1dadbf7 Limit direct dependencies on cpuinfo by Marat Dukhan · 5 years ago
  69. bb4c18b Report Freq in additional benchmarks by Frank Barchard · 5 years ago
  70. b455b12 Initial open-source release by XNNPACK Team · 5 years ago