1. b177732 Remove prefetch of output buffer from A53 kernels. by Frank Barchard · 4 years, 9 months ago
  2. 279908a A75 / A53 aarch32 epilogue reordered by B the same as main loop. by Frank Barchard · 4 years, 9 months ago
  3. 387c2d1 Generate A57 micro-kernels from A75 source. by Frank Barchard · 4 years, 10 months ago
  4. 0090f5b 4x8 FMA sorted by B to match load order by Frank Barchard · 4 years, 10 months ago
  5. abf8154 Code generator for PLD and non-PLD versions of aarch32 4x8 Cortex-A75 kernel by Frank Barchard · 4 years, 10 months ago
  6. 07efec4 Run generator for A73 kernel NOP by Frank Barchard · 4 years, 10 months ago
  7. 73ccfb4 Move SUBS to 2nd instruction of clamp code. by Frank Barchard · 4 years, 10 months ago
  8. c659140 a73 kernel move SUBS before clamp and add NOP before branch by Frank Barchard · 4 years, 10 months ago
  9. d94b856 Rename strided gemm and igemm fma3 broadcasts. by Ashkan Aliabadi · 4 years, 10 months ago
  10. 2712132 FMA3 microkernels with 4-wide shuffle by Marat Dukhan · 4 years, 10 months ago
  11. eccfd71 NR=16 GEMM and IGEMM micro-kernels in AVX and FMA3 implementations by Marat Dukhan · 4 years, 10 months ago
  12. cfb3134 Polyfill missing _cvtu32_mask16 intrinsic on old gcc by Marat Dukhan · 4 years, 10 months ago
  13. 6383f49 Assembly GEMM kernel NC loop use SUBS instead of CMP+SUBS by Frank Barchard · 4 years, 10 months ago
  14. 436ebe6 Separate WAsm micro-kernels and scalar micro-kernels by Marat Dukhan · 4 years, 10 months ago
  15. 0f349c4 AVX512F implementation of GEMM & IGEMM micro-kernels by Marat Dukhan · 4 years, 10 months ago
  16. c72fa1e Use XNN_ARCH_* macros for architecture-specific parts in micro-kernels by Marat Dukhan · 4 years, 10 months ago
  17. 69172d9 6x8 ld128 GEMM microkernels by Frank Barchard · 4 years, 10 months ago
  18. 40a672f Move generated micro-kernels into a subdirectory by Marat Dukhan · 4 years, 10 months ago