1. b177732 Remove prefetch of output buffer from A53 kernels. by Frank Barchard · 4 years, 6 months ago
  2. 279908a A75 / A53 aarch32 epilogue reordered by B the same as main loop. by Frank Barchard · 4 years, 6 months ago
  3. 387c2d1 Generate A57 micro-kernels from A75 source. by Frank Barchard · 4 years, 7 months ago
  4. 005feb8 A53 push r1, r2 so they can be used as scratch. Reorder FMA by B by Frank Barchard · 4 years, 7 months ago
  5. 0090f5b 4x8 FMA sorted by B to match load order by Frank Barchard · 4 years, 7 months ago
  6. abf8154 Code generator for PLD and non-PLD versions of aarch32 4x8 Cortex-A75 kernel by Frank Barchard · 4 years, 7 months ago
  7. 07efec4 Run generator for A73 kernel NOP by Frank Barchard · 4 years, 7 months ago
  8. 9f7d555 Prefetch version of the aarch32 a75 GEMM kernel by Frank Barchard · 4 years, 7 months ago
  9. 73ccfb4 Move SUBS to 2nd instruction of clamp code. by Frank Barchard · 4 years, 7 months ago
  10. c659140 a73 kernel move SUBS before clamp and add NOP before branch by Frank Barchard · 4 years, 7 months ago
  11. 1391604 Initial Cortex A53 kernel for aarch32 by Frank Barchard · 4 years, 7 months ago
  12. d94b856 Rename strided gemm and igemm fma3 broadcasts. by Ashkan Aliabadi · 4 years, 7 months ago
  13. b738ad2 fix for linux arm 32 bit by Frank Barchard · 4 years, 7 months ago
  14. 2712132 FMA3 microkernels with 4-wide shuffle by Marat Dukhan · 4 years, 7 months ago
  15. eccfd71 NR=16 GEMM and IGEMM micro-kernels in AVX and FMA3 implementations by Marat Dukhan · 4 years, 7 months ago
  16. cfb3134 Polyfill missing _cvtu32_mask16 intrinsic on old gcc by Marat Dukhan · 4 years, 7 months ago
  17. 3e237f2 AARCH32 4x8 for Cortex A75 by Frank Barchard · 4 years, 7 months ago
  18. f917cbd AARCH32 4x8 LD64 stores simplified by Frank Barchard · 4 years, 7 months ago
  19. 6383f49 Assembly GEMM kernel NC loop use SUBS instead of CMP+SUBS by Frank Barchard · 4 years, 7 months ago
  20. 436ebe6 Separate WAsm micro-kernels and scalar micro-kernels by Marat Dukhan · 4 years, 7 months ago
  21. 61cad89 AARCH32 4x8 GEMM kernel fully register based. by Frank Barchard · 4 years, 7 months ago
  22. 72d6afb AARCH32 4x8 kernel code clean up. by Frank Barchard · 4 years, 7 months ago
  23. 8b0f026 AARCH32 4x8 NEON GEMM Assembly version of 4x8 for 32 bit ARM. Based on LD64. by Frank Barchard · 4 years, 7 months ago
  24. 0f349c4 AVX512F implementation of GEMM & IGEMM micro-kernels by Marat Dukhan · 4 years, 7 months ago
  25. c72fa1e Use XNN_ARCH_* macros for architecture-specific parts in micro-kernels by Marat Dukhan · 4 years, 7 months ago
  26. 69172d9 6x8 ld128 GEMM microkernels by Frank Barchard · 4 years, 7 months ago
  27. 40a672f Move generated micro-kernels into a subdirectory by Marat Dukhan · 4 years, 7 months ago
  28. 5243bb0 DUP Neon GEMM kernels for Exynos by Frank Barchard · 4 years, 7 months ago
  29. 91317c5 Rename neon intrinsics to lane. by Frank Barchard · 4 years, 7 months ago
  30. 5743193 Fix typos in END_FUNCTION arguments in ARM64 assembly kernels by Marat Dukhan · 4 years, 7 months ago
  31. fda12b8 AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM by Marat Dukhan · 4 years, 7 months ago
  32. 5480997 Replace IDLETTERS with ABC by Frank Barchard · 4 years, 7 months ago
  33. df06d80 Neon shuffle GEMM and IGEMM kernels. by Frank Barchard · 4 years, 8 months ago
  34. 684bbb0 CMP 2 instructions earlier in A/C clamping. by Frank Barchard · 4 years, 8 months ago
  35. 9efaed7 A53 GEMM and IGEMM pipelined kernels prefetch C in epilogue by Frank Barchard · 4 years, 8 months ago
  36. 19418b5 GEMM 4x8 and 4x12 kernels use forward stores for C. by Frank Barchard · 4 years, 8 months ago
  37. 82cfe18 4x8 a53 epilogue NOPs in group 5 by Frank Barchard · 4 years, 8 months ago
  38. 0ecc2ab 4x8 GEMM for Cortex A53 by Frank Barchard · 4 years, 8 months ago
  39. 5abe43c ST1 post increment for Cortex A53 GEMM/IGEMM microkernels by Frank Barchard · 4 years, 8 months ago
  40. e67b783 ST1 post increment for ld64/ld128 GEMM/IGEMM microkernels by Frank Barchard · 4 years, 8 months ago
  41. e64f91a Pipelined 6x8 GEMM for Cortex A53 by Frank Barchard · 4 years, 8 months ago
  42. bd41971 A57 branch a version of A53 kernel by Frank Barchard · 4 years, 8 months ago
  43. 64a5bfe A53 6x8 IGEMM kernel prefetch by Frank Barchard · 4 years, 8 months ago
  44. bd1d5d9 6x8 A53 GEMM use prefetch. by Frank Barchard · 4 years, 8 months ago
  45. 00bf68e A53 6x8 GEMM unrolled by Frank Barchard · 4 years, 8 months ago
  46. b3c6c6e 6x8 A53 remove pushes for NEON by Frank Barchard · 4 years, 8 months ago
  47. 46fb807 4x8 A53 GEMM, and GEMMINC unpipelined microkernels. by Frank Barchard · 4 years, 8 months ago
  48. a7fb855 6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels. by Frank Barchard · 4 years, 8 months ago
  49. caf8544 LD64/LD128 kernels remove all pushes (d8-d15) Remap d12-d15 to d16-d19 by Frank Barchard · 4 years, 9 months ago
  50. fcfdc0e Automated g4 rollback of changelist 274728310. by Frank Barchard · 4 years, 9 months ago
  51. 8e3c551 1x8 a53 kernel refactor based on a57. by Frank Barchard · 4 years, 9 months ago
  52. baa9ead Update assembly Copyright notice to // comment by Frank Barchard · 4 years, 9 months ago
  53. 459c9fc 6x8 and a53 kernel comments. by Frank Barchard · 4 years, 9 months ago
  54. a5ca10e Neon intrinsics clamping - Replace 2 LD1R with 1 LD2R by Frank Barchard · 4 years, 9 months ago
  55. bd9e495 Remove 4x12 intrinsics kernels. by Frank Barchard · 4 years, 9 months ago
  56. 21be34f 1x8 A53 GEMM, GEMMINC and IGEMM microkernels. by Frank Barchard · 4 years, 9 months ago
  57. 1b0421b 1x12 and 1x8 assembly kernel cleanup by Frank Barchard · 4 years, 9 months ago
  58. 629a33e Fix incompatibilities with open-source Bazel-based build by Marat Dukhan · 4 years, 9 months ago
  59. 4c2637d LD2R for loading clamp parameters by Frank Barchard · 4 years, 9 months ago
  60. 80fc932 Unify comments style by Marat Dukhan · 4 years, 9 months ago
  61. b455b12 Initial open-source release by XNNPACK Team · 4 years, 9 months ago