1. 0b04374 Support QC8 GEMM microkernels by Marat Dukhan · 3 years, 1 month ago
  2. 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 1 month ago
  3. c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 1 month ago
  4. 8102593 Update GEMM benchmarks by Marat Dukhan · 3 years, 2 months ago
  5. 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 2 months ago
  6. e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 2 months ago
  7. b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  8. 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  9. 725f47e Split QS8/QU8 GEMM parameter initialization by datatype by Marat Dukhan · 3 years, 2 months ago
  10. d5694df Use pointer to parameter initialization function in GEMM/IGEMM/DWCONV microkernel tests by Marat Dukhan · 3 years, 2 months ago
  11. d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 2 months ago
  12. f56f4c4 Refactor interface of microkernel parameter initialization by Marat Dukhan · 3 years, 2 months ago
  13. a1a4e78 Scalar QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
  14. 938ea81 Code generate 1x8C8 nicrokernel for Cortex A75 with and without prfm by Frank Barchard · 3 years, 2 months ago
  15. 46a69c9 QS8 1x8C8 GEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 2 months ago
  16. 5549735 4X8 and 4x16 mla lane microkernels for A53 by Frank Barchard · 3 years, 2 months ago
  17. d68e114 Cortex A53 tuned C8 gemm/igemm microkernels by Frank Barchard · 3 years, 3 months ago
  18. 1f51d38 Add prefetch to MLA lane microkernel by Frank Barchard · 3 years, 3 months ago
  19. 4a35204 PRFM variant of QS8 C8 Neon microkernel. by Frank Barchard · 3 years, 3 months ago
  20. 2e42787 2x4c2/3x4c2 microkernels for SSE2/SSSE3/SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 3 months ago
  21. a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  22. b876263 QS8 1X8C8 GEMM microkernel by Frank Barchard · 3 years, 4 months ago
  23. 2f06150 xnn_qs8_gemm_minmax_ukernel_2x8c8__aarch64_neon_mlal_padal GEMM microkernel by Frank Barchard · 3 years, 4 months ago
  24. 5655cb7 QS8 GEMM 2x8c16 MLAL PADAL assembly microkernel for AArch64 by Frank Barchard · 3 years, 4 months ago
  25. 62b4ff7 Remove 12x8 QS8 GEMM and IGEMM Neon dotproduct microkernels. by Frank Barchard · 3 years, 4 months ago
  26. da78da1 QS8 C8 Neon microkernels with MUL and MLA versions. by Frank Barchard · 3 years, 4 months ago
  27. 4a4be4e QS8 1x16c4 ld32 GEMM microkernel using NEON dot product by Frank Barchard · 3 years, 4 months ago
  28. a5e242c QS8 LD32 GEMM microkernel for big cores with dotproduct by Frank Barchard · 3 years, 5 months ago
  29. 71c4d1a QS8 Neon GEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 5 months ago
  30. 8247e21 C2 QS8 microkernel using mull then mlal with KC loop of 16 by Frank Barchard · 3 years, 5 months ago
  31. 5899012 QS8 Neon GEMM C8 microkernel with 8 bit multiply and vpadal to accumulate. by Frank Barchard · 3 years, 5 months ago
  32. a93765f Add MR=1 versions of QS8 gemm benchmarks by Frank Barchard · 3 years, 6 months ago
  33. 2302ffd QS8 Neon GEMM microkernel with 8 bit multiply and vpadal to accumulate by Frank Barchard · 3 years, 6 months ago
  34. ec0bf14 QS8 GEMM and IGEMM 3x8 3x16 and IGEMM 4x8 and 4x16 by Frank Barchard · 3 years, 6 months ago
  35. 4ecae2e QS8 Neon GEMM microkernel with 8 bit multiply by Frank Barchard · 3 years, 6 months ago
  36. cfbc849 Add 4x8 and 4x16 qs8 gemm microkernels by Frank Barchard · 3 years, 6 months ago
  37. d713e8a Refactor microbenchmarks by Marat Dukhan · 3 years, 7 months ago
  38. 146e999 Replace QS8 4x8 with 2x8 neon microkernel. Improves performance for aarch32. by Frank Barchard · 3 years, 9 months ago
  39. f2742c4 Cortex A55r1 QS8 GEMM microkernel by Frank Barchard · 3 years, 9 months ago
  40. 31328cb Add RUY benchmark to qs8_gemm_bench by Frank Barchard · 3 years, 9 months ago
  41. 0797eb1 Rename QS8 assembly GEMM kernels to ld64 by Frank Barchard · 3 years, 9 months ago
  42. f1fd89e 1x16 QS8 GEMM AARCH64 assembly microkernel using dot product. by Frank Barchard · 3 years, 9 months ago
  43. 31bb45b 4x16 QS8 GEMM AARCH64 assembly microkernel using dot product. by Frank Barchard · 3 years, 9 months ago
  44. a48848f 4x8, 6x8 and 8x16 Neon dot product GEMM microkernels by Frank Barchard · 3 years, 10 months ago
  45. 2fa1745 6x16 QS8 GEMM for Neon dot product by Frank Barchard · 3 years, 10 months ago
  46. a964473 Add xnn_qs8_gemm_minmax_ukernel_${MR}x${NR}c4__neondot (ARMv8.2+dotprod). by Benoit Jacob · 4 years ago
  47. bb00b1d AVX512 variants of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years ago
  48. ab67142 Benchmark ARM NEON versions of QS8 GEMM microkernels by Marat Dukhan · 4 years ago
  49. 27203da WAsm SIMD versions of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years ago
  50. ecd8311 Rename s8rng/s32rng -> i8rng/i32rng by Marat Dukhan · 4 years ago
  51. 683fab3 XW (eXtended Weights) optimization for QS8 GEMM microkernel by Marat Dukhan · 4 years ago
  52. e7edc80 Add 3x4c8 variants of SSE2/SSSE3/SSE4.1/XOP GEMM/IGEMM microkernels by Marat Dukhan · 4 years ago
  53. 1280952 AVX2 version of QS8 GEMM and IGEMM microkernels by Marat Dukhan · 4 years ago
  54. 1566fee XOP versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years ago
  55. 44f0ca7 Bind RNG by reference in microbenchmarks by Marat Dukhan · 4 years ago
  56. dee732b LD128 versions of QS8 GEMM SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years ago
  57. 14d3ce8 Add LD64 suffix in QS8 GEMM/IGEMM microkernels by Marat Dukhan · 4 years ago
  58. 733d0be QS8 GEMM MRx4c8 SSE2/SSSE3/SSE4.1 microkernels by Marat Dukhan · 4 years ago
  59. 595e170 QS8 GEMM microkernels and infrastructure by Marat Dukhan · 4 years ago