1. 0e0f726 Split qs8-gemm-minmax-rndnu.yaml into 2 files, all the microkernels with c2 suffix goes into a separate file. by Zhi An Ng · 2 years, 9 months ago
  2. c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 9 months ago
  3. 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 9 months ago
  4. 16b734c Add more QC8 GEMM/IGEMM JIT microkernels. by Zhi An Ng · 2 years, 9 months ago
  5. 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 9 months ago
  6. ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 9 months ago
  7. 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 9 months ago
  8. 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 9 months ago
  9. ed90216 aarch64 transpose TBL microkernel by Alan Kelly · 2 years, 9 months ago
  10. f290a14 Enable QC8 4x8 mla lane assembler microkernel by Frank Barchard · 2 years, 9 months ago
  11. f623740 QC8 NEON lane microkernels by Frank Barchard · 2 years, 9 months ago
  12. d8a1dbe Add RISC-V scalar microkernels to CMake build by Marat Dukhan · 2 years, 9 months ago
  13. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 9 months ago
  14. bd11e6a Add -fno-math-errno compilation option for scalar microkernels by Marat Dukhan · 2 years, 9 months ago
  15. cccb012 Apply sort and formatting to ARM code by Frank Barchard · 2 years, 9 months ago
  16. 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 9 months ago
  17. f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  18. bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
  19. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
  20. ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 9 months ago
  21. 3ddc20c Benchmarks for Abs, Negate, and Square operators by Marat Dukhan · 2 years, 9 months ago
  22. 5c7fd89 Benchmark for Leaky ReLU operator by Marat Dukhan · 2 years, 9 months ago
  23. 134f984 Refactor F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  24. 2700809 Specify -mfp16-format=ieee for AArch32 GCC builds by Marat Dukhan · 2 years, 9 months ago
  25. 87fe410 QC8 quantization for all aarch32 GEMM/IGEMM microkernels by Frank Barchard · 2 years, 9 months ago
  26. 1945f0b SSE transpose x16 microkernel (4x8) by Alan Kelly · 2 years, 9 months ago
  27. b43b47a Add a script to convert existing assembly microkernels to JIT codegen. by Zhi An Ng · 2 years, 9 months ago
  28. 7a03a0f Merge pull request #2191 from xbwee1024:bugfix by XNNPACK Team · 2 years, 9 months ago
  29. e0f15ad Split scalar production microkernels into portable, AArch32, and Wasm by Marat Dukhan · 2 years, 9 months ago
  30. f98f58d Lowering to c++11 as c++14 literals was converted to c++11 in #2192 by xbwee · 2 years, 9 months ago
  31. 562112e Fix build error with cmake for src/jit. by xbwee · 2 years, 9 months ago
  32. 9519816 Enable QS8 4x8 LD64 Neon on AArch32 by Frank Barchard · 2 years, 9 months ago
  33. 1e9c5ac Fix CMake build by Marat Dukhan · 2 years, 9 months ago
  34. e48b5c1 QS8 4x8 Neon Lane LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
  35. 4841021 QS8 4x8 dot product LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
  36. 9f3f420 QS8 4x8 LD64 dot product GEMM AArch32 microkernel by Frank Barchard · 2 years, 10 months ago
  37. 98393ad AVX512 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  38. fda06cb SSE transpose microkernel by Alan Kelly · 2 years, 10 months ago
  39. 7b5f779 AVX2 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  40. cd4089f AVX QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  41. 2edf863 AVX512 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  42. 0d399ca AVX2 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  43. b91432c AVX F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  44. 6883abb JIT memory allocation and integration into Assembler by Zhi An Ng · 2 years, 10 months ago
  45. da7b2e2 QS8 4x8 lane GEMM AArch32 microkernel by Frank Barchard · 2 years, 10 months ago
  46. 710fb42 Benchmark for the Convert (F32->QS8) operator by Marat Dukhan · 2 years, 10 months ago
  47. 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 10 months ago
  48. f92206b QS8->F32 and QU8->F32 Convert NC operators by Marat Dukhan · 2 years, 10 months ago
  49. ad6f2dc Benchmarks for QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  50. cb052a3 Remove duplicate template line for 1x8c4 NEON dot product. by Frank Barchard · 2 years, 10 months ago
  51. 86bd270 Scalar QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  52. d873fa2 SSE2 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  53. f9cf55d SSE4.1 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  54. fee66be NEON QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  55. 59d6515 Enable FP32 requant variant for QU8 [1,4]x8 Neon MLAL [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
  56. 9982ed3 Enable FP32 requant variant for QU8 NEON dotprod [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
  57. 2e2d179 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
  58. 10f9f62 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels for CA55r1 by Digant Desai · 2 years, 10 months ago
  59. b559fe9 Initial AArch32 structure by Zhi An Ng · 2 years, 10 months ago
  60. 8999190 Remove GEMMLOWP requantization from QS8 GEMM/IGEMM templates by Marat Dukhan · 2 years, 10 months ago
  61. 17a9e3f Remove GEMMLOWP requantization from QS8 DWCONV templates by Marat Dukhan · 2 years, 10 months ago
  62. 20483c7 Expose Convert operator in Subgraph API by Marat Dukhan · 2 years, 10 months ago
  63. 430b173 F32->QS8/QU8 VCVT scalar microkernels using FP32 min/max by Marat Dukhan · 2 years, 10 months ago
  64. ed2d776 F32->QS8 and F32->QU8 Convert NC operators by Marat Dukhan · 2 years, 10 months ago
  65. 03f1297 F32->QS8 and F32->QU8 Convert NC operators by XNNPACK Team · 2 years, 10 months ago
  66. 7d2d85c F32->QS8 and F32->QU8 Convert NC operators by Marat Dukhan · 2 years, 10 months ago
  67. 563eee1 Benchmarks for F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  68. 00a1085 F32->QS8 and F32->QU8 VCVT scalar microkernels by Marat Dukhan · 2 years, 10 months ago
  69. b2d0a2a F32->QS8 and F32->QU8 VCVT NEON microkernels by Marat Dukhan · 2 years, 10 months ago
  70. d24301d F32->QS8/QU8 CVT evaluation stubs for NEON and NEON v8 by Marat Dukhan · 2 years, 10 months ago
  71. 9551075 Fix CMake build by Marat Dukhan · 2 years, 10 months ago
  72. 3df14d3 F32->QS8 and F32->QU8 VCVT NEON V8 microkernels by Marat Dukhan · 2 years, 10 months ago
  73. c5aa242 F32->QS8 and F32->QU8 microkernels for SSE by Marat Dukhan · 2 years, 10 months ago
  74. 5f7cf55 Avoid using gcc-specific intrinsics in NEON microkernels by Marat Dukhan · 2 years, 10 months ago
  75. 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 10 months ago
  76. 24abe6b Initialize S8/U8 IBILINEAR microkernel pointers by Marat Dukhan · 2 years, 10 months ago
  77. 6a69c8e Scalar versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 10 months ago
  78. 7519eb1 SSE2 & SSE4.1 versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 10 months ago
  79. cdb42a5 NEON versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 10 months ago
  80. 9cdc10d QU8 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
  81. 0bc5801 QC8 AArch32 use NeonV8 when available. by Frank Barchard · 2 years, 10 months ago
  82. 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
  83. 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 10 months ago
  84. 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 11 months ago
  85. 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 11 months ago
  86. e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 11 months ago
  87. eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
  88. a0c6168 F32->F16 Convert operator by Marat Dukhan · 2 years, 11 months ago
  89. e7043ff Enable C2S4 for QC8 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 11 months ago
  90. c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 11 months ago
  91. 1fe8995 Scalar F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 11 months ago
  92. 78f039d Scalar F16->F32 evaluation stubs of bitcast-based and fabsf-based variants by Marat Dukhan · 2 years, 11 months ago
  93. 4edfdbf NEON F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 11 months ago
  94. b4cde5a Fix CMake build on ARM by Marat Dukhan · 2 years, 11 months ago
  95. eb84423 SSE2, SSE4.1, and AVX F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 11 months ago
  96. 056f49d Evaluation stubs for SSE2 & SSE4.1 F32->F16 conversion by Marat Dukhan · 2 years, 11 months ago
  97. a6eb1e5 Evaluation stubs for NEON F32->F16 conversion by Marat Dukhan · 2 years, 11 months ago
  98. 46cc1e1 Evaluation stubs for scalar F32->F16 conversion by Marat Dukhan · 2 years, 11 months ago
  99. 287952a QS8 C4 Neon GEMM/IGEMM microkernels by Frank Barchard · 2 years, 11 months ago
  100. 66ae257 Switch from C2 to S4C2 for qs8 microkernels on 32 bit ARM by Frank Barchard · 2 years, 11 months ago