1. 16c0912 F16 MAXPOOL microkernel for NEON FP16ARITH by Marat Dukhan · 2 years, 4 months ago
  2. 6b72e6c Convert F32 IGEMM for A75 to JIT, add tests by Zhi An Ng · 2 years, 4 months ago
  3. e96b6bc Split qs8-igemm-minmax-rndnu tests into 1 more file (4 total), seeing compile timeouts in coverage runs by Zhi An Ng · 2 years, 4 months ago
  4. 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 4 months ago
  5. 101271e QC8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 4 months ago
  6. 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 4 months ago
  7. cfd947d Add neon zip microkernel generator by Alan Kelly · 2 years, 4 months ago
  8. 348c377 QU8 GEMM/IGEMM WAsm SIMD microkernels with SR=4 by Marat Dukhan · 2 years, 4 months ago
  9. f2b233b Make SSE2 microkernels consistent with neon zip microkernels. - DEC is now MOV by Alan Kelly · 2 years, 4 months ago
  10. c607028 Remove wb from JIT aarch32 instructions, use mem operand and ++ instead by Zhi An Ng · 2 years, 4 months ago
  11. 870108c QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  12. c2e2da8 Fix conversion script for aarch64 assembly kernels and convert a single F32 GEMM as a test by Zhi An Ng · 2 years, 4 months ago
  13. ac654f1 QC8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  14. 0f294ad QS8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  15. 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 4 months ago
  16. b26ead1 F16C implementation of F16 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  17. c7c92b0 Generate F16 GAVGPOOL NEONFP16ARITH microkernels from template by Marat Dukhan · 2 years, 4 months ago
  18. 5e1a303 QC8 GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 4 months ago
  19. b1a869d Merge generate transpose scripts by Alan Kelly · 2 years, 4 months ago
  20. 4b23423 Split test generator for qu8-gavgpool by Frank Barchard · 2 years, 4 months ago
  21. 5da6d38 SSE2 transpose microkernel code generator. by Alan Kelly · 2 years, 4 months ago
  22. d19bde9 Add x64 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  23. cd21b02 Add x8 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  24. 84aae41 Add x16 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  25. 8575504 Switch QS8/QU8 GAVGPOOL NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 4 months ago
  26. 33a98fa Switch QS8/QU8 VMUL[C] NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 4 months ago
  27. d1f53e4 Generate QU8 GAVGPOOL microkernels from QS8 GAVGPOOL templates by Marat Dukhan · 2 years, 4 months ago
  28. d81fa0a Pipeline remaining QS8 AVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  29. 9e258d6 Remove multi-accumulator support in QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  30. d7a4b22 Generate missing QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  31. 847ff5e Refactor naming of QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  32. 53f4106 Switch QS8 GAVGPOOL microkernels to use FP32 requantization by Marat Dukhan · 2 years, 4 months ago
  33. c27f04b Add missing generated unit tests to BUILD and CMakeLists.txt. by Zhi An Ng · 2 years, 4 months ago
  34. bd7f9a4 F16C implementation of F16 PRELU microkernels by Marat Dukhan · 2 years, 4 months ago
  35. 4c1fd6f Allow generate-gemm-test.py to accept multiple output files, and shard the generated tests across all specified output files. by Zhi An Ng · 2 years, 4 months ago
  36. d454545 F16C implementation of F16 VBINARY[C] microkernels by Marat Dukhan · 2 years, 4 months ago
  37. 2780863 Scalar transpose microkernel by Alan Kelly · 2 years, 4 months ago
  38. a248337 Split more of qs8-gemm-minmax-rndnu out into another file, for microkernels with "c4" by Zhi An Ng · 2 years, 4 months ago
  39. d5a5333 Additional tile sizes for QU8 neon lane microkernel. by Frank Barchard · 2 years, 4 months ago
  40. 751f622 F16C implementation of F16 VHSWISH microkernels by Marat Dukhan · 2 years, 4 months ago
  41. 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 4 months ago
  42. cbe478a Generate QU8 GAVGPOOL tests from YAML specification by Marat Dukhan · 2 years, 5 months ago
  43. bf72b54 Split qc8-igemm-minmax-fp32.yaml into 2 files, all microkernels with c go into a separate file. by Zhi An Ng · 2 years, 5 months ago
  44. 49d94ca Split qc8-gemm-minmax-fp32.yaml into 2 files, all the microkernels with c goes into a separate file. by Zhi An Ng · 2 years, 5 months ago
  45. 0e0f726 Split qs8-gemm-minmax-rndnu.yaml into 2 files, all the microkernels with c2 suffix goes into a separate file. by Zhi An Ng · 2 years, 5 months ago
  46. c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 5 months ago
  47. 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 5 months ago
  48. ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 5 months ago
  49. 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 5 months ago
  50. 8a9eac6 Amalgamate AVX, AVX2, and FMA3 microkernels by Marat Dukhan · 2 years, 5 months ago
  51. 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 5 months ago
  52. 68db12e Amalgamate F16C microkernels by Marat Dukhan · 2 years, 5 months ago
  53. f623740 QC8 NEON lane microkernels by Frank Barchard · 2 years, 5 months ago
  54. 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 5 months ago
  55. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 5 months ago
  56. 580292d Print some usage examples when called without arguments, also add a comment on how to use the script. by Zhi An Ng · 2 years, 5 months ago
  57. 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 5 months ago
  58. f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  59. bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 5 months ago
  60. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 5 months ago
  61. ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 5 months ago
  62. 134f984 Refactor F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  63. 87fe410 QC8 quantization for all aarch32 GEMM/IGEMM microkernels by Frank Barchard · 2 years, 5 months ago
  64. b43b47a Add a script to convert existing assembly microkernels to JIT codegen. by Zhi An Ng · 2 years, 5 months ago
  65. 51c6134 Amalgamate SSE and AVX512 microkernels for TFLite build by Marat Dukhan · 2 years, 5 months ago
  66. e48b5c1 QS8 4x8 Neon Lane LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  67. 4841021 QS8 4x8 dot product LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  68. 9f3f420 QS8 4x8 LD64 dot product GEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  69. 98393ad AVX512 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  70. 7b5f779 AVX2 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  71. cd4089f AVX QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  72. 2edf863 AVX512 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  73. 0d399ca AVX2 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  74. b91432c AVX F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  75. da7b2e2 QS8 4x8 lane GEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  76. 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 5 months ago
  77. cb052a3 Remove duplicate template line for 1x8c4 NEON dot product. by Frank Barchard · 2 years, 5 months ago
  78. 86bd270 Scalar QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 6 months ago
  79. d873fa2 SSE2 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 6 months ago
  80. fbf12b0 WAsm SIMD QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 6 months ago
  81. f9cf55d SSE4.1 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 6 months ago
  82. fee66be NEON QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 6 months ago
  83. 1a55180 Merge pull request #2036 from digantdesai:enable_fp32_arm_kernels by XNNPACK Team · 2 years, 6 months ago
  84. 0f1ed94 QS8/QC8 GEMM/IGEMM WAsm SIMD microkernels using C2S4 layout by Marat Dukhan · 2 years, 6 months ago
  85. 59d6515 Enable FP32 requant variant for QU8 [1,4]x8 Neon MLAL [I]GEMM kernels by Digant Desai · 2 years, 6 months ago
  86. 9982ed3 Enable FP32 requant variant for QU8 NEON dotprod [I]GEMM kernels by Digant Desai · 2 years, 6 months ago
  87. 2e2d179 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels by Digant Desai · 2 years, 6 months ago
  88. 10f9f62 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels for CA55r1 by Digant Desai · 2 years, 6 months ago
  89. e20a873 Optimize selection of QS8/QU8 VADD[C] microkernels on WAsm SIMD by Marat Dukhan · 2 years, 6 months ago
  90. 8999190 Remove GEMMLOWP requantization from QS8 GEMM/IGEMM templates by Marat Dukhan · 2 years, 6 months ago
  91. 17a9e3f Remove GEMMLOWP requantization from QS8 DWCONV templates by Marat Dukhan · 2 years, 6 months ago
  92. 430b173 F32->QS8/QU8 VCVT scalar microkernels using FP32 min/max by Marat Dukhan · 2 years, 6 months ago
  93. 4bd1de9 F32->QS8 and F32->QU8 VCVT WAsm SIMD microkernels using F32->I32 conversion by Marat Dukhan · 2 years, 6 months ago
  94. 00a1085 F32->QS8 and F32->QU8 VCVT scalar microkernels by Marat Dukhan · 2 years, 6 months ago
  95. 98d5552 F32->QS8 and F32->QU8 VCVT WAsm SIMD microkernels by Marat Dukhan · 2 years, 6 months ago
  96. b2d0a2a F32->QS8 and F32->QU8 VCVT NEON microkernels by Marat Dukhan · 2 years, 6 months ago
  97. 3df14d3 F32->QS8 and F32->QU8 VCVT NEON V8 microkernels by Marat Dukhan · 2 years, 6 months ago
  98. c5aa242 F32->QS8 and F32->QU8 microkernels for SSE by Marat Dukhan · 2 years, 6 months ago
  99. 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 6 months ago
  100. 6a69c8e Scalar versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 2 years, 6 months ago