1. f30a859 Port aarch64 F32 IGEMM 1x8 A75 microkernel to JIT, add tests, benchmarks, enable in init.c if JIT is enabled by Zhi An Ng · 2 years, 4 months ago
  2. eb7256b Port F32 GEMM A75 1x8 microkernel to JIT and specialize for min/max, add tests and benchmarks by Zhi An Ng · 2 years, 4 months ago
  3. 1425eb5 Copy IGEMM benchmark code into JIT's IGEMM benchmark code, and add JIT aarch64 generators to benchmarks by Zhi An Ng · 2 years, 4 months ago
  4. 2188833 Fix F32 IGEMM benchmark loop to not require capping NC to NR by Zhi An Ng · 2 years, 4 months ago
  5. 77d2885 QS8 AArch32 GEMM benchmark build fix by Frank Barchard · 2 years, 4 months ago
  6. 6cb0fd0 Add AArch32 GEMM benchmarks for Cortex A53 and Cortex A7 by Frank Barchard · 2 years, 4 months ago
  7. ca51090 QS8 GEMM benchmark for JIT add ISA check by Frank Barchard · 2 years, 4 months ago
  8. 9fd2f3e Fix passing of kc JIT generator in F32 GEMM benchmarks by Zhi An Ng · 2 years, 4 months ago
  9. 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 4 months ago
  10. f82410d Enable QU8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 4 months ago
  11. 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 4 months ago
  12. cfd947d Add neon zip microkernel generator by Alan Kelly · 2 years, 4 months ago
  13. 3deae1d Guard JIT-related structs and functionality behind XNN_PLATFORM_JIT by Zhi An Ng · 2 years, 4 months ago
  14. f9fc9ec Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 4 months ago
  15. 58cdcf2 Reoptimize QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernel selection by Marat Dukhan · 2 years, 4 months ago
  16. 348c377 QU8 GEMM/IGEMM WAsm SIMD microkernels with SR=4 by Marat Dukhan · 2 years, 4 months ago
  17. fbd67a7 Pad K to a multiple of SR in GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 4 months ago
  18. f2b233b Make SSE2 microkernels consistent with neon zip microkernels. - DEC is now MOV by Alan Kelly · 2 years, 4 months ago
  19. 8b758bf Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by XNNPACK Team · 2 years, 4 months ago
  20. 64cb10f Guard JIT-related structs and functionality behind XNN_PLATFORM_JIT by XNNPACK Team · 2 years, 4 months ago
  21. c9a2e74 Guard JIT-related structs and functionality behind XNN_PLATFORM_JIT by Zhi An Ng · 2 years, 4 months ago
  22. df51e11 Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 4 months ago
  23. d236074 Add F32 GEMM 6x8 aarch64 neonfma cortex a75 JIT microkernel to benchmark by Zhi An Ng · 2 years, 4 months ago
  24. 870108c QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  25. a1cad4a Add x8 transpose bench by Alan Kelly · 2 years, 4 months ago
  26. ba68f44 Add x64 transpose bench by Alan Kelly · 2 years, 4 months ago
  27. c821ea7 Refactor x16 transpose bench and add missing ukernels. by Alan Kelly · 2 years, 4 months ago
  28. e8bbda0 Re-factor x32 transpose bench by Alan Kelly · 2 years, 4 months ago
  29. 0f294ad QS8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  30. 70ea0a2 Specialize F32 GEMM A53 JIT microkernel for min/max params by Zhi An Ng · 2 years, 4 months ago
  31. 0ec25cf Duplicate test methods in gemm-microkernel-test for JIT codegen, update IGEMM generator signature and test generation script. by Zhi An Ng · 2 years, 4 months ago
  32. 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 4 months ago
  33. 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 4 months ago
  34. 5da6d38 SSE2 transpose microkernel code generator. by Alan Kelly · 2 years, 4 months ago
  35. d7111a5 Remove F32 GEMM E2E JIT benchmarks (temporarily) as we are changing the JIT generator interface by Zhi An Ng · 2 years, 4 months ago
  36. 33a98fa Switch QS8/QU8 VMUL[C] NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 4 months ago
  37. 717665f Add JIT microkernels to F32 GEMM E2E benchmarks by Zhi An Ng · 2 years, 4 months ago
  38. a30e2df Fix QU8 E2E lane benchmark tile sizes by Frank Barchard · 2 years, 4 months ago
  39. 2780863 Scalar transpose microkernel by Alan Kelly · 2 years, 4 months ago
  40. d5a5333 Additional tile sizes for QU8 neon lane microkernel. by Frank Barchard · 2 years, 4 months ago
  41. 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 4 months ago
  42. 1bef0f2 Add JIT microkernels to QS8 GEMM benchmarks by Zhi An Ng · 2 years, 5 months ago
  43. 665cb23 Add JIT microkernels to F32 IGEMM benchmarks by Zhi An Ng · 2 years, 5 months ago
  44. 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 5 months ago
  45. c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 5 months ago
  46. 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 5 months ago
  47. 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 5 months ago
  48. 4a5c771 Refactor F32 RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 5 months ago
  49. 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 5 months ago
  50. 5876744 Minor refactoring of RADDSTOREEXPMINUSMAX interface by Marat Dukhan · 2 years, 5 months ago
  51. ed90216 aarch64 transpose TBL microkernel by Alan Kelly · 2 years, 5 months ago
  52. 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 5 months ago
  53. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 5 months ago
  54. 440e8ed Add FMAGIC/IMAGIC/LRINTF requantization variants in microkernel benchmarks by Marat Dukhan · 2 years, 5 months ago
  55. f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  56. bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 5 months ago
  57. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 5 months ago
  58. ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 5 months ago
  59. 4a79ff2 Refactor parameters in F32 VELU microkernels by Marat Dukhan · 2 years, 5 months ago
  60. 9084fc8 Quantized Sigmoid and ELU benchmarks by Marat Dukhan · 2 years, 5 months ago
  61. 3ddc20c Benchmarks for Abs, Negate, and Square operators by Marat Dukhan · 2 years, 5 months ago
  62. 5c7fd89 Benchmark for Leaky ReLU operator by Marat Dukhan · 2 years, 5 months ago
  63. a0129e9 Refactor benchmarks for elementwise operators by Marat Dukhan · 2 years, 5 months ago
  64. e72b282 Refactor parameters in F32 VSQRT microkernels by Marat Dukhan · 2 years, 5 months ago
  65. 2894e99 Refactor F32 VLRELU microkernels by Marat Dukhan · 2 years, 5 months ago
  66. b7c1b71 Refactor F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  67. 134f984 Refactor F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  68. ef0f09c Add cpu clockrate to x16/x32_transpose benchmarks. by Frank Barchard · 2 years, 5 months ago
  69. 1945f0b SSE transpose x16 microkernel (4x8) by Alan Kelly · 2 years, 5 months ago
  70. 0d10cc7 Split VHSWISH parameter initialization functions per ISA by Marat Dukhan · 2 years, 5 months ago
  71. 4c61779 Minimally support WebAssembly Relaxed SIMD builds by Marat Dukhan · 2 years, 5 months ago
  72. e48b5c1 QS8 4x8 Neon Lane LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  73. 4841021 QS8 4x8 dot product LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  74. 9f3f420 QS8 4x8 LD64 dot product GEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  75. 98393ad AVX512 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  76. fda06cb SSE transpose microkernel by Alan Kelly · 2 years, 5 months ago
  77. 7b5f779 AVX2 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  78. cd4089f AVX QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  79. 2edf863 AVX512 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  80. 0d399ca AVX2 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  81. b91432c AVX F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  82. 9820234 Full set of benchmarks for Convert operator by Marat Dukhan · 2 years, 5 months ago
  83. da7b2e2 QS8 4x8 lane GEMM AArch32 microkernel by Frank Barchard · 2 years, 5 months ago
  84. 710fb42 Benchmark for the Convert (F32->QS8) operator by Marat Dukhan · 2 years, 5 months ago
  85. 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 5 months ago
  86. ad6f2dc Benchmarks for QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  87. 0f1ed94 QS8/QC8 GEMM/IGEMM WAsm SIMD microkernels using C2S4 layout by Marat Dukhan · 2 years, 6 months ago
  88. 8999190 Remove GEMMLOWP requantization from QS8 GEMM/IGEMM templates by Marat Dukhan · 2 years, 6 months ago
  89. 482508b Optimize FP32 requantization in ARMv7 NEON QS8/QU8 VMUL[C] by Marat Dukhan · 2 years, 6 months ago
  90. 430b173 F32->QS8/QU8 VCVT scalar microkernels using FP32 min/max by Marat Dukhan · 2 years, 6 months ago
  91. 5740f75 Fix trailing whitespace in VCVT benchmarks by Marat Dukhan · 2 years, 6 months ago
  92. 563eee1 Benchmarks for F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 6 months ago
  93. f82ea82 Add PRFM benchmarks for qs8 lane by Frank Barchard · 2 years, 6 months ago
  94. 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 6 months ago
  95. 9cdc10d QU8 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 6 months ago
  96. 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 6 months ago
  97. 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 6 months ago
  98. 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 6 months ago
  99. 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 6 months ago
  100. e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 6 months ago