1. 4a6dca9 Specify parameter initialization function in [P]AVGPOOL microkernel tests by Marat Dukhan · 2 years, 7 months ago
  2. 4e5a767 Rename xnn_f32_scaleminmax_params.sse2 to xnn_f32_scaleminmax_params.sse by Marat Dukhan · 2 years, 7 months ago
  3. 1bef0f2 Add JIT microkernels to QS8 GEMM benchmarks by Zhi An Ng · 2 years, 7 months ago
  4. 5d456ce Refactor naming of QS8/QU8 AVGPOOL parameters by Marat Dukhan · 2 years, 7 months ago
  5. 665cb23 Add JIT microkernels to F32 IGEMM benchmarks by Zhi An Ng · 2 years, 7 months ago
  6. cbe478a Generate QU8 GAVGPOOL tests from YAML specification by Marat Dukhan · 2 years, 7 months ago
  7. bf72b54 Split qc8-igemm-minmax-fp32.yaml into 2 files, all microkernels with c go into a separate file. by Zhi An Ng · 2 years, 7 months ago
  8. 49d94ca Split qc8-gemm-minmax-fp32.yaml into 2 files, all the microkernels with c goes into a separate file. by Zhi An Ng · 2 years, 7 months ago
  9. 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 7 months ago
  10. 0e0f726 Split qs8-gemm-minmax-rndnu.yaml into 2 files, all the microkernels with c2 suffix goes into a separate file. by Zhi An Ng · 2 years, 7 months ago
  11. c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 7 months ago
  12. 0afdfab Fix incorrect JIT tests in QC8 GEMM FP32 by Zhi An Ng · 2 years, 7 months ago
  13. 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 7 months ago
  14. 14dd8d0 Convert F16 parameter structures to unions by Marat Dukhan · 2 years, 7 months ago
  15. 16b734c Add more QC8 GEMM/IGEMM JIT microkernels. by Zhi An Ng · 2 years, 7 months ago
  16. ddc49c1 Update the list of supported architectures by Marat Dukhan · 2 years, 7 months ago
  17. 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 7 months ago
  18. ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 7 months ago
  19. 29d9acd Implement vcvt vcvtn vmul_f32, these are used in qc8 microkernels. by Zhi An Ng · 2 years, 7 months ago
  20. 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 7 months ago
  21. 8a9eac6 Amalgamate AVX, AVX2, and FMA3 microkernels by Marat Dukhan · 2 years, 7 months ago
  22. 4a5c771 Refactor F32 RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 7 months ago
  23. 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 7 months ago
  24. 5876744 Minor refactoring of RADDSTOREEXPMINUSMAX interface by Marat Dukhan · 2 years, 7 months ago
  25. d858522 Fix aarch64 builds with -fno-lax-vector-conversions by Zhi An Ng · 2 years, 7 months ago
  26. 68db12e Amalgamate F16C microkernels by Marat Dukhan · 2 years, 7 months ago
  27. ed90216 aarch64 transpose TBL microkernel by Alan Kelly · 2 years, 7 months ago
  28. f290a14 Enable QC8 4x8 mla lane assembler microkernel by Frank Barchard · 2 years, 7 months ago
  29. 0a40541 Use FMA instructions for scalar microkernels on RISC-V by Marat Dukhan · 2 years, 7 months ago
  30. f623740 QC8 NEON lane microkernels by Frank Barchard · 2 years, 7 months ago
  31. d8a1dbe Add RISC-V scalar microkernels to CMake build by Marat Dukhan · 2 years, 7 months ago
  32. a198f00 Initialize RISC-V microkernel pointers by Marat Dukhan · 2 years, 7 months ago
  33. 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 7 months ago
  34. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 7 months ago
  35. 0ad4737 Minimally support RISC-V Bazel builds by Marat Dukhan · 2 years, 7 months ago
  36. 580292d Print some usage examples when called without arguments, also add a comment on how to use the script. by Zhi An Ng · 2 years, 7 months ago
  37. bd11e6a Add -fno-math-errno compilation option for scalar microkernels by Marat Dukhan · 2 years, 7 months ago
  38. 440e8ed Add FMAGIC/IMAGIC/LRINTF requantization variants in microkernel benchmarks by Marat Dukhan · 2 years, 7 months ago
  39. cccb012 Apply sort and formatting to ARM code by Frank Barchard · 2 years, 7 months ago
  40. 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 7 months ago
  41. f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 7 months ago
  42. bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 7 months ago
  43. 74ddd27 Run formatting on generate-gemm-test.py by Zhi An Ng · 2 years, 7 months ago
  44. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 7 months ago
  45. 0e80137 Refactor parameters in F32 VRND microkernels by Marat Dukhan · 2 years, 7 months ago
  46. d57a5ad Add const keyword for AVX load masks by Marat Dukhan · 2 years, 7 months ago
  47. bbfc27d Refactor NEON/NEONFMA VSIGMOID microkernels by Marat Dukhan · 2 years, 7 months ago
  48. 6853f23 Update amalgamated microkernels by Marat Dukhan · 2 years, 7 months ago
  49. ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 7 months ago
  50. 05b6cb1 Transpose microkernel tester uses iota instead of rng so that it's easier to debug tests by Alan Kelly · 2 years, 7 months ago
  51. 4a79ff2 Refactor parameters in F32 VELU microkernels by Marat Dukhan · 2 years, 7 months ago
  52. e5efb16 Refactor VUNARY microkernel parameters by Marat Dukhan · 2 years, 7 months ago
  53. a8b3994 Remove mask_table literal from F32 DWCONV AVX microkernels by Marat Dukhan · 2 years, 7 months ago
  54. 9084fc8 Quantized Sigmoid and ELU benchmarks by Marat Dukhan · 2 years, 7 months ago
  55. 3ddc20c Benchmarks for Abs, Negate, and Square operators by Marat Dukhan · 2 years, 7 months ago
  56. 5c7fd89 Benchmark for Leaky ReLU operator by Marat Dukhan · 2 years, 7 months ago
  57. a0129e9 Refactor benchmarks for elementwise operators by Marat Dukhan · 2 years, 7 months ago
  58. e72b282 Refactor parameters in F32 VSQRT microkernels by Marat Dukhan · 2 years, 7 months ago
  59. 98c5215 Move mask_table into VBINARY[C] AVX microkernel parameters by Marat Dukhan · 2 years, 7 months ago
  60. 0f28193 Minor optimization in F32 GEMM/IGEMM AVX512F microkernels by Marat Dukhan · 2 years, 7 months ago
  61. d57186a Refactor F32 VMULCADDC parameters by Marat Dukhan · 2 years, 7 months ago
  62. f600497 Refactor parameter initialization in Vector Binary Elementwise microkernels by Marat Dukhan · 2 years, 7 months ago
  63. b40ee63 Add comment about architecture for each set of inits for ARM. by Frank Barchard · 2 years, 7 months ago
  64. c83ef3b Refactor F32 MINMAX parameters for WAsm SIMD by Marat Dukhan · 2 years, 7 months ago
  65. 2894e99 Refactor F32 VLRELU microkernels by Marat Dukhan · 2 years, 7 months ago
  66. b7c1b71 Refactor F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 7 months ago
  67. 7ddef84 Move mask_table into VCLAMP AVX microkernel parameters by Marat Dukhan · 2 years, 7 months ago
  68. e14e791 Move mask_table into VHSWISH AVX microkernel parameters by Marat Dukhan · 2 years, 7 months ago
  69. 134f984 Refactor F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 7 months ago
  70. 3220551 Use --features wasm_simd rather than --copts -msimd128 for WAsm SIMD builds by Marat Dukhan · 2 years, 7 months ago
  71. 4d738ae Include vcvtnq_s32_f32 polyfill for older gcc versions by Marat Dukhan · 2 years, 7 months ago
  72. dc54e12 Replace vshll_n_u32(v, 0) with vmovl_u32 in C4/C4S2 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 7 months ago
  73. e1228e3 Fix .clang-format path by Marat Dukhan · 2 years, 7 months ago
  74. 2700809 Specify -mfp16-format=ieee for AArch32 GCC builds by Marat Dukhan · 2 years, 7 months ago
  75. 70137e4 Enable AArch32 QC8 neon dot product by Frank Barchard · 2 years, 7 months ago
  76. 87fe410 QC8 quantization for all aarch32 GEMM/IGEMM microkernels by Frank Barchard · 2 years, 7 months ago
  77. a2f1891 Add _prfm to names on Neon microkernels in a consistent way. by Frank Barchard · 2 years, 7 months ago
  78. ef0f09c Add cpu clockrate to x16/x32_transpose benchmarks. by Frank Barchard · 2 years, 7 months ago
  79. 447aa7b #include allocator.h header to gemm tests. by Frank Barchard · 2 years, 7 months ago
  80. 1945f0b SSE transpose x16 microkernel (4x8) by Alan Kelly · 2 years, 7 months ago
  81. 57719a2 clang-format for microkernels by Alan Kelly · 2 years, 7 months ago
  82. 0d10cc7 Split VHSWISH parameter initialization functions per ISA by Marat Dukhan · 2 years, 7 months ago
  83. b43b47a Add a script to convert existing assembly microkernels to JIT codegen. by Zhi An Ng · 2 years, 7 months ago
  84. 561d068 Refactor parameter initialization for VHSWISH microkernels by Marat Dukhan · 2 years, 7 months ago
  85. e4d3f76 Mark aarch64 microkernels as assembly for tests by Frank Barchard · 2 years, 7 months ago
  86. 7a03a0f Merge pull request #2191 from xbwee1024:bugfix by XNNPACK Team · 2 years, 7 months ago
  87. 1c852c9 Enable PRFM version of QS8 4x8 lane AArch32 microkernels by Frank Barchard · 2 years, 7 months ago
  88. 0db2e4c Support - (minus) operator for creating S/D register lists, this looks closer to native assembly. by Zhi An Ng · 2 years, 7 months ago
  89. 2493de9 WAsmSIMD transpose microkernel by Alan Kelly · 2 years, 7 months ago
  90. 77b694c Fixes style issues with SSE microkernel by Alan Kelly · 2 years, 7 months ago
  91. 691ec40 Use proper intrinsics header in SSE F32 VHSWISH microkernels by Marat Dukhan · 2 years, 7 months ago
  92. c025831 Refactor declarations of parameter initialization functions by Marat Dukhan · 2 years, 7 months ago
  93. 51c6134 Amalgamate SSE and AVX512 microkernels for TFLite build by Marat Dukhan · 2 years, 7 months ago
  94. e0f15ad Split scalar production microkernels into portable, AArch32, and Wasm by Marat Dukhan · 2 years, 7 months ago
  95. c80ffb0 Fix generation of gemm tests for ADJBLOCK and rerun scripts. by Zhi An Ng · 2 years, 8 months ago
  96. f98f58d Lowering to c++11 as c++14 literals was converted to c++11 in #2192 by xbwee · 2 years, 8 months ago
  97. 0fd983b Adds -Wcast-qual flag to detect cast dropping const. by Alan Kelly · 2 years, 8 months ago
  98. f527d56 Avoid using C++14 features in AArch32 assembler test by Marat Dukhan · 2 years, 8 months ago
  99. 562112e Fix build error with cmake for src/jit. by xbwee · 2 years, 8 months ago
  100. 19bfefe Support Relaxed SIMD in xnnpack_cc_library and xnnpack_aggregate_library by Marat Dukhan · 2 years, 8 months ago