1. 8f2eeee Skip calling __builtin_clear_cache on iOS, iOS uses sys_cache_invalidate by Zhi An Ng · 2 years, 10 months ago
  2. e7242ea Replicate QS8/QU8 ADDSUB WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 10 months ago
  3. 48d74c3 Replicate QC8/QS8/QU8 CONV WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 10 months ago
  4. b402cbe Bump shard counts for qs8_igemm_minmax_rndnu_test by Zhi An Ng · 2 years, 10 months ago
  5. 44616e1 Bump shard counts for qs8_gemm_minmax_rndnu_test, the test sometimes timeout in coverage runs. by Zhi An Ng · 2 years, 10 months ago
  6. 9441d46 clang-format indents case labels with in switches matching existing XNNPACK style. by Alan Kelly · 2 years, 10 months ago
  7. c27f04b Add missing generated unit tests to BUILD and CMakeLists.txt. by Zhi An Ng · 2 years, 10 months ago
  8. d6e2e1a Remove xnn_qu8_quantize_avgpool and xnn_qs8_quantize_avgpool helpers by Marat Dukhan · 2 years, 10 months ago
  9. 50323b8 Combine requantization with parameter initialization in unit tests by Marat Dukhan · 2 years, 10 months ago
  10. bd7f9a4 F16C implementation of F16 PRELU microkernels by Marat Dukhan · 2 years, 10 months ago
  11. 4c1fd6f Allow generate-gemm-test.py to accept multiple output files, and shard the generated tests across all specified output files. by Zhi An Ng · 2 years, 10 months ago
  12. 4897670 Re-enable up to AVX2 microkernels on Android x86/x86-64 & iOS simulator builds by Marat Dukhan · 2 years, 10 months ago
  13. 603ec5f Remove unused declarations for F16 VRELU microkernels by Marat Dukhan · 2 years, 10 months ago
  14. 085102c Reoptimize pointer updates in PRELU microkernels by Marat Dukhan · 2 years, 10 months ago
  15. 3ab63b0 Rollback "Enable up to AVX2 microkernels on Android x86/x86-64 builds" by XNNPACK Team · 2 years, 10 months ago
  16. d454545 F16C implementation of F16 VBINARY[C] microkernels by Marat Dukhan · 2 years, 10 months ago
  17. 717665f Add JIT microkernels to F32 GEMM E2E benchmarks by Zhi An Ng · 2 years, 10 months ago
  18. 1f1ee2c Enable up to AVX2 microkernels on Android x86/x86-64 builds by Marat Dukhan · 2 years, 10 months ago
  19. a0b45e5 Allow overriding logging settings in Bazel by Marat Dukhan · 2 years, 10 months ago
  20. d90af6f Move gemm-microkernel-tester test code into separate cc file by Zhi An Ng · 2 years, 10 months ago
  21. c7e534f Bump shard_count for slow subtract_nd_test by Zhi An Ng · 2 years, 10 months ago
  22. a30e2df Fix QU8 E2E lane benchmark tile sizes by Frank Barchard · 2 years, 10 months ago
  23. 969e61f Enable 2x16 for QU8 neon lane microkernel in AArch32 by Frank Barchard · 2 years, 10 months ago
  24. 2780863 Scalar transpose microkernel by Alan Kelly · 2 years, 10 months ago
  25. 49979b6 Implement vldr for S registers by Zhi An Ng · 2 years, 10 months ago
  26. a72cde3 Reoptimize pointer updates in VMULCADDC microkernels by Marat Dukhan · 2 years, 10 months ago
  27. e8c1979 Add enable_jit to various targets in BUILD by Zhi An Ng · 2 years, 10 months ago
  28. a248337 Split more of qs8-gemm-minmax-rndnu out into another file, for microkernels with "c4" by Zhi An Ng · 2 years, 10 months ago
  29. 4c738f0 Fix wrong WAsm SIMD parameter initialization in f32-spmm-minmax.yaml by Marat Dukhan · 2 years, 10 months ago
  30. d5a5333 Additional tile sizes for QU8 neon lane microkernel. by Frank Barchard · 2 years, 10 months ago
  31. 751f622 F16C implementation of F16 VHSWISH microkernels by Marat Dukhan · 2 years, 10 months ago
  32. 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 10 months ago
  33. 8459822 Split F32 SCALEMINMAX parameter initialization functions by ISA by Marat Dukhan · 2 years, 10 months ago
  34. f2e2edf Round results to FP16 after multiplication by scale in AVX2 F16 GEMM/IGEMM by Marat Dukhan · 2 years, 10 months ago
  35. ef5560d Use ISA-specific parameter initialization functions in F32 PAVGPOOL tests by Marat Dukhan · 2 years, 10 months ago
  36. 3c949a3 Split QS8/QU8 AVGPOOL parameter initialization functions by ISA by Marat Dukhan · 2 years, 10 months ago
  37. 9f8eac7 Avoid _mm_loadu_si16 and _mm_storeu_si16 unsupported on older compilers by Marat Dukhan · 2 years, 10 months ago
  38. 48c5e98 Fix CMake build on x86-64 by Marat Dukhan · 2 years, 10 months ago
  39. da382d1 Refactor parameter initialization for AVGPOOL/GAVGPOOL/PAVGPOOL microkernels by Marat Dukhan · 2 years, 10 months ago
  40. a7d74b1 Specify parameter initialization function in GAVGPOOL microkernel tests by Marat Dukhan · 2 years, 10 months ago
  41. 4a6dca9 Specify parameter initialization function in [P]AVGPOOL microkernel tests by Marat Dukhan · 2 years, 10 months ago
  42. 4e5a767 Rename xnn_f32_scaleminmax_params.sse2 to xnn_f32_scaleminmax_params.sse by Marat Dukhan · 2 years, 10 months ago
  43. 1bef0f2 Add JIT microkernels to QS8 GEMM benchmarks by Zhi An Ng · 2 years, 10 months ago
  44. 5d456ce Refactor naming of QS8/QU8 AVGPOOL parameters by Marat Dukhan · 2 years, 10 months ago
  45. 665cb23 Add JIT microkernels to F32 IGEMM benchmarks by Zhi An Ng · 2 years, 10 months ago
  46. cbe478a Generate QU8 GAVGPOOL tests from YAML specification by Marat Dukhan · 2 years, 10 months ago
  47. bf72b54 Split qc8-igemm-minmax-fp32.yaml into 2 files, all microkernels with c go into a separate file. by Zhi An Ng · 2 years, 10 months ago
  48. 49d94ca Split qc8-gemm-minmax-fp32.yaml into 2 files, all the microkernels with c goes into a separate file. by Zhi An Ng · 2 years, 10 months ago
  49. 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 11 months ago
  50. 0e0f726 Split qs8-gemm-minmax-rndnu.yaml into 2 files, all the microkernels with c2 suffix goes into a separate file. by Zhi An Ng · 2 years, 11 months ago
  51. c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 11 months ago
  52. 0afdfab Fix incorrect JIT tests in QC8 GEMM FP32 by Zhi An Ng · 2 years, 11 months ago
  53. 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 11 months ago
  54. 14dd8d0 Convert F16 parameter structures to unions by Marat Dukhan · 2 years, 11 months ago
  55. 16b734c Add more QC8 GEMM/IGEMM JIT microkernels. by Zhi An Ng · 2 years, 11 months ago
  56. ddc49c1 Update the list of supported architectures by Marat Dukhan · 2 years, 11 months ago
  57. 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 11 months ago
  58. ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 11 months ago
  59. 29d9acd Implement vcvt vcvtn vmul_f32, these are used in qc8 microkernels. by Zhi An Ng · 2 years, 11 months ago
  60. 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 11 months ago
  61. 8a9eac6 Amalgamate AVX, AVX2, and FMA3 microkernels by Marat Dukhan · 2 years, 11 months ago
  62. 4a5c771 Refactor F32 RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 11 months ago
  63. 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 11 months ago
  64. 5876744 Minor refactoring of RADDSTOREEXPMINUSMAX interface by Marat Dukhan · 2 years, 11 months ago
  65. d858522 Fix aarch64 builds with -fno-lax-vector-conversions by Zhi An Ng · 2 years, 11 months ago
  66. 68db12e Amalgamate F16C microkernels by Marat Dukhan · 2 years, 11 months ago
  67. ed90216 aarch64 transpose TBL microkernel by Alan Kelly · 2 years, 11 months ago
  68. f290a14 Enable QC8 4x8 mla lane assembler microkernel by Frank Barchard · 2 years, 11 months ago
  69. 0a40541 Use FMA instructions for scalar microkernels on RISC-V by Marat Dukhan · 2 years, 11 months ago
  70. f623740 QC8 NEON lane microkernels by Frank Barchard · 2 years, 11 months ago
  71. d8a1dbe Add RISC-V scalar microkernels to CMake build by Marat Dukhan · 2 years, 11 months ago
  72. a198f00 Initialize RISC-V microkernel pointers by Marat Dukhan · 2 years, 11 months ago
  73. 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 11 months ago
  74. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 11 months ago
  75. 0ad4737 Minimally support RISC-V Bazel builds by Marat Dukhan · 2 years, 11 months ago
  76. 580292d Print some usage examples when called without arguments, also add a comment on how to use the script. by Zhi An Ng · 2 years, 11 months ago
  77. bd11e6a Add -fno-math-errno compilation option for scalar microkernels by Marat Dukhan · 2 years, 11 months ago
  78. 440e8ed Add FMAGIC/IMAGIC/LRINTF requantization variants in microkernel benchmarks by Marat Dukhan · 2 years, 11 months ago
  79. cccb012 Apply sort and formatting to ARM code by Frank Barchard · 2 years, 11 months ago
  80. 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 11 months ago
  81. f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 11 months ago
  82. bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 11 months ago
  83. 74ddd27 Run formatting on generate-gemm-test.py by Zhi An Ng · 2 years, 11 months ago
  84. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 11 months ago
  85. 0e80137 Refactor parameters in F32 VRND microkernels by Marat Dukhan · 2 years, 11 months ago
  86. d57a5ad Add const keyword for AVX load masks by Marat Dukhan · 2 years, 11 months ago
  87. bbfc27d Refactor NEON/NEONFMA VSIGMOID microkernels by Marat Dukhan · 2 years, 11 months ago
  88. 6853f23 Update amalgamated microkernels by Marat Dukhan · 2 years, 11 months ago
  89. ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 11 months ago
  90. 05b6cb1 Transpose microkernel tester uses iota instead of rng so that it's easier to debug tests by Alan Kelly · 2 years, 11 months ago
  91. 4a79ff2 Refactor parameters in F32 VELU microkernels by Marat Dukhan · 2 years, 11 months ago
  92. e5efb16 Refactor VUNARY microkernel parameters by Marat Dukhan · 2 years, 11 months ago
  93. a8b3994 Remove mask_table literal from F32 DWCONV AVX microkernels by Marat Dukhan · 2 years, 11 months ago
  94. 9084fc8 Quantized Sigmoid and ELU benchmarks by Marat Dukhan · 2 years, 11 months ago
  95. 3ddc20c Benchmarks for Abs, Negate, and Square operators by Marat Dukhan · 2 years, 11 months ago
  96. 5c7fd89 Benchmark for Leaky ReLU operator by Marat Dukhan · 2 years, 11 months ago
  97. a0129e9 Refactor benchmarks for elementwise operators by Marat Dukhan · 2 years, 11 months ago
  98. e72b282 Refactor parameters in F32 VSQRT microkernels by Marat Dukhan · 2 years, 11 months ago
  99. 98c5215 Move mask_table into VBINARY[C] AVX microkernel parameters by Marat Dukhan · 2 years, 11 months ago
  100. 0f28193 Minor optimization in F32 GEMM/IGEMM AVX512F microkernels by Marat Dukhan · 2 years, 11 months ago