1. 5756a92 F16 Max Pooling NHWC operator by Marat Dukhan · 2 years, 4 months ago
  2. 10f2bf8 F16 MAXPOOL microkernel for F16C by Marat Dukhan · 2 years, 4 months ago
  3. 0a756b5 F16 PReLU operator by Marat Dukhan · 2 years, 4 months ago
  4. 88d06fc Disable neondot microkernels on iOS 32 bit by Frank Barchard · 2 years, 4 months ago
  5. 16c0912 F16 MAXPOOL microkernel for NEON FP16ARITH by Marat Dukhan · 2 years, 4 months ago
  6. f30a859 Port aarch64 F32 IGEMM 1x8 A75 microkernel to JIT, add tests, benchmarks, enable in init.c if JIT is enabled by Zhi An Ng · 2 years, 4 months ago
  7. eb7256b Port F32 GEMM A75 1x8 microkernel to JIT and specialize for min/max, add tests and benchmarks by Zhi An Ng · 2 years, 4 months ago
  8. 6b72e6c Convert F32 IGEMM for A75 to JIT, add tests by Zhi An Ng · 2 years, 4 months ago
  9. f0f374f Rename f32-gemm/6x8-aarch64-neonfma-prfm-cortex-a75.cc to remove prfm from file name by Zhi An Ng · 2 years, 4 months ago
  10. 1d5c616 Enable QU8 AAarch microkernels based on uarch by Frank Barchard · 2 years, 4 months ago
  11. 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 4 months ago
  12. 5ec5591 Fix tfjs build by adding dependency on jit by Zhi An Ng · 2 years, 4 months ago
  13. 101271e QC8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 4 months ago
  14. 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 4 months ago
  15. cfd947d Add neon zip microkernel generator by Alan Kelly · 2 years, 4 months ago
  16. a63651c Set F32 GEMM generator function for A75 if XNN_ENABLE_JIT is set (defaults to off) by Zhi An Ng · 2 years, 4 months ago
  17. d9aaf69 Explicitly disable -ffast-math for scalar & WAsm microkernels by Marat Dukhan · 2 years, 4 months ago
  18. f9fc9ec Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 4 months ago
  19. 348c377 QU8 GEMM/IGEMM WAsm SIMD microkernels with SR=4 by Marat Dukhan · 2 years, 4 months ago
  20. f2b233b Make SSE2 microkernels consistent with neon zip microkernels. - DEC is now MOV by Alan Kelly · 2 years, 4 months ago
  21. 8b758bf Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by XNNPACK Team · 2 years, 4 months ago
  22. df51e11 Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 4 months ago
  23. 870108c QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  24. c2e2da8 Fix conversion script for aarch64 assembly kernels and convert a single F32 GEMM as a test by Zhi An Ng · 2 years, 4 months ago
  25. a1cad4a Add x8 transpose bench by Alan Kelly · 2 years, 4 months ago
  26. ba68f44 Add x64 transpose bench by Alan Kelly · 2 years, 4 months ago
  27. ac654f1 QC8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  28. 708874b Add cpu configs to support iOS simulator builds on M1-based macs. by XNNPACK Team · 2 years, 4 months ago
  29. 0f294ad QS8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  30. 0ba29e7 Implement LDP for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  31. 109a5eb Initial aarch64 assembler structure by Zhi An Ng · 2 years, 4 months ago
  32. 8f920a6 Initialize F16 microkernel pointers on x86 by Marat Dukhan · 2 years, 4 months ago
  33. 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 4 months ago
  34. b26ead1 F16C implementation of F16 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  35. c7c92b0 Generate F16 GAVGPOOL NEONFP16ARITH microkernels from template by Marat Dukhan · 2 years, 4 months ago
  36. e78eb33 Bump shard count for f32_igemm_minmax_test (timing out on coverage runs) by Zhi An Ng · 2 years, 4 months ago
  37. 13599f3 Specialize F32 GEMM (a53) on nc by Zhi An Ng · 2 years, 4 months ago
  38. d2e8d4d Enable QC8 AArch32 4x8 lane GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 4 months ago
  39. 5e1a303 QC8 GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 4 months ago
  40. 5da6d38 SSE2 transpose microkernel code generator. by Alan Kelly · 2 years, 4 months ago
  41. d19bde9 Add x64 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  42. cd21b02 Add x8 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  43. 84aae41 Add x16 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  44. af9ff85 Fix GEMM test templates to use variable n instead of fixed NR and regenerate tests by Zhi An Ng · 2 years, 4 months ago
  45. 8575504 Switch QS8/QU8 GAVGPOOL NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 4 months ago
  46. 33a98fa Switch QS8/QU8 VMUL[C] NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 4 months ago
  47. d1f53e4 Generate QU8 GAVGPOOL microkernels from QS8 GAVGPOOL templates by Marat Dukhan · 2 years, 4 months ago
  48. 9e258d6 Remove multi-accumulator support in QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  49. 7d45d90 Create a new jit-test for jit-related tests that are not architecture specific by Zhi An Ng · 2 years, 4 months ago
  50. 7781786 Enable QU8 3x8 lane for AArch32 by Frank Barchard · 2 years, 4 months ago
  51. d7a4b22 Generate missing QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  52. 847ff5e Refactor naming of QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  53. 53f4106 Switch QS8 GAVGPOOL microkernels to use FP32 requantization by Marat Dukhan · 2 years, 4 months ago
  54. 1a856c1 Change unit tests to depend on jit_test_mode by Zhi An Ng · 2 years, 4 months ago
  55. 8f2eeee Skip calling __builtin_clear_cache on iOS, iOS uses sys_cache_invalidate by Zhi An Ng · 2 years, 4 months ago
  56. b402cbe Bump shard counts for qs8_igemm_minmax_rndnu_test by Zhi An Ng · 2 years, 4 months ago
  57. 44616e1 Bump shard counts for qs8_gemm_minmax_rndnu_test, the test sometimes timeout in coverage runs. by Zhi An Ng · 2 years, 4 months ago
  58. c27f04b Add missing generated unit tests to BUILD and CMakeLists.txt. by Zhi An Ng · 2 years, 4 months ago
  59. bd7f9a4 F16C implementation of F16 PRELU microkernels by Marat Dukhan · 2 years, 4 months ago
  60. 4c1fd6f Allow generate-gemm-test.py to accept multiple output files, and shard the generated tests across all specified output files. by Zhi An Ng · 2 years, 4 months ago
  61. d454545 F16C implementation of F16 VBINARY[C] microkernels by Marat Dukhan · 2 years, 4 months ago
  62. 717665f Add JIT microkernels to F32 GEMM E2E benchmarks by Zhi An Ng · 2 years, 4 months ago
  63. a0b45e5 Allow overriding logging settings in Bazel by Marat Dukhan · 2 years, 4 months ago
  64. d90af6f Move gemm-microkernel-tester test code into separate cc file by Zhi An Ng · 2 years, 4 months ago
  65. c7e534f Bump shard_count for slow subtract_nd_test by Zhi An Ng · 2 years, 4 months ago
  66. 969e61f Enable 2x16 for QU8 neon lane microkernel in AArch32 by Frank Barchard · 2 years, 4 months ago
  67. 2780863 Scalar transpose microkernel by Alan Kelly · 2 years, 4 months ago
  68. e8c1979 Add enable_jit to various targets in BUILD by Zhi An Ng · 2 years, 4 months ago
  69. a248337 Split more of qs8-gemm-minmax-rndnu out into another file, for microkernels with "c4" by Zhi An Ng · 2 years, 4 months ago
  70. d5a5333 Additional tile sizes for QU8 neon lane microkernel. by Frank Barchard · 2 years, 4 months ago
  71. 751f622 F16C implementation of F16 VHSWISH microkernels by Marat Dukhan · 2 years, 4 months ago
  72. 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 4 months ago
  73. 1bef0f2 Add JIT microkernels to QS8 GEMM benchmarks by Zhi An Ng · 2 years, 5 months ago
  74. bf72b54 Split qc8-igemm-minmax-fp32.yaml into 2 files, all microkernels with c go into a separate file. by Zhi An Ng · 2 years, 5 months ago
  75. 49d94ca Split qc8-gemm-minmax-fp32.yaml into 2 files, all the microkernels with c goes into a separate file. by Zhi An Ng · 2 years, 5 months ago
  76. 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 5 months ago
  77. 0e0f726 Split qs8-gemm-minmax-rndnu.yaml into 2 files, all the microkernels with c2 suffix goes into a separate file. by Zhi An Ng · 2 years, 5 months ago
  78. c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 5 months ago
  79. 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 5 months ago
  80. 16b734c Add more QC8 GEMM/IGEMM JIT microkernels. by Zhi An Ng · 2 years, 5 months ago
  81. 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 5 months ago
  82. ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 5 months ago
  83. 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 5 months ago
  84. 8a9eac6 Amalgamate AVX, AVX2, and FMA3 microkernels by Marat Dukhan · 2 years, 5 months ago
  85. 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 5 months ago
  86. 68db12e Amalgamate F16C microkernels by Marat Dukhan · 2 years, 5 months ago
  87. ed90216 aarch64 transpose TBL microkernel by Alan Kelly · 2 years, 5 months ago
  88. f290a14 Enable QC8 4x8 mla lane assembler microkernel by Frank Barchard · 2 years, 5 months ago
  89. f623740 QC8 NEON lane microkernels by Frank Barchard · 2 years, 5 months ago
  90. a198f00 Initialize RISC-V microkernel pointers by Marat Dukhan · 2 years, 5 months ago
  91. 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 5 months ago
  92. 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 5 months ago
  93. bd11e6a Add -fno-math-errno compilation option for scalar microkernels by Marat Dukhan · 2 years, 5 months ago
  94. cccb012 Apply sort and formatting to ARM code by Frank Barchard · 2 years, 5 months ago
  95. 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 5 months ago
  96. f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 5 months ago
  97. bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 5 months ago
  98. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 5 months ago
  99. ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 5 months ago
  100. 3ddc20c Benchmarks for Abs, Negate, and Square operators by Marat Dukhan · 2 years, 5 months ago