1. 801d2c2 Fix QS8 IGEMM with FP32 requantization for SSE/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
  2. 0b04374 Support QC8 GEMM microkernels by Marat Dukhan · 3 years, 5 months ago
  3. 8b0e381 Remove bias_n accessor in GemmMicrokernelTester by Marat Dukhan · 3 years, 5 months ago
  4. e695791 4x16C4 QS8 IGEMM Cortex A55 microkernel reuse X10 to save push by Frank Barchard · 3 years, 5 months ago
  5. 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 5 months ago
  6. caf4831 FP32 requantization in QS8 DWCONV microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
  7. c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 5 months ago
  8. c6e6ee0 Refactor RNDNA and RNDNU reference requantization by Marat Dukhan · 3 years, 5 months ago
  9. 062bee3 Evaluation stubs for RNDNU requantization by Marat Dukhan · 3 years, 5 months ago
  10. 0671624 Rename PRECISE requantization schema to RNDNA by Marat Dukhan · 3 years, 5 months ago
  11. 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
  12. d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
  13. 77ded05 Use byte-wide MIN/MAX in AVX512 QS8 DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
  14. 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
  15. a5d1261 Explicitly specify requantization in GEMM/IGEMM/DWCONV tests by Marat Dukhan · 3 years, 5 months ago
  16. 5ca0d8d Consolidate requantization structures and functions in a single header by Marat Dukhan · 3 years, 5 months ago
  17. 9976cd8 Rename Q31 requantization to GEMMLOWP requantization by Marat Dukhan · 3 years, 5 months ago
  18. e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 5 months ago
  19. b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
  20. a0acc15 Use pointer to parameter initialization function in VMULCADDC microkernel tests by Marat Dukhan · 3 years, 5 months ago
  21. 104ae5e Use ISA-specific layouts in F32 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
  22. 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
  23. 725f47e Split QS8/QU8 GEMM parameter initialization by datatype by Marat Dukhan · 3 years, 5 months ago
  24. d5694df Use pointer to parameter initialization function in GEMM/IGEMM/DWCONV microkernel tests by Marat Dukhan · 3 years, 5 months ago
  25. d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 5 months ago
  26. f56f4c4 Refactor interface of microkernel parameter initialization by Marat Dukhan · 3 years, 5 months ago
  27. a6c0516 Migrate remaining CLAMP and HSWISH tests to VUNARY test gen by Marat Dukhan · 3 years, 5 months ago
  28. 6eaab71 Remove pointer casting in generated vector unary tests by Marat Dukhan · 3 years, 5 months ago
  29. 10f1fe0 Rename VBinOpMicrokernelTester -> VBinaryMicrokernelTester by Marat Dukhan · 3 years, 5 months ago
  30. 87ed45c Rename VUnOpMicrokernelTester -> VUnaryMicrokernelTester by Marat Dukhan · 3 years, 5 months ago
  31. 60d3f24 Migrate F32 VCLAMP microkernel tests to VUNARY test gen by Marat Dukhan · 3 years, 5 months ago
  32. 949b6e7 Migrate F32 HSWISH microkernel tests to VUNARY test gen by Marat Dukhan · 3 years, 5 months ago
  33. 4ed1488 QS8 DWCONV25 microkernels by Marat Dukhan · 3 years, 5 months ago
  34. d481c28 QS8 VADD microkernels by Marat Dukhan · 3 years, 6 months ago
  35. 047b620 Scalar QS8 GAVGPOOL microkernels by Marat Dukhan · 3 years, 6 months ago
  36. 4454288 Scalar QS8 DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
  37. a1a4e78 Scalar QS8 GEMM and IGEMM microkernels by Marat Dukhan · 3 years, 6 months ago
  38. 938ea81 Code generate 1x8C8 nicrokernel for Cortex A75 with and without prfm by Frank Barchard · 3 years, 6 months ago
  39. a91559a Move declarations of VHSWISH microkernels into vunary.h by Marat Dukhan · 3 years, 6 months ago
  40. 6674d69 Refactor naming of unary elementwise microkernels by Marat Dukhan · 3 years, 6 months ago
  41. dddb38f QS8 1x8C8 IGEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
  42. 46a69c9 QS8 1x8C8 GEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
  43. 5549735 4X8 and 4x16 mla lane microkernels for A53 by Frank Barchard · 3 years, 6 months ago
  44. d68e114 Cortex A53 tuned C8 gemm/igemm microkernels by Frank Barchard · 3 years, 6 months ago
  45. 1f51d38 Add prefetch to MLA lane microkernel by Frank Barchard · 3 years, 6 months ago
  46. 4a35204 PRFM variant of QS8 C8 Neon microkernel. by Frank Barchard · 3 years, 6 months ago
  47. 6e8c0ce Disable compilation of neondot microkernels for AArch32 iOS by Marat Dukhan · 3 years, 6 months ago
  48. 3fd4e27 XOP versions of QS8 DWCONV MUL32 microkernels by Marat Dukhan · 3 years, 7 months ago
  49. 2e42787 2x4c2/3x4c2 microkernels for SSE2/SSSE3/SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 7 months ago
  50. 07feec8 MUL32 versions of SSE4.1 & AVX QS8 DWCONV microkernels by Marat Dukhan · 3 years, 7 months ago
  51. fa0ab85 AVX versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 7 months ago
  52. e9c4b96 AVX versions of QS8 VADD/VADDC microkernels by Marat Dukhan · 3 years, 7 months ago
  53. a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 7 months ago
  54. d23cb6e Fully Connected operator for QS8 datatype by Marat Dukhan · 3 years, 7 months ago
  55. b3ffd58 Implement bilinear upsampling for SSE architecture by Artsiom Ablavatski · 3 years, 7 months ago
  56. 6e35de5 QS8 1X8C8 IGEMM microkernel by Frank Barchard · 3 years, 7 months ago
  57. b876263 QS8 1X8C8 GEMM microkernel by Frank Barchard · 3 years, 7 months ago
  58. 967712d Limit range of test values for f16 binary minmax ops. by Frank Barchard · 3 years, 7 months ago
  59. cbb8e70 QS8 2x8c8-aarch64-neon-mlal-padal IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
  60. 7ca54df QS8 2x8c16-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
  61. 2f06150 xnn_qs8_gemm_minmax_ukernel_2x8c8__aarch64_neon_mlal_padal GEMM microkernel by Frank Barchard · 3 years, 8 months ago
  62. 1dc9fef QS8 2x8c8-aarch64 GEMM microkernel by Frank Barchard · 3 years, 8 months ago
  63. 671d1b0 QS8 4x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
  64. 5655cb7 QS8 GEMM 2x8c16 MLAL PADAL assembly microkernel for AArch64 by Frank Barchard · 3 years, 8 months ago
  65. 89e12f8 QS8 IGEMM for Cortex A55 by Frank Barchard · 3 years, 8 months ago
  66. fb8d1f1 Increase minimum value to avoid f16_vrdivc producing inf by Frank Barchard · 3 years, 8 months ago
  67. 62b4ff7 Remove 12x8 QS8 GEMM and IGEMM Neon dotproduct microkernels. by Frank Barchard · 3 years, 8 months ago
  68. da78da1 QS8 C8 Neon microkernels with MUL and MLA versions. by Frank Barchard · 3 years, 8 months ago
  69. 4a4be4e QS8 1x16c4 ld32 GEMM microkernel using NEON dot product by Frank Barchard · 3 years, 8 months ago
  70. 02121ca QS8 Neon IGEMM microkernels with 8 bit MUL using DUP by Frank Barchard · 3 years, 8 months ago
  71. 77e93a2 Fix mismatch in block layout in mixed-layout Depth-To-Space operator by Marat Dukhan · 3 years, 8 months ago
  72. a5e242c QS8 LD32 GEMM microkernel for big cores with dotproduct by Frank Barchard · 3 years, 8 months ago
  73. 36f95cf QS8 Neon IGEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 8 months ago
  74. 71c4d1a QS8 Neon GEMM C16 microkernel with two 8 bit multiplies and vpadal to accumulate. by Frank Barchard · 3 years, 8 months ago
  75. 6d138db Remove scalar C4 QS8 and QU8 gemm microkernels. by Frank Barchard · 3 years, 8 months ago
  76. 6fa8078 QS8 C2 Neon igemm by Frank Barchard · 3 years, 8 months ago
  77. d79391d QS8 C8 Neon igemm by Frank Barchard · 3 years, 8 months ago
  78. c8532ae Unroll KC loop to do MULL and then MLAL to 16 bit before lengthening to 32 bit. by Frank Barchard · 3 years, 9 months ago
  79. 8247e21 C2 QS8 microkernel using mull then mlal with KC loop of 16 by Frank Barchard · 3 years, 9 months ago
  80. 5899012 QS8 Neon GEMM C8 microkernel with 8 bit multiply and vpadal to accumulate. by Frank Barchard · 3 years, 9 months ago
  81. 6d490f7 Change isfinite() to std::isfinite() by Anush Elangovan · 3 years, 9 months ago
  82. 2202c81 Implement bilinear upsampling (CHW layout) for ARM architecture by Artsiom Ablavatski · 3 years, 9 months ago
  83. 2302ffd QS8 Neon GEMM microkernel with 8 bit multiply and vpadal to accumulate by Frank Barchard · 3 years, 9 months ago
  84. ec0bf14 QS8 GEMM and IGEMM 3x8 3x16 and IGEMM 4x8 and 4x16 by Frank Barchard · 3 years, 9 months ago
  85. 4ecae2e QS8 Neon GEMM microkernel with 8 bit multiply by Frank Barchard · 3 years, 9 months ago
  86. cfbc849 Add 4x8 and 4x16 qs8 gemm microkernels by Frank Barchard · 3 years, 9 months ago
  87. c5704bf WebAssembly DWConv2D 3x3 stride 2 loadsplat by Frank Barchard · 3 years, 10 months ago
  88. c6889b3 WebAssembly DWConv2D 5x5 stride 2 loadsplat by Frank Barchard · 3 years, 10 months ago
  89. 02bb429 WebAssembly DWConv2D 3x3p1 adapted from NEON by Frank Barchard · 3 years, 10 months ago
  90. b20dcd6 WASMSIMD dwconv2d 5x5p2 use loadsplat by Frank Barchard · 3 years, 10 months ago
  91. 4eddb9c Fix incompatibility with Apple Clang in Subgraph tester by Marat Dukhan · 3 years, 10 months ago
  92. 802fcae Additional SSE/SSE2 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 11 months ago
  93. 412e2f4 Rename WASMSIMD dwconv2d functions to splat or loadsplat by Frank Barchard · 3 years, 11 months ago
  94. 3de5dfa Remove PSIMD dependency by Marat Dukhan · 3 years, 11 months ago
  95. b36582b Enable sparse inference by default by Marat Dukhan · 3 years, 11 months ago
  96. c10585f Minor refactoring of SubgraphTester by Marat Dukhan · 3 years, 11 months ago
  97. 54b2d54 Disable sparse graph rewriting for clusters with <= 2/3 zeroes by Marat Dukhan · 3 years, 11 months ago
  98. c763488 CONV2D HWC2CHW microkernel for ARM NEON by Marat Dukhan · 3 years, 11 months ago
  99. 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 3 years, 11 months ago
  100. 5d7ca1a Remove duplicate WASMSIMD dwconv2d 5x5s2 tests by Frank Barchard · 3 years, 11 months ago