1. 3eac69c Optimized QU8 VADD[C] microkernels for SSE4/AVX/XOP/AVX2 by Marat Dukhan · 3 years, 4 months ago
  2. db007cd QU8 Add ND operator by Marat Dukhan · 3 years, 4 months ago
  3. 76e78c8 Generalize QS8 VADD[C] templates to cover QU8 VADD[C] microkernels by Marat Dukhan · 3 years, 4 months ago
  4. 7679b1e Optimize QS8 VADD[C] microkernels for SSE4/AVX/XOP/AVX2 by Marat Dukhan · 3 years, 4 months ago
  5. 6691324 Split initialization function for QS8 VADD parameters by Marat Dukhan · 3 years, 4 months ago
  6. 288ecd4 Use function pointer to initialize microkernel parameters in QS8 Addition operator by Marat Dukhan · 3 years, 4 months ago
  7. 6e0fc39 Relax initialization of Quantized Addition microkernel parameters by Marat Dukhan · 3 years, 4 months ago
  8. 4ba70b7 QS8/QC8 NEON microkernels using 8x8->16-bit multiplication by Marat Dukhan · 3 years, 4 months ago
  9. e903dff QS8 GEMM/IGEMM microkernels with RNDNU requantization by Marat Dukhan · 3 years, 4 months ago
  10. be18f5c QS8 DWCONV microkernels with RNDNU requantization by Marat Dukhan · 3 years, 4 months ago
  11. 43b46ee Use generated QU8 GEMM/IGEMM/DWCONV microkernels on ARM by Marat Dukhan · 3 years, 4 months ago
  12. 3d76e55 Reoptimize microkernel selection for WAsm MVP by Marat Dukhan · 3 years, 4 months ago
  13. 8172135 Use generated QU8 GEMM/IGEMM/DWCONV microkernels on ARM64 by Marat Dukhan · 3 years, 4 months ago
  14. a97e975 Initialize QU8 microkernels for WebAssembly SIMD by Marat Dukhan · 3 years, 4 months ago
  15. abee3a7 Enable optimized QU8 microkernels on x86/x86-64 by Marat Dukhan · 3 years, 4 months ago
  16. 3cf2e22 QU8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 4 months ago
  17. 902ef7f QU8 GEMM/IGEMM AVX2 microkernels with FP32 requantization by Marat Dukhan · 3 years, 5 months ago
  18. ef47f8d QU8 GEMM/IGEMM microkernels for SSE/AVX/XOP with FP32 requantization by Marat Dukhan · 3 years, 5 months ago
  19. cdbe9a3 Code-generate QU8 GEMM and IGEMM microkernels for SSE2/SSSE3/SSE4.1 by Marat Dukhan · 3 years, 5 months ago
  20. c698c11 Refactor xnn_qu8_conv_minmax_params by Marat Dukhan · 3 years, 5 months ago
  21. c2e8f66 Unify naming of QU8 GEMM/IGEMM/DWCONV microkernels with QS8/QC8 by Marat Dukhan · 3 years, 5 months ago
  22. 75d1b79 Enable assembly implementations of QC8 microkernels for ARM64 by Marat Dukhan · 3 years, 5 months ago
  23. 898d585 Initialize QC8 microkernels on ARM/ARM64/WAsm/WAsm SIMD by Marat Dukhan · 3 years, 5 months ago
  24. 79cd5f9 FP32 LD128 IGEMM for Cortex X1 by Frank Barchard · 3 years, 5 months ago
  25. 0ae35f2 QS8 LD128 GEMM/IGEMM dot product 4x16 microkernel by Frank Barchard · 3 years, 5 months ago
  26. 143a110 Rename GEMM/IGEMM microkernels from Cortex-A57/A75 to prfm_cortex_a75 by Frank Barchard · 3 years, 5 months ago
  27. 5e35386 Initialize QC8 microkernels on x86/x86-64 by Marat Dukhan · 3 years, 5 months ago
  28. 990b2af Enable LD128 GEMM microkernel for Cortex X1 by Frank Barchard · 3 years, 5 months ago
  29. 7b48ddc Enable FP16 GEMM/IGEMM To use A75 microkernel on Cortex X1 by Frank Barchard · 3 years, 5 months ago
  30. d2f454e Enable LD64 GEMM microkernel for Cortex A55r0 by Frank Barchard · 3 years, 5 months ago
  31. 07f4a89 Enable A75 GEMM microkernel for FP16 by Frank Barchard · 3 years, 5 months ago
  32. 7c3826e Initialize FP16 GEMM microkernel for Cortex A55r1 by Frank Barchard · 3 years, 5 months ago
  33. 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 6 months ago
  34. caf4831 FP32 requantization in QS8 DWCONV microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 6 months ago
  35. c46e671 FP32 requantization in QS8 GEMM/IGEMM microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 6 months ago
  36. df94d98 Explicitly cast arguments in _InterlockedCompareExchangePointer call by Marat Dukhan · 3 years, 6 months ago
  37. 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 6 months ago
  38. d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 6 months ago
  39. 77ded05 Use byte-wide MIN/MAX in AVX512 QS8 DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
  40. 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 6 months ago
  41. e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 6 months ago
  42. b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
  43. c58453f Initialize VMULCADDC microkernel parameters through function pointers by Marat Dukhan · 3 years, 6 months ago
  44. 104ae5e Use ISA-specific layouts in F32 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
  45. 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
  46. 725f47e Split QS8/QU8 GEMM parameter initialization by datatype by Marat Dukhan · 3 years, 6 months ago
  47. c5a7a39 Initialize GEMM/IGEMM/DWCONV microkernel parameters through function pointers by Marat Dukhan · 3 years, 6 months ago
  48. d4416d6 4x16 QS8 microkernel for Cortex A53 by Frank Barchard · 3 years, 6 months ago
  49. 32ff5ba Update init.c by Yuriy Chernyshov · 3 years, 6 months ago
  50. 814c249 Workaround -Wincompatible-pointer-types by Yuriy Chernyshov · 3 years, 6 months ago
  51. 60d3f24 Migrate F32 VCLAMP microkernel tests to VUNARY test gen by Marat Dukhan · 3 years, 6 months ago
  52. 4ed1488 QS8 DWCONV25 microkernels by Marat Dukhan · 3 years, 6 months ago
  53. 15a35c0 Optimize QS8 GEMM/IGEMM selection on RISC-V by Marat Dukhan · 3 years, 6 months ago
  54. 803c1f8 Enable QS8 microkernels on WAsm and RISC-V by Marat Dukhan · 3 years, 6 months ago
  55. b77be26 Enable 1x8C8 a75 nicrokernel for Cortex A75 by Frank Barchard · 3 years, 6 months ago
  56. e3cb19b Minimal RISC-V support by Marat Dukhan · 3 years, 6 months ago
  57. a91559a Move declarations of VHSWISH microkernels into vunary.h by Marat Dukhan · 3 years, 6 months ago
  58. 6674d69 Refactor naming of unary elementwise microkernels by Marat Dukhan · 3 years, 6 months ago
  59. ec51a4e Enable QS8 1x8C8 IGEMM microkernel for Cortex A53 by Frank Barchard · 3 years, 7 months ago
  60. 21acdd0 Enable QS8 1x8C8 GEMM microkernel for Cortex A53. by Frank Barchard · 3 years, 7 months ago
  61. 184a8e1 Enable A53 C8 microkernel load A with ldr/ldr/ins by Frank Barchard · 3 years, 7 months ago
  62. 90f520b Enable Cortex A53 tuned C8 gemm/igemm microkernels for Cortex A53 and Cortex A55r0 by Frank Barchard · 3 years, 7 months ago
  63. fb5983d Enable prefetch to MLA lane microkernel on Cortex A53 by Frank Barchard · 3 years, 7 months ago
  64. 3dd80b3 Fix allocator initialize issue on Windows by Larry Liu · 3 years, 7 months ago
  65. 6ac1d18 Cortex A53 used MLAL lane by Frank Barchard · 3 years, 7 months ago
  66. c77fc4c Bug fix add missing break for qs8 select on big core. by Frank Barchard · 3 years, 7 months ago
  67. ec56b7e Avoid selection of NEON-DOT microkernels on AArch32 iOS by Marat Dukhan · 3 years, 7 months ago
  68. 2a995e7 Enable PRFM variant of QS8 C8 Neon microkernel on Cortex A53, A72, A73 and Kryo. by Frank Barchard · 3 years, 7 months ago
  69. 3fd4e27 XOP versions of QS8 DWCONV MUL32 microkernels by Marat Dukhan · 3 years, 7 months ago
  70. 4181f94 Optimize QS8 GEMM/IGEMM microkernel selection for AVX by Marat Dukhan · 3 years, 7 months ago
  71. 496389f Make xnn_initialize thread-safe by Marat Dukhan · 3 years, 7 months ago
  72. 07feec8 MUL32 versions of SSE4.1 & AVX QS8 DWCONV microkernels by Marat Dukhan · 3 years, 8 months ago
  73. fa0ab85 AVX versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 8 months ago
  74. e9c4b96 AVX versions of QS8 VADD/VADDC microkernels by Marat Dukhan · 3 years, 8 months ago
  75. a3c1633 AVX versions of QS8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 8 months ago
  76. b3ffd58 Implement bilinear upsampling for SSE architecture by Artsiom Ablavatski · 3 years, 8 months ago
  77. bbf5182 Enable QS8 2x8c8-aarch64-neon-mlal-padal GEMM / IGEMM microkernels by Frank Barchard · 3 years, 8 months ago
  78. 4610854 Disable QS8 1x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
  79. 3522c0a Enable QS8 4x16c4-aarch64-neondot-ld64 IGEMM microkernel by Frank Barchard · 3 years, 8 months ago
  80. b75840f Enable QS8 IGEMM for Cortex A55 by Frank Barchard · 3 years, 8 months ago
  81. fb0ab0b QS8 enable 4x8c4__neondot for ARM32 by Frank Barchard · 3 years, 9 months ago
  82. a414daa Enable Quantized C2 microkernel for Neon by Frank Barchard · 3 years, 9 months ago
  83. 4baa2ac Process 32 pixels at a time in ARM64 SpMM microkernels by Marat Dukhan · 3 years, 9 months ago
  84. 2d6bcbb Reorder a few gemm1 initializations to match end to end order of gemm, igemm, gemm1, igemm1 by Jared Duke · 3 years, 9 months ago
  85. 9b7562b Reorder a few gemm1 initializations to match end to end order of gemm, igemm, gemm1, igemm1 by Frank Barchard · 3 years, 9 months ago
  86. 2202c81 Implement bilinear upsampling (CHW layout) for ARM architecture by Artsiom Ablavatski · 3 years, 10 months ago
  87. b94e34b QS8 GEMM select 2x16 for Neon MLAL. by Frank Barchard · 3 years, 10 months ago
  88. dfe47b9 Use iOS microkernels for Apple Silicon Macs by Marat Dukhan · 4 years ago
  89. 412e2f4 Rename WASMSIMD dwconv2d functions to splat or loadsplat by Frank Barchard · 4 years ago
  90. cfbed0a Disable sparse graph rewriting on x86 with AVX+ by Marat Dukhan · 4 years ago
  91. 6f7d4a2 Remove unused input_width_tile from dwconv2d_chw_parameters by Marat Dukhan · 4 years ago
  92. 4ddfab4 Optimize CHW microkernel selection for pre-NEON AArch32 by Marat Dukhan · 4 years ago
  93. c763488 CONV2D HWC2CHW microkernel for ARM NEON by Marat Dukhan · 4 years ago
  94. 3e91338 Initialize pointers to NEON CHW microkernels by Marat Dukhan · 4 years ago
  95. 0725b8d Rename WebAssembly SIMD source files and functions with x86 or arm suffix after wasmsimd by Frank Barchard · 4 years ago
  96. ff0624e Add WebAssembly dwconv2d_chw_3x3s2p1 benchmark by Frank Barchard · 4 years ago
  97. b6bd4bc Implement ELU operator by Marat Dukhan · 4 years ago
  98. 048931b Extract memcpy wrapper used by Copy operator into a microkernel by Marat Dukhan · 4 years ago
  99. 2213606 xnn_f32_conv_hwc2chw_ukernel_3x3s2p1c3x4__wasmsimd_2x2 based on SSE version by Frank Barchard · 4 years ago
  100. 97883b8 Enable dwconv2d_chw_3x3p1__wasmsimd_x86_2x4 microkernel by Frank Barchard · 4 years ago