1. 48d74c3 Replicate QC8/QS8/QU8 CONV WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 9 months ago
  2. 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 9 months ago
  3. 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 9 months ago
  4. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
  5. 6150425 Disable MSan in AVX512SKX QS8/QC8/QU8 DWCONV microkernels by Marat Dukhan · 2 years, 9 months ago
  6. 7be427a Disable MSan and TSan in most microkernels with Out-of-Bounds reads by Marat Dukhan · 2 years, 9 months ago
  7. 03efa0f Reoptimize FP32 requantization in NEON QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
  8. 5a31dc6 Optimize FP32 requantization in NEON QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
  9. 7988a18 Refactoring xnn_qs8_minmax_params for NEON/NEONv8 by Marat Dukhan · 2 years, 10 months ago
  10. 8978ac2 Support requantization scale greater than 1 in RNDNU NEON microkernels by Marat Dukhan · 2 years, 10 months ago
  11. 13c9f8d Support requantization scale over 1 in SSE/AVX GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
  12. 17a9e3f Remove GEMMLOWP requantization from QS8 DWCONV templates by Marat Dukhan · 2 years, 10 months ago
  13. 411c18d Optimize FP32 requantization in WAsm SIMD QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 2 years, 10 months ago
  14. 5f7cf55 Avoid using gcc-specific intrinsics in NEON microkernels by Marat Dukhan · 2 years, 10 months ago
  15. 0bf8afa Leverage f32x4.pmin and f32x4.pmax WAsm SIMD instructions by Marat Dukhan · 3 years ago
  16. 2aa2e2a q8 dwconv add channel tiles of 24 and 32 for mul16 rndnu microkernels by Frank Barchard · 3 years ago
  17. 4c49494 Fix crash on AArch32 in scalar quantized microkernels by Marat Dukhan · 3 years, 1 month ago
  18. 7a8dd87 Work around generating v128.storeXX_lane for quantized WAsm SIMD microkernels by Marat Dukhan · 3 years, 1 month ago
  19. 9cedb59 Accumulate in 16 bits once in WAsm SIMD MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 1 month ago
  20. 36fe5aa Remove WAsm SIMD QS8 DWCONV microkernels with GEMMLOWP requantization by Marat Dukhan · 3 years, 2 months ago
  21. 07706f6 Replace generic shuffle with narrow instructions in WAsm SIMD QS8/QU8/QC8 microkernels by Marat Dukhan · 3 years, 2 months ago
  22. 1e6fc21 Fix incompatible pointer type in QU8 DWCONV NEON microkernels by Marat Dukhan · 3 years, 2 months ago
  23. 60bb7ec Accumulate in 16 bits once in AVX2 MUL16 VPUNPCK QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
  24. 881ab02 AVX2 MUL16 QS8/QC8 DWCONV microkernels using VPUNPCK instructions to extend the product by Marat Dukhan · 3 years, 2 months ago
  25. 0966856 Accumulate in 16 bits once in SSE2/SSE4/AVX/XOP MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
  26. ab952f1 Remove multiplication in QS8/QC8 DWCONV MUL16 microkernels for SSE4 by Marat Dukhan · 3 years, 2 months ago
  27. 5f2939f QS8/QC8 DWCONV NEON MUL8/MLA8 microkernels using 128-bit loads by Marat Dukhan · 3 years, 2 months ago
  28. caccd8e Accumulate in 16 bits once in NEON QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
  29. 575dfb9 Disable MSan in quantized DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  30. 4ba70b7 QS8/QC8 NEON microkernels using 8x8->16-bit multiplication by Marat Dukhan · 3 years, 2 months ago
  31. 5c92195 Fix incompatibilities with GCC on ARM by Marat Dukhan · 3 years, 2 months ago
  32. be18f5c QS8 DWCONV microkernels with RNDNU requantization by Marat Dukhan · 3 years, 2 months ago
  33. 605696a NEON implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  34. 1f71428 Scalar implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  35. f601135 WAsm SIMD implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  36. cfd606b QU8 DWCONV microkernels for AVX512 by Marat Dukhan · 3 years, 3 months ago
  37. 09c312b QU8 DWCONV microkernels for AVX2 by Marat Dukhan · 3 years, 3 months ago
  38. f0f2881 QS8 DWCONV microkernels for SSE2/SSE4.1/AVX by Marat Dukhan · 3 years, 3 months ago
  39. 3c35f7a QU8 DWCONV microkernels for SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 3 months ago
  40. e5eee46 Refactor pre-SSE4 versions of QS8/QC8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 3 months ago
  41. b3336d9 Refactoring in QS8/QC8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
  42. 9ca2333 Minor refactoring in QC8 microkernels by Marat Dukhan · 3 years, 3 months ago
  43. 0fae3bc Include polyfill in NEON QS8/QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
  44. 5754706 Scalar implementation of QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
  45. 313eef7 WAsm SIMD implementations of QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
  46. 69aa623 WAsm SIMD QS8 DWCONV with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  47. ee029b2 Replace deprecated wasm_simd128.h intrinsics with new versions by Marat Dukhan · 3 years, 3 months ago
  48. 85d772b QS8 DWCONV microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  49. 59af581 NEON implementations of QC8 DWCONV microkernels by Marat Dukhan · 3 years, 3 months ago
  50. aef9091 Minor optimization for AArch64 NEON QS8 microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  51. 6f90529 QS8 DWCONV microkernels for ARM NEON with FP32 requantization by Marat Dukhan · 3 years, 3 months ago
  52. 98042f2 QC8 DWCONV microkernels for SSE/AVX/XOP/AVX512 by Marat Dukhan · 3 years, 3 months ago
  53. 8228689 Support QC8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  54. 4a2d255 Remove redundant SSSE3 microkernels with FP32 requantization by Marat Dukhan · 3 years, 4 months ago
  55. caf4831 FP32 requantization in QS8 DWCONV microkernels for SSE/AVX/XOP by Marat Dukhan · 3 years, 4 months ago
  56. 71855ee Support FP32 requantization in AVX512 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  57. d4c7d82 AVX512-specific parameters for QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  58. 77ded05 Use byte-wide MIN/MAX in AVX512 QS8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  59. 9b474cf Support FP32 requantization in AVX2 QS8 microkernels by Marat Dukhan · 3 years, 4 months ago
  60. f86ee8b Refactor requantization helper functions by Marat Dukhan · 3 years, 4 months ago
  61. e3d17bf Rename microkernel-related types and structures by Marat Dukhan · 3 years, 4 months ago
  62. b07c26a Rename QS8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  63. e1ff248 Rename QS8 DWCONV microkernel filenames by Marat Dukhan · 3 years, 4 months ago
  64. 748fd12 Use specialized layouts in SSE4/AVX2 QS8 [I]GEMM & DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  65. 4ed1488 QS8 DWCONV25 microkernels by Marat Dukhan · 3 years, 4 months ago
  66. a24cc08 Small refactoring of scalar QS8 microkernels by Marat Dukhan · 3 years, 5 months ago
  67. 4454288 Scalar QS8 DWCONV microkernels by Marat Dukhan · 3 years, 5 months ago
  68. 3fd4e27 XOP versions of QS8 DWCONV MUL32 microkernels by Marat Dukhan · 3 years, 6 months ago
  69. 60fc613 Polyfill _mm_loadu_si32 in MUL32 QS8 DWCONV SSE4.1/AVX microkernels by Marat Dukhan · 3 years, 6 months ago
  70. 07feec8 MUL32 versions of SSE4.1 & AVX QS8 DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
  71. fa0ab85 AVX versions of QS8 DWCONV microkernels by Marat Dukhan · 3 years, 6 months ago
  72. 66ccf64 Rename QS8 generator templates by Marat Dukhan · 4 years ago
  73. 2ffc5e6 AVX512 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 1 month ago
  74. 023bcf9 NEON variant of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  75. c15aa4e Remove XOP variants of QS8 DWCONV by Marat Dukhan · 4 years, 2 months ago
  76. 4013552 AVX2 versions of QS8 DWCONV microkernels using 16-bit multiplication by Marat Dukhan · 4 years, 2 months ago
  77. cc8f34c WAsm SIMD variants of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  78. 23848db Reoptimize x86 requantization by Marat Dukhan · 4 years, 2 months ago
  79. d65a152 AVX2 versions of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago
  80. f62bbdc SSE2/SSSE3/SSE4.1/XOP implementation of QS8 DWCONV microkernels by Marat Dukhan · 4 years, 2 months ago