1. 2848059 Optimize QC8 DWCONV microkernel selection on AVX and XOP by Marat Dukhan · 3 years, 2 months ago
  2. cc96770 Evaluate MUL32 XOP QS8 DWCONV microkernels in E2E benchmark by Marat Dukhan · 3 years, 2 months ago
  3. 195b72f Split microkernel lists in CMakeLists into production and non-production by Marat Dukhan · 3 years, 2 months ago
  4. 2c72495 Split microkernel lists in BUILD file into production and non-production by Marat Dukhan · 3 years, 2 months ago
  5. 02f06e3 Fix QS8 DWCONV microkernel selection for XOP processors by Marat Dukhan · 3 years, 2 months ago
  6. db3b0a7 Refactor microkernel lists in BUILD and CMakeLists.txt by Marat Dukhan · 3 years, 2 months ago
  7. caa7fc7 Optimize selection of QU8 DWCONV microkernel on AVX processors by Marat Dukhan · 3 years, 2 months ago
  8. 6084fb8 E2E benchmark for QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  9. afd2ed9 Use NEON microkernels with RNDNU requantization in QU8 GEMM benchmark by Marat Dukhan · 3 years, 2 months ago
  10. 73a899a QU8 DWCONV NEON microkernels with RNDNU requantization by Marat Dukhan · 3 years, 2 months ago
  11. 173661d QU8 GEMM/IGEMM NEON microkernels with RNDNU requantization by Marat Dukhan · 3 years, 2 months ago
  12. d8e2d71 Update QU8 GEMM microkernel benchmarks by Marat Dukhan · 3 years, 2 months ago
  13. 336974c Merge pull request #1703 from slowy07:minor-fixing by XNNPACK Team · 3 years, 2 months ago
  14. 0744fa0 QS8 DWCONV microkernel benchmark by Marat Dukhan · 3 years, 2 months ago
  15. e13e639 Align packed weights on 64 bytes in microkernel benchmarks by Marat Dukhan · 3 years, 2 months ago
  16. b657605 Fix random number generation in QS8 GEMM benchmark by Marat Dukhan · 3 years, 2 months ago
  17. bbfc6d3 E2E benchmark for QS8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  18. ab1127f docs: spelling grammar by slowy07 · 3 years, 2 months ago
  19. 42b441b Specify parameter initialization function in F32 DWCONV benchmark by Marat Dukhan · 3 years, 2 months ago
  20. 510b8e0 Code generator for RNDNU quantization mode on neon-mull-addw-dup microkernel by Frank Barchard · 3 years, 2 months ago
  21. 88780d3 Specify parameter initialization function in F32 DWCONV E2E benchmark by Marat Dukhan · 3 years, 2 months ago
  22. 0966856 Accumulate in 16 bits once in SSE2/SSE4/AVX/XOP MUL16 QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
  23. 26e8378 Reduce register pressure in GEMMLOWP quantization on NEON by Frank Barchard · 3 years, 2 months ago
  24. 28407b2 Support zeroes in shape dimensions in binary elementwise operators by Marat Dukhan · 3 years, 2 months ago
  25. ab952f1 Remove multiplication in QS8/QC8 DWCONV MUL16 microkernels for SSE4 by Marat Dukhan · 3 years, 2 months ago
  26. 5f2939f QS8/QC8 DWCONV NEON MUL8/MLA8 microkernels using 128-bit loads by Marat Dukhan · 3 years, 2 months ago
  27. 476eb84 Fix CMake build by Marat Dukhan · 3 years, 2 months ago
  28. caccd8e Accumulate in 16 bits once in NEON QS8/QC8 DWCONV before extending to 32 bits by Marat Dukhan · 3 years, 2 months ago
  29. 1a2dbe1 RNDNU scalar GEMM/IGEMM microkernel by Frank Barchard · 3 years, 2 months ago
  30. e76049a AVX512 implementation of QS8/QU8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  31. efa123d Update Neon code with generators for added comment by Frank Barchard · 3 years, 2 months ago
  32. 6c7b9e8 Disable MSan in quantized addition microkernels by Marat Dukhan · 3 years, 2 months ago
  33. 9670626 Support QU8 Fully Connected operator in the Subgraph API by Marat Dukhan · 3 years, 2 months ago
  34. 28c82b2 Fix CMake build by Marat Dukhan · 3 years, 2 months ago
  35. 09a1f65 Support QU8 Add operator in Subgraph API by Marat Dukhan · 3 years, 2 months ago
  36. 22f9a9f Enable RNDNU requantization for NEON QS8 GEMM/IGEMM by Frank Barchard · 3 years, 2 months ago
  37. 3eac69c Optimized QU8 VADD[C] microkernels for SSE4/AVX/XOP/AVX2 by Marat Dukhan · 3 years, 2 months ago
  38. 036b2b1 Add QU8 MobileNet v2 model to end-to-end benchmark by Marat Dukhan · 3 years, 2 months ago
  39. db007cd QU8 Add ND operator by Marat Dukhan · 3 years, 2 months ago
  40. 76e78c8 Generalize QS8 VADD[C] templates to cover QU8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  41. 7679b1e Optimize QS8 VADD[C] microkernels for SSE4/AVX/XOP/AVX2 by Marat Dukhan · 3 years, 2 months ago
  42. 6691324 Split initialization function for QS8 VADD parameters by Marat Dukhan · 3 years, 2 months ago
  43. 22fbe77 RNDNU quantized 1x16 and 4x16 Neon lane GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 2 months ago
  44. 288ecd4 Use function pointer to initialize microkernel parameters in QS8 Addition operator by Marat Dukhan · 3 years, 2 months ago
  45. 13db60f RNDNU quantized Neon assembly GEMM/IGEMM microkernels. by Frank Barchard · 3 years, 2 months ago
  46. 8a04565 Use RNDNU requantization in QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  47. c3f69fd Simplify requantization in WAsm SIMD QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  48. 947c298 Simplify requantization in SSE2/SSE4/AVX/XOP/AVX2 QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  49. c0612f0 Simplify requantization in NEON QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  50. 60729d0 4x16c4 RNDNU quantized Neon assembly GEMM/IGEMM microkernel. by Frank Barchard · 3 years, 2 months ago
  51. a842fef Rename zero_point_product parameter to bias in Quantized Add microkernels by Marat Dukhan · 3 years, 2 months ago
  52. e6a4805 Simplify requantization in scalar QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  53. f0ebd4b Reduce multiplier precision in quantized addition by Marat Dukhan · 3 years, 2 months ago
  54. 49d9005 Refactor QS8 VADD[C] parameters by Marat Dukhan · 3 years, 2 months ago
  55. af5843d Optimize QS8 VADD[C] microkernels for SSE2 by Marat Dukhan · 3 years, 2 months ago
  56. d4c478b Restrict input-to-output scale ratio in quantized addition by Marat Dukhan · 3 years, 2 months ago
  57. 076bcfe Refactor argument names in QS8 VADD[C] microkernels by Marat Dukhan · 3 years, 2 months ago
  58. 6e0fc39 Relax initialization of Quantized Addition microkernel parameters by Marat Dukhan · 3 years, 2 months ago
  59. 575dfb9 Disable MSan in quantized DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  60. 4ba70b7 QS8/QC8 NEON microkernels using 8x8->16-bit multiplication by Marat Dukhan · 3 years, 2 months ago
  61. 5c92195 Fix incompatibilities with GCC on ARM by Marat Dukhan · 3 years, 2 months ago
  62. 2bb448c Fix polyfill for vcvtnq_s32_f32 on AArch32 GCC by Marat Dukhan · 3 years, 2 months ago
  63. 20c36d4 Fix CMake build by Marat Dukhan · 3 years, 2 months ago
  64. e903dff QS8 GEMM/IGEMM microkernels with RNDNU requantization by Marat Dukhan · 3 years, 2 months ago
  65. be18f5c QS8 DWCONV microkernels with RNDNU requantization by Marat Dukhan · 3 years, 2 months ago
  66. f975d7f Fix instruction listings in NEON requantization stubs by Marat Dukhan · 3 years, 2 months ago
  67. d3d818c Fix requantization stubs for Ruy requantization schema by Marat Dukhan · 3 years, 2 months ago
  68. 7b1aeb9 Evaluation stubs for Ruy requantization schema by Marat Dukhan · 3 years, 2 months ago
  69. 2837e8b Remove 0 offset from loads. by Frank Barchard · 3 years, 2 months ago
  70. d194311 4x16c4-aarch64-neondot-ld32 use LD1R instead of lanes by Frank Barchard · 3 years, 2 months ago
  71. 89cd59b Remove legacy QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  72. 43b46ee Use generated QU8 GEMM/IGEMM/DWCONV microkernels on ARM by Marat Dukhan · 3 years, 2 months ago
  73. 3d76e55 Reoptimize microkernel selection for WAsm MVP by Marat Dukhan · 3 years, 2 months ago
  74. 8172135 Use generated QU8 GEMM/IGEMM/DWCONV microkernels on ARM64 by Marat Dukhan · 3 years, 2 months ago
  75. 605696a NEON implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  76. a97e975 Initialize QU8 microkernels for WebAssembly SIMD by Marat Dukhan · 3 years, 2 months ago
  77. bd3c9aa Add cpufreq to requantization benchmarks by Frank Barchard · 3 years, 2 months ago
  78. 1f71428 Scalar implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  79. f601135 WAsm SIMD implementations of QU8 DWCONV microkernels by Marat Dukhan · 3 years, 2 months ago
  80. 927d474 Scalar implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
  81. 43bee05 WAsm SIMD implementation of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
  82. 69c8a29 NEON-MLAL implementations of QU8 GEMM/IGEMM microkernels by Marat Dukhan · 3 years, 2 months ago
  83. f6f6209 Refactoring in QS8/QC8 GEMM/IGEMM NEON-MLAL microkernels by Marat Dukhan · 3 years, 2 months ago
  84. 3ddd61f Merge pull request #1631 from malfet:patch-1 by XNNPACK Team · 3 years, 2 months ago
  85. a98a109 Fix missing braces around initializer warning by Nikita Shulga · 3 years, 2 months ago
  86. 8c8c159 Expose QU8 [Depthwise] Convolution 2D operators in Subgraph API by Marat Dukhan · 3 years, 2 months ago
  87. ac67ae8 Fix CMake build for ARM64 by Marat Dukhan · 3 years, 2 months ago
  88. 8c8ce5d Include vcvtnq_s32_f32 polyfill in NEON requantization stubs by Marat Dukhan · 3 years, 2 months ago
  89. abee3a7 Enable optimized QU8 microkernels on x86/x86-64 by Marat Dukhan · 3 years, 3 months ago
  90. cfd606b QU8 DWCONV microkernels for AVX512 by Marat Dukhan · 3 years, 3 months ago
  91. 09c312b QU8 DWCONV microkernels for AVX2 by Marat Dukhan · 3 years, 3 months ago
  92. f0f2881 QS8 DWCONV microkernels for SSE2/SSE4.1/AVX by Marat Dukhan · 3 years, 3 months ago
  93. a3fb629 Merge pull request #1621 from mitchellspryn:msvc_intrinsic_polyfill by XNNPACK Team · 3 years, 3 months ago
  94. 3c35f7a QU8 DWCONV microkernels for SSE4.1/AVX/XOP by Marat Dukhan · 3 years, 3 months ago
  95. f317528 Add MSVC version check. by Mitchell Spryn · 3 years, 3 months ago
  96. f31f242 Update comment in intrinsics-polyfill.h by Mitchell Spryn · 3 years, 3 months ago
  97. 6a99b5b Add MSVC polyfill flag by Mitchell Spryn · 3 years, 3 months ago
  98. 3cf2e22 QU8 GEMM/IGEMM microkernels for AVX512 by Marat Dukhan · 3 years, 3 months ago
  99. cf277de Fix out-of-bounds bug in initializing QU8 conv params by Marat Dukhan · 3 years, 3 months ago
  100. 902ef7f QU8 GEMM/IGEMM AVX2 microkernels with FP32 requantization by Marat Dukhan · 3 years, 3 months ago