1. 5093cbc Exclude unused parts of xnn_params by Marat Dukhan · 4 years, 9 months ago
  2. e64f91a Pipelined 6x8 GEMM for Cortex A53 by Frank Barchard · 4 years, 9 months ago
  3. 9fab3f9 Support input offset in BILINEAR micro-kernels by Marat Dukhan · 4 years, 9 months ago
  4. 38709a6 Add scalar chw 5x5p2 and 5x5s2p2 kernels by Erich Elsen · 4 years, 9 months ago
  5. 2a64a1a Fix incompatibility with ARM gcc by Marat Dukhan · 4 years, 9 months ago
  6. 0f06b5c Fix gcc incompatibility in SSE PReLU microkernels by Marat Dukhan · 4 years, 9 months ago
  7. 35dacfb BILINEAR micro-kernels by Marat Dukhan · 4 years, 9 months ago
  8. 5098c3e Refactor DWCONV micro-kernels by Marat Dukhan · 4 years, 9 months ago
  9. 49e6ee9 Refactor VMulCAddC micro-kernel by Marat Dukhan · 4 years, 9 months ago
  10. 69c3f2c Refactor PReLU microkernels by Marat Dukhan · 4 years, 9 months ago
  11. d5208d6 Remove a_sum buffer by Marat Dukhan · 4 years, 9 months ago
  12. 1898b91 Move adjustment_* arguments of Deconvolution into setup by Marat Dukhan · 4 years, 9 months ago
  13. 70ad409 Make ARM microkernels compatible with gcc by Marat Dukhan · 4 years, 9 months ago
  14. fb60914 Make F32 CLAMP NEON micro-kernel compatible with gcc on AArch32 by Marat Dukhan · 4 years, 9 months ago
  15. bd41971 A57 branch a version of A53 kernel by Frank Barchard · 4 years, 9 months ago
  16. 63ba2ed Fix typos in AVX2 ExtExp micro-kernels by Marat Dukhan · 4 years, 9 months ago
  17. 64a5bfe A53 6x8 IGEMM kernel prefetch by Frank Barchard · 4 years, 9 months ago
  18. bd1d5d9 6x8 A53 GEMM use prefetch. by Frank Barchard · 4 years, 9 months ago
  19. f568f08 Support Convolution, Deconvolution, and Fully Connected operators without bias by Marat Dukhan · 4 years, 9 months ago
  20. 263bb09 Cortex A76 use 6x8 micro kernel by Frank Barchard · 4 years, 9 months ago
  21. feb4923 AVX512F exp implementation based on PERM2 by Marat Dukhan · 4 years, 9 months ago
  22. ba7c3bb Merge generate-f32-gemminc.sh script into generate-f32-gemm.sh by Marat Dukhan · 4 years, 9 months ago
  23. 00bf68e A53 6x8 GEMM unrolled by Frank Barchard · 4 years, 9 months ago
  24. c452eb1 Re-generate SpMM micro-kernels by Marat Dukhan · 4 years, 9 months ago
  25. ae777b4 4x8 a53 eliminate pushes to stack by Frank Barchard · 4 years, 9 months ago
  26. e0601b5 Sort include order for params-init.h and log.h by Frank Barchard · 4 years, 9 months ago
  27. eeaa7bd Refactor initialization of micro-kernel parameters by Marat Dukhan · 4 years, 9 months ago
  28. 6f8d4d3 RADDEXTEXP and VSCALEEXTEXP micro-kernels for AVX2 and AVX512F by Marat Dukhan · 4 years, 9 months ago
  29. b3c6c6e 6x8 A53 remove pushes for NEON by Frank Barchard · 4 years, 9 months ago
  30. 46fb807 4x8 A53 GEMM, and GEMMINC unpipelined microkernels. by Frank Barchard · 4 years, 9 months ago
  31. cd945c6 Re-enable swizzle GEMM/IGEMM micro-kernels in WAsm SIMD on ARM by Marat Dukhan · 4 years, 9 months ago
  32. c4ae7de Propagate IGEMM SR argument to weights packing in Deconvolution operator by Marat Dukhan · 4 years, 9 months ago
  33. c6afd9b Add blocked scalar spmm kernels. by Erich Elsen · 4 years, 9 months ago
  34. 8440fde Support TF-style SAME padding via explicit flag by Marat Dukhan · 4 years, 9 months ago
  35. bff791e Use 8x1 SpMM micro-kernel on WebAssembly by Marat Dukhan · 4 years, 9 months ago
  36. 14fe0b2 Enable sparse MobileNet v1/v2 operators on WebAssembly by Marat Dukhan · 4 years, 9 months ago
  37. a7fb855 6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels. by Frank Barchard · 4 years, 10 months ago
  38. 563df5f Add scalar version of hwc2spchw convolution. by Erich Elsen · 4 years, 10 months ago
  39. 98ba441 Vectorized extexp functions by Marat Dukhan · 4 years, 10 months ago
  40. cb80197 Disable GEMM/IGEMM micro-kernels with swizzle by Marat Dukhan · 4 years, 10 months ago
  41. 31a98d7 Remove warnings about inefficient padding parameters in Convolution by Marat Dukhan · 4 years, 10 months ago
  42. 1756f9e Propagate GEMM/IGEMM SR argument to weights packing in Fully Connected operator by Marat Dukhan · 4 years, 10 months ago
  43. e0df831 Remove trailing whitespace by Marat Dukhan · 4 years, 10 months ago
  44. 07cb676 Refactor initialization of even/odd masks in parameters for SpCHW micro-kernels by Marat Dukhan · 4 years, 10 months ago
  45. 838c8e3 Refactor initialization of masks in parameters for SpCHW micro-kernels by Marat Dukhan · 4 years, 10 months ago
  46. caf8544 LD64/LD128 kernels remove all pushes (d8-d15) Remap d12-d15 to d16-d19 by Frank Barchard · 4 years, 10 months ago
  47. fcfdc0e Automated g4 rollback of changelist 274728310. by Frank Barchard · 4 years, 10 months ago
  48. 05ac8e3 VSCALE microkernel and SoftMax Three-Pass algorithm with Reloading by Marat Dukhan · 4 years, 10 months ago
  49. 8e3c551 1x8 a53 kernel refactor based on a57. by Frank Barchard · 4 years, 10 months ago
  50. baa9ead Update assembly Copyright notice to // comment by Frank Barchard · 4 years, 10 months ago
  51. 9757953 Refactor and open-source Three-Pass Softmax micro-kernels by Marat Dukhan · 4 years, 10 months ago
  52. 459c9fc 6x8 and a53 kernel comments. by Frank Barchard · 4 years, 10 months ago
  53. 515c977 Refactor and open-source vectorized expminus function by Marat Dukhan · 4 years, 10 months ago
  54. f6839e1 Refactor vectorized exp functions by Marat Dukhan · 4 years, 10 months ago
  55. 2af471b Switch default intrinsics kernel to 6x8 by Frank Barchard · 4 years, 10 months ago
  56. 9cdade3 Add prefetch instructions to 16x1, 16x2, 16x4 kernels. by Erich Elsen · 4 years, 10 months ago
  57. 34dc2c0 Add gavgpool_spchw_scalar__x1 kernel. by Erich Elsen · 4 years, 10 months ago
  58. ac4de80 Add chw 3x3s2p1_scalar kernels. by Erich Elsen · 4 years, 10 months ago
  59. 0cc2c53 add 3x3p1_scalar kernel by Erich Elsen · 4 years, 10 months ago
  60. a5ca10e Neon intrinsics clamping - Replace 2 LD1R with 1 LD2R by Frank Barchard · 4 years, 10 months ago
  61. 6adff4e Vectorized implementations of expf function for AVX2 and AVX512F by Marat Dukhan · 4 years, 10 months ago
  62. bd9e495 Remove 4x12 intrinsics kernels. by Frank Barchard · 4 years, 10 months ago
  63. 8fe54e4 Extra :xnnpack_operators_nhwc_f32 target with only F32 operators in NHWC layout by Marat Dukhan · 4 years, 10 months ago
  64. 21be34f 1x8 A53 GEMM, GEMMINC and IGEMM microkernels. by Frank Barchard · 4 years, 10 months ago
  65. db45b6a 1x8 neonfma IGEMM microkernel and 1x8 benchmarks. by Frank Barchard · 4 years, 10 months ago
  66. 466b523 Use GEMM/IGEMM micro-kernels with Swizzle on WAsm SIMD by Marat Dukhan · 4 years, 10 months ago
  67. afbca9a Remove unused x21 and switch x20 to x8 to avoid push. by Frank Barchard · 4 years, 10 months ago
  68. d343c22 Avoid cpuinfo dependency in Emscripten builds by Marat Dukhan · 4 years, 10 months ago
  69. dd69f0b Rename XNN_CONVOLUTION_FLAG_DEPTHWISE to XNN_FLAG_DEPTHWISE_CONVOLUTION by Marat Dukhan · 4 years, 10 months ago
  70. c8e00eb Disable logging in optimized builds, limit logging in fastbuild by Marat Dukhan · 4 years, 10 months ago
  71. 1b0421b 1x12 and 1x8 assembly kernel cleanup by Frank Barchard · 4 years, 10 months ago
  72. f8c8046 Cleanup redundant #includes by Marat Dukhan · 4 years, 10 months ago
  73. 341c321 Remove unused XNN_INTERNAL_EXTRA_BYTES constant by Marat Dukhan · 4 years, 10 months ago
  74. df6985f Map Exynos-M[1-4] micro-architectures to equivalents in OSS cpuinfo by Marat Dukhan · 4 years, 10 months ago
  75. 22f38e4 Refactor architecture identification macros by Marat Dukhan · 4 years, 10 months ago
  76. 1dadbf7 Limit direct dependencies on cpuinfo by Marat Dukhan · 4 years, 10 months ago
  77. 629a33e Fix incompatibilities with open-source Bazel-based build by Marat Dukhan · 4 years, 10 months ago
  78. 4c2637d LD2R for loading clamp parameters by Frank Barchard · 4 years, 10 months ago
  79. 80fc932 Unify comments style by Marat Dukhan · 4 years, 10 months ago
  80. b455b12 Initial open-source release by XNNPACK Team · 4 years, 10 months ago