1. 2bd2bd2 X8 & X16 Copy NC operators by Marat Dukhan · 2 years, 4 months ago
  2. 5756a92 F16 Max Pooling NHWC operator by Marat Dukhan · 2 years, 4 months ago
  3. af1671a Support FP32 weights in F16 PReLU operator by Marat Dukhan · 2 years, 4 months ago
  4. 10f2bf8 F16 MAXPOOL microkernel for F16C by Marat Dukhan · 2 years, 4 months ago
  5. 0a756b5 F16 PReLU operator by Marat Dukhan · 2 years, 4 months ago
  6. ba05c64 Fix MSVC compilation issues by Marat Dukhan · 2 years, 4 months ago
  7. 6b45a7f 16-bit Constant Pad ND operator by Marat Dukhan · 2 years, 4 months ago
  8. 16c0912 F16 MAXPOOL microkernel for NEON FP16ARITH by Marat Dukhan · 2 years, 4 months ago
  9. f30a859 Port aarch64 F32 IGEMM 1x8 A75 microkernel to JIT, add tests, benchmarks, enable in init.c if JIT is enabled by Zhi An Ng · 2 years, 4 months ago
  10. f672851 Implement str (s register, post index) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  11. 1738f11 Implement ldr (post-index) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  12. eb7256b Port F32 GEMM A75 1x8 microkernel to JIT and specialize for min/max, add tests and benchmarks by Zhi An Ng · 2 years, 4 months ago
  13. 6b72e6c Convert F32 IGEMM for A75 to JIT, add tests by Zhi An Ng · 2 years, 4 months ago
  14. e96b6bc Split qs8-igemm-minmax-rndnu tests into 1 more file (4 total), seeing compile timeouts in coverage runs by Zhi An Ng · 2 years, 4 months ago
  15. 4decc8e Implement mov (x registers) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  16. 8ceeebe Implement stp (x registers) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  17. 9e51ad6 Implement cmp (x registers) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  18. 18f71e0 Support vld1r_32 with 1 or 2 register(s) in list by Zhi An Ng · 2 years, 4 months ago
  19. 60c9bcb Fix incorrect k argument to QC8/QS8 GEMM microkernel test by Zhi An Ng · 2 years, 4 months ago
  20. 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 4 months ago
  21. 101271e QC8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 4 months ago
  22. 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 4 months ago
  23. cfd947d Add neon zip microkernel generator by Alan Kelly · 2 years, 4 months ago
  24. f9fc9ec Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 4 months ago
  25. 348c377 QU8 GEMM/IGEMM WAsm SIMD microkernels with SR=4 by Marat Dukhan · 2 years, 4 months ago
  26. fbd67a7 Pad K to a multiple of SR in GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 4 months ago
  27. 4ef2352 Improve test coverage for quantized Depthwise Convolutions in TFLite weight layout by Marat Dukhan · 2 years, 4 months ago
  28. f2b233b Make SSE2 microkernels consistent with neon zip microkernels. - DEC is now MOV by Alan Kelly · 2 years, 4 months ago
  29. 8b758bf Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by XNNPACK Team · 2 years, 4 months ago
  30. df51e11 Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 4 months ago
  31. 15dd611 Check code_buffer capacity before attempting to release it by Zhi An Ng · 2 years, 4 months ago
  32. c607028 Remove wb from JIT aarch32 instructions, use mem operand and ++ instead by Zhi An Ng · 2 years, 4 months ago
  33. fc67a86 Fix encoding of prfm by Zhi An Ng · 2 years, 4 months ago
  34. 870108c QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  35. 773458c Change return type for assembler functions to void to simplify code, move emit32 into common assembler by Zhi An Ng · 2 years, 4 months ago
  36. c2e2da8 Fix conversion script for aarch64 assembly kernels and convert a single F32 GEMM as a test by Zhi An Ng · 2 years, 4 months ago
  37. 4a1c6a8 Implement ldp (d registers) offset and post index for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  38. 048704d Implement stp (q registers) offset and post indexed for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  39. 3cec451 Implement tst (immediate) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  40. 8709ac9 Implement csel for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  41. 35d8e68 Implemnet stp (d register) offset and pre-index for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  42. 658a67d Implement add (x registers) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  43. 80eac62 Implement cmp (immediate) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  44. ac654f1 QC8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  45. 491e9e0 Implement ldr for s and d registers and str for d registers (post-indexed) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  46. 0f294ad QS8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 4 months ago
  47. 2f24c3e Implement dup (vector) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  48. f761632 Implement str (q register, post-indexed) and str (s register, offset) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  49. 5a5c9e1 Implement mov (VRegister) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  50. 5e31395 Implement stp (post-indexed) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  51. 4915509 Implement add with immediate for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  52. b10677e Implement unconditional branch for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  53. 56e8b91 Implement tbz for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  54. cdfff79 Implement ret for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  55. 3176868 Implement sub (x register) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  56. 3f34299 Implement st1 for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  57. 544d73d Implement fmax and fmin (vector) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  58. ecfb1f0 Implement fadd (vector) for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  59. 0981080 Implement tbnz for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  60. 6a1151b Implement fmla for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  61. 157b0f4 Implement ldr ldp for q registers in aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  62. f67f1be Implement labels and B.cond for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  63. e2dc2ec Implement subs for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  64. 234d6b4 Implement prfm (only PLDL1KEEP) on aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  65. 65ccb13 Implement movi for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  66. 6e68f54 Implement ld1 for 1, 2, and 3 registers for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  67. 5702efb Implement ld2r for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  68. 04cdc41 Implement ldr for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  69. 0ba29e7 Implement LDP for aarch64 assembler by Zhi An Ng · 2 years, 4 months ago
  70. 70ea0a2 Specialize F32 GEMM A53 JIT microkernel for min/max params by Zhi An Ng · 2 years, 4 months ago
  71. 109a5eb Initial aarch64 assembler structure by Zhi An Ng · 2 years, 4 months ago
  72. 66eb508 Add missing declarations and unit tests for F16 DWCONV microkernels by Marat Dukhan · 2 years, 4 months ago
  73. 0ec25cf Duplicate test methods in gemm-microkernel-test for JIT codegen, update IGEMM generator signature and test generation script. by Zhi An Ng · 2 years, 4 months ago
  74. e7225eb Specialize F32 GEMM (a53) on kc by Zhi An Ng · 2 years, 4 months ago
  75. 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 4 months ago
  76. b26ead1 F16C implementation of F16 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  77. c7c92b0 Generate F16 GAVGPOOL NEONFP16ARITH microkernels from template by Marat Dukhan · 2 years, 4 months ago
  78. 1d6b7c9 Support FP32 weights in FP16 NC Fully Connected operator by Marat Dukhan · 2 years, 4 months ago
  79. 6989ec4 Support FP32 weights in FP16 NHWC Convolution operator by Marat Dukhan · 2 years, 4 months ago
  80. 5e1a303 QC8 GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 4 months ago
  81. 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 4 months ago
  82. 667e0f1 Regenerate transpose tests by Alan Kelly · 2 years, 4 months ago
  83. 5da6d38 SSE2 transpose microkernel code generator. by Alan Kelly · 2 years, 4 months ago
  84. d19bde9 Add x64 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  85. cd21b02 Add x8 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  86. 84aae41 Add x16 scalar transpose microkernels by Alan Kelly · 2 years, 4 months ago
  87. af9ff85 Fix GEMM test templates to use variable n instead of fixed NR and regenerate tests by Zhi An Ng · 2 years, 4 months ago
  88. 8575504 Switch QS8/QU8 GAVGPOOL NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 4 months ago
  89. 33a98fa Switch QS8/QU8 VMUL[C] NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 4 months ago
  90. d1f53e4 Generate QU8 GAVGPOOL microkernels from QS8 GAVGPOOL templates by Marat Dukhan · 2 years, 4 months ago
  91. 9e258d6 Remove multi-accumulator support in QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  92. 7d45d90 Create a new jit-test for jit-related tests that are not architecture specific by Zhi An Ng · 2 years, 4 months ago
  93. d7a4b22 Generate missing QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  94. 847ff5e Refactor naming of QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 4 months ago
  95. 53f4106 Switch QS8 GAVGPOOL microkernels to use FP32 requantization by Marat Dukhan · 2 years, 4 months ago
  96. 8f2eeee Skip calling __builtin_clear_cache on iOS, iOS uses sys_cache_invalidate by Zhi An Ng · 2 years, 4 months ago
  97. c27f04b Add missing generated unit tests to BUILD and CMakeLists.txt. by Zhi An Ng · 2 years, 4 months ago
  98. d6e2e1a Remove xnn_qu8_quantize_avgpool and xnn_qs8_quantize_avgpool helpers by Marat Dukhan · 2 years, 4 months ago
  99. 50323b8 Combine requantization with parameter initialization in unit tests by Marat Dukhan · 2 years, 4 months ago
  100. bd7f9a4 F16C implementation of F16 PRELU microkernels by Marat Dukhan · 2 years, 4 months ago