1. b10677e Implement unconditional branch for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  2. 56e8b91 Implement tbz for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  3. cdfff79 Implement ret for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  4. 039a388 Exclude quantized AVX512 microkernels from mobile builds by Marat Dukhan · 2 years, 5 months ago
  5. 3176868 Implement sub (x register) for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  6. 3f34299 Implement st1 for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  7. 544d73d Implement fmax and fmin (vector) for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  8. ecfb1f0 Implement fadd (vector) for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  9. 0981080 Implement tbnz for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  10. 6a1151b Implement fmla for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  11. 157b0f4 Implement ldr ldp for q registers in aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  12. f67f1be Implement labels and B.cond for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  13. e2dc2ec Implement subs for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  14. 234d6b4 Implement prfm (only PLDL1KEEP) on aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  15. 65ccb13 Implement movi for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  16. 6e68f54 Implement ld1 for 1, 2, and 3 registers for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  17. 5702efb Implement ld2r for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  18. 04cdc41 Implement ldr for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  19. 0ba29e7 Implement LDP for aarch64 assembler by Zhi An Ng · 2 years, 5 months ago
  20. 70ea0a2 Specialize F32 GEMM A53 JIT microkernel for min/max params by Zhi An Ng · 2 years, 5 months ago
  21. 109a5eb Initial aarch64 assembler structure by Zhi An Ng · 2 years, 5 months ago
  22. 8f920a6 Initialize F16 microkernel pointers on x86 by Marat Dukhan · 2 years, 5 months ago
  23. 66eb508 Add missing declarations and unit tests for F16 DWCONV microkernels by Marat Dukhan · 2 years, 5 months ago
  24. 0ec25cf Duplicate test methods in gemm-microkernel-test for JIT codegen, update IGEMM generator signature and test generation script. by Zhi An Ng · 2 years, 5 months ago
  25. e7225eb Specialize F32 GEMM (a53) on kc by Zhi An Ng · 2 years, 5 months ago
  26. 8d07e40 Enable QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 5 months ago
  27. 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 5 months ago
  28. b26ead1 F16C implementation of F16 GAVGPOOL microkernels by Marat Dukhan · 2 years, 5 months ago
  29. c7c92b0 Generate F16 GAVGPOOL NEONFP16ARITH microkernels from template by Marat Dukhan · 2 years, 5 months ago
  30. 01f6aee Add unreachable check for F32 GEMM a53 generator by Zhi An Ng · 2 years, 5 months ago
  31. 13599f3 Specialize F32 GEMM (a53) on nc by Zhi An Ng · 2 years, 5 months ago
  32. 1d6b7c9 Support FP32 weights in FP16 NC Fully Connected operator by Marat Dukhan · 2 years, 6 months ago
  33. d2e8d4d Enable QC8 AArch32 4x8 lane GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 6 months ago
  34. 6989ec4 Support FP32 weights in FP16 NHWC Convolution operator by Marat Dukhan · 2 years, 6 months ago
  35. 5e1a303 QC8 GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 6 months ago
  36. 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 6 months ago
  37. 9dfdfb5 Remove unused transpose function declarations. by Alan Kelly · 2 years, 6 months ago
  38. 5da6d38 SSE2 transpose microkernel code generator. by Alan Kelly · 2 years, 6 months ago
  39. d19bde9 Add x64 scalar transpose microkernels by Alan Kelly · 2 years, 6 months ago
  40. cd21b02 Add x8 scalar transpose microkernels by Alan Kelly · 2 years, 6 months ago
  41. 84aae41 Add x16 scalar transpose microkernels by Alan Kelly · 2 years, 6 months ago
  42. 6315472 Remove declarations for scalar transpose microkernels that don't exist by Alan Kelly · 2 years, 6 months ago
  43. 58fe65e Change default JIT code buffer size to 16kb by Zhi An Ng · 2 years, 6 months ago
  44. 8575504 Switch QS8/QU8 GAVGPOOL NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 6 months ago
  45. 33a98fa Switch QS8/QU8 VMUL[C] NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 6 months ago
  46. d1f53e4 Generate QU8 GAVGPOOL microkernels from QS8 GAVGPOOL templates by Marat Dukhan · 2 years, 6 months ago
  47. d81fa0a Pipeline remaining QS8 AVGPOOL microkernels by Marat Dukhan · 2 years, 6 months ago
  48. 139337c Include vcvtnq_f32 polyfill in QS8 GAVGPOOL NEONV8 microkernels by Marat Dukhan · 2 years, 6 months ago
  49. 9e258d6 Remove multi-accumulator support in QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 6 months ago
  50. 7d45d90 Create a new jit-test for jit-related tests that are not architecture specific by Zhi An Ng · 2 years, 6 months ago
  51. 7781786 Enable QU8 3x8 lane for AArch32 by Frank Barchard · 2 years, 6 months ago
  52. d7a4b22 Generate missing QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 6 months ago
  53. 6faf955 Reoptimize SSE QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 6 months ago
  54. 847ff5e Refactor naming of QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 6 months ago
  55. 53f4106 Switch QS8 GAVGPOOL microkernels to use FP32 requantization by Marat Dukhan · 2 years, 6 months ago
  56. e9e9708 Add missing asserts for requantization scale in MUL parameters initialization by Marat Dukhan · 2 years, 6 months ago
  57. 34cb23f Re-generate amalgamated microkernels by Marat Dukhan · 2 years, 6 months ago
  58. 1581248 Specify 8-byte alignment for packed WAsm SIMD parameters by Marat Dukhan · 2 years, 6 months ago
  59. 90a10b8 Replicate QS8/QU8 MUL WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 6 months ago
  60. 3b32963 Fix bug in not changing memory to be executable when we have unused capacity. by Zhi An Ng · 2 years, 6 months ago
  61. 8f2eeee Skip calling __builtin_clear_cache on iOS, iOS uses sys_cache_invalidate by Zhi An Ng · 2 years, 6 months ago
  62. e7242ea Replicate QS8/QU8 ADDSUB WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 6 months ago
  63. 48d74c3 Replicate QC8/QS8/QU8 CONV WAsm SIMD parameters to 64 bit rather than 128 bit by Marat Dukhan · 2 years, 6 months ago
  64. d6e2e1a Remove xnn_qu8_quantize_avgpool and xnn_qs8_quantize_avgpool helpers by Marat Dukhan · 2 years, 6 months ago
  65. 50323b8 Combine requantization with parameter initialization in unit tests by Marat Dukhan · 2 years, 6 months ago
  66. bd7f9a4 F16C implementation of F16 PRELU microkernels by Marat Dukhan · 2 years, 6 months ago
  67. 4897670 Re-enable up to AVX2 microkernels on Android x86/x86-64 & iOS simulator builds by Marat Dukhan · 2 years, 6 months ago
  68. 603ec5f Remove unused declarations for F16 VRELU microkernels by Marat Dukhan · 2 years, 6 months ago
  69. 085102c Reoptimize pointer updates in PRELU microkernels by Marat Dukhan · 2 years, 6 months ago
  70. 3ab63b0 Rollback "Enable up to AVX2 microkernels on Android x86/x86-64 builds" by XNNPACK Team · 2 years, 6 months ago
  71. d454545 F16C implementation of F16 VBINARY[C] microkernels by Marat Dukhan · 2 years, 6 months ago
  72. 1f1ee2c Enable up to AVX2 microkernels on Android x86/x86-64 builds by Marat Dukhan · 2 years, 6 months ago
  73. d90af6f Move gemm-microkernel-tester test code into separate cc file by Zhi An Ng · 2 years, 6 months ago
  74. 969e61f Enable 2x16 for QU8 neon lane microkernel in AArch32 by Frank Barchard · 2 years, 6 months ago
  75. 2780863 Scalar transpose microkernel by Alan Kelly · 2 years, 6 months ago
  76. 49979b6 Implement vldr for S registers by Zhi An Ng · 2 years, 6 months ago
  77. a72cde3 Reoptimize pointer updates in VMULCADDC microkernels by Marat Dukhan · 2 years, 6 months ago
  78. d5a5333 Additional tile sizes for QU8 neon lane microkernel. by Frank Barchard · 2 years, 6 months ago
  79. 751f622 F16C implementation of F16 VHSWISH microkernels by Marat Dukhan · 2 years, 6 months ago
  80. 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 6 months ago
  81. 8459822 Split F32 SCALEMINMAX parameter initialization functions by ISA by Marat Dukhan · 2 years, 6 months ago
  82. f2e2edf Round results to FP16 after multiplication by scale in AVX2 F16 GEMM/IGEMM by Marat Dukhan · 2 years, 6 months ago
  83. 3c949a3 Split QS8/QU8 AVGPOOL parameter initialization functions by ISA by Marat Dukhan · 2 years, 6 months ago
  84. 9f8eac7 Avoid _mm_loadu_si16 and _mm_storeu_si16 unsupported on older compilers by Marat Dukhan · 2 years, 6 months ago
  85. da382d1 Refactor parameter initialization for AVGPOOL/GAVGPOOL/PAVGPOOL microkernels by Marat Dukhan · 2 years, 6 months ago
  86. 4a6dca9 Specify parameter initialization function in [P]AVGPOOL microkernel tests by Marat Dukhan · 2 years, 6 months ago
  87. 4e5a767 Rename xnn_f32_scaleminmax_params.sse2 to xnn_f32_scaleminmax_params.sse by Marat Dukhan · 2 years, 6 months ago
  88. 5d456ce Refactor naming of QS8/QU8 AVGPOOL parameters by Marat Dukhan · 2 years, 6 months ago
  89. 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 6 months ago
  90. c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 6 months ago
  91. 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 6 months ago
  92. 14dd8d0 Convert F16 parameter structures to unions by Marat Dukhan · 2 years, 6 months ago
  93. 16b734c Add more QC8 GEMM/IGEMM JIT microkernels. by Zhi An Ng · 2 years, 6 months ago
  94. 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 6 months ago
  95. ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 6 months ago
  96. 29d9acd Implement vcvt vcvtn vmul_f32, these are used in qc8 microkernels. by Zhi An Ng · 2 years, 6 months ago
  97. 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 6 months ago
  98. 8a9eac6 Amalgamate AVX, AVX2, and FMA3 microkernels by Marat Dukhan · 2 years, 6 months ago
  99. 4a5c771 Refactor F32 RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 6 months ago
  100. 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 6 months ago