1. 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 9 months ago
  2. f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  3. bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
  4. 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
  5. 0e80137 Refactor parameters in F32 VRND microkernels by Marat Dukhan · 2 years, 9 months ago
  6. bbfc27d Refactor NEON/NEONFMA VSIGMOID microkernels by Marat Dukhan · 2 years, 9 months ago
  7. ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 9 months ago
  8. 05b6cb1 Transpose microkernel tester uses iota instead of rng so that it's easier to debug tests by Alan Kelly · 2 years, 9 months ago
  9. 4a79ff2 Refactor parameters in F32 VELU microkernels by Marat Dukhan · 2 years, 9 months ago
  10. e5efb16 Refactor VUNARY microkernel parameters by Marat Dukhan · 2 years, 9 months ago
  11. e72b282 Refactor parameters in F32 VSQRT microkernels by Marat Dukhan · 2 years, 9 months ago
  12. 98c5215 Move mask_table into VBINARY[C] AVX microkernel parameters by Marat Dukhan · 2 years, 9 months ago
  13. d57186a Refactor F32 VMULCADDC parameters by Marat Dukhan · 2 years, 9 months ago
  14. f600497 Refactor parameter initialization in Vector Binary Elementwise microkernels by Marat Dukhan · 2 years, 9 months ago
  15. c83ef3b Refactor F32 MINMAX parameters for WAsm SIMD by Marat Dukhan · 2 years, 9 months ago
  16. 2894e99 Refactor F32 VLRELU microkernels by Marat Dukhan · 2 years, 9 months ago
  17. b7c1b71 Refactor F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  18. 134f984 Refactor F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
  19. 87fe410 QC8 quantization for all aarch32 GEMM/IGEMM microkernels by Frank Barchard · 2 years, 9 months ago
  20. 447aa7b #include allocator.h header to gemm tests. by Frank Barchard · 2 years, 9 months ago
  21. 1945f0b SSE transpose x16 microkernel (4x8) by Alan Kelly · 2 years, 9 months ago
  22. 0d10cc7 Split VHSWISH parameter initialization functions per ISA by Marat Dukhan · 2 years, 9 months ago
  23. b43b47a Add a script to convert existing assembly microkernels to JIT codegen. by Zhi An Ng · 2 years, 9 months ago
  24. e4d3f76 Mark aarch64 microkernels as assembly for tests by Frank Barchard · 2 years, 9 months ago
  25. 0db2e4c Support - (minus) operator for creating S/D register lists, this looks closer to native assembly. by Zhi An Ng · 2 years, 9 months ago
  26. 2493de9 WAsmSIMD transpose microkernel by Alan Kelly · 2 years, 9 months ago
  27. c80ffb0 Fix generation of gemm tests for ADJBLOCK and rerun scripts. by Zhi An Ng · 2 years, 9 months ago
  28. e31f29e Declare assembly for QS8 microkernels by Frank Barchard · 2 years, 9 months ago
  29. 4c61779 Minimally support WebAssembly Relaxed SIMD builds by Marat Dukhan · 2 years, 9 months ago
  30. 50b0bd9 Fix encoding and supported immediate values for vldr and vstr. by Zhi An Ng · 2 years, 9 months ago
  31. 1aac8e8 Implement vmrs (FPSCR) by Zhi An Ng · 2 years, 9 months ago
  32. 0a1b7b6 Implement ldrd (immediate) by Zhi An Ng · 2 years, 9 months ago
  33. 26e55ed Implement vstr instruction by Zhi An Ng · 2 years, 9 months ago
  34. 932e823 Implement str (imm) by Zhi An Ng · 2 years, 9 months ago
  35. 4ebd680 Implement moveq, cmp (imm), sub (imm). by Zhi An Ng · 2 years, 9 months ago
  36. 2b74ddd Implement vld1_8 with offset register by Zhi An Ng · 2 years, 9 months ago
  37. fea422d Implement vld1_32 (single element to one lane). by Zhi An Ng · 2 years, 9 months ago
  38. e48b5c1 QS8 4x8 Neon Lane LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
  39. 4841021 QS8 4x8 dot product LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
  40. 938ee9b Implement bic, vld1_8 and vld1_32 for QRegisterList, assert encodings don't error out in tests. by Zhi An Ng · 2 years, 9 months ago
  41. 9364bdc Implement vsdot_s8 instruction by Zhi An Ng · 2 years, 9 months ago
  42. a251f87 Implement vqmovn_s16, and_, adds. by Zhi An Ng · 2 years, 9 months ago
  43. 7c8090d Implement vcmpe_f32, vmovpl_f32, vmovmi_f32. by Zhi An Ng · 2 years, 9 months ago
  44. 2d8180c Implement 2-argument add, vmla_f32, vmov_f32, vmov_f64, vstm. by Zhi An Ng · 2 years, 9 months ago
  45. 9f3f420 QS8 4x8 LD64 dot product GEMM AArch32 microkernel by Frank Barchard · 2 years, 10 months ago
  46. b63e84c Implement b (unconditional branch) by Zhi An Ng · 2 years, 10 months ago
  47. be4e6a5 Add align for aligning instructions (similar to .align in assembly) by Zhi An Ng · 2 years, 10 months ago
  48. ec17e99 Add license to files by Zhi An Ng · 2 years, 10 months ago
  49. fda06cb SSE transpose microkernel by Alan Kelly · 2 years, 10 months ago
  50. 7b5f779 AVX2 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  51. cd4089f AVX QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  52. 2edf863 AVX512 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  53. 0d399ca AVX2 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  54. 3bdbe9f Fix xnn_release_code_memory to unmap entire capacity of buffer by Zhi An Ng · 2 years, 10 months ago
  55. b91432c AVX F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  56. 6fac719 Implement vqmovn_s32 and vext_8 by Zhi An Ng · 2 years, 10 months ago
  57. 4a58583 Implement vdup_8, vdup_16, vdup_32 by Zhi An Ng · 2 years, 10 months ago
  58. 2649014 Implement vmax_s8, vmin_s8, vqadd_s16, vqdmulh_s32, vqshl_s32, vrshl_s32 by Zhi An Ng · 2 years, 10 months ago
  59. 4ef8d51 Implement vst1_16, add some more test cases by Zhi An Ng · 2 years, 10 months ago
  60. 00a929f Implement vst1_8 and fix vst1_32 encoding by Zhi An Ng · 2 years, 10 months ago
  61. a38a161 Implement vld1_8, vmlal_s16, vmovl_s8 by Zhi An Ng · 2 years, 10 months ago
  62. 6883abb JIT memory allocation and integration into Assembler by Zhi An Ng · 2 years, 10 months ago
  63. da7b2e2 QS8 4x8 lane GEMM AArch32 microkernel by Frank Barchard · 2 years, 10 months ago
  64. 4f36e85 Fully quality std::isnormal in ConvertOperatorTester by Marat Dukhan · 2 years, 10 months ago
  65. 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 10 months ago
  66. f92206b QS8->F32 and QU8->F32 Convert NC operators by Marat Dukhan · 2 years, 10 months ago
  67. 86bd270 Scalar QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  68. d873fa2 SSE2 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  69. fbf12b0 WAsm SIMD QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  70. f9cf55d SSE4.1 QS8/QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  71. fee66be NEON QS8/QU8 -> F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  72. 4bdc9f5 Refactor VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
  73. 10475ec Implement bx instruction by Zhi An Ng · 2 years, 10 months ago
  74. 16f3548 Implement pop and vpop (for D registers) by Zhi An Ng · 2 years, 10 months ago
  75. fe4a750 Implement vst1_32 (multiple single elements) and vst1_32 (single element from one lane) by Zhi An Ng · 2 years, 10 months ago
  76. ea612bc Implement vmax_f32 and vmin_f32 by Zhi An Ng · 2 years, 10 months ago
  77. 2fce75b Implement tst with immediate by Zhi An Ng · 2 years, 10 months ago
  78. f73e55b Implement add with immediate (drive-by fix for missing return when error in push) by Zhi An Ng · 2 years, 10 months ago
  79. c9f70f7 Implement vmla.f32, add DRegisterLane for lane-indexed DRegister by Zhi An Ng · 2 years, 10 months ago
  80. 1a55180 Merge pull request #2036 from digantdesai:enable_fp32_arm_kernels by XNNPACK Team · 2 years, 10 months ago
  81. 0f1ed94 QS8/QC8 GEMM/IGEMM WAsm SIMD microkernels using C2S4 layout by Marat Dukhan · 2 years, 10 months ago
  82. dfe8929 Implement vld1 (multiple single element) and vld1r (single element to all lanes) by Zhi An Ng · 2 years, 10 months ago
  83. 737ad01 Add .clang-format and reformat jit related files by Zhi An Ng · 2 years, 10 months ago
  84. 477bdbb Implement vldr instruction by Zhi An Ng · 2 years, 10 months ago
  85. f4beaf1 Implement vmov (q to q, d to d, s to s, core to d) by Zhi An Ng · 2 years, 10 months ago
  86. 637becf Implement vldm instruction by Zhi An Ng · 2 years, 10 months ago
  87. 68c27d3 Implement vpush, add SIMD registers and register lists. by Zhi An Ng · 2 years, 10 months ago
  88. 59d6515 Enable FP32 requant variant for QU8 [1,4]x8 Neon MLAL [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
  89. 9982ed3 Enable FP32 requant variant for QU8 NEON dotprod [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
  90. 65584bd Implement labels and branches by Zhi An Ng · 2 years, 10 months ago
  91. 2e2d179 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels by Digant Desai · 2 years, 10 months ago
  92. 10f9f62 Enable FP32 requant variant for QU8 4x16c4 NEON asm dotprod [I]GEMM kernels for CA55r1 by Digant Desai · 2 years, 10 months ago
  93. e20a873 Optimize selection of QS8/QU8 VADD[C] microkernels on WAsm SIMD by Marat Dukhan · 2 years, 10 months ago
  94. d221c54 Better formatting for instruction encoding test errors by Zhi An Ng · 2 years, 10 months ago
  95. 591b917 Implement pld instruction. by Zhi An Ng · 2 years, 10 months ago
  96. 4ab7b93 Implement sub and subs instructions. by Zhi An Ng · 2 years, 10 months ago
  97. ff2e8b2 Implement mov instruction. by Zhi An Ng · 2 years, 10 months ago
  98. 663b4fe Implement cmp instruction. by Zhi An Ng · 2 years, 10 months ago
  99. c9ffad7 Add support for MemOperand with addressing mode and ldr instruction. by Zhi An Ng · 2 years, 10 months ago
  100. 7988a18 Refactoring xnn_qs8_minmax_params for NEON/NEONv8 by Marat Dukhan · 2 years, 10 months ago