- eb7256b Port F32 GEMM A75 1x8 microkernel to JIT and specialize for min/max, add tests and benchmarks by Zhi An Ng · 2 years, 8 months ago
- 6b72e6c Convert F32 IGEMM for A75 to JIT, add tests by Zhi An Ng · 2 years, 8 months ago
- f0f374f Rename f32-gemm/6x8-aarch64-neonfma-prfm-cortex-a75.cc to remove prfm from file name by Zhi An Ng · 2 years, 8 months ago
- 1d5c616 Enable QU8 AAarch microkernels based on uarch by Frank Barchard · 2 years, 8 months ago
- 043c1f5 Include JIT_SRCS in XNNPACK build by Marat Dukhan · 2 years, 8 months ago
- 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 8 months ago
- 101271e QC8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 8 months ago
- 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 8 months ago
- cfd947d Add neon zip microkernel generator by Alan Kelly · 2 years, 8 months ago
- 9dc0452 Link LibM to indirection target in CMake build by Marat Dukhan · 2 years, 8 months ago
- f2b233b Make SSE2 microkernels consistent with neon zip microkernels. - DEC is now MOV by Alan Kelly · 2 years, 8 months ago
- c2f62ea Remove redundant closing brace in CMakeLists by Marat Dukhan · 2 years, 8 months ago
- 870108c QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
- c2e2da8 Fix conversion script for aarch64 assembly kernels and convert a single F32 GEMM as a test by Zhi An Ng · 2 years, 8 months ago
- a1cad4a Add x8 transpose bench by Alan Kelly · 2 years, 8 months ago
- ba68f44 Add x64 transpose bench by Alan Kelly · 2 years, 8 months ago
- ac654f1 QC8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
- 0f294ad QS8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
- 109a5eb Initial aarch64 assembler structure by Zhi An Ng · 2 years, 8 months ago
- 8f920a6 Initialize F16 microkernel pointers on x86 by Marat Dukhan · 2 years, 8 months ago
- ffbf7ff Cleanup transpose microkernels in BUILD & CMakeLists by Marat Dukhan · 2 years, 8 months ago
- 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 8 months ago
- b26ead1 F16C implementation of F16 GAVGPOOL microkernels by Marat Dukhan · 2 years, 8 months ago
- c7c92b0 Generate F16 GAVGPOOL NEONFP16ARITH microkernels from template by Marat Dukhan · 2 years, 8 months ago
- d2e8d4d Enable QC8 AArch32 4x8 lane GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 9 months ago
- 5e1a303 QC8 GEMM/IGEMM assembly microkernels for ARMv7 NEON by Frank Barchard · 2 years, 9 months ago
- 5da6d38 SSE2 transpose microkernel code generator. by Alan Kelly · 2 years, 9 months ago
- d19bde9 Add x64 scalar transpose microkernels by Alan Kelly · 2 years, 9 months ago
- cd21b02 Add x8 scalar transpose microkernels by Alan Kelly · 2 years, 9 months ago
- 84aae41 Add x16 scalar transpose microkernels by Alan Kelly · 2 years, 9 months ago
- 2d38e3c Fix more errors in CMakeLists by Marat Dukhan · 2 years, 9 months ago
- 1e074d7 Fix CMake build by Marat Dukhan · 2 years, 9 months ago
- 8575504 Switch QS8/QU8 GAVGPOOL NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 9 months ago
- 33a98fa Switch QS8/QU8 VMUL[C] NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 9 months ago
- d1f53e4 Generate QU8 GAVGPOOL microkernels from QS8 GAVGPOOL templates by Marat Dukhan · 2 years, 9 months ago
- 9e258d6 Remove multi-accumulator support in QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 9 months ago
- 7d45d90 Create a new jit-test for jit-related tests that are not architecture specific by Zhi An Ng · 2 years, 9 months ago
- 7781786 Enable QU8 3x8 lane for AArch32 by Frank Barchard · 2 years, 9 months ago
- d7a4b22 Generate missing QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 9 months ago
- 847ff5e Refactor naming of QS8 GAVGPOOL microkernels by Marat Dukhan · 2 years, 9 months ago
- 53f4106 Switch QS8 GAVGPOOL microkernels to use FP32 requantization by Marat Dukhan · 2 years, 9 months ago
- 1789a3c Fix CMake builds by Zhi An Ng · 2 years, 9 months ago
- c27f04b Add missing generated unit tests to BUILD and CMakeLists.txt. by Zhi An Ng · 2 years, 9 months ago
- bd7f9a4 F16C implementation of F16 PRELU microkernels by Marat Dukhan · 2 years, 9 months ago
- 4c1fd6f Allow generate-gemm-test.py to accept multiple output files, and shard the generated tests across all specified output files. by Zhi An Ng · 2 years, 9 months ago
- d454545 F16C implementation of F16 VBINARY[C] microkernels by Marat Dukhan · 2 years, 9 months ago
- 717665f Add JIT microkernels to F32 GEMM E2E benchmarks by Zhi An Ng · 2 years, 9 months ago
- d90af6f Move gemm-microkernel-tester test code into separate cc file by Zhi An Ng · 2 years, 9 months ago
- 969e61f Enable 2x16 for QU8 neon lane microkernel in AArch32 by Frank Barchard · 2 years, 9 months ago
- 2780863 Scalar transpose microkernel by Alan Kelly · 2 years, 9 months ago
- a248337 Split more of qs8-gemm-minmax-rndnu out into another file, for microkernels with "c4" by Zhi An Ng · 2 years, 9 months ago
- d5a5333 Additional tile sizes for QU8 neon lane microkernel. by Frank Barchard · 2 years, 9 months ago
- 751f622 F16C implementation of F16 VHSWISH microkernels by Marat Dukhan · 2 years, 9 months ago
- 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 9 months ago
- 48c5e98 Fix CMake build on x86-64 by Marat Dukhan · 2 years, 9 months ago
- 1bef0f2 Add JIT microkernels to QS8 GEMM benchmarks by Zhi An Ng · 2 years, 9 months ago
- bf72b54 Split qc8-igemm-minmax-fp32.yaml into 2 files, all microkernels with c go into a separate file. by Zhi An Ng · 2 years, 9 months ago
- 49d94ca Split qc8-gemm-minmax-fp32.yaml into 2 files, all the microkernels with c goes into a separate file. by Zhi An Ng · 2 years, 9 months ago
- 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 9 months ago
- 0e0f726 Split qs8-gemm-minmax-rndnu.yaml into 2 files, all the microkernels with c2 suffix goes into a separate file. by Zhi An Ng · 2 years, 9 months ago
- c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 9 months ago
- 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 9 months ago
- 16b734c Add more QC8 GEMM/IGEMM JIT microkernels. by Zhi An Ng · 2 years, 9 months ago
- 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 9 months ago
- ed73fb6 Add qc8 gemm and igemm JIT microkernels by Zhi An Ng · 2 years, 9 months ago
- 13b57dd Add more converted microkernels used in init.c. by Zhi An Ng · 2 years, 9 months ago
- 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 9 months ago
- ed90216 aarch64 transpose TBL microkernel by Alan Kelly · 2 years, 9 months ago
- f290a14 Enable QC8 4x8 mla lane assembler microkernel by Frank Barchard · 2 years, 9 months ago
- f623740 QC8 NEON lane microkernels by Frank Barchard · 2 years, 9 months ago
- d8a1dbe Add RISC-V scalar microkernels to CMake build by Marat Dukhan · 2 years, 9 months ago
- 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 9 months ago
- bd11e6a Add -fno-math-errno compilation option for scalar microkernels by Marat Dukhan · 2 years, 9 months ago
- cccb012 Apply sort and formatting to ARM code by Frank Barchard · 2 years, 9 months ago
- 272d4d9 FP32 IMAGIC variants of scalar QC8/QS8/QU8 GEMM/IGEMM/DWCONV microkernels by Marat Dukhan · 2 years, 9 months ago
- f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
- 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
- ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 9 months ago
- 3ddc20c Benchmarks for Abs, Negate, and Square operators by Marat Dukhan · 2 years, 9 months ago
- 5c7fd89 Benchmark for Leaky ReLU operator by Marat Dukhan · 2 years, 9 months ago
- 134f984 Refactor F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- 2700809 Specify -mfp16-format=ieee for AArch32 GCC builds by Marat Dukhan · 2 years, 9 months ago
- 87fe410 QC8 quantization for all aarch32 GEMM/IGEMM microkernels by Frank Barchard · 2 years, 9 months ago
- 1945f0b SSE transpose x16 microkernel (4x8) by Alan Kelly · 2 years, 9 months ago
- b43b47a Add a script to convert existing assembly microkernels to JIT codegen. by Zhi An Ng · 2 years, 9 months ago
- 7a03a0f Merge pull request #2191 from xbwee1024:bugfix by XNNPACK Team · 2 years, 9 months ago
- e0f15ad Split scalar production microkernels into portable, AArch32, and Wasm by Marat Dukhan · 2 years, 9 months ago
- f98f58d Lowering to c++11 as c++14 literals was converted to c++11 in #2192 by xbwee · 2 years, 9 months ago
- 562112e Fix build error with cmake for src/jit. by xbwee · 2 years, 9 months ago
- 9519816 Enable QS8 4x8 LD64 Neon on AArch32 by Frank Barchard · 2 years, 9 months ago
- 1e9c5ac Fix CMake build by Marat Dukhan · 2 years, 9 months ago
- e48b5c1 QS8 4x8 Neon Lane LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
- 4841021 QS8 4x8 dot product LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
- 9f3f420 QS8 4x8 LD64 dot product GEMM AArch32 microkernel by Frank Barchard · 2 years, 10 months ago
- 98393ad AVX512 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- fda06cb SSE transpose microkernel by Alan Kelly · 2 years, 10 months ago
- 7b5f779 AVX2 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- cd4089f AVX QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 2edf863 AVX512 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago