- f30a859 Port aarch64 F32 IGEMM 1x8 A75 microkernel to JIT, add tests, benchmarks, enable in init.c if JIT is enabled by Zhi An Ng · 2 years, 8 months ago
- eb7256b Port F32 GEMM A75 1x8 microkernel to JIT and specialize for min/max, add tests and benchmarks by Zhi An Ng · 2 years, 8 months ago
- 1425eb5 Copy IGEMM benchmark code into JIT's IGEMM benchmark code, and add JIT aarch64 generators to benchmarks by Zhi An Ng · 2 years, 8 months ago
- 2188833 Fix F32 IGEMM benchmark loop to not require capping NC to NR by Zhi An Ng · 2 years, 8 months ago
- 77d2885 QS8 AArch32 GEMM benchmark build fix by Frank Barchard · 2 years, 8 months ago
- 6cb0fd0 Add AArch32 GEMM benchmarks for Cortex A53 and Cortex A7 by Frank Barchard · 2 years, 8 months ago
- ca51090 QS8 GEMM benchmark for JIT add ISA check by Frank Barchard · 2 years, 8 months ago
- 9fd2f3e Fix passing of kc JIT generator in F32 GEMM benchmarks by Zhi An Ng · 2 years, 8 months ago
- 34251d8 QS8 4x8 lane GEMM AArch32 microkernel for Cortex A7 by Frank Barchard · 2 years, 8 months ago
- f82410d Enable QU8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 8 months ago
- 9e4d2aa QS8 4x8 lane GEMM AArch32 microkernel for Cortex A53 by Frank Barchard · 2 years, 8 months ago
- cfd947d Add neon zip microkernel generator by Alan Kelly · 2 years, 8 months ago
- 3deae1d Guard JIT-related structs and functionality behind XNN_PLATFORM_JIT by Zhi An Ng · 2 years, 8 months ago
- f9fc9ec Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 8 months ago
- 58cdcf2 Reoptimize QC8/QS8/QU8 GEMM/IGEMM WAsm SIMD microkernel selection by Marat Dukhan · 2 years, 8 months ago
- 348c377 QU8 GEMM/IGEMM WAsm SIMD microkernels with SR=4 by Marat Dukhan · 2 years, 8 months ago
- fbd67a7 Pad K to a multiple of SR in GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 8 months ago
- f2b233b Make SSE2 microkernels consistent with neon zip microkernels. - DEC is now MOV by Alan Kelly · 2 years, 8 months ago
- 8b758bf Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by XNNPACK Team · 2 years, 8 months ago
- 64cb10f Guard JIT-related structs and functionality behind XNN_PLATFORM_JIT by XNNPACK Team · 2 years, 8 months ago
- c9a2e74 Guard JIT-related structs and functionality behind XNN_PLATFORM_JIT by Zhi An Ng · 2 years, 8 months ago
- df51e11 Integrate JIT generated GEMM microkernels into create_convolution2d_nhwc by Zhi An Ng · 2 years, 8 months ago
- d236074 Add F32 GEMM 6x8 aarch64 neonfma cortex a75 JIT microkernel to benchmark by Zhi An Ng · 2 years, 8 months ago
- 870108c QS8/QC8 4x8 dot product IGEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
- a1cad4a Add x8 transpose bench by Alan Kelly · 2 years, 8 months ago
- ba68f44 Add x64 transpose bench by Alan Kelly · 2 years, 8 months ago
- c821ea7 Refactor x16 transpose bench and add missing ukernels. by Alan Kelly · 2 years, 8 months ago
- e8bbda0 Re-factor x32 transpose bench by Alan Kelly · 2 years, 8 months ago
- 0f294ad QS8 4x8 dot product GEMM AArch32 microkernel for Cortex A55 by Frank Barchard · 2 years, 8 months ago
- 70ea0a2 Specialize F32 GEMM A53 JIT microkernel for min/max params by Zhi An Ng · 2 years, 8 months ago
- 0ec25cf Duplicate test methods in gemm-microkernel-test for JIT codegen, update IGEMM generator signature and test generation script. by Zhi An Ng · 2 years, 8 months ago
- 901845c QU8 4x8 NEON MLA Lane microkernel AArch32 assembly language by Frank Barchard · 2 years, 8 months ago
- 83844ae Change JIT generator signature to accept nc and kc to specialize on those values by Zhi An Ng · 2 years, 9 months ago
- 5da6d38 SSE2 transpose microkernel code generator. by Alan Kelly · 2 years, 9 months ago
- d7111a5 Remove F32 GEMM E2E JIT benchmarks (temporarily) as we are changing the JIT generator interface by Zhi An Ng · 2 years, 9 months ago
- 33a98fa Switch QS8/QU8 VMUL[C] NEON microkernels to RNDNU requantization by Marat Dukhan · 2 years, 9 months ago
- 717665f Add JIT microkernels to F32 GEMM E2E benchmarks by Zhi An Ng · 2 years, 9 months ago
- a30e2df Fix QU8 E2E lane benchmark tile sizes by Frank Barchard · 2 years, 9 months ago
- 2780863 Scalar transpose microkernel by Alan Kelly · 2 years, 9 months ago
- d5a5333 Additional tile sizes for QU8 neon lane microkernel. by Frank Barchard · 2 years, 9 months ago
- 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 9 months ago
- 1bef0f2 Add JIT microkernels to QS8 GEMM benchmarks by Zhi An Ng · 2 years, 9 months ago
- 665cb23 Add JIT microkernels to F32 IGEMM benchmarks by Zhi An Ng · 2 years, 9 months ago
- 25764d8 Add JIT microkernels to bench/f32-gemm by Zhi An Ng · 2 years, 9 months ago
- c4302c2 AVX2 implementations of F16 GEMM/IGEMM microkernels by Marat Dukhan · 2 years, 9 months ago
- 842bea9 Remove F16 VRELU microkernels by Marat Dukhan · 2 years, 9 months ago
- 58b17ba Remove VSCALE microkernels by Marat Dukhan · 2 years, 9 months ago
- 4a5c771 Refactor F32 RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 9 months ago
- 5999c92 Refactor naming of RADDSTOREEXPMINUSMAX microkernels by Marat Dukhan · 2 years, 9 months ago
- 5876744 Minor refactoring of RADDSTOREEXPMINUSMAX interface by Marat Dukhan · 2 years, 9 months ago
- ed90216 aarch64 transpose TBL microkernel by Alan Kelly · 2 years, 9 months ago
- 7c1115f Reoptimize microkernel selection for WAsm 1.0 by Marat Dukhan · 2 years, 9 months ago
- 7873586 Rename PLD to PRFM for aarch32 microkernels. by Frank Barchard · 2 years, 9 months ago
- 440e8ed Add FMAGIC/IMAGIC/LRINTF requantization variants in microkernel benchmarks by Marat Dukhan · 2 years, 9 months ago
- f721e37 LRINTF variants of scalar F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- bdf1099 Refactor scalar F32->QS8 and F32->QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
- 2ac722e Refactor requantization in scalar QS8/QC8/QU8 microkernels by Marat Dukhan · 2 years, 9 months ago
- ce834ad Refactor parameters in F32 VSIGMOID microkernels by Marat Dukhan · 2 years, 9 months ago
- 4a79ff2 Refactor parameters in F32 VELU microkernels by Marat Dukhan · 2 years, 9 months ago
- 9084fc8 Quantized Sigmoid and ELU benchmarks by Marat Dukhan · 2 years, 9 months ago
- 3ddc20c Benchmarks for Abs, Negate, and Square operators by Marat Dukhan · 2 years, 9 months ago
- 5c7fd89 Benchmark for Leaky ReLU operator by Marat Dukhan · 2 years, 9 months ago
- a0129e9 Refactor benchmarks for elementwise operators by Marat Dukhan · 2 years, 9 months ago
- e72b282 Refactor parameters in F32 VSQRT microkernels by Marat Dukhan · 2 years, 9 months ago
- 2894e99 Refactor F32 VLRELU microkernels by Marat Dukhan · 2 years, 9 months ago
- b7c1b71 Refactor F32->F16 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- 134f984 Refactor F16->F32 VCVT microkernels by Marat Dukhan · 2 years, 9 months ago
- ef0f09c Add cpu clockrate to x16/x32_transpose benchmarks. by Frank Barchard · 2 years, 9 months ago
- 1945f0b SSE transpose x16 microkernel (4x8) by Alan Kelly · 2 years, 9 months ago
- 0d10cc7 Split VHSWISH parameter initialization functions per ISA by Marat Dukhan · 2 years, 9 months ago
- 4c61779 Minimally support WebAssembly Relaxed SIMD builds by Marat Dukhan · 2 years, 9 months ago
- e48b5c1 QS8 4x8 Neon Lane LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
- 4841021 QS8 4x8 dot product LD64 IGEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
- 9f3f420 QS8 4x8 LD64 dot product GEMM AArch32 microkernel by Frank Barchard · 2 years, 9 months ago
- 98393ad AVX512 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- fda06cb SSE transpose microkernel by Alan Kelly · 2 years, 10 months ago
- 7b5f779 AVX2 QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- cd4089f AVX QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 2edf863 AVX512 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 0d399ca AVX2 F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- b91432c AVX F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 9820234 Full set of benchmarks for Convert operator by Marat Dukhan · 2 years, 10 months ago
- da7b2e2 QS8 4x8 lane GEMM AArch32 microkernel by Frank Barchard · 2 years, 10 months ago
- 710fb42 Benchmark for the Convert (F32->QS8) operator by Marat Dukhan · 2 years, 10 months ago
- 914f57b Aarch64 4x8 lane ld64 GEMM/IGEMM microkernels. by Frank Barchard · 2 years, 10 months ago
- ad6f2dc Benchmarks for QS8->F32 and QU8->F32 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- 0f1ed94 QS8/QC8 GEMM/IGEMM WAsm SIMD microkernels using C2S4 layout by Marat Dukhan · 2 years, 10 months ago
- 8999190 Remove GEMMLOWP requantization from QS8 GEMM/IGEMM templates by Marat Dukhan · 2 years, 10 months ago
- 482508b Optimize FP32 requantization in ARMv7 NEON QS8/QU8 VMUL[C] by Marat Dukhan · 2 years, 10 months ago
- 430b173 F32->QS8/QU8 VCVT scalar microkernels using FP32 min/max by Marat Dukhan · 2 years, 10 months ago
- 5740f75 Fix trailing whitespace in VCVT benchmarks by Marat Dukhan · 2 years, 10 months ago
- 563eee1 Benchmarks for F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 2 years, 10 months ago
- f82ea82 Add PRFM benchmarks for qs8 lane by Frank Barchard · 2 years, 10 months ago
- 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 2 years, 10 months ago
- 9cdc10d QU8 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
- 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 2 years, 10 months ago
- 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 2 years, 10 months ago
- 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 2 years, 10 months ago
- 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 2 years, 10 months ago
- e22685a Remove padal from quantized microkernel names. by Frank Barchard · 2 years, 11 months ago