- 9fe932e Partial port to Windows and MSVC by Marat Dukhan · 4 years, 3 months ago
- 36b76b6 1x16 LD32 F16 GEMM by Frank Barchard · 4 years, 3 months ago
- 3cb54f9 1x8 LD64 F32 GEMM by Frank Barchard · 4 years, 3 months ago
- 683f559 FP16 4x16 and 6x16 GEMM ld32 microkernels by Frank Barchard · 4 years, 3 months ago
- 355ab43 Rename SpMM micro-kernels by Marat Dukhan · 4 years, 3 months ago
- aefaef3 Prepare xnn_params for variations in fused activations by Marat Dukhan · 4 years, 3 months ago
- 163a7e6 Scalar & WAsm GEMM/IGEMM/DWCONV micro-kernels without activation by Marat Dukhan · 4 years, 3 months ago
- de06f49 Add MINMAX suffix to GEMM/IGEMM/DWCONV/PPMM micro-kernel names by Marat Dukhan · 4 years, 3 months ago
- eb09a6b Rename F32/U8 output params to minmax params by Marat Dukhan · 4 years, 3 months ago
- 05702cf Dynamically choose micro-kernel depending on active core by Marat Dukhan · 4 years, 3 months ago
- b038fdc Adapt XNNPACK to the move of ruy to its own GitHub repository. by Benoit Jacob · 4 years, 3 months ago
- 7d3a8c3 Make e2e DWCONV benchmark compatible with older gcc by Marat Dukhan · 4 years, 3 months ago
- 0d1052c iOS 6x8 microkernel based on Cortex-A75 but with X18 avoided. by Frank Barchard · 4 years, 3 months ago
- 8fb9055 4x8 GEMM and IGEMM microkernels for Cortex A55. 7.8% faster for e2e mobile net v2. by Frank Barchard · 4 years, 4 months ago
- 99103dc Improve compatibility with older gcc versions by Marat Dukhan · 4 years, 4 months ago
- b7dd29e 4x8 GEMM and IGEMM microkernels for AARCH32 Cortex A55. 11.5% faster end to end: by Frank Barchard · 4 years, 4 months ago
- 7a16d8b Update Average Pooling operator benchmark by Marat Dukhan · 4 years, 4 months ago
- fe7acb6 Targets for requantization tests and benchmarks by Marat Dukhan · 4 years, 4 months ago
- 91e1999 6x8 GEMM and IGEMM microkernels for Cortex A55. 9% faster end to end: by Frank Barchard · 4 years, 4 months ago
- 9fd7e25 Use cpuinfo_get_max_cache_size in bench-utils by Marat Dukhan · 4 years, 4 months ago
- b6862d1 Cleanup unused SoftMax benchmark presets by Marat Dukhan · 4 years, 4 months ago
- 8d3c693 Optional SoftMax benchmarks vs DNNL by Marat Dukhan · 4 years, 4 months ago
- 5b3185b Merge pull request #381 from mattn:build-windows by XNNPACK Team · 4 years, 4 months ago
- 462be05 Build on Windows by Yasuhiro Matsumoto · 4 years, 4 months ago
- c8230a4 Remove output_min & output_max arguments in PReLU operator by Marat Dukhan · 4 years, 4 months ago
- c87a8fd Cortex A53 IGEMM 32 bit ARM by Frank Barchard · 4 years, 5 months ago
- 90ce789 Cortex A75 IGEMM 32 bit ARM. by Frank Barchard · 4 years, 5 months ago
- dc38f07 LD64 IGEMM 32 bit ARM by Frank Barchard · 4 years, 5 months ago
- bdb56f5 FP16 versions of SpMM micro-kernels by Marat Dukhan · 4 years, 5 months ago
- a7b22c1 Fix wrong MR specifications in IGEMM benchmark by Marat Dukhan · 4 years, 5 months ago
- 9c0db96 F32 Softmax operator benchmark by Marat Dukhan · 4 years, 5 months ago
- fd8e689 Rename SoftArgMax operator to SoftMax by Marat Dukhan · 4 years, 5 months ago
- e104aa3 Remove extraneous argument from mobilenetv3 igemm benchmark. by Frank Barchard · 4 years, 5 months ago
- 8137e4c NEON/NEONFMA RAddStoreExpMinusMax micro-kernels by Marat Dukhan · 4 years, 5 months ago
- b39689d SSE2/PSIMD RAddStoreExpMinusMax micro-kernels by Marat Dukhan · 4 years, 5 months ago
- f46f675 Scalar RAddStoreExpMinusMax micro-kernels by Marat Dukhan · 4 years, 5 months ago
- fa0a432 F32 Sigmoid micro-kernels in AVX2 implementation by Marat Dukhan · 4 years, 6 months ago
- 4a24a58 Use 1-step range reduction in NEONFMA Sigmoid micro-kernels by Marat Dukhan · 4 years, 6 months ago
- 68b3b45 Complete set of NEON F32 Sigmoid micro-kernels by Marat Dukhan · 4 years, 6 months ago
- 1f5d9bc Apply patch to cpuinfo by Marat Dukhan · 4 years, 6 months ago
- 8d3c07e Additional Sigmoid micro-kernels and accuracy evaluation stub by Marat Dukhan · 4 years, 6 months ago
- 3a77ea7 Scalar F32 Sigmoid micro-kernels by Marat Dukhan · 4 years, 6 months ago
- 387c2d1 Generate A57 micro-kernels from A75 source. by Frank Barchard · 4 years, 7 months ago
- 9f7d555 Prefetch version of the aarch32 a75 GEMM kernel by Frank Barchard · 4 years, 7 months ago
- 1391604 Initial Cortex A53 kernel for aarch32 by Frank Barchard · 4 years, 7 months ago
- 77b78a6 Fix compiler warning in end2end.h microbenchmarking header by Marat Dukhan · 4 years, 7 months ago
- 2712132 FMA3 microkernels with 4-wide shuffle by Marat Dukhan · 4 years, 7 months ago
- c08cdf5 Randomized end-to-end MobileNet v3 benchmark by Marat Dukhan · 4 years, 7 months ago
- 4c4eb00 Additional variants of Softmax microkernels by Marat Dukhan · 4 years, 7 months ago
- eccfd71 NR=16 GEMM and IGEMM micro-kernels in AVX and FMA3 implementations by Marat Dukhan · 4 years, 7 months ago
- 03ff294 Fix incorrect indirection size computation for DWCONV by Marat Dukhan · 4 years, 7 months ago
- ad74a7b Fix out-of-bounds reads in F32 DWCONV benchmark by Marat Dukhan · 4 years, 7 months ago
- 3e237f2 AARCH32 4x8 for Cortex A75 by Frank Barchard · 4 years, 7 months ago
- cab9493 Add E2E aarch32 GEMM kernel. by Frank Barchard · 4 years, 7 months ago
- 8b0f026 AARCH32 4x8 NEON GEMM Assembly version of 4x8 for 32 bit ARM. Based on LD64. by Frank Barchard · 4 years, 7 months ago
- 479f87e AVX512F implementation of DWCONV micro-kernels by Marat Dukhan · 4 years, 7 months ago
- 0f349c4 AVX512F implementation of GEMM & IGEMM micro-kernels by Marat Dukhan · 4 years, 7 months ago
- 69172d9 6x8 ld128 GEMM microkernels by Frank Barchard · 4 years, 7 months ago
- c8466f5 Add checks for target ISA in microbenchmarks by Marat Dukhan · 4 years, 7 months ago
- 54a9d9d f16_gemm benchmark renamed from hgemm by Frank Barchard · 4 years, 7 months ago
- b186463 Fix F16-GEMM benchmark by Marat Dukhan · 4 years, 7 months ago
- 5243bb0 DUP Neon GEMM kernels for Exynos by Frank Barchard · 4 years, 7 months ago
- 17ec5f3 AVX and FMA3 microkernels for DWCONV by Marat Dukhan · 4 years, 7 months ago
- 91317c5 Rename neon intrinsics to lane. by Frank Barchard · 4 years, 7 months ago
- 1e782c4 Rename vunop and vbinop functions by Marat Dukhan · 4 years, 7 months ago
- 496e735 SSE4.1 Sigmoid microkernels by Marat Dukhan · 4 years, 7 months ago
- fda12b8 AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM by Marat Dukhan · 4 years, 7 months ago
- df06d80 Neon shuffle GEMM and IGEMM kernels. by Frank Barchard · 4 years, 7 months ago
- 04f03be Support overriding memory allocation functions by Marat Dukhan · 4 years, 7 months ago
- 7bee751 SSE2 Sigmoid micro-kernels by Marat Dukhan · 4 years, 7 months ago
- 581c1ac CMake targets for f32-sigmoid-test and f32-sigmoid-bench by Marat Dukhan · 4 years, 7 months ago
- 14bec50 Benchmark for F32 Sigmoid micro-kernels by Marat Dukhan · 4 years, 7 months ago
- 1b09229 Fix issues in PReLU benchmark by Marat Dukhan · 4 years, 7 months ago
- c3b9e86 Benchmark Sigmoid operator in TFLite implementation by Marat Dukhan · 4 years, 7 months ago
- 95bebc9 Benchmarks rename sgemm and sppmm to f32_gemm and f32_ppmm by Frank Barchard · 4 years, 8 months ago
- 346a9e5 Sigmoid evaluation stubs, micro-kernels, and operator by Marat Dukhan · 4 years, 8 months ago
- 38709a6 Add scalar chw 5x5p2 and 5x5s2p2 kernels by Erich Elsen · 4 years, 8 months ago
- 5098c3e Refactor DWCONV micro-kernels by Marat Dukhan · 4 years, 8 months ago
- 1898b91 Move adjustment_* arguments of Deconvolution into setup by Marat Dukhan · 4 years, 8 months ago
- bad48fe Vary number of threads in the End-to-End benchmark by Marat Dukhan · 4 years, 8 months ago
- c712fa4 Add Freq to end2end benchmark. by Frank Barchard · 4 years, 8 months ago
- ef4416e End-to-end benchmarks for DWCONV microkernels by Marat Dukhan · 4 years, 8 months ago
- e72e287 Add Freq to E2E benchmark by Frank Barchard · 4 years, 8 months ago
- 5f18d26 End-to-end benchmarks for GEMM/IGEMM microkernels by Marat Dukhan · 4 years, 8 months ago
- 4fa0fbe Add missing <chrono> header in f32-softargmax benchmark by Marat Dukhan · 4 years, 8 months ago
- 95b2243 PReLU benchmark on characteristic arguments of ImageNet classifiers by Marat Dukhan · 4 years, 8 months ago
- e0601b5 Sort include order for params-init.h and log.h by Frank Barchard · 4 years, 8 months ago
- 4a2bbc6 Benchmark for Two-Pass Softmax algorithm by Marat Dukhan · 4 years, 8 months ago
- eeaa7bd Refactor initialization of micro-kernel parameters by Marat Dukhan · 4 years, 8 months ago
- 46fb807 4x8 A53 GEMM, and GEMMINC unpipelined microkernels. by Frank Barchard · 4 years, 8 months ago
- c6afd9b Add blocked scalar spmm kernels. by Erich Elsen · 4 years, 8 months ago
- a7fb855 6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels. by Frank Barchard · 4 years, 8 months ago
- 563df5f Add scalar version of hwc2spchw convolution. by Erich Elsen · 4 years, 8 months ago
- 4232323 Unify naming of functions in benchmark::utils:: by Marat Dukhan · 4 years, 8 months ago
- 05ac8e3 VSCALE microkernel and SoftMax Three-Pass algorithm with Reloading by Marat Dukhan · 4 years, 8 months ago
- 4a4a7fa Three-Pass Softargmax benchmark (recomputing version) by Marat Dukhan · 4 years, 8 months ago
- ac4de80 Add chw 3x3s2p1_scalar kernels. by Erich Elsen · 4 years, 9 months ago
- 0cc2c53 add 3x3p1_scalar kernel by Erich Elsen · 4 years, 9 months ago
- bd9e495 Remove 4x12 intrinsics kernels. by Frank Barchard · 4 years, 9 months ago
- 7e95597 Pass XNN_ENABLE_ASSEMBLY for all tests and kernel benchmarks by Frank Barchard · 4 years, 9 months ago