- d18cec3 Add more missing "assembly: true" specifications by Marat Dukhan · 4 years, 3 months ago
- ce7a3f8 Auto-generate NEON CONV micro-kernels by Marat Dukhan · 4 years, 3 months ago
- 32f9381 4x4 LD64 GEMM microkernel in AArch32+VFP assembly by Marat Dukhan · 4 years, 3 months ago
- d536072 DWCONV add input_offset and zero parameters by Frank Barchard · 4 years, 3 months ago
- 3b98f6b 4x4 LD64 GEMM+MINMAX microkernel in AArch32+VFP assembly by Marat Dukhan · 4 years, 3 months ago
- f606806 Add missing "assembly: true" specifications by Marat Dukhan · 4 years, 3 months ago
- 1f29b80 Refactor CHW micro-kernels by Marat Dukhan · 4 years, 3 months ago
- 7e4ca40 3x3s2c3 CONV NEONFMA microkernel with 0+1 padding by Marat Dukhan · 4 years, 3 months ago
- 6ddfc60 Implement "Greedy by size planner" memory optimization by Chao Mei · 4 years, 3 months ago
- bf31e3f FP16 vbinary and gemm microkernel testers remove scalar variant by Frank Barchard · 4 years, 3 months ago
- b196659 FP16 hswish, clamp and prelu microkernels by Frank Barchard · 4 years, 3 months ago
- bcdb1c1 Remove xnn_q8_dwconv_minmax_ukernel_up8x9__aarch32_neon by Frank Barchard · 4 years, 3 months ago
- eda9c11 Update CHW DWCONV to pass in input_height and not output_height. by Erich Elsen · 4 years, 3 months ago
- 6aa7e04 Fix typos in NCHW Convolution tests by Marat Dukhan · 4 years, 3 months ago
- d793f6c FP16 vbinary ops by Frank Barchard · 4 years, 3 months ago
- 4e5db3d CHW DWCONV with implicit padding by Erich Elsen · 4 years, 3 months ago
- 3f9f99f Nx16 FP16 intrinsic GEMM and IGEMM ukernels by Frank Barchard · 4 years, 3 months ago
- e4b8e57 Support TF SAME padding in Argmax Pooling operator by Marat Dukhan · 4 years, 3 months ago
- b5d9bb0 gemm tests replace k block loops with k index by Frank Barchard · 4 years, 3 months ago
- 6709097 Remove unnecessary cast to void * in f16_igemm tester. by Frank Barchard · 4 years, 3 months ago
- b0e4fae FP16 IGEMM microkernels by Frank Barchard · 4 years, 3 months ago
- fa9e20c f32 IGEMM test use float instead of double by Frank Barchard · 4 years, 3 months ago
- 7a6cae2 Remove fused-nc from f16 gemm tests. fused-nc is default now. by Frank Barchard · 4 years, 3 months ago
- 99003a8 Add xnn_init_f16_scaleminmax_params helper by Frank Barchard · 4 years, 3 months ago
- e92f859 Rename xnn_f16_gemm_ukernel_function to xnn_f16_gemm_minmax_ukernel_function by Frank Barchard · 4 years, 3 months ago
- e70dbeb Rename minmax_params to params for variables. by Frank Barchard · 4 years, 3 months ago
- 77acbf2 Rename output_params to params by Frank Barchard · 4 years, 3 months ago
- 142268b F16 pack functions by Frank Barchard · 4 years, 3 months ago
- 5871703 Support TF-SAME padding in Deconvolution by Marat Dukhan · 4 years, 3 months ago
- f5425ea Additional NEON/NEONFMA DWCONV microkernels by Marat Dukhan · 4 years, 4 months ago
- 3b8e566 F16 8x8 GEMM ld64 microkernels by Frank Barchard · 4 years, 4 months ago
- 875be77 Change xnn_f16_output_params to xnn_f16_scaleminmax_params by Frank Barchard · 4 years, 4 months ago
- bddfbcd FP16 4x8, 6x8 and 1x8 GEMM ld64 microkernels by Frank Barchard · 4 years, 4 months ago
- 1d62336 Fix typo in VScaleExpMinusMaxMicrokernelTester by Marat Dukhan · 4 years, 4 months ago
- 29c6b26 Exlude PSIMD micro-kernels from the MSVC/ICC build by Marat Dukhan · 4 years, 4 months ago
- 5ce30d9 Work around non-standard std::uniform_int_distribution<uint8_t> by Marat Dukhan · 4 years, 4 months ago
- 57dccd8 NEON and SSE2 implementations of X32 UNPOOL micro-kernel by Marat Dukhan · 4 years, 4 months ago
- 0183625 Increase error tolerance in IBilinearMicrokernelTester by Marat Dukhan · 4 years, 4 months ago
- 1f4e461 F16 1x8 GEMM ld64 microkernel by Frank Barchard · 4 years, 4 months ago
- c5ee9ff Include missing <numeric> header in BinaryElementwiseOperatorTester by Marat Dukhan · 4 years, 4 months ago
- 9993660 Add MINMAX suffix to remaining micro-kernels by Marat Dukhan · 4 years, 4 months ago
- 36b76b6 1x16 LD32 F16 GEMM by Frank Barchard · 4 years, 4 months ago
- 3cb54f9 1x8 LD64 F32 GEMM by Frank Barchard · 4 years, 4 months ago
- 683f559 FP16 4x16 and 6x16 GEMM ld32 microkernels by Frank Barchard · 4 years, 4 months ago
- 91cd2b7 Rename binary operation micro-kernels by Marat Dukhan · 4 years, 4 months ago
- 355ab43 Rename SpMM micro-kernels by Marat Dukhan · 4 years, 4 months ago
- 869c62d Auto-switch to LINEAR GEMM/IGEMM/DWCONV micro-kernels by Marat Dukhan · 4 years, 4 months ago
- 163a7e6 Scalar & WAsm GEMM/IGEMM/DWCONV micro-kernels without activation by Marat Dukhan · 4 years, 4 months ago
- de06f49 Add MINMAX suffix to GEMM/IGEMM/DWCONV/PPMM micro-kernel names by Marat Dukhan · 4 years, 4 months ago
- 8452ff5 Refactor AVGPOOL micro-kernel parameters by Marat Dukhan · 4 years, 4 months ago
- 1c58711 Add MINMAX suffix to filenames of GEMM/IGEMM/PPMM/DWCONV micro-kernels by Marat Dukhan · 4 years, 4 months ago
- eb09a6b Rename F32/U8 output params to minmax params by Marat Dukhan · 4 years, 4 months ago
- 0d1052c iOS 6x8 microkernel based on Cortex-A75 but with X18 avoided. by Frank Barchard · 4 years, 5 months ago
- 1c8bc0c Fix bug in Average Pooling operator by Marat Dukhan · 4 years, 5 months ago
- c58bd34 Fix bug in dilated max pooling with padding by Marat Dukhan · 4 years, 5 months ago
- 5868d80 Automatically switch to GAVGPOOL micro-kernels in Average Pooling operator by Marat Dukhan · 4 years, 5 months ago
- 8fb9055 4x8 GEMM and IGEMM microkernels for Cortex A55. 7.8% faster for e2e mobile net v2. by Frank Barchard · 4 years, 5 months ago
- b7dd29e 4x8 GEMM and IGEMM microkernels for AARCH32 Cortex A55. 11.5% faster end to end: by Frank Barchard · 4 years, 5 months ago
- d9e92eb Fix AVX and AVX512F PReLU microkernels by Marat Dukhan · 4 years, 5 months ago
- 90eca0a AVX and AVX512F versions of PReLU micro-kernel by Marat Dukhan · 4 years, 5 months ago
- 5c5fa96 Auto-generate CLAMP micro-kernels by Marat Dukhan · 4 years, 5 months ago
- a63a6fc Refactor GAVGPOOL micro-kernels by Marat Dukhan · 4 years, 5 months ago
- 660fd19 Rename BILINEAR microkernels into IBILINEAR by Marat Dukhan · 4 years, 5 months ago
- 20c3b92 Size test for Subgraph API by Marat Dukhan · 4 years, 5 months ago
- fe7acb6 Targets for requantization tests and benchmarks by Marat Dukhan · 4 years, 5 months ago
- 91e1999 6x8 GEMM and IGEMM microkernels for Cortex A55. 9% faster end to end: by Frank Barchard · 4 years, 5 months ago
- 4245f43 Update size_test with newly added operators by Marat Dukhan · 4 years, 5 months ago
- f092a4a f32-maxpool microkernel for ARM Neon. by Frank Barchard · 4 years, 5 months ago
- 466da75 Support TF SAME padding in Average Pooling operator by Marat Dukhan · 4 years, 5 months ago
- bee7825 Support TF SAME padding in Max Pooling operator by Marat Dukhan · 4 years, 5 months ago
- ee1f63e Support input_offset in AVGPOOL and PAVGPOOL micro-kernels by Marat Dukhan · 4 years, 5 months ago
- f5fec4b Fix MAXPOOL test generator for two-pass subtile and multi-pass test cases by Marat Dukhan · 4 years, 5 months ago
- 6ee435a Refactor AVGPOOL & PAVGPOOL micro-kernels & unit tests by Marat Dukhan · 4 years, 5 months ago
- 5cb8ff0 Re-generate F32 MAXPOOL micro-kernel tests by Marat Dukhan · 4 years, 5 months ago
- c8230a4 Remove output_min & output_max arguments in PReLU operator by Marat Dukhan · 4 years, 6 months ago
- 2995427 Optimized Indirect Deconvolution algorithm for 1x1 subconvolutions by Marat Dukhan · 4 years, 6 months ago
- b00004d 4x2c4 GEMM micro-kernels for PSIMD and SSE by Marat Dukhan · 4 years, 6 months ago
- c87a8fd Cortex A53 IGEMM 32 bit ARM by Frank Barchard · 4 years, 6 months ago
- 90ce789 Cortex A75 IGEMM 32 bit ARM. by Frank Barchard · 4 years, 6 months ago
- dc38f07 LD64 IGEMM 32 bit ARM by Frank Barchard · 4 years, 6 months ago
- bdb56f5 FP16 versions of SpMM micro-kernels by Marat Dukhan · 4 years, 6 months ago
- ca27b40 Add pipelined to gemm tests for aarch32. by Frank Barchard · 4 years, 6 months ago
- fd8e689 Rename SoftArgMax operator to SoftMax by Marat Dukhan · 4 years, 6 months ago
- 1edc454 SoftArgMax operator by Marat Dukhan · 4 years, 7 months ago
- 8137e4c NEON/NEONFMA RAddStoreExpMinusMax micro-kernels by Marat Dukhan · 4 years, 7 months ago
- b39689d SSE2/PSIMD RAddStoreExpMinusMax micro-kernels by Marat Dukhan · 4 years, 7 months ago
- f46f675 Scalar RAddStoreExpMinusMax micro-kernels by Marat Dukhan · 4 years, 7 months ago
- fa0a432 F32 Sigmoid micro-kernels in AVX2 implementation by Marat Dukhan · 4 years, 7 months ago
- 4a24a58 Use 1-step range reduction in NEONFMA Sigmoid micro-kernels by Marat Dukhan · 4 years, 7 months ago
- 68b3b45 Complete set of NEON F32 Sigmoid micro-kernels by Marat Dukhan · 4 years, 7 months ago
- 8d3c07e Additional Sigmoid micro-kernels and accuracy evaluation stub by Marat Dukhan · 4 years, 7 months ago
- 3a77ea7 Scalar F32 Sigmoid micro-kernels by Marat Dukhan · 4 years, 8 months ago
- 387c2d1 Generate A57 micro-kernels from A75 source. by Frank Barchard · 4 years, 8 months ago
- 9f7d555 Prefetch version of the aarch32 a75 GEMM kernel by Frank Barchard · 4 years, 8 months ago
- 1391604 Initial Cortex A53 kernel for aarch32 by Frank Barchard · 4 years, 8 months ago
- 9a88efe AVX & AVX512F versions of binary elementwise micro-kernels by Marat Dukhan · 4 years, 8 months ago
- 662faa0 Refactor HardSwish micro-kernels by Marat Dukhan · 4 years, 8 months ago
- 2712132 FMA3 microkernels with 4-wide shuffle by Marat Dukhan · 4 years, 8 months ago
- 4c4eb00 Additional variants of Softmax microkernels by Marat Dukhan · 4 years, 8 months ago
- eccfd71 NR=16 GEMM and IGEMM micro-kernels in AVX and FMA3 implementations by Marat Dukhan · 4 years, 8 months ago