Gitiles
Code Review
Sign In
gerrit-public.fairphone.software
/
platform
/
external
/
XNNPACK
/
dffaa4657728d7840a8ba3917017546c2c75b5de
dffaa46
Remove sse2-p5-div.c.in Sigmoid template
by Marat Dukhan
· 5 years ago
5739f70
Evaluation stubs for Sigmoid function in scalar implementation
by Marat Dukhan
· 5 years ago
5e9a91e
Evaluation stubs for ExpMinus function in scalar implementation
by Marat Dukhan
· 5 years ago
066c983
Merge pull request #284 from AshkanAliabadi:build
by XNNPACK Team
· 5 years ago
9520fc9
Tighten the compiler detection guards in intrinsics-polyfill.h.
by Ashkan Aliabadi
· 5 years ago
387c2d1
Generate A57 micro-kernels from A75 source.
by Frank Barchard
· 5 years ago
005feb8
A53 push r1, r2 so they can be used as scratch. Reorder FMA by B
by Frank Barchard
· 5 years ago
0090f5b
4x8 FMA sorted by B to match load order
by Frank Barchard
· 5 years ago
5cc1cc2
Select a default kernel optimized for big cores.
by Frank Barchard
· 5 years ago
abf8154
Code generator for PLD and non-PLD versions of aarch32 4x8 Cortex-A75 kernel
by Frank Barchard
· 5 years ago
9e0e8ee
Fix gcc-6 build
by Marat Dukhan
· 5 years ago
4d281a5
Enable PLD prefetch version of a75 microkernel for aarch32
by Frank Barchard
· 5 years ago
07efec4
Run generator for A73 kernel NOP
by Frank Barchard
· 5 years ago
03b51ee
Add Raspberry Pi performance on MobileNets to README
by Marat Dukhan
· 5 years ago
f9a3484
Detect A53 in aarch32 and use a53 specific micro kernel.
by Frank Barchard
· 5 years ago
9f7d555
Prefetch version of the aarch32 a75 GEMM kernel
by Frank Barchard
· 5 years ago
73ccfb4
Move SUBS to 2nd instruction of clamp code.
by Frank Barchard
· 5 years ago
a84e40b
Pass proper CXX flags to built-in randomized models
by Marat Dukhan
· 5 years ago
c659140
a73 kernel move SUBS before clamp and add NOP before branch
by Frank Barchard
· 5 years ago
1391604
Initial Cortex A53 kernel for aarch32
by Frank Barchard
· 5 years ago
77b78a6
Fix compiler warning in end2end.h microbenchmarking header
by Marat Dukhan
· 5 years ago
0126feb
Merge pull request #267 from AshkanAliabadi:build
by XNNPACK Team
· 5 years ago
d94b856
Rename strided gemm and igemm fma3 broadcasts.
by Ashkan Aliabadi
· 5 years ago
9a88efe
AVX & AVX512F versions of binary elementwise micro-kernels
by Marat Dukhan
· 5 years ago
17e1628
Define bench-models as static library in CMake
by Marat Dukhan
· 5 years ago
f52eff5
Fix typos in AVX/AVX512 HSWISH micro-kernels
by Marat Dukhan
· 5 years ago
b738ad2
fix for linux arm 32 bit
by Frank Barchard
· 5 years ago
662faa0
Refactor HardSwish micro-kernels
by Marat Dukhan
· 5 years ago
f10168c
CMake toolchain for arm-linux-gnueabihf
by Marat Dukhan
· 5 years ago
2712132
FMA3 microkernels with 4-wide shuffle
by Marat Dukhan
· 5 years ago
c08cdf5
Randomized end-to-end MobileNet v3 benchmark
by Marat Dukhan
· 5 years ago
4c4eb00
Additional variants of Softmax microkernels
by Marat Dukhan
· 5 years ago
eccfd71
NR=16 GEMM and IGEMM micro-kernels in AVX and FMA3 implementations
by Marat Dukhan
· 5 years ago
53873d0
Fix typos in some micro-kernel test names
by Marat Dukhan
· 5 years ago
6918050
ND Divide operator with broadcasting support
by Marat Dukhan
· 5 years ago
77ca630
Elementwise Divide micro-kernels
by Marat Dukhan
· 5 years ago
bd8a962
Fix incorrect indirection buffer size in pooling operators
by Marat Dukhan
· 5 years ago
79e7f84
ND Maximum and Minimum operators (with broadcasting support)
by Marat Dukhan
· 5 years ago
9594db0
Align static weights in micro-kernel unit tests on 64 bytes
by Marat Dukhan
· 5 years ago
403b7d4
Elementwise minimum and maximum micro-kernels
by Marat Dukhan
· 5 years ago
65a0139
Polyfill _mm512_reduce_add_ps and _mm512_reduce_max_ps for old gcc
by Marat Dukhan
· 5 years ago
cfb3134
Polyfill missing _cvtu32_mask16 intrinsic on old gcc
by Marat Dukhan
· 5 years ago
03ff294
Fix incorrect indirection size computation for DWCONV
by Marat Dukhan
· 5 years ago
ad74a7b
Fix out-of-bounds reads in F32 DWCONV benchmark
by Marat Dukhan
· 5 years ago
3e237f2
AARCH32 4x8 for Cortex A75
by Frank Barchard
· 5 years ago
f917cbd
AARCH32 4x8 LD64 stores simplified
by Frank Barchard
· 5 years ago
6383f49
Assembly GEMM kernel NC loop use SUBS instead of CMP+SUBS
by Frank Barchard
· 5 years ago
441e221
Scalar 3x3s2c3 HWC dconv micro-kernel with 0+1 width padding
by Marat Dukhan
· 5 years ago
fd659bf
Document new ND elementwise operators in README
by Marat Dukhan
· 5 years ago
6b7dfae
Refactor scalar 3x3s2c3 HWC->HWC2CHW dconv, add 3x3s2c3 HWC dconv
by Marat Dukhan
· 5 years ago
436ebe6
Separate WAsm micro-kernels and scalar micro-kernels
by Marat Dukhan
· 5 years ago
05f3f6d
Subtract ND operator
by Marat Dukhan
· 5 years ago
c4f0ff9
Support built-in transposition of static weights in Fully Connected operator
by Marat Dukhan
· 5 years ago
e95d037
Merge pull request #231 from AshkanAliabadi:build
by XNNPACK Team
· 5 years ago
d255a31
Fix xnnpack.h casing in CMakeLists.txt
by Ashkan Aliabadi
· 5 years ago
fc2b96e
Support up to 6 dimensions in ND operators with broadcasting
by Marat Dukhan
· 5 years ago
ab4af57
Harden unit tests for ND Add & Multiply operators
by Marat Dukhan
· 5 years ago
cab9493
Add E2E aarch32 GEMM kernel.
by Frank Barchard
· 5 years ago
bfd02cd
Fix assert in AVX/AVX512 Clamp micro-kernels
by Marat Dukhan
· 5 years ago
b1a0fc3
Add ND operator with broadcasting
by Marat Dukhan
· 5 years ago
61cad89
AARCH32 4x8 GEMM kernel fully register based.
by Frank Barchard
· 5 years ago
3267092
Enable AARCH32 4x8 GEMM kernel
by Frank Barchard
· 5 years ago
72d6afb
AARCH32 4x8 kernel code clean up.
by Frank Barchard
· 5 years ago
191e5cd
Fix typo in CMakeLists
by Marat Dukhan
· 5 years ago
8b0f026
AARCH32 4x8 NEON GEMM Assembly version of 4x8 for 32 bit ARM. Based on LD64.
by Frank Barchard
· 5 years ago
e2c3f29
F32 CLAMP micro-kernels in AVX and AVX512F implementations
by Marat Dukhan
· 5 years ago
479f87e
AVX512F implementation of DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
91f8d86
Accuracy evaluation stubs for LUT-based Sigmoid implementations
by Marat Dukhan
· 5 years ago
0f349c4
AVX512F implementation of GEMM & IGEMM micro-kernels
by Marat Dukhan
· 5 years ago
c72fa1e
Use XNN_ARCH_* macros for architecture-specific parts in micro-kernels
by Marat Dukhan
· 5 years ago
69172d9
6x8 ld128 GEMM microkernels
by Frank Barchard
· 5 years ago
189ae80
Additional implementation of expminus function
by Marat Dukhan
· 5 years ago
c8466f5
Add checks for target ISA in microbenchmarks
by Marat Dukhan
· 5 years ago
54a9d9d
f16_gemm benchmark renamed from hgemm
by Frank Barchard
· 5 years ago
b186463
Fix F16-GEMM benchmark
by Marat Dukhan
· 5 years ago
1390be0
Add reference to "Fast Sparse ConvNets" paper
by Marat Dukhan
· 5 years ago
40a672f
Move generated micro-kernels into a subdirectory
by Marat Dukhan
· 5 years ago
9f08af4
Fix remaining incompatibility with ARM64 gcc in DWCONV SpCHW micro-kernel
by Marat Dukhan
· 5 years ago
a71520b
Update README with new functionality
by Marat Dukhan
· 5 years ago
36aecb5
Fix typos and mixed tabulation in CMakeLists
by Marat Dukhan
· 5 years ago
22aae13
Evaluation stubs for NEONFMA Sigmoid with alternative division implementations
by Marat Dukhan
· 5 years ago
5243bb0
DUP Neon GEMM kernels for Exynos
by Frank Barchard
· 5 years ago
17ec5f3
AVX and FMA3 microkernels for DWCONV
by Marat Dukhan
· 5 years ago
e3fad19
Fix wrong vreinterpret intrinsics in NEONFMA SpCHW DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
8c3b028
Fix wrong vget_low/high intrinsics in FP16 GEMM micro-kernels
by Marat Dukhan
· 5 years ago
91317c5
Rename neon intrinsics to lane.
by Frank Barchard
· 5 years ago
709ac2f
Fully qualify std::isfinite in VScaleExpMinusMax microkernel tester
by Marat Dukhan
· 5 years ago
5743193
Fix typos in END_FUNCTION arguments in ARM64 assembly kernels
by Marat Dukhan
· 5 years ago
1e782c4
Rename vunop and vbinop functions
by Marat Dukhan
· 5 years ago
1025ea3
Enable AVX and FMA3 GEMM micro-kernels for non-mobile x86
by Marat Dukhan
· 5 years ago
496e735
SSE4.1 Sigmoid microkernels
by Marat Dukhan
· 5 years ago
fda12b8
AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM
by Marat Dukhan
· 5 years ago
5480997
Replace IDLETTERS with ABC
by Frank Barchard
· 5 years ago
fd58293
Fix minor bug in MobileNet v1/v2 models
by Marat Dukhan
· 5 years ago
d42bdf7
SSE shuffle kernel GEMM tests added
by Frank Barchard
· 5 years ago
866f7b3
Fix bugs in Depthwise Convolution and Average Pooling
by Marat Dukhan
· 5 years ago
d7a2c5f
Remove erroneous assert in NEON DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
202eed0
Test changes in input buffer in NHWC Convolution
by Marat Dukhan
· 5 years ago
df06d80
Neon shuffle GEMM and IGEMM kernels.
by Frank Barchard
· 5 years ago
93d29a3
Merge pull request #185 from AshkanAliabadi:build
by Marat Dukhan
· 5 years ago
Next »