Gitiles
Code Review
Sign In
gerrit-public.fairphone.software
/
platform
/
external
/
XNNPACK
/
479f87e6fbdc2477e5cfff8c7e89fce5cdf040a8
/
src
479f87e
AVX512F implementation of DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
91f8d86
Accuracy evaluation stubs for LUT-based Sigmoid implementations
by Marat Dukhan
· 5 years ago
0f349c4
AVX512F implementation of GEMM & IGEMM micro-kernels
by Marat Dukhan
· 5 years ago
c72fa1e
Use XNN_ARCH_* macros for architecture-specific parts in micro-kernels
by Marat Dukhan
· 5 years ago
69172d9
6x8 ld128 GEMM microkernels
by Frank Barchard
· 5 years ago
189ae80
Additional implementation of expminus function
by Marat Dukhan
· 5 years ago
40a672f
Move generated micro-kernels into a subdirectory
by Marat Dukhan
· 5 years ago
9f08af4
Fix remaining incompatibility with ARM64 gcc in DWCONV SpCHW micro-kernel
by Marat Dukhan
· 5 years ago
22aae13
Evaluation stubs for NEONFMA Sigmoid with alternative division implementations
by Marat Dukhan
· 5 years ago
5243bb0
DUP Neon GEMM kernels for Exynos
by Frank Barchard
· 5 years ago
17ec5f3
AVX and FMA3 microkernels for DWCONV
by Marat Dukhan
· 5 years ago
e3fad19
Fix wrong vreinterpret intrinsics in NEONFMA SpCHW DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
8c3b028
Fix wrong vget_low/high intrinsics in FP16 GEMM micro-kernels
by Marat Dukhan
· 5 years ago
91317c5
Rename neon intrinsics to lane.
by Frank Barchard
· 5 years ago
5743193
Fix typos in END_FUNCTION arguments in ARM64 assembly kernels
by Marat Dukhan
· 5 years ago
1e782c4
Rename vunop and vbinop functions
by Marat Dukhan
· 5 years ago
1025ea3
Enable AVX and FMA3 GEMM micro-kernels for non-mobile x86
by Marat Dukhan
· 5 years ago
496e735
SSE4.1 Sigmoid microkernels
by Marat Dukhan
· 5 years ago
fda12b8
AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM
by Marat Dukhan
· 5 years ago
5480997
Replace IDLETTERS with ABC
by Frank Barchard
· 5 years ago
866f7b3
Fix bugs in Depthwise Convolution and Average Pooling
by Marat Dukhan
· 5 years ago
d7a2c5f
Remove erroneous assert in NEON DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
df06d80
Neon shuffle GEMM and IGEMM kernels.
by Frank Barchard
· 5 years ago
329da64
Refactor MaxPool and ArgMaxPool micro-kernels
by Marat Dukhan
· 5 years ago
7ccaab6
IGEMM kernels add asserts for a, c, and w pointers.
by Frank Barchard
· 5 years ago
4ad5115
Process 7 input rows / 3 output rows at a time in 5x5p2 chw dwconv. ~10% speedup.
by Erich Elsen
· 5 years ago
04f03be
Support overriding memory allocation functions
by Marat Dukhan
· 5 years ago
7bee751
SSE2 Sigmoid micro-kernels
by Marat Dukhan
· 5 years ago
14bec50
Benchmark for F32 Sigmoid micro-kernels
by Marat Dukhan
· 5 years ago
80b537a
6x8 IGEMM for Cortex A53 pipelined.
by Frank Barchard
· 5 years ago
8fd7b5f
Add slightly less accurate sigmoid approximation that is approximately 50%
by Erich Elsen
· 5 years ago
80bafd2
SSE2 Sigmoid implementation and evaluation stub
by Marat Dukhan
· 5 years ago
efc47b8
Unify operator implementations filenames and tests
by Marat Dukhan
· 5 years ago
44e06a6
Fix Sigmoid microkernel incompatibilities with ARM gcc
by Marat Dukhan
· 5 years ago
7c8e0c7
4x8 IGEMM for Cortex-A53 pipelined
by Frank Barchard
· 5 years ago
684bbb0
CMP 2 instructions earlier in A/C clamping.
by Frank Barchard
· 5 years ago
ca2733c
ND elementwise Multiply operator with broadcasting support
by Marat Dukhan
· 5 years ago
de9b0fb
Process 6 rows at a time in 5x5s2 spchw dwconv kernel.
by Erich Elsen
· 5 years ago
179ac85
Add tail to depthwise 5x5s2p2_chw kernels.
by Erich Elsen
· 5 years ago
9efaed7
A53 GEMM and IGEMM pipelined kernels prefetch C in epilogue
by Frank Barchard
· 5 years ago
ffd6840
SSE2 exp and expminus evaluation stubs
by Marat Dukhan
· 5 years ago
19418b5
GEMM 4x8 and 4x12 kernels use forward stores for C.
by Frank Barchard
· 5 years ago
82cfe18
4x8 a53 epilogue NOPs in group 5
by Frank Barchard
· 5 years ago
a99918a
Support 5x5 Depthwise Convolution in SpNCHW layout
by Marat Dukhan
· 5 years ago
346a9e5
Sigmoid evaluation stubs, micro-kernels, and operator
by Marat Dukhan
· 5 years ago
797a8fe
NEONFMA versions of vector expf function
by Marat Dukhan
· 5 years ago
c07cb7f
Binary elementwise microkernels
by Marat Dukhan
· 5 years ago
0ecc2ab
4x8 GEMM for Cortex A53
by Frank Barchard
· 5 years ago
5abe43c
ST1 post increment for Cortex A53 GEMM/IGEMM microkernels
by Frank Barchard
· 5 years ago
72b3250
Provide compiler hints to generate branch-free code for math helper functions
by Marat Dukhan
· 5 years ago
e67b783
ST1 post increment for ld64/ld128 GEMM/IGEMM microkernels
by Frank Barchard
· 5 years ago
e334cc6
Revert "Exclude unused parts of xnn_params"
by Marat Dukhan
· 5 years ago
6972249
Resize Bilinear operator
by Marat Dukhan
· 5 years ago
5093cbc
Exclude unused parts of xnn_params
by Marat Dukhan
· 5 years ago
e64f91a
Pipelined 6x8 GEMM for Cortex A53
by Frank Barchard
· 5 years ago
9fab3f9
Support input offset in BILINEAR micro-kernels
by Marat Dukhan
· 5 years ago
38709a6
Add scalar chw 5x5p2 and 5x5s2p2 kernels
by Erich Elsen
· 5 years ago
2a64a1a
Fix incompatibility with ARM gcc
by Marat Dukhan
· 5 years ago
0f06b5c
Fix gcc incompatibility in SSE PReLU microkernels
by Marat Dukhan
· 5 years ago
35dacfb
BILINEAR micro-kernels
by Marat Dukhan
· 5 years ago
5098c3e
Refactor DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
49e6ee9
Refactor VMulCAddC micro-kernel
by Marat Dukhan
· 5 years ago
69c3f2c
Refactor PReLU microkernels
by Marat Dukhan
· 5 years ago
d5208d6
Remove a_sum buffer
by Marat Dukhan
· 5 years ago
1898b91
Move adjustment_* arguments of Deconvolution into setup
by Marat Dukhan
· 5 years ago
70ad409
Make ARM microkernels compatible with gcc
by Marat Dukhan
· 5 years ago
fb60914
Make F32 CLAMP NEON micro-kernel compatible with gcc on AArch32
by Marat Dukhan
· 5 years ago
bd41971
A57 branch a version of A53 kernel
by Frank Barchard
· 5 years ago
63ba2ed
Fix typos in AVX2 ExtExp micro-kernels
by Marat Dukhan
· 5 years ago
64a5bfe
A53 6x8 IGEMM kernel prefetch
by Frank Barchard
· 5 years ago
bd1d5d9
6x8 A53 GEMM use prefetch.
by Frank Barchard
· 5 years ago
f568f08
Support Convolution, Deconvolution, and Fully Connected operators without bias
by Marat Dukhan
· 5 years ago
263bb09
Cortex A76 use 6x8 micro kernel
by Frank Barchard
· 5 years ago
feb4923
AVX512F exp implementation based on PERM2
by Marat Dukhan
· 5 years ago
ba7c3bb
Merge generate-f32-gemminc.sh script into generate-f32-gemm.sh
by Marat Dukhan
· 5 years ago
00bf68e
A53 6x8 GEMM unrolled
by Frank Barchard
· 5 years ago
c452eb1
Re-generate SpMM micro-kernels
by Marat Dukhan
· 5 years ago
ae777b4
4x8 a53 eliminate pushes to stack
by Frank Barchard
· 5 years ago
e0601b5
Sort include order for params-init.h and log.h
by Frank Barchard
· 5 years ago
eeaa7bd
Refactor initialization of micro-kernel parameters
by Marat Dukhan
· 5 years ago
6f8d4d3
RADDEXTEXP and VSCALEEXTEXP micro-kernels for AVX2 and AVX512F
by Marat Dukhan
· 5 years ago
b3c6c6e
6x8 A53 remove pushes for NEON
by Frank Barchard
· 5 years ago
46fb807
4x8 A53 GEMM, and GEMMINC unpipelined microkernels.
by Frank Barchard
· 5 years ago
cd945c6
Re-enable swizzle GEMM/IGEMM micro-kernels in WAsm SIMD on ARM
by Marat Dukhan
· 5 years ago
c4ae7de
Propagate IGEMM SR argument to weights packing in Deconvolution operator
by Marat Dukhan
· 5 years ago
c6afd9b
Add blocked scalar spmm kernels.
by Erich Elsen
· 5 years ago
8440fde
Support TF-style SAME padding via explicit flag
by Marat Dukhan
· 5 years ago
bff791e
Use 8x1 SpMM micro-kernel on WebAssembly
by Marat Dukhan
· 5 years ago
14fe0b2
Enable sparse MobileNet v1/v2 operators on WebAssembly
by Marat Dukhan
· 5 years ago
a7fb855
6x8 A53 GEMM, GEMMINC and IGEMM unpipelined microkernels.
by Frank Barchard
· 5 years ago
563df5f
Add scalar version of hwc2spchw convolution.
by Erich Elsen
· 5 years ago
98ba441
Vectorized extexp functions
by Marat Dukhan
· 5 years ago
cb80197
Disable GEMM/IGEMM micro-kernels with swizzle
by Marat Dukhan
· 5 years ago
31a98d7
Remove warnings about inefficient padding parameters in Convolution
by Marat Dukhan
· 5 years ago
1756f9e
Propagate GEMM/IGEMM SR argument to weights packing in Fully Connected operator
by Marat Dukhan
· 5 years ago
e0df831
Remove trailing whitespace
by Marat Dukhan
· 5 years ago
07cb676
Refactor initialization of even/odd masks in parameters for SpCHW micro-kernels
by Marat Dukhan
· 5 years ago
838c8e3
Refactor initialization of masks in parameters for SpCHW micro-kernels
by Marat Dukhan
· 5 years ago
caf8544
LD64/LD128 kernels remove all pushes (d8-d15) Remap d12-d15 to d16-d19
by Frank Barchard
· 5 years ago
fcfdc0e
Automated g4 rollback of changelist 274728310.
by Frank Barchard
· 5 years ago
Next »