Gitiles
Code Review
Sign In
gerrit-public.fairphone.software
/
platform
/
external
/
XNNPACK
/
479f87e6fbdc2477e5cfff8c7e89fce5cdf040a8
479f87e
AVX512F implementation of DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
91f8d86
Accuracy evaluation stubs for LUT-based Sigmoid implementations
by Marat Dukhan
· 5 years ago
0f349c4
AVX512F implementation of GEMM & IGEMM micro-kernels
by Marat Dukhan
· 5 years ago
c72fa1e
Use XNN_ARCH_* macros for architecture-specific parts in micro-kernels
by Marat Dukhan
· 5 years ago
69172d9
6x8 ld128 GEMM microkernels
by Frank Barchard
· 5 years ago
189ae80
Additional implementation of expminus function
by Marat Dukhan
· 5 years ago
c8466f5
Add checks for target ISA in microbenchmarks
by Marat Dukhan
· 5 years ago
54a9d9d
f16_gemm benchmark renamed from hgemm
by Frank Barchard
· 5 years ago
b186463
Fix F16-GEMM benchmark
by Marat Dukhan
· 5 years ago
1390be0
Add reference to "Fast Sparse ConvNets" paper
by Marat Dukhan
· 5 years ago
40a672f
Move generated micro-kernels into a subdirectory
by Marat Dukhan
· 5 years ago
9f08af4
Fix remaining incompatibility with ARM64 gcc in DWCONV SpCHW micro-kernel
by Marat Dukhan
· 5 years ago
a71520b
Update README with new functionality
by Marat Dukhan
· 5 years ago
36aecb5
Fix typos and mixed tabulation in CMakeLists
by Marat Dukhan
· 5 years ago
22aae13
Evaluation stubs for NEONFMA Sigmoid with alternative division implementations
by Marat Dukhan
· 5 years ago
5243bb0
DUP Neon GEMM kernels for Exynos
by Frank Barchard
· 5 years ago
17ec5f3
AVX and FMA3 microkernels for DWCONV
by Marat Dukhan
· 5 years ago
e3fad19
Fix wrong vreinterpret intrinsics in NEONFMA SpCHW DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
8c3b028
Fix wrong vget_low/high intrinsics in FP16 GEMM micro-kernels
by Marat Dukhan
· 5 years ago
91317c5
Rename neon intrinsics to lane.
by Frank Barchard
· 5 years ago
709ac2f
Fully qualify std::isfinite in VScaleExpMinusMax microkernel tester
by Marat Dukhan
· 5 years ago
5743193
Fix typos in END_FUNCTION arguments in ARM64 assembly kernels
by Marat Dukhan
· 5 years ago
1e782c4
Rename vunop and vbinop functions
by Marat Dukhan
· 5 years ago
1025ea3
Enable AVX and FMA3 GEMM micro-kernels for non-mobile x86
by Marat Dukhan
· 5 years ago
496e735
SSE4.1 Sigmoid microkernels
by Marat Dukhan
· 5 years ago
fda12b8
AVX and FMA3 microkernels for GEMM/GEMMINC/IGEMM
by Marat Dukhan
· 5 years ago
5480997
Replace IDLETTERS with ABC
by Frank Barchard
· 5 years ago
fd58293
Fix minor bug in MobileNet v1/v2 models
by Marat Dukhan
· 5 years ago
d42bdf7
SSE shuffle kernel GEMM tests added
by Frank Barchard
· 5 years ago
866f7b3
Fix bugs in Depthwise Convolution and Average Pooling
by Marat Dukhan
· 5 years ago
d7a2c5f
Remove erroneous assert in NEON DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
202eed0
Test changes in input buffer in NHWC Convolution
by Marat Dukhan
· 5 years ago
df06d80
Neon shuffle GEMM and IGEMM kernels.
by Frank Barchard
· 5 years ago
93d29a3
Merge pull request #185 from AshkanAliabadi:build
by Marat Dukhan
· 5 years ago
1f8a2b8
Add memory.c to CMake.
by Ashkan Aliabadi
· 5 years ago
f866a45
Rename f32-argmaxpool source files in CMake.
by Ashkan Aliabadi
· 5 years ago
329da64
Refactor MaxPool and ArgMaxPool micro-kernels
by Marat Dukhan
· 5 years ago
7ccaab6
IGEMM kernels add asserts for a, c, and w pointers.
by Frank Barchard
· 5 years ago
4ad5115
Process 7 input rows / 3 output rows at a time in 5x5p2 chw dwconv. ~10% speedup.
by Erich Elsen
· 5 years ago
04f03be
Support overriding memory allocation functions
by Marat Dukhan
· 5 years ago
7bee751
SSE2 Sigmoid micro-kernels
by Marat Dukhan
· 5 years ago
581c1ac
CMake targets for f32-sigmoid-test and f32-sigmoid-bench
by Marat Dukhan
· 5 years ago
7c6c3b2
Fix CMake build of sigmoid test and benchmark
by Marat Dukhan
· 5 years ago
14bec50
Benchmark for F32 Sigmoid micro-kernels
by Marat Dukhan
· 5 years ago
80b537a
6x8 IGEMM for Cortex A53 pipelined.
by Frank Barchard
· 5 years ago
8fd7b5f
Add slightly less accurate sigmoid approximation that is approximately 50%
by Erich Elsen
· 5 years ago
80bafd2
SSE2 Sigmoid implementation and evaluation stub
by Marat Dukhan
· 5 years ago
efc47b8
Unify operator implementations filenames and tests
by Marat Dukhan
· 5 years ago
1b09229
Fix issues in PReLU benchmark
by Marat Dukhan
· 5 years ago
44e06a6
Fix Sigmoid microkernel incompatibilities with ARM gcc
by Marat Dukhan
· 5 years ago
c3b9e86
Benchmark Sigmoid operator in TFLite implementation
by Marat Dukhan
· 5 years ago
7c8e0c7
4x8 IGEMM for Cortex-A53 pipelined
by Frank Barchard
· 5 years ago
cb10f26
Build accuracy evaluation benchmarks with CMake
by Marat Dukhan
· 5 years ago
684bbb0
CMP 2 instructions earlier in A/C clamping.
by Frank Barchard
· 5 years ago
9d501d5
Add new micro-kernels, operators, and unit tests to CMake build
by Marat Dukhan
· 5 years ago
ca2733c
ND elementwise Multiply operator with broadcasting support
by Marat Dukhan
· 5 years ago
de9b0fb
Process 6 rows at a time in 5x5s2 spchw dwconv kernel.
by Erich Elsen
· 5 years ago
95bebc9
Benchmarks rename sgemm and sppmm to f32_gemm and f32_ppmm
by Frank Barchard
· 5 years ago
179ac85
Add tail to depthwise 5x5s2p2_chw kernels.
by Erich Elsen
· 5 years ago
9efaed7
A53 GEMM and IGEMM pipelined kernels prefetch C in epilogue
by Frank Barchard
· 5 years ago
ffd6840
SSE2 exp and expminus evaluation stubs
by Marat Dukhan
· 5 years ago
19418b5
GEMM 4x8 and 4x12 kernels use forward stores for C.
by Frank Barchard
· 5 years ago
82cfe18
4x8 a53 epilogue NOPs in group 5
by Frank Barchard
· 5 years ago
a99918a
Support 5x5 Depthwise Convolution in SpNCHW layout
by Marat Dukhan
· 5 years ago
430efbc
Merge pull request #153 from Maratyszcza:readme-fix
by XNNPACK Team
· 5 years ago
346a9e5
Sigmoid evaluation stubs, micro-kernels, and operator
by Marat Dukhan
· 5 years ago
21a57c9
Minor grammatical fix to READ
by Andrew Tulloch
· 5 years ago
797a8fe
NEONFMA versions of vector expf function
by Marat Dukhan
· 5 years ago
c76065f
Fix CMake build of elementwise microkernels
by Marat Dukhan
· 5 years ago
c07cb7f
Binary elementwise microkernels
by Marat Dukhan
· 5 years ago
0ecc2ab
4x8 GEMM for Cortex A53
by Frank Barchard
· 5 years ago
5abe43c
ST1 post increment for Cortex A53 GEMM/IGEMM microkernels
by Frank Barchard
· 5 years ago
e097028
Python3 compatibility for xngen
by Marat Dukhan
· 5 years ago
72b3250
Provide compiler hints to generate branch-free code for math helper functions
by Marat Dukhan
· 5 years ago
e67b783
ST1 post increment for ld64/ld128 GEMM/IGEMM microkernels
by Frank Barchard
· 5 years ago
e334cc6
Revert "Exclude unused parts of xnn_params"
by Marat Dukhan
· 5 years ago
6972249
Resize Bilinear operator
by Marat Dukhan
· 5 years ago
5093cbc
Exclude unused parts of xnn_params
by Marat Dukhan
· 5 years ago
e64f91a
Pipelined 6x8 GEMM for Cortex A53
by Frank Barchard
· 5 years ago
9fab3f9
Support input offset in BILINEAR micro-kernels
by Marat Dukhan
· 5 years ago
38709a6
Add scalar chw 5x5p2 and 5x5s2p2 kernels
by Erich Elsen
· 5 years ago
2a64a1a
Fix incompatibility with ARM gcc
by Marat Dukhan
· 5 years ago
0f06b5c
Fix gcc incompatibility in SSE PReLU microkernels
by Marat Dukhan
· 5 years ago
c465fc2
Add missed f32-bilinear-test to CMake build
by Marat Dukhan
· 5 years ago
35dacfb
BILINEAR micro-kernels
by Marat Dukhan
· 5 years ago
5098c3e
Refactor DWCONV micro-kernels
by Marat Dukhan
· 5 years ago
49e6ee9
Refactor VMulCAddC micro-kernel
by Marat Dukhan
· 5 years ago
69c3f2c
Refactor PReLU microkernels
by Marat Dukhan
· 5 years ago
d5208d6
Remove a_sum buffer
by Marat Dukhan
· 5 years ago
1898b91
Move adjustment_* arguments of Deconvolution into setup
by Marat Dukhan
· 5 years ago
a41533d
Reduce image sizes in Deconvolution unit tests
by Marat Dukhan
· 5 years ago
bad48fe
Vary number of threads in the End-to-End benchmark
by Marat Dukhan
· 5 years ago
70ad409
Make ARM microkernels compatible with gcc
by Marat Dukhan
· 5 years ago
fb60914
Make F32 CLAMP NEON micro-kernel compatible with gcc on AArch32
by Marat Dukhan
· 5 years ago
bd41971
A57 branch a version of A53 kernel
by Frank Barchard
· 5 years ago
8e6e997
Fix ARM64 build with CMake
by Marat Dukhan
· 5 years ago
c9d2f3f
Fix CMake build of End-to-End DWCONV & GEMM benchmarks
by Marat Dukhan
· 5 years ago
c712fa4
Add Freq to end2end benchmark.
by Frank Barchard
· 5 years ago
ef4416e
End-to-end benchmarks for DWCONV microkernels
by Marat Dukhan
· 5 years ago
e72e287
Add Freq to E2E benchmark
by Frank Barchard
· 5 years ago
Next »