- 17a9e3f Remove GEMMLOWP requantization from QS8 DWCONV templates by Marat Dukhan · 3 years ago
- 482508b Optimize FP32 requantization in ARMv7 NEON QS8/QU8 VMUL[C] by Marat Dukhan · 3 years ago
- 20483c7 Expose Convert operator in Subgraph API by Marat Dukhan · 3 years ago
- d52d20b Use the same F32->QS8/QU8 VCVT WAsm SIMD microkernels on ARM and x86 by Marat Dukhan · 3 years ago
- 411c18d Optimize FP32 requantization in WAsm SIMD QS8/QC8/QU8 GEMM/IGEMM/DWCONV by Marat Dukhan · 3 years ago
- af9c4e1 Optimize FP32 requantization in WAsm SIMD QS8/QU8 VMUL[C] by Marat Dukhan · 3 years ago
- 430b173 F32->QS8/QU8 VCVT scalar microkernels using FP32 min/max by Marat Dukhan · 3 years ago
- d5ff6ae Remove erroneous assertions from ConvertOperatorTester by Marat Dukhan · 3 years ago
- ed2d776 F32->QS8 and F32->QU8 Convert NC operators by Marat Dukhan · 3 years ago
- 03f1297 F32->QS8 and F32->QU8 Convert NC operators by XNNPACK Team · 3 years ago
- 21d9ac1 Fix debug build of XNNPACK by Marat Dukhan · 3 years ago
- 7d2d85c F32->QS8 and F32->QU8 Convert NC operators by Marat Dukhan · 3 years ago
- 19c8644 Fix prefetch offset for QS8 lane prfm GEMM/IGEMM microkernels/ by Frank Barchard · 3 years ago
- 5740f75 Fix trailing whitespace in VCVT benchmarks by Marat Dukhan · 3 years ago
- 563eee1 Benchmarks for F32->QS8 and F32->QU8 VCVT microkernels by Marat Dukhan · 3 years ago
- 4bd1de9 F32->QS8 and F32->QU8 VCVT WAsm SIMD microkernels using F32->I32 conversion by Marat Dukhan · 3 years ago
- 00a1085 F32->QS8 and F32->QU8 VCVT scalar microkernels by Marat Dukhan · 3 years ago
- 98d5552 F32->QS8 and F32->QU8 VCVT WAsm SIMD microkernels by Marat Dukhan · 3 years ago
- b2d0a2a F32->QS8 and F32->QU8 VCVT NEON microkernels by Marat Dukhan · 3 years ago
- d24301d F32->QS8/QU8 CVT evaluation stubs for NEON and NEON v8 by Marat Dukhan · 3 years ago
- 9551075 Fix CMake build by Marat Dukhan · 3 years ago
- f82ea82 Add PRFM benchmarks for qs8 lane by Frank Barchard · 3 years ago
- 3df14d3 F32->QS8 and F32->QU8 VCVT NEON V8 microkernels by Marat Dukhan · 3 years ago
- c5aa242 F32->QS8 and F32->QU8 microkernels for SSE by Marat Dukhan · 3 years ago
- 0d6a119 Expose quantized Global Average Pooling in Subgraph API by Marat Dukhan · 3 years ago
- c345718 Fix crash in X8 LUT unit tests by Marat Dukhan · 3 years ago
- 5ef4519 Fix test failure under asan in ResizeBilinearTester by Marat Dukhan · 3 years ago
- 5f7cf55 Avoid using gcc-specific intrinsics in NEON microkernels by Marat Dukhan · 3 years ago
- 4cec842 Support quantized Resize Bilinear 2D Node in Subgraph API by Marat Dukhan · 3 years ago
- 0ab7553 S8 & U8 Resize Bilinear NHWC operators by Marat Dukhan · 3 years ago
- 27bf92c RNDNU versions of all Neon lane microkernels. by Frank Barchard · 3 years ago
- 24abe6b Initialize S8/U8 IBILINEAR microkernel pointers by Marat Dukhan · 3 years ago
- 6a69c8e Scalar versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 3 years ago
- 266a47b WAsm SIMD versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 3 years ago
- cfcc99d Minor optimization in NEON S8/U8 IBILINEAR microkernels on ARM64 by Marat Dukhan · 3 years ago
- 7519eb1 SSE2 & SSE4.1 versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 3 years ago
- 6621058 Adds shard_count to long running tests as Coverage tests are timing out. by Alan Kelly · 3 years ago
- cdb42a5 NEON versions of S8/U8 IBILINEAR microkernels by Marat Dukhan · 3 years ago
- b654abf Fix broken QC8 Convolution on AArch32 ARMv8 processors by Marat Dukhan · 3 years ago
- 9cdc10d QU8 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 3 years ago
- 0bc5801 QC8 AArch32 use NeonV8 when available. by Frank Barchard · 3 years ago
- 6c34dbf Enable 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel on Cortex-A35 by Frank Barchard · 3 years ago
- 5cffb64 4x16 lane AArch64 NEON GEMM/IGEMM ld64 microkernel by Frank Barchard · 3 years ago
- 64ab1b7 LD1R and LD2R variants of c4 microkernel by Frank Barchard · 3 years ago
- 15eec02 LD1R and LD2R variants of c2 microkernel by Frank Barchard · 3 years ago
- 42f5c50 LOADDUP variant of c2 microkernel by Frank Barchard · 3 years ago
- 94c11e6 Initialize compute type in Bankers' Rounding node by Marat Dukhan · 3 years ago
- d2ad6d0 Disable NHWC->NCHW graph rewriting for non-FP32 nodes by Marat Dukhan · 3 years ago
- 8605333 Initialize xnn_compute_type in remaining Subgraph Nodes by Marat Dukhan · 3 years ago
- b1325b9 Introduce xnn_compute_type in Subgraph Nodes by Marat Dukhan · 3 years ago
- e22685a Remove padal from quantized microkernel names. by Frank Barchard · 3 years ago
- eb704f7 QS8 C4S2 Neon GEMM/IGEMM microkernels by Frank Barchard · 3 years ago
- 9eb52c7 Fix build with older gcc versions on x86-64 by Marat Dukhan · 3 years ago
- 4133313 Remove duplicate e2e benchmark. by Frank Barchard · 3 years ago
- a0c6168 F32->F16 Convert operator by Marat Dukhan · 3 years ago
- e7043ff Enable C2S4 for QC8 GEMM/IGEMM microkernels. by Frank Barchard · 3 years ago
- 07228a3 Remove E2E MR=1 benchmarks by Frank Barchard · 3 years ago
- c7a032d C2S4 QS8 Neon GEMM/IGEMM microkernels. by Frank Barchard · 3 years ago
- 1fe8995 Scalar F32->F16 VCVT microkernels by Marat Dukhan · 3 years ago
- 78f039d Scalar F16->F32 evaluation stubs of bitcast-based and fabsf-based variants by Marat Dukhan · 3 years ago
- 4edfdbf NEON F32->F16 VCVT microkernels by Marat Dukhan · 3 years ago
- b4cde5a Fix CMake build on ARM by Marat Dukhan · 3 years ago
- 22e31c8 WAsm SIMD F32->F16 VCVT microkernels by Marat Dukhan · 3 years ago
- eb84423 SSE2, SSE4.1, and AVX F32->F16 VCVT microkernels by Marat Dukhan · 3 years ago
- 79c78b2 Evaluation stubs for WAsm SIMD F32->F16 conversion by Marat Dukhan · 3 years ago
- 056f49d Evaluation stubs for SSE2 & SSE4.1 F32->F16 conversion by Marat Dukhan · 3 years ago
- a6eb1e5 Evaluation stubs for NEON F32->F16 conversion by Marat Dukhan · 3 years ago
- 5132010 QS8 C4 Neon GEMM and E2E benchmarks by Frank Barchard · 3 years ago
- f975ee0 Cortex A35 use A55 microkernels by Frank Barchard · 3 years ago
- 46cc1e1 Evaluation stubs for scalar F32->F16 conversion by Marat Dukhan · 3 years ago
- cefc376 Fixes asan error in dwconv-microkernel-tester.h. by Alan Kelly · 3 years ago
- 287952a QS8 C4 Neon GEMM/IGEMM microkernels by Frank Barchard · 3 years ago
- 05f6e17 Expose quantized deconvolution via the subgraph API by Yury Kartynnik · 3 years ago
- 66ae257 Switch from C2 to S4C2 for qs8 microkernels on 32 bit ARM by Frank Barchard · 3 years ago
- 758b979 Expose XNNPACK transpose convolution implementation as TRANSPOSE_CONV builtin op by Yury Kartynnik · 3 years ago
- 0214d86 Expose XNNPACK transpose convolution implementation as TRANSPOSE_CONV builtin op by XNNPACK Team · 3 years ago
- 47a74db Add specific microkernel for 1D convolutions with 1x3 kernel size for Android backend by Artsiom Ablavatski · 3 years ago
- dcdc2a2 Expose the optionality of bias in 2D deconvolution by Yury Kartynnik · 3 years ago
- 1f31f99 Expose XNNPACK transpose convolution implementation as TRANSPOSE_CONV builtin op by Yury Kartynnik · 3 years ago
- 494cd2b S4 variant of C2 Neon GEMM/IGEMM microkernel by Frank Barchard · 3 years ago
- 952cb51 S4 variant of C2 Neon GEMM/IGEMM mull microkernel by Frank Barchard · 3 years ago
- fa4daf0 Add ISA check to QU8 GEMM benchmark by Frank Barchard · 3 years, 1 month ago
- ccbaedf C2 Neon microkernel remove duplicate DUP instructions from NR loop. by Frank Barchard · 3 years, 1 month ago
- 1d41247 Neon C2 microkernels switch to rndnu from gemmlowp by Frank Barchard · 3 years, 1 month ago
- 8e9a66f Parse shuffle after channels for test names by Frank Barchard · 3 years, 1 month ago
- 582e184 Evaluation stubs and tests for FP16->FP32 conversion by Marat Dukhan · 3 years, 1 month ago
- ddb3d16 F16 Fully Connected operator by Marat Dukhan · 3 years, 1 month ago
- d77f77d F32->F16 VCVT microkernels for NEON-FP16, F16C, and AVX512 by Marat Dukhan · 3 years, 1 month ago
- af2ba00 F16->F32 Convert operator by Marat Dukhan · 3 years, 1 month ago
- ade893c Support unary elementwise ops on 0-dimensional tensors (scalars) by Marat Dukhan · 3 years, 1 month ago
- c9f9d67 Add Channel Tile of 16 for float and 32 for half float. by Frank Barchard · 3 years, 1 month ago
- dbe781b Enable 8x4, 8x9, 8x25 f32 dwconv by Frank Barchard · 3 years, 1 month ago
- e2c0001 Scalar FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years, 1 month ago
- 434352f Benchmarks for FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years, 1 month ago
- f6507f8 WAsm SIMD FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years, 1 month ago
- 322ed6f NEON FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years, 1 month ago
- 1227adb SSE2/SSE4.1/AVX FP16->FP32 VCVT microkernels by Marat Dukhan · 3 years, 1 month ago
- 2dd18fd Parse ROW_TILE field in multipass DWCONV variant by Frank Barchard · 3 years, 1 month ago
- 3c6d6b4 Update performance data on Raspberry Pi by Marat Dukhan · 3 years, 1 month ago
- 1851410 f32 dwconv load params first by Frank Barchard · 3 years, 1 month ago