1. 645af97 FMA3 implementation of F16 DWCONV/VCLAMP/VMULCADDC microkernels by Marat Dukhan · 2 years, 9 months ago
  2. c9f9d67 Add Channel Tile of 16 for float and 32 for half float. by Frank Barchard · 3 years ago
  3. e13e639 Align packed weights on 64 bytes in microkernel benchmarks by Marat Dukhan · 3 years, 2 months ago
  4. 8228689 Support QC8 DWCONV microkernels by Marat Dukhan · 3 years, 4 months ago
  5. f56f4c4 Refactor interface of microkernel parameter initialization by Marat Dukhan · 3 years, 4 months ago
  6. d713e8a Refactor microbenchmarks by Marat Dukhan · 3 years, 10 months ago
  7. c79427c Avoid batch-replication of indirection buffer in DW Conv and Avg Pooling by Marat Dukhan · 4 years ago
  8. 44f0ca7 Bind RNG by reference in microbenchmarks by Marat Dukhan · 4 years, 2 months ago
  9. b42f866 Unify interface of weights packing functions by Marat Dukhan · 4 years, 3 months ago
  10. 5a599a6 FP16 DWCONV microkernel by Frank Barchard · 4 years, 4 months ago