SSE2 SIMD implementation of Huffman encoding Full-color compression speedups relative to libjpeg-turbo 1.4.2: 2.8 GHz Intel Xeon W3530, Linux, 64-bit: 2.2-18% (avg. 9.5%) 2.8 GHz Intel Xeon W3530, Linux, 32-bit: 10-25% (avg. 17%) 2.3 GHz AMD A10-4600M APU, Linux, 64-bit: 4.9-17% (avg. 11%) 2.3 GHz AMD A10-4600M APU, Linux, 32-bit: 8.8-19% (avg. 15%) 3.0 GHz Intel Core i7, OS X, 64-bit: 3.5-16% (avg. 10%) 3.0 GHz Intel Core i7, OS X, 32-bit: 4.8-14% (avg. 11%) 2.6 GHz AMD Athlon 64 X2 5050e: Performance-neutral (give or take a few percent) Full-color compression speedups relative to IPP: 2.8 GHz Intel Xeon W3530, Linux, 64-bit: 4.8-34% (avg. 19%) 2.8 GHz Intel Xeon W3530, Linux, 32-bit: -19%-7.0% (avg. -7.0%) Refer to #42 for discussion. Numerous other approaches were attempted, but this one proved to be the most performant across all platforms. This commit also fixes #3 (works around, really-- the clang-compiled version of jchuff.c still performs 20% worse than its GCC-compiled counterpart, but that code is now bypassed by the new SSE2 Huffman algorithm.) Based on: https://github.com/mayeut/libjpeg-turbo/commit/2cb4d41330e1edc4469f6b97ba73b73abfbeb02f https://github.com/mayeut/libjpeg-turbo/commit/36c94e050d117912adbff9fbcc6fe307df240168

commit: f3a8684cd1c28e557d394470962a7a224c76ddbc [log] [tgz]
author: DRC <information@libjpeg-turbo.org> Thu Jan 07 00:19:43 2016 -0600
committer: DRC <information@libjpeg-turbo.org> Tue Jan 12 03:03:49 2016 -0600
tree: 6d3a1b20ccd56bc503233385e9ddc8faba6771d3
parent: eb59b6e72d8098a1f7b8c7e0c710b32eb6f5dc45 [diff] [blame]
diff --git a/jsimd_none.c b/jsimd_none.c
index 34aefc9..65e3f8f 100644
--- a/jsimd_none.c
+++ b/jsimd_none.c

@@ -3,6 +3,7 @@
  *
  * Copyright 2009 Pierre Ossman <ossman@cendio.se> for Cendio AB
  * Copyright 2009-2011, 2014 D. R. Commander
+ * Copyright 2015 Matthieu Darbois
  *
  * Based on the x86 SIMD extension for IJG JPEG library,
  * Copyright (C) 1999-2006, MIYASAKA Masaru.
@@ -387,3 +388,16 @@
 {
 }
 
+GLOBAL(int)
+jsimd_can_huff_encode_one_block (void)
+{
+  return 0;
+}
+
+GLOBAL(JOCTET*)
+jsimd_huff_encode_one_block (void * state, JOCTET *buffer, JCOEFPTR block,
+                             int last_dc_val, c_derived_tbl *dctbl,
+                             c_derived_tbl *actbl)
+{
+  return NULL;
+}
commit	f3a8684cd1c28e557d394470962a7a224c76ddbc	[log] [tgz]
author	DRC <information@libjpeg-turbo.org>	Thu Jan 07 00:19:43 2016 -0600
committer	DRC <information@libjpeg-turbo.org>	Tue Jan 12 03:03:49 2016 -0600
tree	6d3a1b20ccd56bc503233385e9ddc8faba6771d3
parent	eb59b6e72d8098a1f7b8c7e0c710b32eb6f5dc45 [diff] [blame]