blob: 7dbb35423f2ab8998a6693e3c8b512b71660eed1 [file] [log] [blame]
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001USING THE IJG JPEG LIBRARY
2
DRCa73e8702012-12-31 02:52:30 +00003This file was part of the Independent JPEG Group's software:
Guido Vollbeding5829cb22012-01-15 00:00:00 +00004Copyright (C) 1994-2011, Thomas G. Lane, Guido Vollbeding.
DRCda13af62014-05-18 17:52:06 +00005libjpeg-turbo Modifications:
DRC8940e6c2014-05-11 09:46:28 +00006Copyright (C) 2010, 2014, D. R. Commander.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00007For conditions of distribution and use, see the accompanying README file.
8
9
10This file describes how to use the IJG JPEG library within an application
11program. Read it if you want to write a program that uses the library.
12
13The file example.c provides heavily commented skeleton code for calling the
14JPEG library. Also see jpeglib.h (the include file to be used by application
15programs) for full details about data structures and function parameter lists.
16The library source code, of course, is the ultimate reference.
17
18Note that there have been *major* changes from the application interface
19presented by IJG version 4 and earlier versions. The old design had several
20inherent limitations, and it had accumulated a lot of cruft as we added
21features while trying to minimize application-interface changes. We have
22sacrificed backward compatibility in the version 5 rewrite, but we think the
23improvements justify this.
24
25
26TABLE OF CONTENTS
27-----------------
28
29Overview:
DRCb7753512014-05-11 09:36:25 +000030 Functions provided by the library
31 Outline of typical usage
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +000032Basic library usage:
DRCb7753512014-05-11 09:36:25 +000033 Data formats
34 Compression details
35 Decompression details
36 Mechanics of usage: include files, linking, etc
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +000037Advanced features:
DRCb7753512014-05-11 09:36:25 +000038 Compression parameter selection
39 Decompression parameter selection
40 Special color spaces
41 Error handling
42 Compressed data handling (source and destination managers)
43 I/O suspension
44 Progressive JPEG support
45 Buffered-image mode
46 Abbreviated datastreams and multiple images
47 Special markers
48 Raw (downsampled) image data
49 Really raw data: DCT coefficients
50 Progress monitoring
51 Memory management
52 Memory usage
53 Library compile-time options
54 Portability considerations
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +000055
56You should read at least the overview and basic usage sections before trying
57to program with the library. The sections on advanced features can be read
58if and when you need them.
59
60
61OVERVIEW
62========
63
64Functions provided by the library
65---------------------------------
66
67The IJG JPEG library provides C code to read and write JPEG-compressed image
68files. The surrounding application program receives or supplies image data a
69scanline at a time, using a straightforward uncompressed image format. All
70details of color conversion and other preprocessing/postprocessing can be
71handled by the library.
72
73The library includes a substantial amount of code that is not covered by the
74JPEG standard but is necessary for typical applications of JPEG. These
75functions preprocess the image before JPEG compression or postprocess it after
76decompression. They include colorspace conversion, downsampling/upsampling,
77and color quantization. The application indirectly selects use of this code
78by specifying the format in which it wishes to supply or receive image data.
79For example, if colormapped output is requested, then the decompression
80library automatically invokes color quantization.
81
82A wide range of quality vs. speed tradeoffs are possible in JPEG processing,
83and even more so in decompression postprocessing. The decompression library
84provides multiple implementations that cover most of the useful tradeoffs,
85ranging from very-high-quality down to fast-preview operation. On the
86compression side we have generally not provided low-quality choices, since
87compression is normally less time-critical. It should be understood that the
88low-quality modes may not meet the JPEG standard's accuracy requirements;
89nonetheless, they are useful for viewers.
90
91A word about functions *not* provided by the library. We handle a subset of
Thomas G. Lanebc79e061995-08-02 00:00:00 +000092the ISO JPEG standard; most baseline, extended-sequential, and progressive
93JPEG processes are supported. (Our subset includes all features now in common
94use.) Unsupported ISO options include:
DRCb7753512014-05-11 09:36:25 +000095 * Hierarchical storage
96 * Lossless JPEG
97 * DNL marker
98 * Nonintegral subsampling ratios
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +000099We support both 8- and 12-bit data precision, but this is a compile-time
100choice rather than a run-time choice; hence it is difficult to use both
101precisions in a single application.
102
103By itself, the library handles only interchange JPEG datastreams --- in
104particular the widely used JFIF file format. The library can be used by
105surrounding code to process interchange or abbreviated JPEG datastreams that
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000106are embedded in more complex file formats. (For example, this library is
107used by the free LIBTIFF library to support JPEG compression in TIFF.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000108
109
110Outline of typical usage
111------------------------
112
113The rough outline of a JPEG compression operation is:
114
DRCb7753512014-05-11 09:36:25 +0000115 Allocate and initialize a JPEG compression object
116 Specify the destination for the compressed data (eg, a file)
117 Set parameters for compression, including image size & colorspace
118 jpeg_start_compress(...);
119 while (scan lines remain to be written)
120 jpeg_write_scanlines(...);
121 jpeg_finish_compress(...);
122 Release the JPEG compression object
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000123
124A JPEG compression object holds parameters and working state for the JPEG
125library. We make creation/destruction of the object separate from starting
126or finishing compression of an image; the same object can be re-used for a
127series of image compression operations. This makes it easy to re-use the
128same parameter settings for a sequence of images. Re-use of a JPEG object
129also has important implications for processing abbreviated JPEG datastreams,
130as discussed later.
131
132The image data to be compressed is supplied to jpeg_write_scanlines() from
133in-memory buffers. If the application is doing file-to-file compression,
134reading image data from the source file is the application's responsibility.
135The library emits compressed data by calling a "data destination manager",
136which typically will write the data into a file; but the application can
137provide its own destination manager to do something else.
138
139Similarly, the rough outline of a JPEG decompression operation is:
140
DRCb7753512014-05-11 09:36:25 +0000141 Allocate and initialize a JPEG decompression object
142 Specify the source of the compressed data (eg, a file)
143 Call jpeg_read_header() to obtain image info
144 Set parameters for decompression
145 jpeg_start_decompress(...);
146 while (scan lines remain to be read)
147 jpeg_read_scanlines(...);
148 jpeg_finish_decompress(...);
149 Release the JPEG decompression object
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000150
151This is comparable to the compression outline except that reading the
152datastream header is a separate step. This is helpful because information
153about the image's size, colorspace, etc is available when the application
154selects decompression parameters. For example, the application can choose an
155output scaling ratio that will fit the image into the available screen size.
156
157The decompression library obtains compressed data by calling a data source
158manager, which typically will read the data from a file; but other behaviors
159can be obtained with a custom source manager. Decompressed data is delivered
160into in-memory buffers passed to jpeg_read_scanlines().
161
162It is possible to abort an incomplete compression or decompression operation
163by calling jpeg_abort(); or, if you do not need to retain the JPEG object,
164simply release it by calling jpeg_destroy().
165
166JPEG compression and decompression objects are two separate struct types.
167However, they share some common fields, and certain routines such as
168jpeg_destroy() can work on either type of object.
169
170The JPEG library has no static variables: all state is in the compression
171or decompression object. Therefore it is possible to process multiple
172compression and decompression operations concurrently, using multiple JPEG
173objects.
174
175Both compression and decompression can be done in an incremental memory-to-
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000176memory fashion, if suitable source/destination managers are used. See the
177section on "I/O suspension" for more details.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000178
179
180BASIC LIBRARY USAGE
181===================
182
183Data formats
184------------
185
186Before diving into procedural details, it is helpful to understand the
187image data format that the JPEG library expects or returns.
188
189The standard input image format is a rectangular array of pixels, with each
Thomas G. Lane489583f1996-02-07 00:00:00 +0000190pixel having the same number of "component" or "sample" values (color
191channels). You must specify how many components there are and the colorspace
192interpretation of the components. Most applications will use RGB data
193(three components per pixel) or grayscale data (one component per pixel).
194PLEASE NOTE THAT RGB DATA IS THREE SAMPLES PER PIXEL, GRAYSCALE ONLY ONE.
195A remarkable number of people manage to miss this, only to find that their
196programs don't work with grayscale JPEG files.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000197
Thomas G. Lane489583f1996-02-07 00:00:00 +0000198There is no provision for colormapped input. JPEG files are always full-color
199or full grayscale (or sometimes another colorspace such as CMYK). You can
200feed in a colormapped image by expanding it to full-color format. However
201JPEG often doesn't work very well with source data that has been colormapped,
202because of dithering noise. This is discussed in more detail in the JPEG FAQ
203and the other references mentioned in the README file.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000204
205Pixels are stored by scanlines, with each scanline running from left to
206right. The component values for each pixel are adjacent in the row; for
207example, R,G,B,R,G,B,R,G,B,... for 24-bit RGB color. Each scanline is an
208array of data type JSAMPLE --- which is typically "unsigned char", unless
209you've changed jmorecfg.h. (You can also change the RGB pixel layout, say
210to B,G,R order, by modifying jmorecfg.h. But see the restrictions listed in
211that file before doing so.)
212
213A 2-D array of pixels is formed by making a list of pointers to the starts of
214scanlines; so the scanlines need not be physically adjacent in memory. Even
215if you process just one scanline at a time, you must make a one-element
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000216pointer array to conform to this structure. Pointers to JSAMPLE rows are of
217type JSAMPROW, and the pointer to the pointer array is of type JSAMPARRAY.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000218
219The library accepts or supplies one or more complete scanlines per call.
220It is not possible to process part of a row at a time. Scanlines are always
221processed top-to-bottom. You can process an entire image in one call if you
222have it all in memory, but usually it's simplest to process one scanline at
223a time.
224
225For best results, source data values should have the precision specified by
226BITS_IN_JSAMPLE (normally 8 bits). For instance, if you choose to compress
227data that's only 6 bits/channel, you should left-justify each value in a
228byte before passing it to the compressor. If you need to compress data
229that has more than 8 bits/channel, compile with BITS_IN_JSAMPLE = 12.
230(See "Library compile-time options", later.)
231
Thomas G. Lane489583f1996-02-07 00:00:00 +0000232
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000233The data format returned by the decompressor is the same in all details,
Thomas G. Lane489583f1996-02-07 00:00:00 +0000234except that colormapped output is supported. (Again, a JPEG file is never
235colormapped. But you can ask the decompressor to perform on-the-fly color
236quantization to deliver colormapped output.) If you request colormapped
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000237output then the returned data array contains a single JSAMPLE per pixel;
238its value is an index into a color map. The color map is represented as
239a 2-D JSAMPARRAY in which each row holds the values of one color component,
240that is, colormap[i][j] is the value of the i'th color component for pixel
241value (map index) j. Note that since the colormap indexes are stored in
242JSAMPLEs, the maximum number of colors is limited by the size of JSAMPLE
243(ie, at most 256 colors for an 8-bit JPEG library).
244
245
246Compression details
247-------------------
248
249Here we revisit the JPEG compression outline given in the overview.
250
2511. Allocate and initialize a JPEG compression object.
252
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000253A JPEG compression object is a "struct jpeg_compress_struct". (It also has
254a bunch of subsidiary structures which are allocated via malloc(), but the
255application doesn't control those directly.) This struct can be just a local
256variable in the calling routine, if a single routine is going to execute the
257whole JPEG compression sequence. Otherwise it can be static or allocated
258from malloc().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000259
260You will also need a structure representing a JPEG error handler. The part
261of this that the library cares about is a "struct jpeg_error_mgr". If you
262are providing your own error handler, you'll typically want to embed the
263jpeg_error_mgr struct in a larger structure; this is discussed later under
264"Error handling". For now we'll assume you are just using the default error
265handler. The default error handler will print JPEG error/warning messages
266on stderr, and it will call exit() if a fatal error occurs.
267
268You must initialize the error handler structure, store a pointer to it into
269the JPEG object's "err" field, and then call jpeg_create_compress() to
270initialize the rest of the JPEG object.
271
272Typical code for this step, if you are using the default error handler, is
273
DRCb7753512014-05-11 09:36:25 +0000274 struct jpeg_compress_struct cinfo;
275 struct jpeg_error_mgr jerr;
276 ...
277 cinfo.err = jpeg_std_error(&jerr);
278 jpeg_create_compress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000279
280jpeg_create_compress allocates a small amount of memory, so it could fail
281if you are out of memory. In that case it will exit via the error handler;
282that's why the error handler must be initialized first.
283
284
2852. Specify the destination for the compressed data (eg, a file).
286
287As previously mentioned, the JPEG library delivers compressed data to a
288"data destination" module. The library includes one data destination
289module which knows how to write to a stdio stream. You can use your own
290destination module if you want to do something else, as discussed later.
291
292If you use the standard destination module, you must open the target stdio
293stream beforehand. Typical code for this step looks like:
294
DRCb7753512014-05-11 09:36:25 +0000295 FILE * outfile;
296 ...
297 if ((outfile = fopen(filename, "wb")) == NULL) {
298 fprintf(stderr, "can't open %s\n", filename);
299 exit(1);
300 }
301 jpeg_stdio_dest(&cinfo, outfile);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000302
303where the last line invokes the standard destination module.
304
305WARNING: it is critical that the binary compressed data be delivered to the
306output file unchanged. On non-Unix systems the stdio library may perform
307newline translation or otherwise corrupt binary data. To suppress this
308behavior, you may need to use a "b" option to fopen (as shown above), or use
309setmode() or another routine to put the stdio stream in binary mode. See
310cjpeg.c and djpeg.c for code that has been found to work on many systems.
311
312You can select the data destination after setting other parameters (step 3),
313if that's more convenient. You may not change the destination between
314calling jpeg_start_compress() and jpeg_finish_compress().
315
316
3173. Set parameters for compression, including image size & colorspace.
318
319You must supply information about the source image by setting the following
320fields in the JPEG object (cinfo structure):
321
DRCb7753512014-05-11 09:36:25 +0000322 image_width Width of image, in pixels
323 image_height Height of image, in pixels
324 input_components Number of color channels (samples per pixel)
325 in_color_space Color space of source image
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000326
327The image dimensions are, hopefully, obvious. JPEG supports image dimensions
328of 1 to 64K pixels in either direction. The input color space is typically
329RGB or grayscale, and input_components is 3 or 1 accordingly. (See "Special
330color spaces", later, for more info.) The in_color_space field must be
331assigned one of the J_COLOR_SPACE enum constants, typically JCS_RGB or
332JCS_GRAYSCALE.
333
334JPEG has a large number of compression parameters that determine how the
335image is encoded. Most applications don't need or want to know about all
336these parameters. You can set all the parameters to reasonable defaults by
337calling jpeg_set_defaults(); then, if there are particular values you want
338to change, you can do so after that. The "Compression parameter selection"
339section tells about all the parameters.
340
341You must set in_color_space correctly before calling jpeg_set_defaults(),
342because the defaults depend on the source image colorspace. However the
343other three source image parameters need not be valid until you call
344jpeg_start_compress(). There's no harm in calling jpeg_set_defaults() more
345than once, if that happens to be convenient.
346
347Typical code for a 24-bit RGB source image is
348
DRCb7753512014-05-11 09:36:25 +0000349 cinfo.image_width = Width; /* image width and height, in pixels */
350 cinfo.image_height = Height;
351 cinfo.input_components = 3; /* # of color components per pixel */
352 cinfo.in_color_space = JCS_RGB; /* colorspace of input image */
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000353
DRCb7753512014-05-11 09:36:25 +0000354 jpeg_set_defaults(&cinfo);
355 /* Make optional parameter settings here */
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000356
357
3584. jpeg_start_compress(...);
359
360After you have established the data destination and set all the necessary
361source image info and other parameters, call jpeg_start_compress() to begin
362a compression cycle. This will initialize internal state, allocate working
363storage, and emit the first few bytes of the JPEG datastream header.
364
365Typical code:
366
DRCb7753512014-05-11 09:36:25 +0000367 jpeg_start_compress(&cinfo, TRUE);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000368
369The "TRUE" parameter ensures that a complete JPEG interchange datastream
370will be written. This is appropriate in most cases. If you think you might
371want to use an abbreviated datastream, read the section on abbreviated
372datastreams, below.
373
374Once you have called jpeg_start_compress(), you may not alter any JPEG
375parameters or other fields of the JPEG object until you have completed
376the compression cycle.
377
378
3795. while (scan lines remain to be written)
DRCb7753512014-05-11 09:36:25 +0000380 jpeg_write_scanlines(...);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000381
382Now write all the required image data by calling jpeg_write_scanlines()
383one or more times. You can pass one or more scanlines in each call, up
384to the total image height. In most applications it is convenient to pass
385just one or a few scanlines at a time. The expected format for the passed
386data is discussed under "Data formats", above.
387
388Image data should be written in top-to-bottom scanline order. The JPEG spec
389contains some weasel wording about how top and bottom are application-defined
390terms (a curious interpretation of the English language...) but if you want
391your files to be compatible with everyone else's, you WILL use top-to-bottom
392order. If the source data must be read in bottom-to-top order, you can use
393the JPEG library's virtual array mechanism to invert the data efficiently.
394Examples of this can be found in the sample application cjpeg.
395
396The library maintains a count of the number of scanlines written so far
397in the next_scanline field of the JPEG object. Usually you can just use
398this variable as the loop counter, so that the loop test looks like
399"while (cinfo.next_scanline < cinfo.image_height)".
400
401Code for this step depends heavily on the way that you store the source data.
402example.c shows the following code for the case of a full-size 2-D source
403array containing 3-byte RGB pixels:
404
DRCb7753512014-05-11 09:36:25 +0000405 JSAMPROW row_pointer[1]; /* pointer to a single row */
406 int row_stride; /* physical row width in buffer */
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000407
DRCb7753512014-05-11 09:36:25 +0000408 row_stride = image_width * 3; /* JSAMPLEs per row in image_buffer */
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000409
DRCb7753512014-05-11 09:36:25 +0000410 while (cinfo.next_scanline < cinfo.image_height) {
411 row_pointer[0] = & image_buffer[cinfo.next_scanline * row_stride];
412 jpeg_write_scanlines(&cinfo, row_pointer, 1);
413 }
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000414
415jpeg_write_scanlines() returns the number of scanlines actually written.
416This will normally be equal to the number passed in, so you can usually
417ignore the return value. It is different in just two cases:
418 * If you try to write more scanlines than the declared image height,
419 the additional scanlines are ignored.
420 * If you use a suspending data destination manager, output buffer overrun
421 will cause the compressor to return before accepting all the passed lines.
422 This feature is discussed under "I/O suspension", below. The normal
423 stdio destination manager will NOT cause this to happen.
424In any case, the return value is the same as the change in the value of
425next_scanline.
426
427
4286. jpeg_finish_compress(...);
429
430After all the image data has been written, call jpeg_finish_compress() to
431complete the compression cycle. This step is ESSENTIAL to ensure that the
432last bufferload of data is written to the data destination.
433jpeg_finish_compress() also releases working memory associated with the JPEG
434object.
435
436Typical code:
437
DRCb7753512014-05-11 09:36:25 +0000438 jpeg_finish_compress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000439
440If using the stdio destination manager, don't forget to close the output
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000441stdio stream (if necessary) afterwards.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000442
443If you have requested a multi-pass operating mode, such as Huffman code
444optimization, jpeg_finish_compress() will perform the additional passes using
445data buffered by the first pass. In this case jpeg_finish_compress() may take
446quite a while to complete. With the default compression parameters, this will
447not happen.
448
449It is an error to call jpeg_finish_compress() before writing the necessary
450total number of scanlines. If you wish to abort compression, call
451jpeg_abort() as discussed below.
452
453After completing a compression cycle, you may dispose of the JPEG object
454as discussed next, or you may use it to compress another image. In that case
455return to step 2, 3, or 4 as appropriate. If you do not change the
456destination manager, the new datastream will be written to the same target.
457If you do not change any JPEG parameters, the new datastream will be written
458with the same parameters as before. Note that you can change the input image
459dimensions freely between cycles, but if you change the input colorspace, you
460should call jpeg_set_defaults() to adjust for the new colorspace; and then
461you'll need to repeat all of step 3.
462
463
4647. Release the JPEG compression object.
465
466When you are done with a JPEG compression object, destroy it by calling
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000467jpeg_destroy_compress(). This will free all subsidiary memory (regardless of
468the previous state of the object). Or you can call jpeg_destroy(), which
469works for either compression or decompression objects --- this may be more
470convenient if you are sharing code between compression and decompression
471cases. (Actually, these routines are equivalent except for the declared type
472of the passed pointer. To avoid gripes from ANSI C compilers, jpeg_destroy()
473should be passed a j_common_ptr.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000474
475If you allocated the jpeg_compress_struct structure from malloc(), freeing
476it is your responsibility --- jpeg_destroy() won't. Ditto for the error
477handler structure.
478
479Typical code:
480
DRCb7753512014-05-11 09:36:25 +0000481 jpeg_destroy_compress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000482
483
4848. Aborting.
485
486If you decide to abort a compression cycle before finishing, you can clean up
487in either of two ways:
488
489* If you don't need the JPEG object any more, just call
490 jpeg_destroy_compress() or jpeg_destroy() to release memory. This is
491 legitimate at any point after calling jpeg_create_compress() --- in fact,
492 it's safe even if jpeg_create_compress() fails.
493
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000494* If you want to re-use the JPEG object, call jpeg_abort_compress(), or call
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000495 jpeg_abort() which works on both compression and decompression objects.
496 This will return the object to an idle state, releasing any working memory.
497 jpeg_abort() is allowed at any time after successful object creation.
498
499Note that cleaning up the data destination, if required, is your
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000500responsibility; neither of these routines will call term_destination().
501(See "Compressed data handling", below, for more about that.)
502
503jpeg_destroy() and jpeg_abort() are the only safe calls to make on a JPEG
504object that has reported an error by calling error_exit (see "Error handling"
505for more info). The internal state of such an object is likely to be out of
506whack. Either of these two routines will return the object to a known state.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000507
508
509Decompression details
510---------------------
511
512Here we revisit the JPEG decompression outline given in the overview.
513
5141. Allocate and initialize a JPEG decompression object.
515
516This is just like initialization for compression, as discussed above,
517except that the object is a "struct jpeg_decompress_struct" and you
518call jpeg_create_decompress(). Error handling is exactly the same.
519
520Typical code:
521
DRCb7753512014-05-11 09:36:25 +0000522 struct jpeg_decompress_struct cinfo;
523 struct jpeg_error_mgr jerr;
524 ...
525 cinfo.err = jpeg_std_error(&jerr);
526 jpeg_create_decompress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000527
528(Both here and in the IJG code, we usually use variable name "cinfo" for
529both compression and decompression objects.)
530
531
5322. Specify the source of the compressed data (eg, a file).
533
534As previously mentioned, the JPEG library reads compressed data from a "data
535source" module. The library includes one data source module which knows how
536to read from a stdio stream. You can use your own source module if you want
537to do something else, as discussed later.
538
539If you use the standard source module, you must open the source stdio stream
540beforehand. Typical code for this step looks like:
541
DRCb7753512014-05-11 09:36:25 +0000542 FILE * infile;
543 ...
544 if ((infile = fopen(filename, "rb")) == NULL) {
545 fprintf(stderr, "can't open %s\n", filename);
546 exit(1);
547 }
548 jpeg_stdio_src(&cinfo, infile);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000549
550where the last line invokes the standard source module.
551
552WARNING: it is critical that the binary compressed data be read unchanged.
553On non-Unix systems the stdio library may perform newline translation or
554otherwise corrupt binary data. To suppress this behavior, you may need to use
555a "b" option to fopen (as shown above), or use setmode() or another routine to
556put the stdio stream in binary mode. See cjpeg.c and djpeg.c for code that
557has been found to work on many systems.
558
559You may not change the data source between calling jpeg_read_header() and
560jpeg_finish_decompress(). If you wish to read a series of JPEG images from
561a single source file, you should repeat the jpeg_read_header() to
562jpeg_finish_decompress() sequence without reinitializing either the JPEG
563object or the data source module; this prevents buffered input data from
564being discarded.
565
566
5673. Call jpeg_read_header() to obtain image info.
568
569Typical code for this step is just
570
DRCb7753512014-05-11 09:36:25 +0000571 jpeg_read_header(&cinfo, TRUE);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000572
573This will read the source datastream header markers, up to the beginning
574of the compressed data proper. On return, the image dimensions and other
575info have been stored in the JPEG object. The application may wish to
576consult this information before selecting decompression parameters.
577
578More complex code is necessary if
579 * A suspending data source is used --- in that case jpeg_read_header()
580 may return before it has read all the header data. See "I/O suspension",
581 below. The normal stdio source manager will NOT cause this to happen.
582 * Abbreviated JPEG files are to be processed --- see the section on
583 abbreviated datastreams. Standard applications that deal only in
584 interchange JPEG files need not be concerned with this case either.
585
586It is permissible to stop at this point if you just wanted to find out the
587image dimensions and other header info for a JPEG file. In that case,
588call jpeg_destroy() when you are done with the JPEG object, or call
589jpeg_abort() to return it to an idle state before selecting a new data
590source and reading another header.
591
592
5934. Set parameters for decompression.
594
595jpeg_read_header() sets appropriate default decompression parameters based on
596the properties of the image (in particular, its colorspace). However, you
597may well want to alter these defaults before beginning the decompression.
598For example, the default is to produce full color output from a color file.
599If you want colormapped output you must ask for it. Other options allow the
600returned image to be scaled and allow various speed/quality tradeoffs to be
601selected. "Decompression parameter selection", below, gives details.
602
603If the defaults are appropriate, nothing need be done at this step.
604
605Note that all default values are set by each call to jpeg_read_header().
606If you reuse a decompression object, you cannot expect your parameter
607settings to be preserved across cycles, as you can for compression.
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000608You must set desired parameter values each time.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000609
610
6115. jpeg_start_decompress(...);
612
613Once the parameter values are satisfactory, call jpeg_start_decompress() to
614begin decompression. This will initialize internal state, allocate working
615memory, and prepare for returning data.
616
617Typical code is just
618
DRCb7753512014-05-11 09:36:25 +0000619 jpeg_start_decompress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000620
621If you have requested a multi-pass operating mode, such as 2-pass color
622quantization, jpeg_start_decompress() will do everything needed before data
623output can begin. In this case jpeg_start_decompress() may take quite a while
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000624to complete. With a single-scan (non progressive) JPEG file and default
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000625decompression parameters, this will not happen; jpeg_start_decompress() will
626return quickly.
627
628After this call, the final output image dimensions, including any requested
629scaling, are available in the JPEG object; so is the selected colormap, if
630colormapped output has been requested. Useful fields include
631
DRCb7753512014-05-11 09:36:25 +0000632 output_width image width and height, as scaled
633 output_height
634 out_color_components # of color components in out_color_space
635 output_components # of color components returned per pixel
636 colormap the selected colormap, if any
637 actual_number_of_colors number of entries in colormap
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000638
639output_components is 1 (a colormap index) when quantizing colors; otherwise it
640equals out_color_components. It is the number of JSAMPLE values that will be
641emitted per pixel in the output arrays.
642
643Typically you will need to allocate data buffers to hold the incoming image.
644You will need output_width * output_components JSAMPLEs per scanline in your
645output buffer, and a total of output_height scanlines will be returned.
646
647Note: if you are using the JPEG library's internal memory manager to allocate
648data buffers (as djpeg does), then the manager's protocol requires that you
649request large buffers *before* calling jpeg_start_decompress(). This is a
650little tricky since the output_XXX fields are not normally valid then. You
651can make them valid by calling jpeg_calc_output_dimensions() after setting the
652relevant parameters (scaling, output color space, and quantization flag).
653
654
6556. while (scan lines remain to be read)
DRCb7753512014-05-11 09:36:25 +0000656 jpeg_read_scanlines(...);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000657
658Now you can read the decompressed image data by calling jpeg_read_scanlines()
659one or more times. At each call, you pass in the maximum number of scanlines
660to be read (ie, the height of your working buffer); jpeg_read_scanlines()
661will return up to that many lines. The return value is the number of lines
662actually read. The format of the returned data is discussed under "Data
Thomas G. Lanea8b67c41995-03-15 00:00:00 +0000663formats", above. Don't forget that grayscale and color JPEGs will return
664different data formats!
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000665
666Image data is returned in top-to-bottom scanline order. If you must write
667out the image in bottom-to-top order, you can use the JPEG library's virtual
668array mechanism to invert the data efficiently. Examples of this can be
669found in the sample application djpeg.
670
671The library maintains a count of the number of scanlines returned so far
672in the output_scanline field of the JPEG object. Usually you can just use
673this variable as the loop counter, so that the loop test looks like
674"while (cinfo.output_scanline < cinfo.output_height)". (Note that the test
675should NOT be against image_height, unless you never use scaling. The
676image_height field is the height of the original unscaled image.)
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +0000677The return value always equals the change in the value of output_scanline.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000678
679If you don't use a suspending data source, it is safe to assume that
680jpeg_read_scanlines() reads at least one scanline per call, until the
Thomas G. Lane489583f1996-02-07 00:00:00 +0000681bottom of the image has been reached.
682
683If you use a buffer larger than one scanline, it is NOT safe to assume that
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000684jpeg_read_scanlines() fills it. (The current implementation returns only a
685few scanlines per call, no matter how large a buffer you pass.) So you must
686always provide a loop that calls jpeg_read_scanlines() repeatedly until the
687whole image has been read.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000688
689
6907. jpeg_finish_decompress(...);
691
692After all the image data has been read, call jpeg_finish_decompress() to
693complete the decompression cycle. This causes working memory associated
694with the JPEG object to be released.
695
696Typical code:
697
DRCb7753512014-05-11 09:36:25 +0000698 jpeg_finish_decompress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000699
700If using the stdio source manager, don't forget to close the source stdio
701stream if necessary.
702
703It is an error to call jpeg_finish_decompress() before reading the correct
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000704total number of scanlines. If you wish to abort decompression, call
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000705jpeg_abort() as discussed below.
706
707After completing a decompression cycle, you may dispose of the JPEG object as
708discussed next, or you may use it to decompress another image. In that case
709return to step 2 or 3 as appropriate. If you do not change the source
710manager, the next image will be read from the same source.
711
712
7138. Release the JPEG decompression object.
714
715When you are done with a JPEG decompression object, destroy it by calling
716jpeg_destroy_decompress() or jpeg_destroy(). The previous discussion of
717destroying compression objects applies here too.
718
719Typical code:
720
DRCb7753512014-05-11 09:36:25 +0000721 jpeg_destroy_decompress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000722
723
7249. Aborting.
725
726You can abort a decompression cycle by calling jpeg_destroy_decompress() or
727jpeg_destroy() if you don't need the JPEG object any more, or
728jpeg_abort_decompress() or jpeg_abort() if you want to reuse the object.
729The previous discussion of aborting compression cycles applies here too.
730
731
732Mechanics of usage: include files, linking, etc
733-----------------------------------------------
734
735Applications using the JPEG library should include the header file jpeglib.h
736to obtain declarations of data types and routines. Before including
737jpeglib.h, include system headers that define at least the typedefs FILE and
738size_t. On ANSI-conforming systems, including <stdio.h> is sufficient; on
739older Unix systems, you may need <sys/types.h> to define size_t.
740
741If the application needs to refer to individual JPEG library error codes, also
742include jerror.h to define those symbols.
743
744jpeglib.h indirectly includes the files jconfig.h and jmorecfg.h. If you are
745installing the JPEG header files in a system directory, you will want to
746install all four files: jpeglib.h, jerror.h, jconfig.h, jmorecfg.h.
747
748The most convenient way to include the JPEG code into your executable program
749is to prepare a library file ("libjpeg.a", or a corresponding name on non-Unix
750machines) and reference it at your link step. If you use only half of the
751library (only compression or only decompression), only that much code will be
752included from the library, unless your linker is hopelessly brain-damaged.
Guido Vollbeding5996a252009-06-27 00:00:00 +0000753The supplied makefiles build libjpeg.a automatically (see install.txt).
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000754
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000755While you can build the JPEG library as a shared library if the whim strikes
756you, we don't really recommend it. The trouble with shared libraries is that
757at some point you'll probably try to substitute a new version of the library
758without recompiling the calling applications. That generally doesn't work
759because the parameter struct declarations usually change with each new
760version. In other words, the library's API is *not* guaranteed binary
761compatible across versions; we only try to ensure source-code compatibility.
762(In hindsight, it might have been smarter to hide the parameter structs from
763applications and introduce a ton of access functions instead. Too late now,
764however.)
765
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000766It may be worth pointing out that the core JPEG library does not actually
767require the stdio library: only the default source/destination managers and
768error handler need it. You can use the library in a stdio-less environment
769if you replace those modules and use jmemnobs.c (or another memory manager of
770your own devising). More info about the minimum system library requirements
771may be found in jinclude.h.
772
773
774ADVANCED FEATURES
775=================
776
777Compression parameter selection
778-------------------------------
779
780This section describes all the optional parameters you can set for JPEG
781compression, as well as the "helper" routines provided to assist in this
782task. Proper setting of some parameters requires detailed understanding
783of the JPEG standard; if you don't know what a parameter is for, it's best
784not to mess with it! See REFERENCES in the README file for pointers to
785more info about JPEG.
786
787It's a good idea to call jpeg_set_defaults() first, even if you plan to set
788all the parameters; that way your code is more likely to work with future JPEG
789libraries that have additional parameters. For the same reason, we recommend
790you use a helper routine where one is provided, in preference to twiddling
791cinfo fields directly.
792
793The helper routines are:
794
795jpeg_set_defaults (j_compress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +0000796 This routine sets all JPEG parameters to reasonable defaults, using
797 only the input image's color space (field in_color_space, which must
798 already be set in cinfo). Many applications will only need to use
799 this routine and perhaps jpeg_set_quality().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000800
801jpeg_set_colorspace (j_compress_ptr cinfo, J_COLOR_SPACE colorspace)
DRCb7753512014-05-11 09:36:25 +0000802 Sets the JPEG file's colorspace (field jpeg_color_space) as specified,
803 and sets other color-space-dependent parameters appropriately. See
804 "Special color spaces", below, before using this. A large number of
805 parameters, including all per-component parameters, are set by this
806 routine; if you want to twiddle individual parameters you should call
807 jpeg_set_colorspace() before rather than after.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000808
809jpeg_default_colorspace (j_compress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +0000810 Selects an appropriate JPEG colorspace based on cinfo->in_color_space,
811 and calls jpeg_set_colorspace(). This is actually a subroutine of
812 jpeg_set_defaults(). It's broken out in case you want to change
813 just the colorspace-dependent JPEG parameters.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000814
815jpeg_set_quality (j_compress_ptr cinfo, int quality, boolean force_baseline)
DRCb7753512014-05-11 09:36:25 +0000816 Constructs JPEG quantization tables appropriate for the indicated
817 quality setting. The quality value is expressed on the 0..100 scale
818 recommended by IJG (cjpeg's "-quality" switch uses this routine).
819 Note that the exact mapping from quality values to tables may change
820 in future IJG releases as more is learned about DCT quantization.
821 If the force_baseline parameter is TRUE, then the quantization table
822 entries are constrained to the range 1..255 for full JPEG baseline
823 compatibility. In the current implementation, this only makes a
824 difference for quality settings below 25, and it effectively prevents
825 very small/low quality files from being generated. The IJG decoder
826 is capable of reading the non-baseline files generated at low quality
827 settings when force_baseline is FALSE, but other decoders may not be.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000828
829jpeg_set_linear_quality (j_compress_ptr cinfo, int scale_factor,
DRCb7753512014-05-11 09:36:25 +0000830 boolean force_baseline)
831 Same as jpeg_set_quality() except that the generated tables are the
832 sample tables given in the JPEC spec section K.1, multiplied by the
833 specified scale factor (which is expressed as a percentage; thus
834 scale_factor = 100 reproduces the spec's tables). Note that larger
835 scale factors give lower quality. This entry point is useful for
836 conforming to the Adobe PostScript DCT conventions, but we do not
837 recommend linear scaling as a user-visible quality scale otherwise.
838 force_baseline again constrains the computed table entries to 1..255.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000839
840int jpeg_quality_scaling (int quality)
DRCb7753512014-05-11 09:36:25 +0000841 Converts a value on the IJG-recommended quality scale to a linear
842 scaling percentage. Note that this routine may change or go away
843 in future releases --- IJG may choose to adopt a scaling method that
844 can't be expressed as a simple scalar multiplier, in which case the
845 premise of this routine collapses. Caveat user.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000846
Guido Vollbeding5996a252009-06-27 00:00:00 +0000847jpeg_default_qtables (j_compress_ptr cinfo, boolean force_baseline)
DRCb7753512014-05-11 09:36:25 +0000848 [libjpeg v7+ API/ABI emulation only]
849 Set default quantization tables with linear q_scale_factor[] values
850 (see below).
Guido Vollbeding5996a252009-06-27 00:00:00 +0000851
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000852jpeg_add_quant_table (j_compress_ptr cinfo, int which_tbl,
DRCb7753512014-05-11 09:36:25 +0000853 const unsigned int *basic_table,
854 int scale_factor, boolean force_baseline)
855 Allows an arbitrary quantization table to be created. which_tbl
856 indicates which table slot to fill. basic_table points to an array
857 of 64 unsigned ints given in normal array order. These values are
858 multiplied by scale_factor/100 and then clamped to the range 1..65535
859 (or to 1..255 if force_baseline is TRUE).
860 CAUTION: prior to library version 6a, jpeg_add_quant_table expected
861 the basic table to be given in JPEG zigzag order. If you need to
862 write code that works with either older or newer versions of this
863 routine, you must check the library version number. Something like
864 "#if JPEG_LIB_VERSION >= 61" is the right test.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000865
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000866jpeg_simple_progression (j_compress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +0000867 Generates a default scan script for writing a progressive-JPEG file.
868 This is the recommended method of creating a progressive file,
869 unless you want to make a custom scan sequence. You must ensure that
870 the JPEG color space is set correctly before calling this routine.
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000871
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000872
873Compression parameters (cinfo fields) include:
874
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000875J_DCT_METHOD dct_method
DRCb7753512014-05-11 09:36:25 +0000876 Selects the algorithm used for the DCT step. Choices are:
877 JDCT_ISLOW: slow but accurate integer algorithm
878 JDCT_IFAST: faster, less accurate integer method
879 JDCT_FLOAT: floating-point method
880 JDCT_DEFAULT: default method (normally JDCT_ISLOW)
881 JDCT_FASTEST: fastest method (normally JDCT_IFAST)
DRC8940e6c2014-05-11 09:46:28 +0000882 In libjpeg-turbo, JDCT_IFAST is generally about 5-15% faster than
883 JDCT_ISLOW when using the x86/x86-64 SIMD extensions (results may vary
884 with other SIMD implementations, or when using libjpeg-turbo without
885 SIMD extensions.) For quality levels of 90 and below, there should be
886 little or no perceptible difference between the two algorithms. For
887 quality levels above 90, however, the difference between JDCT_IFAST and
888 JDCT_ISLOW becomes more pronounced. With quality=97, for instance,
889 JDCT_IFAST incurs generally about a 1-3 dB loss (in PSNR) relative to
890 JDCT_ISLOW, but this can be larger for some images. Do not use
891 JDCT_IFAST with quality levels above 97. The algorithm often
892 degenerates at quality=98 and above and can actually produce a more
DRC05524e62014-05-11 23:14:43 +0000893 lossy image than if lower quality levels had been used. Also, in
894 libjpeg-turbo, JDCT_IFAST is not fully accelerated for quality levels
895 above 97, so it will be slower than JDCT_ISLOW. JDCT_FLOAT is mainly a
896 legacy feature. It does not produce significantly more accurate
897 results than the ISLOW method, and it is much slower. The FLOAT method
898 may also give different results on different machines due to varying
899 roundoff behavior, whereas the integer methods should give the same
900 results on all machines.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000901
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000902J_COLOR_SPACE jpeg_color_space
903int num_components
DRCb7753512014-05-11 09:36:25 +0000904 The JPEG color space and corresponding number of components; see
905 "Special color spaces", below, for more info. We recommend using
906 jpeg_set_color_space() if you want to change these.
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000907
908boolean optimize_coding
DRCb7753512014-05-11 09:36:25 +0000909 TRUE causes the compressor to compute optimal Huffman coding tables
910 for the image. This requires an extra pass over the data and
911 therefore costs a good deal of space and time. The default is
912 FALSE, which tells the compressor to use the supplied or default
913 Huffman tables. In most cases optimal tables save only a few percent
914 of file size compared to the default tables. Note that when this is
915 TRUE, you need not supply Huffman tables at all, and any you do
916 supply will be overwritten.
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000917
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000918unsigned int restart_interval
919int restart_in_rows
DRCb7753512014-05-11 09:36:25 +0000920 To emit restart markers in the JPEG file, set one of these nonzero.
921 Set restart_interval to specify the exact interval in MCU blocks.
922 Set restart_in_rows to specify the interval in MCU rows. (If
923 restart_in_rows is not 0, then restart_interval is set after the
924 image width in MCUs is computed.) Defaults are zero (no restarts).
925 One restart marker per MCU row is often a good choice.
926 NOTE: the overhead of restart markers is higher in grayscale JPEG
927 files than in color files, and MUCH higher in progressive JPEGs.
928 If you use restarts, you may want to use larger intervals in those
929 cases.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000930
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000931const jpeg_scan_info * scan_info
932int num_scans
DRCb7753512014-05-11 09:36:25 +0000933 By default, scan_info is NULL; this causes the compressor to write a
934 single-scan sequential JPEG file. If not NULL, scan_info points to
935 an array of scan definition records of length num_scans. The
936 compressor will then write a JPEG file having one scan for each scan
937 definition record. This is used to generate noninterleaved or
938 progressive JPEG files. The library checks that the scan array
939 defines a valid JPEG scan sequence. (jpeg_simple_progression creates
940 a suitable scan definition array for progressive JPEG.) This is
941 discussed further under "Progressive JPEG support".
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000942
943int smoothing_factor
DRCb7753512014-05-11 09:36:25 +0000944 If non-zero, the input image is smoothed; the value should be 1 for
945 minimal smoothing to 100 for maximum smoothing. Consult jcsample.c
946 for details of the smoothing algorithm. The default is zero.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000947
948boolean write_JFIF_header
DRCb7753512014-05-11 09:36:25 +0000949 If TRUE, a JFIF APP0 marker is emitted. jpeg_set_defaults() and
950 jpeg_set_colorspace() set this TRUE if a JFIF-legal JPEG color space
951 (ie, YCbCr or grayscale) is selected, otherwise FALSE.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000952
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000953UINT8 JFIF_major_version
954UINT8 JFIF_minor_version
DRCb7753512014-05-11 09:36:25 +0000955 The version number to be written into the JFIF marker.
956 jpeg_set_defaults() initializes the version to 1.01 (major=minor=1).
957 You should set it to 1.02 (major=1, minor=2) if you plan to write
958 any JFIF 1.02 extension markers.
Thomas G. Lane5ead57a1998-03-27 00:00:00 +0000959
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000960UINT8 density_unit
961UINT16 X_density
962UINT16 Y_density
DRCb7753512014-05-11 09:36:25 +0000963 The resolution information to be written into the JFIF marker;
964 not used otherwise. density_unit may be 0 for unknown,
965 1 for dots/inch, or 2 for dots/cm. The default values are 0,1,1
966 indicating square pixels of unknown size.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000967
968boolean write_Adobe_marker
DRCb7753512014-05-11 09:36:25 +0000969 If TRUE, an Adobe APP14 marker is emitted. jpeg_set_defaults() and
970 jpeg_set_colorspace() set this TRUE if JPEG color space RGB, CMYK,
971 or YCCK is selected, otherwise FALSE. It is generally a bad idea
972 to set both write_JFIF_header and write_Adobe_marker. In fact,
973 you probably shouldn't change the default settings at all --- the
974 default behavior ensures that the JPEG file's color space can be
975 recognized by the decoder.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000976
977JQUANT_TBL * quant_tbl_ptrs[NUM_QUANT_TBLS]
DRCb7753512014-05-11 09:36:25 +0000978 Pointers to coefficient quantization tables, one per table slot,
979 or NULL if no table is defined for a slot. Usually these should
980 be set via one of the above helper routines; jpeg_add_quant_table()
981 is general enough to define any quantization table. The other
982 routines will set up table slot 0 for luminance quality and table
983 slot 1 for chrominance.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000984
Guido Vollbeding5996a252009-06-27 00:00:00 +0000985int q_scale_factor[NUM_QUANT_TBLS]
DRCb7753512014-05-11 09:36:25 +0000986 [libjpeg v7+ API/ABI emulation only]
987 Linear quantization scaling factors (0-100, default 100)
988 for use with jpeg_default_qtables().
989 See rdswitch.c and cjpeg.c for an example of usage.
990 Note that the q_scale_factor[] values use "linear" scales, so JPEG
991 quality levels chosen by the user must be converted to these scales
992 using jpeg_quality_scaling(). Here is an example that corresponds to
993 cjpeg -quality 90,70:
Guido Vollbeding5996a252009-06-27 00:00:00 +0000994
DRCb7753512014-05-11 09:36:25 +0000995 jpeg_set_defaults(cinfo);
Guido Vollbeding5996a252009-06-27 00:00:00 +0000996
DRCb7753512014-05-11 09:36:25 +0000997 /* Set luminance quality 90. */
998 cinfo->q_scale_factor[0] = jpeg_quality_scaling(90);
999 /* Set chrominance quality 70. */
1000 cinfo->q_scale_factor[1] = jpeg_quality_scaling(70);
Guido Vollbeding5996a252009-06-27 00:00:00 +00001001
DRCb7753512014-05-11 09:36:25 +00001002 jpeg_default_qtables(cinfo, force_baseline);
Guido Vollbeding5996a252009-06-27 00:00:00 +00001003
DRCb7753512014-05-11 09:36:25 +00001004 CAUTION: Setting separate quality levels for chrominance and luminance
1005 is mainly only useful if chrominance subsampling is disabled. 2x2
1006 chrominance subsampling (AKA "4:2:0") is the default, but you can
1007 explicitly disable subsampling as follows:
Guido Vollbeding5996a252009-06-27 00:00:00 +00001008
DRCb7753512014-05-11 09:36:25 +00001009 cinfo->comp_info[0].v_samp_factor = 1;
1010 cinfo->comp_info[0].h_samp_factor = 1;
Guido Vollbeding5996a252009-06-27 00:00:00 +00001011
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001012JHUFF_TBL * dc_huff_tbl_ptrs[NUM_HUFF_TBLS]
1013JHUFF_TBL * ac_huff_tbl_ptrs[NUM_HUFF_TBLS]
DRCb7753512014-05-11 09:36:25 +00001014 Pointers to Huffman coding tables, one per table slot, or NULL if
1015 no table is defined for a slot. Slots 0 and 1 are filled with the
1016 JPEG sample tables by jpeg_set_defaults(). If you need to allocate
1017 more table structures, jpeg_alloc_huff_table() may be used.
1018 Note that optimal Huffman tables can be computed for an image
1019 by setting optimize_coding, as discussed above; there's seldom
1020 any need to mess with providing your own Huffman tables.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001021
Guido Vollbeding5996a252009-06-27 00:00:00 +00001022
DRC30913542012-01-27 09:53:33 +00001023[libjpeg v7+ API/ABI emulation only]
1024The actual dimensions of the JPEG image that will be written to the file are
1025given by the following fields. These are computed from the input image
1026dimensions and the compression parameters by jpeg_start_compress(). You can
1027also call jpeg_calc_jpeg_dimensions() to obtain the values that will result
Guido Vollbeding5996a252009-06-27 00:00:00 +00001028from the current parameter settings. This can be useful if you are trying
1029to pick a scaling ratio that will get close to a desired target size.
Guido Vollbeding5996a252009-06-27 00:00:00 +00001030
DRCb7753512014-05-11 09:36:25 +00001031JDIMENSION jpeg_width Actual dimensions of output image.
Guido Vollbeding5996a252009-06-27 00:00:00 +00001032JDIMENSION jpeg_height
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001033
1034
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001035Per-component parameters are stored in the struct cinfo.comp_info[i] for
1036component number i. Note that components here refer to components of the
1037JPEG color space, *not* the source image color space. A suitably large
1038comp_info[] array is allocated by jpeg_set_defaults(); if you choose not
1039to use that routine, it's up to you to allocate the array.
1040
1041int component_id
DRCb7753512014-05-11 09:36:25 +00001042 The one-byte identifier code to be recorded in the JPEG file for
1043 this component. For the standard color spaces, we recommend you
1044 leave the default values alone.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001045
1046int h_samp_factor
1047int v_samp_factor
DRCb7753512014-05-11 09:36:25 +00001048 Horizontal and vertical sampling factors for the component; must
1049 be 1..4 according to the JPEG standard. Note that larger sampling
1050 factors indicate a higher-resolution component; many people find
1051 this behavior quite unintuitive. The default values are 2,2 for
1052 luminance components and 1,1 for chrominance components, except
1053 for grayscale where 1,1 is used.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001054
1055int quant_tbl_no
DRCb7753512014-05-11 09:36:25 +00001056 Quantization table number for component. The default value is
1057 0 for luminance components and 1 for chrominance components.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001058
1059int dc_tbl_no
1060int ac_tbl_no
DRCb7753512014-05-11 09:36:25 +00001061 DC and AC entropy coding table numbers. The default values are
1062 0 for luminance components and 1 for chrominance components.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001063
1064int component_index
DRCb7753512014-05-11 09:36:25 +00001065 Must equal the component's index in comp_info[]. (Beginning in
1066 release v6, the compressor library will fill this in automatically;
1067 you don't have to.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001068
1069
1070Decompression parameter selection
1071---------------------------------
1072
1073Decompression parameter selection is somewhat simpler than compression
1074parameter selection, since all of the JPEG internal parameters are
1075recorded in the source file and need not be supplied by the application.
1076(Unless you are working with abbreviated files, in which case see
1077"Abbreviated datastreams", below.) Decompression parameters control
1078the postprocessing done on the image to deliver it in a format suitable
1079for the application's use. Many of the parameters control speed/quality
1080tradeoffs, in which faster decompression may be obtained at the price of
1081a poorer-quality image. The defaults select the highest quality (slowest)
1082processing.
1083
1084The following fields in the JPEG object are set by jpeg_read_header() and
1085may be useful to the application in choosing decompression parameters:
1086
DRCb7753512014-05-11 09:36:25 +00001087JDIMENSION image_width Width and height of image
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001088JDIMENSION image_height
DRCb7753512014-05-11 09:36:25 +00001089int num_components Number of color components
1090J_COLOR_SPACE jpeg_color_space Colorspace of image
1091boolean saw_JFIF_marker TRUE if a JFIF APP0 marker was seen
1092 UINT8 JFIF_major_version Version information from JFIF marker
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00001093 UINT8 JFIF_minor_version
DRCb7753512014-05-11 09:36:25 +00001094 UINT8 density_unit Resolution data from JFIF marker
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001095 UINT16 X_density
1096 UINT16 Y_density
DRCb7753512014-05-11 09:36:25 +00001097boolean saw_Adobe_marker TRUE if an Adobe APP14 marker was seen
1098 UINT8 Adobe_transform Color transform code from Adobe marker
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001099
1100The JPEG color space, unfortunately, is something of a guess since the JPEG
1101standard proper does not provide a way to record it. In practice most files
1102adhere to the JFIF or Adobe conventions, and the decoder will recognize these
1103correctly. See "Special color spaces", below, for more info.
1104
1105
1106The decompression parameters that determine the basic properties of the
1107returned image are:
1108
1109J_COLOR_SPACE out_color_space
DRCb7753512014-05-11 09:36:25 +00001110 Output color space. jpeg_read_header() sets an appropriate default
1111 based on jpeg_color_space; typically it will be RGB or grayscale.
1112 The application can change this field to request output in a different
1113 colorspace. For example, set it to JCS_GRAYSCALE to get grayscale
1114 output from a color file. (This is useful for previewing: grayscale
1115 output is faster than full color since the color components need not
1116 be processed.) Note that not all possible color space transforms are
1117 currently implemented; you may need to extend jdcolor.c if you want an
1118 unusual conversion.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001119
1120unsigned int scale_num, scale_denom
DRCb7753512014-05-11 09:36:25 +00001121 Scale the image by the fraction scale_num/scale_denom. Default is
1122 1/1, or no scaling. Currently, the only supported scaling ratios
1123 are M/8 with all M from 1 to 16, or any reduced fraction thereof (such
1124 as 1/2, 3/4, etc.) (The library design allows for arbitrary
1125 scaling ratios but this is not likely to be implemented any time soon.)
1126 Smaller scaling ratios permit significantly faster decoding since
1127 fewer pixels need be processed and a simpler IDCT method can be used.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001128
1129boolean quantize_colors
DRCb7753512014-05-11 09:36:25 +00001130 If set TRUE, colormapped output will be delivered. Default is FALSE,
1131 meaning that full-color output will be delivered.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001132
1133The next three parameters are relevant only if quantize_colors is TRUE.
1134
1135int desired_number_of_colors
DRCb7753512014-05-11 09:36:25 +00001136 Maximum number of colors to use in generating a library-supplied color
1137 map (the actual number of colors is returned in a different field).
1138 Default 256. Ignored when the application supplies its own color map.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001139
1140boolean two_pass_quantize
DRCb7753512014-05-11 09:36:25 +00001141 If TRUE, an extra pass over the image is made to select a custom color
1142 map for the image. This usually looks a lot better than the one-size-
1143 fits-all colormap that is used otherwise. Default is TRUE. Ignored
1144 when the application supplies its own color map.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001145
1146J_DITHER_MODE dither_mode
DRCb7753512014-05-11 09:36:25 +00001147 Selects color dithering method. Supported values are:
1148 JDITHER_NONE no dithering: fast, very low quality
1149 JDITHER_ORDERED ordered dither: moderate speed and quality
1150 JDITHER_FS Floyd-Steinberg dither: slow, high quality
1151 Default is JDITHER_FS. (At present, ordered dither is implemented
1152 only in the single-pass, standard-colormap case. If you ask for
1153 ordered dither when two_pass_quantize is TRUE or when you supply
1154 an external color map, you'll get F-S dithering.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001155
1156When quantize_colors is TRUE, the target color map is described by the next
1157two fields. colormap is set to NULL by jpeg_read_header(). The application
1158can supply a color map by setting colormap non-NULL and setting
1159actual_number_of_colors to the map size. Otherwise, jpeg_start_decompress()
1160selects a suitable color map and sets these two fields itself.
1161[Implementation restriction: at present, an externally supplied colormap is
1162only accepted for 3-component output color spaces.]
1163
1164JSAMPARRAY colormap
DRCb7753512014-05-11 09:36:25 +00001165 The color map, represented as a 2-D pixel array of out_color_components
1166 rows and actual_number_of_colors columns. Ignored if not quantizing.
1167 CAUTION: if the JPEG library creates its own colormap, the storage
1168 pointed to by this field is released by jpeg_finish_decompress().
1169 Copy the colormap somewhere else first, if you want to save it.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001170
1171int actual_number_of_colors
DRCb7753512014-05-11 09:36:25 +00001172 The number of colors in the color map.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001173
1174Additional decompression parameters that the application may set include:
1175
1176J_DCT_METHOD dct_method
DRC8940e6c2014-05-11 09:46:28 +00001177 Selects the algorithm used for the DCT step. Choices are:
1178 JDCT_ISLOW: slow but accurate integer algorithm
1179 JDCT_IFAST: faster, less accurate integer method
1180 JDCT_FLOAT: floating-point method
1181 JDCT_DEFAULT: default method (normally JDCT_ISLOW)
1182 JDCT_FASTEST: fastest method (normally JDCT_IFAST)
1183 In libjpeg-turbo, JDCT_IFAST is generally about 5-15% faster than
1184 JDCT_ISLOW when using the x86/x86-64 SIMD extensions (results may vary
1185 with other SIMD implementations, or when using libjpeg-turbo without
1186 SIMD extensions.) If the JPEG image was compressed using a quality
1187 level of 85 or below, then there should be little or no perceptible
1188 difference between the two algorithms. When decompressing images that
1189 were compressed using quality levels above 85, however, the difference
1190 between JDCT_IFAST and JDCT_ISLOW becomes more pronounced. With images
1191 compressed using quality=97, for instance, JDCT_IFAST incurs generally
1192 about a 4-6 dB loss (in PSNR) relative to JDCT_ISLOW, but this can be
1193 larger for some images. If you can avoid it, do not use JDCT_IFAST
1194 when decompressing images that were compressed using quality levels
1195 above 97. The algorithm often degenerates for such images and can
1196 actually produce a more lossy output image than if the JPEG image had
DRC05524e62014-05-11 23:14:43 +00001197 been compressed using lower quality levels. JDCT_FLOAT is mainly a
DRC8940e6c2014-05-11 09:46:28 +00001198 legacy feature. It does not produce significantly more accurate
1199 results than the ISLOW method, and it is much slower. The FLOAT method
1200 may also give different results on different machines due to varying
1201 roundoff behavior, whereas the integer methods should give the same
1202 results on all machines.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001203
1204boolean do_fancy_upsampling
DRCb7753512014-05-11 09:36:25 +00001205 If TRUE, do careful upsampling of chroma components. If FALSE,
1206 a faster but sloppier method is used. Default is TRUE. The visual
1207 impact of the sloppier method is often very small.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001208
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001209boolean do_block_smoothing
DRCb7753512014-05-11 09:36:25 +00001210 If TRUE, interblock smoothing is applied in early stages of decoding
1211 progressive JPEG files; if FALSE, not. Default is TRUE. Early
1212 progression stages look "fuzzy" with smoothing, "blocky" without.
1213 In any case, block smoothing ceases to be applied after the first few
1214 AC coefficients are known to full accuracy, so it is relevant only
1215 when using buffered-image mode for progressive images.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001216
1217boolean enable_1pass_quant
1218boolean enable_external_quant
1219boolean enable_2pass_quant
DRCb7753512014-05-11 09:36:25 +00001220 These are significant only in buffered-image mode, which is
1221 described in its own section below.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001222
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001223
1224The output image dimensions are given by the following fields. These are
1225computed from the source image dimensions and the decompression parameters
1226by jpeg_start_decompress(). You can also call jpeg_calc_output_dimensions()
1227to obtain the values that will result from the current parameter settings.
1228This can be useful if you are trying to pick a scaling ratio that will get
1229close to a desired target size. It's also important if you are using the
1230JPEG library's memory manager to allocate output buffer space, because you
1231are supposed to request such buffers *before* jpeg_start_decompress().
1232
DRCb7753512014-05-11 09:36:25 +00001233JDIMENSION output_width Actual dimensions of output image.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001234JDIMENSION output_height
DRCb7753512014-05-11 09:36:25 +00001235int out_color_components Number of color components in out_color_space.
1236int output_components Number of color components returned.
1237int rec_outbuf_height Recommended height of scanline buffer.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001238
1239When quantizing colors, output_components is 1, indicating a single color map
1240index per pixel. Otherwise it equals out_color_components. The output arrays
1241are required to be output_width * output_components JSAMPLEs wide.
1242
1243rec_outbuf_height is the recommended minimum height (in scanlines) of the
1244buffer passed to jpeg_read_scanlines(). If the buffer is smaller, the
1245library will still work, but time will be wasted due to unnecessary data
1246copying. In high-quality modes, rec_outbuf_height is always 1, but some
1247faster, lower-quality modes set it to larger values (typically 2 to 4).
1248If you are going to ask for a high-speed processing mode, you may as well
1249go to the trouble of honoring rec_outbuf_height so as to avoid data copying.
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00001250(An output buffer larger than rec_outbuf_height lines is OK, but won't
1251provide any material speed improvement over that height.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001252
1253
1254Special color spaces
1255--------------------
1256
1257The JPEG standard itself is "color blind" and doesn't specify any particular
1258color space. It is customary to convert color data to a luminance/chrominance
1259color space before compressing, since this permits greater compression. The
1260existing de-facto JPEG file format standards specify YCbCr or grayscale data
1261(JFIF), or grayscale, RGB, YCbCr, CMYK, or YCCK (Adobe). For special
1262applications such as multispectral images, other color spaces can be used,
1263but it must be understood that such files will be unportable.
1264
1265The JPEG library can handle the most common colorspace conversions (namely
1266RGB <=> YCbCr and CMYK <=> YCCK). It can also deal with data of an unknown
1267color space, passing it through without conversion. If you deal extensively
1268with an unusual color space, you can easily extend the library to understand
1269additional color spaces and perform appropriate conversions.
1270
1271For compression, the source data's color space is specified by field
1272in_color_space. This is transformed to the JPEG file's color space given
1273by jpeg_color_space. jpeg_set_defaults() chooses a reasonable JPEG color
1274space depending on in_color_space, but you can override this by calling
1275jpeg_set_colorspace(). Of course you must select a supported transformation.
1276jccolor.c currently supports the following transformations:
DRCb7753512014-05-11 09:36:25 +00001277 RGB => YCbCr
1278 RGB => GRAYSCALE
1279 YCbCr => GRAYSCALE
1280 CMYK => YCCK
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001281plus the null transforms: GRAYSCALE => GRAYSCALE, RGB => RGB,
1282YCbCr => YCbCr, CMYK => CMYK, YCCK => YCCK, and UNKNOWN => UNKNOWN.
1283
1284The de-facto file format standards (JFIF and Adobe) specify APPn markers that
1285indicate the color space of the JPEG file. It is important to ensure that
1286these are written correctly, or omitted if the JPEG file's color space is not
1287one of the ones supported by the de-facto standards. jpeg_set_colorspace()
1288will set the compression parameters to include or omit the APPn markers
1289properly, so long as it is told the truth about the JPEG color space.
1290For example, if you are writing some random 3-component color space without
1291conversion, don't try to fake out the library by setting in_color_space and
1292jpeg_color_space to JCS_YCbCr; use JCS_UNKNOWN. You may want to write an
1293APPn marker of your own devising to identify the colorspace --- see "Special
1294markers", below.
1295
1296When told that the color space is UNKNOWN, the library will default to using
1297luminance-quality compression parameters for all color components. You may
1298well want to change these parameters. See the source code for
1299jpeg_set_colorspace(), in jcparam.c, for details.
1300
1301For decompression, the JPEG file's color space is given in jpeg_color_space,
1302and this is transformed to the output color space out_color_space.
1303jpeg_read_header's setting of jpeg_color_space can be relied on if the file
1304conforms to JFIF or Adobe conventions, but otherwise it is no better than a
1305guess. If you know the JPEG file's color space for certain, you can override
1306jpeg_read_header's guess by setting jpeg_color_space. jpeg_read_header also
1307selects a default output color space based on (its guess of) jpeg_color_space;
1308set out_color_space to override this. Again, you must select a supported
1309transformation. jdcolor.c currently supports
DRCb7753512014-05-11 09:36:25 +00001310 YCbCr => RGB
1311 YCbCr => GRAYSCALE
1312 RGB => GRAYSCALE
1313 GRAYSCALE => RGB
1314 YCCK => CMYK
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00001315as well as the null transforms. (Since GRAYSCALE=>RGB is provided, an
1316application can force grayscale JPEGs to look like color JPEGs if it only
1317wants to handle one case.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001318
1319The two-pass color quantizer, jquant2.c, is specialized to handle RGB data
1320(it weights distances appropriately for RGB colors). You'll need to modify
1321the code if you want to use it for non-RGB output color spaces. Note that
1322jquant2.c is used to map to an application-supplied colormap as well as for
1323the normal two-pass colormap selection process.
1324
1325CAUTION: it appears that Adobe Photoshop writes inverted data in CMYK JPEG
1326files: 0 represents 100% ink coverage, rather than 0% ink as you'd expect.
1327This is arguably a bug in Photoshop, but if you need to work with Photoshop
1328CMYK files, you will have to deal with it in your application. We cannot
1329"fix" this in the library by inverting the data during the CMYK<=>YCCK
1330transform, because that would break other applications, notably Ghostscript.
1331Photoshop versions prior to 3.0 write EPS files containing JPEG-encoded CMYK
1332data in the same inverted-YCCK representation used in bare JPEG files, but
1333the surrounding PostScript code performs an inversion using the PS image
1334operator. I am told that Photoshop 3.0 will write uninverted YCCK in
1335EPS/JPEG files, and will omit the PS-level inversion. (But the data
1336polarity used in bare JPEG files will not change in 3.0.) In either case,
1337the JPEG library must not invert the data itself, or else Ghostscript would
1338read these EPS files incorrectly.
1339
1340
1341Error handling
1342--------------
1343
1344When the default error handler is used, any error detected inside the JPEG
1345routines will cause a message to be printed on stderr, followed by exit().
1346You can supply your own error handling routines to override this behavior
1347and to control the treatment of nonfatal warnings and trace/debug messages.
1348The file example.c illustrates the most common case, which is to have the
1349application regain control after an error rather than exiting.
1350
1351The JPEG library never writes any message directly; it always goes through
1352the error handling routines. Three classes of messages are recognized:
1353 * Fatal errors: the library cannot continue.
1354 * Warnings: the library can continue, but the data is corrupt, and a
1355 damaged output image is likely to result.
1356 * Trace/informational messages. These come with a trace level indicating
1357 the importance of the message; you can control the verbosity of the
1358 program by adjusting the maximum trace level that will be displayed.
1359
1360You may, if you wish, simply replace the entire JPEG error handling module
1361(jerror.c) with your own code. However, you can avoid code duplication by
1362only replacing some of the routines depending on the behavior you need.
1363This is accomplished by calling jpeg_std_error() as usual, but then overriding
1364some of the method pointers in the jpeg_error_mgr struct, as illustrated by
1365example.c.
1366
1367All of the error handling routines will receive a pointer to the JPEG object
1368(a j_common_ptr which points to either a jpeg_compress_struct or a
1369jpeg_decompress_struct; if you need to tell which, test the is_decompressor
1370field). This struct includes a pointer to the error manager struct in its
1371"err" field. Frequently, custom error handler routines will need to access
1372additional data which is not known to the JPEG library or the standard error
1373handler. The most convenient way to do this is to embed either the JPEG
1374object or the jpeg_error_mgr struct in a larger structure that contains
1375additional fields; then casting the passed pointer provides access to the
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00001376additional fields. Again, see example.c for one way to do it. (Beginning
1377with IJG version 6b, there is also a void pointer "client_data" in each
1378JPEG object, which the application can also use to find related data.
1379The library does not touch client_data at all.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001380
1381The individual methods that you might wish to override are:
1382
1383error_exit (j_common_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001384 Receives control for a fatal error. Information sufficient to
1385 generate the error message has been stored in cinfo->err; call
1386 output_message to display it. Control must NOT return to the caller;
1387 generally this routine will exit() or longjmp() somewhere.
1388 Typically you would override this routine to get rid of the exit()
1389 default behavior. Note that if you continue processing, you should
1390 clean up the JPEG object with jpeg_abort() or jpeg_destroy().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001391
1392output_message (j_common_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001393 Actual output of any JPEG message. Override this to send messages
1394 somewhere other than stderr. Note that this method does not know
1395 how to generate a message, only where to send it.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001396
1397format_message (j_common_ptr cinfo, char * buffer)
DRCb7753512014-05-11 09:36:25 +00001398 Constructs a readable error message string based on the error info
1399 stored in cinfo->err. This method is called by output_message. Few
1400 applications should need to override this method. One possible
1401 reason for doing so is to implement dynamic switching of error message
1402 language.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001403
1404emit_message (j_common_ptr cinfo, int msg_level)
DRCb7753512014-05-11 09:36:25 +00001405 Decide whether or not to emit a warning or trace message; if so,
1406 calls output_message. The main reason for overriding this method
1407 would be to abort on warnings. msg_level is -1 for warnings,
1408 0 and up for trace messages.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001409
1410Only error_exit() and emit_message() are called from the rest of the JPEG
1411library; the other two are internal to the error handler.
1412
1413The actual message texts are stored in an array of strings which is pointed to
1414by the field err->jpeg_message_table. The messages are numbered from 0 to
1415err->last_jpeg_message, and it is these code numbers that are used in the
1416JPEG library code. You could replace the message texts (for instance, with
1417messages in French or German) by changing the message table pointer. See
1418jerror.h for the default texts. CAUTION: this table will almost certainly
1419change or grow from one library version to the next.
1420
1421It may be useful for an application to add its own message texts that are
1422handled by the same mechanism. The error handler supports a second "add-on"
1423message table for this purpose. To define an addon table, set the pointer
1424err->addon_message_table and the message numbers err->first_addon_message and
1425err->last_addon_message. If you number the addon messages beginning at 1000
1426or so, you won't have to worry about conflicts with the library's built-in
1427messages. See the sample applications cjpeg/djpeg for an example of using
1428addon messages (the addon messages are defined in cderror.h).
1429
1430Actual invocation of the error handler is done via macros defined in jerror.h:
DRCb7753512014-05-11 09:36:25 +00001431 ERREXITn(...) for fatal errors
1432 WARNMSn(...) for corrupt-data warnings
1433 TRACEMSn(...) for trace and informational messages.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001434These macros store the message code and any additional parameters into the
1435error handler struct, then invoke the error_exit() or emit_message() method.
1436The variants of each macro are for varying numbers of additional parameters.
1437The additional parameters are inserted into the generated message using
1438standard printf() format codes.
1439
1440See jerror.h and jerror.c for further details.
1441
1442
1443Compressed data handling (source and destination managers)
1444----------------------------------------------------------
1445
1446The JPEG compression library sends its compressed data to a "destination
1447manager" module. The default destination manager just writes the data to a
Guido Vollbeding989630f2010-01-10 00:00:00 +00001448memory buffer or to a stdio stream, but you can provide your own manager to
1449do something else. Similarly, the decompression library calls a "source
1450manager" to obtain the compressed data; you can provide your own source
1451manager if you want the data to come from somewhere other than a memory
1452buffer or a stdio stream.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001453
1454In both cases, compressed data is processed a bufferload at a time: the
1455destination or source manager provides a work buffer, and the library invokes
1456the manager only when the buffer is filled or emptied. (You could define a
1457one-character buffer to force the manager to be invoked for each byte, but
1458that would be rather inefficient.) The buffer's size and location are
Guido Vollbeding989630f2010-01-10 00:00:00 +00001459controlled by the manager, not by the library. For example, the memory
1460source manager just makes the buffer pointer and length point to the original
1461data in memory. In this case the buffer-reload procedure will be invoked
1462only if the decompressor ran off the end of the datastream, which would
1463indicate an erroneous datastream.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001464
1465The work buffer is defined as an array of datatype JOCTET, which is generally
1466"char" or "unsigned char". On a machine where char is not exactly 8 bits
1467wide, you must define JOCTET as a wider data type and then modify the data
1468source and destination modules to transcribe the work arrays into 8-bit units
1469on external storage.
1470
1471A data destination manager struct contains a pointer and count defining the
1472next byte to write in the work buffer and the remaining free space:
1473
DRCb7753512014-05-11 09:36:25 +00001474 JOCTET * next_output_byte; /* => next byte to write in buffer */
1475 size_t free_in_buffer; /* # of byte spaces remaining in buffer */
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001476
1477The library increments the pointer and decrements the count until the buffer
1478is filled. The manager's empty_output_buffer method must reset the pointer
1479and count. The manager is expected to remember the buffer's starting address
1480and total size in private fields not visible to the library.
1481
1482A data destination manager provides three methods:
1483
1484init_destination (j_compress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001485 Initialize destination. This is called by jpeg_start_compress()
1486 before any data is actually written. It must initialize
1487 next_output_byte and free_in_buffer. free_in_buffer must be
1488 initialized to a positive value.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001489
1490empty_output_buffer (j_compress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001491 This is called whenever the buffer has filled (free_in_buffer
1492 reaches zero). In typical applications, it should write out the
1493 *entire* buffer (use the saved start address and buffer length;
1494 ignore the current state of next_output_byte and free_in_buffer).
1495 Then reset the pointer & count to the start of the buffer, and
1496 return TRUE indicating that the buffer has been dumped.
1497 free_in_buffer must be set to a positive value when TRUE is
1498 returned. A FALSE return should only be used when I/O suspension is
1499 desired (this operating mode is discussed in the next section).
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001500
1501term_destination (j_compress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001502 Terminate destination --- called by jpeg_finish_compress() after all
1503 data has been written. In most applications, this must flush any
1504 data remaining in the buffer. Use either next_output_byte or
1505 free_in_buffer to determine how much data is in the buffer.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001506
1507term_destination() is NOT called by jpeg_abort() or jpeg_destroy(). If you
1508want the destination manager to be cleaned up during an abort, you must do it
1509yourself.
1510
1511You will also need code to create a jpeg_destination_mgr struct, fill in its
1512method pointers, and insert a pointer to the struct into the "dest" field of
1513the JPEG compression object. This can be done in-line in your setup code if
1514you like, but it's probably cleaner to provide a separate routine similar to
Guido Vollbeding989630f2010-01-10 00:00:00 +00001515the jpeg_stdio_dest() or jpeg_mem_dest() routines of the supplied destination
1516managers.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001517
1518Decompression source managers follow a parallel design, but with some
1519additional frammishes. The source manager struct contains a pointer and count
1520defining the next byte to read from the work buffer and the number of bytes
1521remaining:
1522
DRCb7753512014-05-11 09:36:25 +00001523 const JOCTET * next_input_byte; /* => next byte to read from buffer */
1524 size_t bytes_in_buffer; /* # of bytes remaining in buffer */
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001525
1526The library increments the pointer and decrements the count until the buffer
1527is emptied. The manager's fill_input_buffer method must reset the pointer and
1528count. In most applications, the manager must remember the buffer's starting
1529address and total size in private fields not visible to the library.
1530
1531A data source manager provides five methods:
1532
1533init_source (j_decompress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001534 Initialize source. This is called by jpeg_read_header() before any
1535 data is actually read. Unlike init_destination(), it may leave
1536 bytes_in_buffer set to 0 (in which case a fill_input_buffer() call
1537 will occur immediately).
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001538
1539fill_input_buffer (j_decompress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001540 This is called whenever bytes_in_buffer has reached zero and more
1541 data is wanted. In typical applications, it should read fresh data
1542 into the buffer (ignoring the current state of next_input_byte and
1543 bytes_in_buffer), reset the pointer & count to the start of the
1544 buffer, and return TRUE indicating that the buffer has been reloaded.
1545 It is not necessary to fill the buffer entirely, only to obtain at
1546 least one more byte. bytes_in_buffer MUST be set to a positive value
1547 if TRUE is returned. A FALSE return should only be used when I/O
1548 suspension is desired (this mode is discussed in the next section).
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001549
1550skip_input_data (j_decompress_ptr cinfo, long num_bytes)
DRCb7753512014-05-11 09:36:25 +00001551 Skip num_bytes worth of data. The buffer pointer and count should
1552 be advanced over num_bytes input bytes, refilling the buffer as
1553 needed. This is used to skip over a potentially large amount of
1554 uninteresting data (such as an APPn marker). In some applications
1555 it may be possible to optimize away the reading of the skipped data,
1556 but it's not clear that being smart is worth much trouble; large
1557 skips are uncommon. bytes_in_buffer may be zero on return.
1558 A zero or negative skip count should be treated as a no-op.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001559
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001560resync_to_restart (j_decompress_ptr cinfo, int desired)
DRCb7753512014-05-11 09:36:25 +00001561 This routine is called only when the decompressor has failed to find
1562 a restart (RSTn) marker where one is expected. Its mission is to
1563 find a suitable point for resuming decompression. For most
1564 applications, we recommend that you just use the default resync
1565 procedure, jpeg_resync_to_restart(). However, if you are able to back
1566 up in the input data stream, or if you have a-priori knowledge about
1567 the likely location of restart markers, you may be able to do better.
1568 Read the read_restart_marker() and jpeg_resync_to_restart() routines
1569 in jdmarker.c if you think you'd like to implement your own resync
1570 procedure.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001571
1572term_source (j_decompress_ptr cinfo)
DRCb7753512014-05-11 09:36:25 +00001573 Terminate source --- called by jpeg_finish_decompress() after all
1574 data has been read. Often a no-op.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001575
1576For both fill_input_buffer() and skip_input_data(), there is no such thing
1577as an EOF return. If the end of the file has been reached, the routine has
1578a choice of exiting via ERREXIT() or inserting fake data into the buffer.
1579In most cases, generating a warning message and inserting a fake EOI marker
1580is the best course of action --- this will allow the decompressor to output
1581however much of the image is there. In pathological cases, the decompressor
1582may swallow the EOI and again demand data ... just keep feeding it fake EOIs.
1583jdatasrc.c illustrates the recommended error recovery behavior.
1584
1585term_source() is NOT called by jpeg_abort() or jpeg_destroy(). If you want
1586the source manager to be cleaned up during an abort, you must do it yourself.
1587
1588You will also need code to create a jpeg_source_mgr struct, fill in its method
1589pointers, and insert a pointer to the struct into the "src" field of the JPEG
1590decompression object. This can be done in-line in your setup code if you
1591like, but it's probably cleaner to provide a separate routine similar to the
Guido Vollbeding989630f2010-01-10 00:00:00 +00001592jpeg_stdio_src() or jpeg_mem_src() routines of the supplied source managers.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001593
Guido Vollbeding989630f2010-01-10 00:00:00 +00001594For more information, consult the memory and stdio source and destination
1595managers in jdatasrc.c and jdatadst.c.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001596
1597
1598I/O suspension
1599--------------
1600
1601Some applications need to use the JPEG library as an incremental memory-to-
1602memory filter: when the compressed data buffer is filled or emptied, they want
1603control to return to the outer loop, rather than expecting that the buffer can
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001604be emptied or reloaded within the data source/destination manager subroutine.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001605The library supports this need by providing an "I/O suspension" mode, which we
1606describe in this section.
1607
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001608The I/O suspension mode is not a panacea: nothing is guaranteed about the
1609maximum amount of time spent in any one call to the library, so it will not
1610eliminate response-time problems in single-threaded applications. If you
1611need guaranteed response time, we suggest you "bite the bullet" and implement
1612a real multi-tasking capability.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001613
1614To use I/O suspension, cooperation is needed between the calling application
1615and the data source or destination manager; you will always need a custom
1616source/destination manager. (Please read the previous section if you haven't
1617already.) The basic idea is that the empty_output_buffer() or
1618fill_input_buffer() routine is a no-op, merely returning FALSE to indicate
1619that it has done nothing. Upon seeing this, the JPEG library suspends
1620operation and returns to its caller. The surrounding application is
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001621responsible for emptying or refilling the work buffer before calling the
1622JPEG library again.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001623
1624Compression suspension:
1625
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001626For compression suspension, use an empty_output_buffer() routine that returns
1627FALSE; typically it will not do anything else. This will cause the
1628compressor to return to the caller of jpeg_write_scanlines(), with the return
1629value indicating that not all the supplied scanlines have been accepted.
1630The application must make more room in the output buffer, adjust the output
1631buffer pointer/count appropriately, and then call jpeg_write_scanlines()
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001632again, pointing to the first unconsumed scanline.
1633
1634When forced to suspend, the compressor will backtrack to a convenient stopping
1635point (usually the start of the current MCU); it will regenerate some output
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001636data when restarted. Therefore, although empty_output_buffer() is only
1637called when the buffer is filled, you should NOT write out the entire buffer
1638after a suspension. Write only the data up to the current position of
1639next_output_byte/free_in_buffer. The data beyond that point will be
1640regenerated after resumption.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001641
1642Because of the backtracking behavior, a good-size output buffer is essential
1643for efficiency; you don't want the compressor to suspend often. (In fact, an
1644overly small buffer could lead to infinite looping, if a single MCU required
1645more data than would fit in the buffer.) We recommend a buffer of at least
1646several Kbytes. You may want to insert explicit code to ensure that you don't
1647call jpeg_write_scanlines() unless there is a reasonable amount of space in
1648the output buffer; in other words, flush the buffer before trying to compress
1649more data.
1650
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001651The compressor does not allow suspension while it is trying to write JPEG
1652markers at the beginning and end of the file. This means that:
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001653 * At the beginning of a compression operation, there must be enough free
1654 space in the output buffer to hold the header markers (typically 600 or
1655 so bytes). The recommended buffer size is bigger than this anyway, so
1656 this is not a problem as long as you start with an empty buffer. However,
1657 this restriction might catch you if you insert large special markers, such
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001658 as a JFIF thumbnail image, without flushing the buffer afterwards.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001659 * When you call jpeg_finish_compress(), there must be enough space in the
1660 output buffer to emit any buffered data and the final EOI marker. In the
1661 current implementation, half a dozen bytes should suffice for this, but
1662 for safety's sake we recommend ensuring that at least 100 bytes are free
1663 before calling jpeg_finish_compress().
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001664
1665A more significant restriction is that jpeg_finish_compress() cannot suspend.
1666This means you cannot use suspension with multi-pass operating modes, namely
1667Huffman code optimization and multiple-scan output. Those modes write the
1668whole file during jpeg_finish_compress(), which will certainly result in
1669buffer overrun. (Note that this restriction applies only to compression,
1670not decompression. The decompressor supports input suspension in all of its
1671operating modes.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001672
1673Decompression suspension:
1674
1675For decompression suspension, use a fill_input_buffer() routine that simply
1676returns FALSE (except perhaps during error recovery, as discussed below).
1677This will cause the decompressor to return to its caller with an indication
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001678that suspension has occurred. This can happen at four places:
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001679 * jpeg_read_header(): will return JPEG_SUSPENDED.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001680 * jpeg_start_decompress(): will return FALSE, rather than its usual TRUE.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001681 * jpeg_read_scanlines(): will return the number of scanlines already
DRCb7753512014-05-11 09:36:25 +00001682 completed (possibly 0).
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001683 * jpeg_finish_decompress(): will return FALSE, rather than its usual TRUE.
1684The surrounding application must recognize these cases, load more data into
1685the input buffer, and repeat the call. In the case of jpeg_read_scanlines(),
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001686increment the passed pointers past any scanlines successfully read.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001687
1688Just as with compression, the decompressor will typically backtrack to a
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001689convenient restart point before suspending. When fill_input_buffer() is
1690called, next_input_byte/bytes_in_buffer point to the current restart point,
1691which is where the decompressor will backtrack to if FALSE is returned.
1692The data beyond that position must NOT be discarded if you suspend; it needs
1693to be re-read upon resumption. In most implementations, you'll need to shift
1694this data down to the start of your work buffer and then load more data after
1695it. Again, this behavior means that a several-Kbyte work buffer is essential
1696for decent performance; furthermore, you should load a reasonable amount of
1697new data before resuming decompression. (If you loaded, say, only one new
1698byte each time around, you could waste a LOT of cycles.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001699
1700The skip_input_data() source manager routine requires special care in a
1701suspension scenario. This routine is NOT granted the ability to suspend the
1702decompressor; it can decrement bytes_in_buffer to zero, but no more. If the
1703requested skip distance exceeds the amount of data currently in the input
1704buffer, then skip_input_data() must set bytes_in_buffer to zero and record the
1705additional skip distance somewhere else. The decompressor will immediately
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001706call fill_input_buffer(), which should return FALSE, which will cause a
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001707suspension return. The surrounding application must then arrange to discard
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001708the recorded number of bytes before it resumes loading the input buffer.
1709(Yes, this design is rather baroque, but it avoids complexity in the far more
1710common case where a non-suspending source manager is used.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001711
1712If the input data has been exhausted, we recommend that you emit a warning
1713and insert dummy EOI markers just as a non-suspending data source manager
1714would do. This can be handled either in the surrounding application logic or
1715within fill_input_buffer(); the latter is probably more efficient. If
1716fill_input_buffer() knows that no more data is available, it can set the
1717pointer/count to point to a dummy EOI marker and then return TRUE just as
1718though it had read more data in a non-suspending situation.
1719
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00001720The decompressor does not attempt to suspend within standard JPEG markers;
1721instead it will backtrack to the start of the marker and reprocess the whole
1722marker next time. Hence the input buffer must be large enough to hold the
1723longest standard marker in the file. Standard JPEG markers should normally
1724not exceed a few hundred bytes each (DHT tables are typically the longest).
1725We recommend at least a 2K buffer for performance reasons, which is much
1726larger than any correct marker is likely to be. For robustness against
1727damaged marker length counts, you may wish to insert a test in your
1728application for the case that the input buffer is completely full and yet
1729the decoder has suspended without consuming any data --- otherwise, if this
1730situation did occur, it would lead to an endless loop. (The library can't
1731provide this test since it has no idea whether "the buffer is full", or
1732even whether there is a fixed-size input buffer.)
1733
1734The input buffer would need to be 64K to allow for arbitrary COM or APPn
1735markers, but these are handled specially: they are either saved into allocated
1736memory, or skipped over by calling skip_input_data(). In the former case,
1737suspension is handled correctly, and in the latter case, the problem of
1738buffer overrun is placed on skip_input_data's shoulders, as explained above.
1739Note that if you provide your own marker handling routine for large markers,
1740you should consider how to deal with buffer overflow.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001741
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +00001742Multiple-buffer management:
1743
1744In some applications it is desirable to store the compressed data in a linked
1745list of buffer areas, so as to avoid data copying. This can be handled by
1746having empty_output_buffer() or fill_input_buffer() set the pointer and count
1747to reference the next available buffer; FALSE is returned only if no more
1748buffers are available. Although seemingly straightforward, there is a
1749pitfall in this approach: the backtrack that occurs when FALSE is returned
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001750could back up into an earlier buffer. For example, when fill_input_buffer()
1751is called, the current pointer & count indicate the backtrack restart point.
1752Since fill_input_buffer() will set the pointer and count to refer to a new
1753buffer, the restart position must be saved somewhere else. Suppose a second
1754call to fill_input_buffer() occurs in the same library call, and no
1755additional input data is available, so fill_input_buffer must return FALSE.
1756If the JPEG library has not moved the pointer/count forward in the current
1757buffer, then *the correct restart point is the saved position in the prior
1758buffer*. Prior buffers may be discarded only after the library establishes
1759a restart point within a later buffer. Similar remarks apply for output into
1760a chain of buffers.
1761
1762The library will never attempt to backtrack over a skip_input_data() call,
1763so any skipped data can be permanently discarded. You still have to deal
1764with the case of skipping not-yet-received data, however.
1765
1766It's much simpler to use only a single buffer; when fill_input_buffer() is
1767called, move any unconsumed data (beyond the current pointer/count) down to
1768the beginning of this buffer and then load new data into the remaining buffer
1769space. This approach requires a little more data copying but is far easier
1770to get right.
1771
1772
1773Progressive JPEG support
1774------------------------
1775
1776Progressive JPEG rearranges the stored data into a series of scans of
1777increasing quality. In situations where a JPEG file is transmitted across a
1778slow communications link, a decoder can generate a low-quality image very
1779quickly from the first scan, then gradually improve the displayed quality as
1780more scans are received. The final image after all scans are complete is
1781identical to that of a regular (sequential) JPEG file of the same quality
1782setting. Progressive JPEG files are often slightly smaller than equivalent
1783sequential JPEG files, but the possibility of incremental display is the main
1784reason for using progressive JPEG.
1785
1786The IJG encoder library generates progressive JPEG files when given a
1787suitable "scan script" defining how to divide the data into scans.
1788Creation of progressive JPEG files is otherwise transparent to the encoder.
1789Progressive JPEG files can also be read transparently by the decoder library.
1790If the decoding application simply uses the library as defined above, it
1791will receive a final decoded image without any indication that the file was
1792progressive. Of course, this approach does not allow incremental display.
1793To perform incremental display, an application needs to use the decoder
1794library's "buffered-image" mode, in which it receives a decoded image
1795multiple times.
1796
1797Each displayed scan requires about as much work to decode as a full JPEG
1798image of the same size, so the decoder must be fairly fast in relation to the
1799data transmission rate in order to make incremental display useful. However,
1800it is possible to skip displaying the image and simply add the incoming bits
1801to the decoder's coefficient buffer. This is fast because only Huffman
1802decoding need be done, not IDCT, upsampling, colorspace conversion, etc.
1803The IJG decoder library allows the application to switch dynamically between
1804displaying the image and simply absorbing the incoming bits. A properly
1805coded application can automatically adapt the number of display passes to
1806suit the time available as the image is received. Also, a final
1807higher-quality display cycle can be performed from the buffered data after
1808the end of the file is reached.
1809
1810Progressive compression:
1811
1812To create a progressive JPEG file (or a multiple-scan sequential JPEG file),
1813set the scan_info cinfo field to point to an array of scan descriptors, and
1814perform compression as usual. Instead of constructing your own scan list,
1815you can call the jpeg_simple_progression() helper routine to create a
1816recommended progression sequence; this method should be used by all
1817applications that don't want to get involved in the nitty-gritty of
1818progressive scan sequence design. (If you want to provide user control of
1819scan sequences, you may wish to borrow the scan script reading code found
1820in rdswitch.c, so that you can read scan script files just like cjpeg's.)
1821When scan_info is not NULL, the compression library will store DCT'd data
1822into a buffer array as jpeg_write_scanlines() is called, and will emit all
1823the requested scans during jpeg_finish_compress(). This implies that
1824multiple-scan output cannot be created with a suspending data destination
1825manager, since jpeg_finish_compress() does not support suspension. We
1826should also note that the compressor currently forces Huffman optimization
1827mode when creating a progressive JPEG file, because the default Huffman
1828tables are unsuitable for progressive files.
1829
1830Progressive decompression:
1831
1832When buffered-image mode is not used, the decoder library will read all of
1833a multi-scan file during jpeg_start_decompress(), so that it can provide a
1834final decoded image. (Here "multi-scan" means either progressive or
1835multi-scan sequential.) This makes multi-scan files transparent to the
1836decoding application. However, existing applications that used suspending
1837input with version 5 of the IJG library will need to be modified to check
1838for a suspension return from jpeg_start_decompress().
1839
1840To perform incremental display, an application must use the library's
1841buffered-image mode. This is described in the next section.
1842
1843
1844Buffered-image mode
1845-------------------
1846
1847In buffered-image mode, the library stores the partially decoded image in a
1848coefficient buffer, from which it can be read out as many times as desired.
1849This mode is typically used for incremental display of progressive JPEG files,
1850but it can be used with any JPEG file. Each scan of a progressive JPEG file
1851adds more data (more detail) to the buffered image. The application can
1852display in lockstep with the source file (one display pass per input scan),
1853or it can allow input processing to outrun display processing. By making
1854input and display processing run independently, it is possible for the
1855application to adapt progressive display to a wide range of data transmission
1856rates.
1857
1858The basic control flow for buffered-image decoding is
1859
DRCb7753512014-05-11 09:36:25 +00001860 jpeg_create_decompress()
1861 set data source
1862 jpeg_read_header()
1863 set overall decompression parameters
1864 cinfo.buffered_image = TRUE; /* select buffered-image mode */
1865 jpeg_start_decompress()
1866 for (each output pass) {
1867 adjust output decompression parameters if required
1868 jpeg_start_output() /* start a new output pass */
1869 for (all scanlines in image) {
1870 jpeg_read_scanlines()
1871 display scanlines
1872 }
1873 jpeg_finish_output() /* terminate output pass */
1874 }
1875 jpeg_finish_decompress()
1876 jpeg_destroy_decompress()
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001877
1878This differs from ordinary unbuffered decoding in that there is an additional
1879level of looping. The application can choose how many output passes to make
1880and how to display each pass.
1881
1882The simplest approach to displaying progressive images is to do one display
1883pass for each scan appearing in the input file. In this case the outer loop
1884condition is typically
DRCb7753512014-05-11 09:36:25 +00001885 while (! jpeg_input_complete(&cinfo))
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001886and the start-output call should read
DRCb7753512014-05-11 09:36:25 +00001887 jpeg_start_output(&cinfo, cinfo.input_scan_number);
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001888The second parameter to jpeg_start_output() indicates which scan of the input
1889file is to be displayed; the scans are numbered starting at 1 for this
1890purpose. (You can use a loop counter starting at 1 if you like, but using
1891the library's input scan counter is easier.) The library automatically reads
1892data as necessary to complete each requested scan, and jpeg_finish_output()
1893advances to the next scan or end-of-image marker (hence input_scan_number
1894will be incremented by the time control arrives back at jpeg_start_output()).
1895With this technique, data is read from the input file only as needed, and
1896input and output processing run in lockstep.
1897
1898After reading the final scan and reaching the end of the input file, the
1899buffered image remains available; it can be read additional times by
1900repeating the jpeg_start_output()/jpeg_read_scanlines()/jpeg_finish_output()
1901sequence. For example, a useful technique is to use fast one-pass color
1902quantization for display passes made while the image is arriving, followed by
1903a final display pass using two-pass quantization for highest quality. This
1904is done by changing the library parameters before the final output pass.
1905Changing parameters between passes is discussed in detail below.
1906
1907In general the last scan of a progressive file cannot be recognized as such
1908until after it is read, so a post-input display pass is the best approach if
1909you want special processing in the final pass.
1910
1911When done with the image, be sure to call jpeg_finish_decompress() to release
1912the buffered image (or just use jpeg_destroy_decompress()).
1913
1914If input data arrives faster than it can be displayed, the application can
1915cause the library to decode input data in advance of what's needed to produce
1916output. This is done by calling the routine jpeg_consume_input().
1917The return value is one of the following:
DRCb7753512014-05-11 09:36:25 +00001918 JPEG_REACHED_SOS: reached an SOS marker (the start of a new scan)
1919 JPEG_REACHED_EOI: reached the EOI marker (end of image)
1920 JPEG_ROW_COMPLETED: completed reading one MCU row of compressed data
1921 JPEG_SCAN_COMPLETED: completed reading last MCU row of current scan
1922 JPEG_SUSPENDED: suspended before completing any of the above
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001923(JPEG_SUSPENDED can occur only if a suspending data source is used.) This
1924routine can be called at any time after initializing the JPEG object. It
1925reads some additional data and returns when one of the indicated significant
1926events occurs. (If called after the EOI marker is reached, it will
1927immediately return JPEG_REACHED_EOI without attempting to read more data.)
1928
1929The library's output processing will automatically call jpeg_consume_input()
1930whenever the output processing overtakes the input; thus, simple lockstep
1931display requires no direct calls to jpeg_consume_input(). But by adding
1932calls to jpeg_consume_input(), you can absorb data in advance of what is
1933being displayed. This has two benefits:
1934 * You can limit buildup of unprocessed data in your input buffer.
1935 * You can eliminate extra display passes by paying attention to the
1936 state of the library's input processing.
1937
1938The first of these benefits only requires interspersing calls to
1939jpeg_consume_input() with your display operations and any other processing
1940you may be doing. To avoid wasting cycles due to backtracking, it's best to
1941call jpeg_consume_input() only after a hundred or so new bytes have arrived.
1942This is discussed further under "I/O suspension", above. (Note: the JPEG
1943library currently is not thread-safe. You must not call jpeg_consume_input()
1944from one thread of control if a different library routine is working on the
1945same JPEG object in another thread.)
1946
1947When input arrives fast enough that more than one new scan is available
1948before you start a new output pass, you may as well skip the output pass
1949corresponding to the completed scan. This occurs for free if you pass
1950cinfo.input_scan_number as the target scan number to jpeg_start_output().
1951The input_scan_number field is simply the index of the scan currently being
1952consumed by the input processor. You can ensure that this is up-to-date by
1953emptying the input buffer just before calling jpeg_start_output(): call
1954jpeg_consume_input() repeatedly until it returns JPEG_SUSPENDED or
1955JPEG_REACHED_EOI.
1956
1957The target scan number passed to jpeg_start_output() is saved in the
1958cinfo.output_scan_number field. The library's output processing calls
1959jpeg_consume_input() whenever the current input scan number and row within
Thomas G. Lane489583f1996-02-07 00:00:00 +00001960that scan is less than or equal to the current output scan number and row.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001961Thus, input processing can "get ahead" of the output processing but is not
1962allowed to "fall behind". You can achieve several different effects by
1963manipulating this interlock rule. For example, if you pass a target scan
1964number greater than the current input scan number, the output processor will
1965wait until that scan starts to arrive before producing any output. (To avoid
1966an infinite loop, the target scan number is automatically reset to the last
1967scan number when the end of image is reached. Thus, if you specify a large
1968target scan number, the library will just absorb the entire input file and
1969then perform an output pass. This is effectively the same as what
1970jpeg_start_decompress() does when you don't select buffered-image mode.)
1971When you pass a target scan number equal to the current input scan number,
1972the image is displayed no faster than the current input scan arrives. The
1973final possibility is to pass a target scan number less than the current input
1974scan number; this disables the input/output interlock and causes the output
1975processor to simply display whatever it finds in the image buffer, without
1976waiting for input. (However, the library will not accept a target scan
1977number less than one, so you can't avoid waiting for the first scan.)
1978
Thomas G. Lane489583f1996-02-07 00:00:00 +00001979When data is arriving faster than the output display processing can advance
1980through the image, jpeg_consume_input() will store data into the buffered
1981image beyond the point at which the output processing is reading data out
1982again. If the input arrives fast enough, it may "wrap around" the buffer to
1983the point where the input is more than one whole scan ahead of the output.
1984If the output processing simply proceeds through its display pass without
1985paying attention to the input, the effect seen on-screen is that the lower
1986part of the image is one or more scans better in quality than the upper part.
1987Then, when the next output scan is started, you have a choice of what target
1988scan number to use. The recommended choice is to use the current input scan
1989number at that time, which implies that you've skipped the output scans
1990corresponding to the input scans that were completed while you processed the
1991previous output scan. In this way, the decoder automatically adapts its
1992speed to the arriving data, by skipping output scans as necessary to keep up
1993with the arriving data.
1994
1995When using this strategy, you'll want to be sure that you perform a final
1996output pass after receiving all the data; otherwise your last display may not
1997be full quality across the whole screen. So the right outer loop logic is
1998something like this:
DRCb7753512014-05-11 09:36:25 +00001999 do {
2000 absorb any waiting input by calling jpeg_consume_input()
2001 final_pass = jpeg_input_complete(&cinfo);
2002 adjust output decompression parameters if required
2003 jpeg_start_output(&cinfo, cinfo.input_scan_number);
2004 ...
2005 jpeg_finish_output()
2006 } while (! final_pass);
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002007rather than quitting as soon as jpeg_input_complete() returns TRUE. This
2008arrangement makes it simple to use higher-quality decoding parameters
2009for the final pass. But if you don't want to use special parameters for
2010the final pass, the right loop logic is like this:
DRCb7753512014-05-11 09:36:25 +00002011 for (;;) {
2012 absorb any waiting input by calling jpeg_consume_input()
2013 jpeg_start_output(&cinfo, cinfo.input_scan_number);
2014 ...
2015 jpeg_finish_output()
2016 if (jpeg_input_complete(&cinfo) &&
2017 cinfo.input_scan_number == cinfo.output_scan_number)
2018 break;
2019 }
Thomas G. Lane489583f1996-02-07 00:00:00 +00002020In this case you don't need to know in advance whether an output pass is to
2021be the last one, so it's not necessary to have reached EOF before starting
2022the final output pass; rather, what you want to test is whether the output
2023pass was performed in sync with the final input scan. This form of the loop
2024will avoid an extra output pass whenever the decoder is able (or nearly able)
2025to keep up with the incoming data.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002026
2027When the data transmission speed is high, you might begin a display pass,
Thomas G. Lane489583f1996-02-07 00:00:00 +00002028then find that much or all of the file has arrived before you can complete
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002029the pass. (You can detect this by noting the JPEG_REACHED_EOI return code
2030from jpeg_consume_input(), or equivalently by testing jpeg_input_complete().)
2031In this situation you may wish to abort the current display pass and start a
2032new one using the newly arrived information. To do so, just call
2033jpeg_finish_output() and then start a new pass with jpeg_start_output().
2034
2035A variant strategy is to abort and restart display if more than one complete
2036scan arrives during an output pass; this can be detected by noting
2037JPEG_REACHED_SOS returns and/or examining cinfo.input_scan_number. This
2038idea should be employed with caution, however, since the display process
2039might never get to the bottom of the image before being aborted, resulting
2040in the lower part of the screen being several passes worse than the upper.
2041In most cases it's probably best to abort an output pass only if the whole
2042file has arrived and you want to begin the final output pass immediately.
2043
2044When receiving data across a communication link, we recommend always using
2045the current input scan number for the output target scan number; if a
2046higher-quality final pass is to be done, it should be started (aborting any
2047incomplete output pass) as soon as the end of file is received. However,
2048many other strategies are possible. For example, the application can examine
2049the parameters of the current input scan and decide whether to display it or
2050not. If the scan contains only chroma data, one might choose not to use it
2051as the target scan, expecting that the scan will be small and will arrive
2052quickly. To skip to the next scan, call jpeg_consume_input() until it
2053returns JPEG_REACHED_SOS or JPEG_REACHED_EOI. Or just use the next higher
2054number as the target scan for jpeg_start_output(); but that method doesn't
2055let you inspect the next scan's parameters before deciding to display it.
2056
2057
2058In buffered-image mode, jpeg_start_decompress() never performs input and
2059thus never suspends. An application that uses input suspension with
2060buffered-image mode must be prepared for suspension returns from these
2061routines:
2062* jpeg_start_output() performs input only if you request 2-pass quantization
2063 and the target scan isn't fully read yet. (This is discussed below.)
2064* jpeg_read_scanlines(), as always, returns the number of scanlines that it
2065 was able to produce before suspending.
2066* jpeg_finish_output() will read any markers following the target scan,
Thomas G. Lane489583f1996-02-07 00:00:00 +00002067 up to the end of the file or the SOS marker that begins another scan.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002068 (But it reads no input if jpeg_consume_input() has already reached the
Thomas G. Lane489583f1996-02-07 00:00:00 +00002069 end of the file or a SOS marker beyond the target output scan.)
2070* jpeg_finish_decompress() will read until the end of file, and thus can
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002071 suspend if the end hasn't already been reached (as can be tested by
2072 calling jpeg_input_complete()).
2073jpeg_start_output(), jpeg_finish_output(), and jpeg_finish_decompress()
2074all return TRUE if they completed their tasks, FALSE if they had to suspend.
2075In the event of a FALSE return, the application must load more input data
2076and repeat the call. Applications that use non-suspending data sources need
2077not check the return values of these three routines.
2078
2079
2080It is possible to change decoding parameters between output passes in the
2081buffered-image mode. The decoder library currently supports only very
2082limited changes of parameters. ONLY THE FOLLOWING parameter changes are
2083allowed after jpeg_start_decompress() is called:
2084* dct_method can be changed before each call to jpeg_start_output().
2085 For example, one could use a fast DCT method for early scans, changing
2086 to a higher quality method for the final scan.
2087* dither_mode can be changed before each call to jpeg_start_output();
2088 of course this has no impact if not using color quantization. Typically
2089 one would use ordered dither for initial passes, then switch to
2090 Floyd-Steinberg dither for the final pass. Caution: changing dither mode
2091 can cause more memory to be allocated by the library. Although the amount
2092 of memory involved is not large (a scanline or so), it may cause the
2093 initial max_memory_to_use specification to be exceeded, which in the worst
2094 case would result in an out-of-memory failure.
2095* do_block_smoothing can be changed before each call to jpeg_start_output().
2096 This setting is relevant only when decoding a progressive JPEG image.
2097 During the first DC-only scan, block smoothing provides a very "fuzzy" look
2098 instead of the very "blocky" look seen without it; which is better seems a
2099 matter of personal taste. But block smoothing is nearly always a win
2100 during later stages, especially when decoding a successive-approximation
2101 image: smoothing helps to hide the slight blockiness that otherwise shows
2102 up on smooth gradients until the lowest coefficient bits are sent.
2103* Color quantization mode can be changed under the rules described below.
2104 You *cannot* change between full-color and quantized output (because that
2105 would alter the required I/O buffer sizes), but you can change which
2106 quantization method is used.
2107
2108When generating color-quantized output, changing quantization method is a
2109very useful way of switching between high-speed and high-quality display.
2110The library allows you to change among its three quantization methods:
21111. Single-pass quantization to a fixed color cube.
2112 Selected by cinfo.two_pass_quantize = FALSE and cinfo.colormap = NULL.
21132. Single-pass quantization to an application-supplied colormap.
2114 Selected by setting cinfo.colormap to point to the colormap (the value of
2115 two_pass_quantize is ignored); also set cinfo.actual_number_of_colors.
21163. Two-pass quantization to a colormap chosen specifically for the image.
2117 Selected by cinfo.two_pass_quantize = TRUE and cinfo.colormap = NULL.
2118 (This is the default setting selected by jpeg_read_header, but it is
2119 probably NOT what you want for the first pass of progressive display!)
2120These methods offer successively better quality and lesser speed. However,
2121only the first method is available for quantizing in non-RGB color spaces.
2122
2123IMPORTANT: because the different quantizer methods have very different
2124working-storage requirements, the library requires you to indicate which
2125one(s) you intend to use before you call jpeg_start_decompress(). (If we did
2126not require this, the max_memory_to_use setting would be a complete fiction.)
2127You do this by setting one or more of these three cinfo fields to TRUE:
DRCb7753512014-05-11 09:36:25 +00002128 enable_1pass_quant Fixed color cube colormap
2129 enable_external_quant Externally-supplied colormap
2130 enable_2pass_quant Two-pass custom colormap
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002131All three are initialized FALSE by jpeg_read_header(). But
2132jpeg_start_decompress() automatically sets TRUE the one selected by the
2133current two_pass_quantize and colormap settings, so you only need to set the
2134enable flags for any other quantization methods you plan to change to later.
2135
2136After setting the enable flags correctly at jpeg_start_decompress() time, you
2137can change to any enabled quantization method by setting two_pass_quantize
2138and colormap properly just before calling jpeg_start_output(). The following
2139special rules apply:
21401. You must explicitly set cinfo.colormap to NULL when switching to 1-pass
2141 or 2-pass mode from a different mode, or when you want the 2-pass
2142 quantizer to be re-run to generate a new colormap.
21432. To switch to an external colormap, or to change to a different external
2144 colormap than was used on the prior pass, you must call
2145 jpeg_new_colormap() after setting cinfo.colormap.
2146NOTE: if you want to use the same colormap as was used in the prior pass,
2147you should not do either of these things. This will save some nontrivial
2148switchover costs.
2149(These requirements exist because cinfo.colormap will always be non-NULL
2150after completing a prior output pass, since both the 1-pass and 2-pass
2151quantizers set it to point to their output colormaps. Thus you have to
2152do one of these two things to notify the library that something has changed.
2153Yup, it's a bit klugy, but it's necessary to do it this way for backwards
2154compatibility.)
2155
2156Note that in buffered-image mode, the library generates any requested colormap
2157during jpeg_start_output(), not during jpeg_start_decompress().
2158
2159When using two-pass quantization, jpeg_start_output() makes a pass over the
2160buffered image to determine the optimum color map; it therefore may take a
2161significant amount of time, whereas ordinarily it does little work. The
2162progress monitor hook is called during this pass, if defined. It is also
2163important to realize that if the specified target scan number is greater than
2164or equal to the current input scan number, jpeg_start_output() will attempt
2165to consume input as it makes this pass. If you use a suspending data source,
2166you need to check for a FALSE return from jpeg_start_output() under these
2167conditions. The combination of 2-pass quantization and a not-yet-fully-read
2168target scan is the only case in which jpeg_start_output() will consume input.
2169
2170
2171Application authors who support buffered-image mode may be tempted to use it
2172for all JPEG images, even single-scan ones. This will work, but it is
2173inefficient: there is no need to create an image-sized coefficient buffer for
2174single-scan images. Requesting buffered-image mode for such an image wastes
2175memory. Worse, it can cost time on large images, since the buffered data has
2176to be swapped out or written to a temporary file. If you are concerned about
2177maximum performance on baseline JPEG files, you should use buffered-image
2178mode only when the incoming file actually has multiple scans. This can be
2179tested by calling jpeg_has_multiple_scans(), which will return a correct
2180result at any time after jpeg_read_header() completes.
2181
2182It is also worth noting that when you use jpeg_consume_input() to let input
2183processing get ahead of output processing, the resulting pattern of access to
2184the coefficient buffer is quite nonsequential. It's best to use the memory
2185manager jmemnobs.c if you can (ie, if you have enough real or virtual main
2186memory). If not, at least make sure that max_memory_to_use is set as high as
2187possible. If the JPEG memory manager has to use a temporary file, you will
2188probably see a lot of disk traffic and poor performance. (This could be
2189improved with additional work on the memory manager, but we haven't gotten
2190around to it yet.)
2191
2192In some applications it may be convenient to use jpeg_consume_input() for all
2193input processing, including reading the initial markers; that is, you may
2194wish to call jpeg_consume_input() instead of jpeg_read_header() during
2195startup. This works, but note that you must check for JPEG_REACHED_SOS and
2196JPEG_REACHED_EOI return codes as the equivalent of jpeg_read_header's codes.
2197Once the first SOS marker has been reached, you must call
2198jpeg_start_decompress() before jpeg_consume_input() will consume more input;
2199it'll just keep returning JPEG_REACHED_SOS until you do. If you read a
2200tables-only file this way, jpeg_consume_input() will return JPEG_REACHED_EOI
2201without ever returning JPEG_REACHED_SOS; be sure to check for this case.
2202If this happens, the decompressor will not read any more input until you call
2203jpeg_abort() to reset it. It is OK to call jpeg_consume_input() even when not
2204using buffered-image mode, but in that case it's basically a no-op after the
2205initial markers have been read: it will just return JPEG_SUSPENDED.
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +00002206
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002207
2208Abbreviated datastreams and multiple images
2209-------------------------------------------
2210
2211A JPEG compression or decompression object can be reused to process multiple
2212images. This saves a small amount of time per image by eliminating the
2213"create" and "destroy" operations, but that isn't the real purpose of the
2214feature. Rather, reuse of an object provides support for abbreviated JPEG
2215datastreams. Object reuse can also simplify processing a series of images in
2216a single input or output file. This section explains these features.
2217
2218A JPEG file normally contains several hundred bytes worth of quantization
2219and Huffman tables. In a situation where many images will be stored or
2220transmitted with identical tables, this may represent an annoying overhead.
2221The JPEG standard therefore permits tables to be omitted. The standard
2222defines three classes of JPEG datastreams:
2223 * "Interchange" datastreams contain an image and all tables needed to decode
2224 the image. These are the usual kind of JPEG file.
2225 * "Abbreviated image" datastreams contain an image, but are missing some or
2226 all of the tables needed to decode that image.
2227 * "Abbreviated table specification" (henceforth "tables-only") datastreams
2228 contain only table specifications.
2229To decode an abbreviated image, it is necessary to load the missing table(s)
2230into the decoder beforehand. This can be accomplished by reading a separate
2231tables-only file. A variant scheme uses a series of images in which the first
2232image is an interchange (complete) datastream, while subsequent ones are
2233abbreviated and rely on the tables loaded by the first image. It is assumed
2234that once the decoder has read a table, it will remember that table until a
2235new definition for the same table number is encountered.
2236
2237It is the application designer's responsibility to figure out how to associate
2238the correct tables with an abbreviated image. While abbreviated datastreams
2239can be useful in a closed environment, their use is strongly discouraged in
2240any situation where data exchange with other applications might be needed.
2241Caveat designer.
2242
2243The JPEG library provides support for reading and writing any combination of
2244tables-only datastreams and abbreviated images. In both compression and
2245decompression objects, a quantization or Huffman table will be retained for
2246the lifetime of the object, unless it is overwritten by a new table definition.
2247
2248
2249To create abbreviated image datastreams, it is only necessary to tell the
2250compressor not to emit some or all of the tables it is using. Each
2251quantization and Huffman table struct contains a boolean field "sent_table",
2252which normally is initialized to FALSE. For each table used by the image, the
2253header-writing process emits the table and sets sent_table = TRUE unless it is
2254already TRUE. (In normal usage, this prevents outputting the same table
2255definition multiple times, as would otherwise occur because the chroma
2256components typically share tables.) Thus, setting this field to TRUE before
2257calling jpeg_start_compress() will prevent the table from being written at
2258all.
2259
2260If you want to create a "pure" abbreviated image file containing no tables,
2261just call "jpeg_suppress_tables(&cinfo, TRUE)" after constructing all the
2262tables. If you want to emit some but not all tables, you'll need to set the
2263individual sent_table fields directly.
2264
2265To create an abbreviated image, you must also call jpeg_start_compress()
2266with a second parameter of FALSE, not TRUE. Otherwise jpeg_start_compress()
2267will force all the sent_table fields to FALSE. (This is a safety feature to
2268prevent abbreviated images from being created accidentally.)
2269
2270To create a tables-only file, perform the same parameter setup that you
2271normally would, but instead of calling jpeg_start_compress() and so on, call
2272jpeg_write_tables(&cinfo). This will write an abbreviated datastream
2273containing only SOI, DQT and/or DHT markers, and EOI. All the quantization
2274and Huffman tables that are currently defined in the compression object will
2275be emitted unless their sent_tables flag is already TRUE, and then all the
2276sent_tables flags will be set TRUE.
2277
2278A sure-fire way to create matching tables-only and abbreviated image files
2279is to proceed as follows:
2280
DRCb7753512014-05-11 09:36:25 +00002281 create JPEG compression object
2282 set JPEG parameters
2283 set destination to tables-only file
2284 jpeg_write_tables(&cinfo);
2285 set destination to image file
2286 jpeg_start_compress(&cinfo, FALSE);
2287 write data...
2288 jpeg_finish_compress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002289
2290Since the JPEG parameters are not altered between writing the table file and
2291the abbreviated image file, the same tables are sure to be used. Of course,
2292you can repeat the jpeg_start_compress() ... jpeg_finish_compress() sequence
2293many times to produce many abbreviated image files matching the table file.
2294
2295You cannot suppress output of the computed Huffman tables when Huffman
2296optimization is selected. (If you could, there'd be no way to decode the
2297image...) Generally, you don't want to set optimize_coding = TRUE when
2298you are trying to produce abbreviated files.
2299
2300In some cases you might want to compress an image using tables which are
2301not stored in the application, but are defined in an interchange or
2302tables-only file readable by the application. This can be done by setting up
2303a JPEG decompression object to read the specification file, then copying the
Thomas G. Lane489583f1996-02-07 00:00:00 +00002304tables into your compression object. See jpeg_copy_critical_parameters()
2305for an example of copying quantization tables.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002306
2307
2308To read abbreviated image files, you simply need to load the proper tables
2309into the decompression object before trying to read the abbreviated image.
2310If the proper tables are stored in the application program, you can just
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002311allocate the table structs and fill in their contents directly. For example,
2312to load a fixed quantization table into table slot "n":
2313
2314 if (cinfo.quant_tbl_ptrs[n] == NULL)
2315 cinfo.quant_tbl_ptrs[n] = jpeg_alloc_quant_table((j_common_ptr) &cinfo);
DRCb7753512014-05-11 09:36:25 +00002316 quant_ptr = cinfo.quant_tbl_ptrs[n]; /* quant_ptr is JQUANT_TBL* */
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002317 for (i = 0; i < 64; i++) {
2318 /* Qtable[] is desired quantization table, in natural array order */
2319 quant_ptr->quantval[i] = Qtable[i];
2320 }
2321
2322Code to load a fixed Huffman table is typically (for AC table "n"):
2323
2324 if (cinfo.ac_huff_tbl_ptrs[n] == NULL)
2325 cinfo.ac_huff_tbl_ptrs[n] = jpeg_alloc_huff_table((j_common_ptr) &cinfo);
DRCb7753512014-05-11 09:36:25 +00002326 huff_ptr = cinfo.ac_huff_tbl_ptrs[n]; /* huff_ptr is JHUFF_TBL* */
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002327 for (i = 1; i <= 16; i++) {
2328 /* counts[i] is number of Huffman codes of length i bits, i=1..16 */
2329 huff_ptr->bits[i] = counts[i];
2330 }
2331 for (i = 0; i < 256; i++) {
2332 /* symbols[] is the list of Huffman symbols, in code-length order */
2333 huff_ptr->huffval[i] = symbols[i];
2334 }
2335
2336(Note that trying to set cinfo.quant_tbl_ptrs[n] to point directly at a
2337constant JQUANT_TBL object is not safe. If the incoming file happened to
2338contain a quantization table definition, your master table would get
2339overwritten! Instead allocate a working table copy and copy the master table
2340into it, as illustrated above. Ditto for Huffman tables, of course.)
2341
2342You might want to read the tables from a tables-only file, rather than
2343hard-wiring them into your application. The jpeg_read_header() call is
2344sufficient to read a tables-only file. You must pass a second parameter of
2345FALSE to indicate that you do not require an image to be present. Thus, the
2346typical scenario is
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002347
DRCb7753512014-05-11 09:36:25 +00002348 create JPEG decompression object
2349 set source to tables-only file
2350 jpeg_read_header(&cinfo, FALSE);
2351 set source to abbreviated image file
2352 jpeg_read_header(&cinfo, TRUE);
2353 set decompression parameters
2354 jpeg_start_decompress(&cinfo);
2355 read data...
2356 jpeg_finish_decompress(&cinfo);
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002357
2358In some cases, you may want to read a file without knowing whether it contains
2359an image or just tables. In that case, pass FALSE and check the return value
2360from jpeg_read_header(): it will be JPEG_HEADER_OK if an image was found,
2361JPEG_HEADER_TABLES_ONLY if only tables were found. (A third return value,
2362JPEG_SUSPENDED, is possible when using a suspending data source manager.)
2363Note that jpeg_read_header() will not complain if you read an abbreviated
2364image for which you haven't loaded the missing tables; the missing-table check
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002365occurs later, in jpeg_start_decompress().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002366
2367
2368It is possible to read a series of images from a single source file by
2369repeating the jpeg_read_header() ... jpeg_finish_decompress() sequence,
2370without releasing/recreating the JPEG object or the data source module.
2371(If you did reinitialize, any partial bufferload left in the data source
2372buffer at the end of one image would be discarded, causing you to lose the
2373start of the next image.) When you use this method, stored tables are
2374automatically carried forward, so some of the images can be abbreviated images
2375that depend on tables from earlier images.
2376
2377If you intend to write a series of images into a single destination file,
2378you might want to make a specialized data destination module that doesn't
2379flush the output buffer at term_destination() time. This would speed things
2380up by some trifling amount. Of course, you'd need to remember to flush the
2381buffer after the last image. You can make the later images be abbreviated
2382ones by passing FALSE to jpeg_start_compress().
2383
2384
2385Special markers
2386---------------
2387
2388Some applications may need to insert or extract special data in the JPEG
2389datastream. The JPEG standard provides marker types "COM" (comment) and
2390"APP0" through "APP15" (application) to hold application-specific data.
2391Unfortunately, the use of these markers is not specified by the standard.
2392COM markers are fairly widely used to hold user-supplied text. The JFIF file
2393format spec uses APP0 markers with specified initial strings to hold certain
2394data. Adobe applications use APP14 markers beginning with the string "Adobe"
2395for miscellaneous data. Other APPn markers are rarely seen, but might
2396contain almost anything.
2397
2398If you wish to store user-supplied text, we recommend you use COM markers
2399and place readable 7-bit ASCII text in them. Newline conventions are not
2400standardized --- expect to find LF (Unix style), CR/LF (DOS style), or CR
2401(Mac style). A robust COM reader should be able to cope with random binary
2402garbage, including nulls, since some applications generate COM markers
2403containing non-ASCII junk. (But yours should not be one of them.)
2404
2405For program-supplied data, use an APPn marker, and be sure to begin it with an
2406identifying string so that you can tell whether the marker is actually yours.
2407It's probably best to avoid using APP0 or APP14 for any private markers.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002408(NOTE: the upcoming SPIFF standard will use APP8 markers; we recommend you
2409not use APP8 markers for any private purposes, either.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002410
2411Keep in mind that at most 65533 bytes can be put into one marker, but you
2412can have as many markers as you like.
2413
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002414By default, the IJG compression library will write a JFIF APP0 marker if the
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002415selected JPEG colorspace is grayscale or YCbCr, or an Adobe APP14 marker if
2416the selected colorspace is RGB, CMYK, or YCCK. You can disable this, but
2417we don't recommend it. The decompression library will recognize JFIF and
2418Adobe markers and will set the JPEG colorspace properly when one is found.
2419
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002420
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002421You can write special markers immediately following the datastream header by
2422calling jpeg_write_marker() after jpeg_start_compress() and before the first
2423call to jpeg_write_scanlines(). When you do this, the markers appear after
2424the SOI and the JFIF APP0 and Adobe APP14 markers (if written), but before
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002425all else. Specify the marker type parameter as "JPEG_COM" for COM or
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002426"JPEG_APP0 + n" for APPn. (Actually, jpeg_write_marker will let you write
2427any marker type, but we don't recommend writing any other kinds of marker.)
2428For example, to write a user comment string pointed to by comment_text:
DRCb7753512014-05-11 09:36:25 +00002429 jpeg_write_marker(cinfo, JPEG_COM, comment_text, strlen(comment_text));
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002430
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002431If it's not convenient to store all the marker data in memory at once,
2432you can instead call jpeg_write_m_header() followed by multiple calls to
2433jpeg_write_m_byte(). If you do it this way, it's your responsibility to
2434call jpeg_write_m_byte() exactly the number of times given in the length
2435parameter to jpeg_write_m_header(). (This method lets you empty the
2436output buffer partway through a marker, which might be important when
2437using a suspending data destination module. In any case, if you are using
2438a suspending destination, you should flush its buffer after inserting
2439any special markers. See "I/O suspension".)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002440
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002441Or, if you prefer to synthesize the marker byte sequence yourself,
2442you can just cram it straight into the data destination module.
2443
2444If you are writing JFIF 1.02 extension markers (thumbnail images), don't
2445forget to set cinfo.JFIF_minor_version = 2 so that the encoder will write the
2446correct JFIF version number in the JFIF header marker. The library's default
2447is to write version 1.01, but that's wrong if you insert any 1.02 extension
2448markers. (We could probably get away with just defaulting to 1.02, but there
2449used to be broken decoders that would complain about unknown minor version
2450numbers. To reduce compatibility risks it's safest not to write 1.02 unless
2451you are actually using 1.02 extensions.)
2452
2453
2454When reading, two methods of handling special markers are available:
24551. You can ask the library to save the contents of COM and/or APPn markers
2456into memory, and then examine them at your leisure afterwards.
24572. You can supply your own routine to process COM and/or APPn markers
2458on-the-fly as they are read.
2459The first method is simpler to use, especially if you are using a suspending
2460data source; writing a marker processor that copes with input suspension is
2461not easy (consider what happens if the marker is longer than your available
2462input buffer). However, the second method conserves memory since the marker
2463data need not be kept around after it's been processed.
2464
2465For either method, you'd normally set up marker handling after creating a
2466decompression object and before calling jpeg_read_header(), because the
2467markers of interest will typically be near the head of the file and so will
2468be scanned by jpeg_read_header. Once you've established a marker handling
2469method, it will be used for the life of that decompression object
2470(potentially many datastreams), unless you change it. Marker handling is
2471determined separately for COM markers and for each APPn marker code.
2472
2473
2474To save the contents of special markers in memory, call
DRCb7753512014-05-11 09:36:25 +00002475 jpeg_save_markers(cinfo, marker_code, length_limit)
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002476where marker_code is the marker type to save, JPEG_COM or JPEG_APP0+n.
2477(To arrange to save all the special marker types, you need to call this
2478routine 17 times, for COM and APP0-APP15.) If the incoming marker is longer
2479than length_limit data bytes, only length_limit bytes will be saved; this
2480parameter allows you to avoid chewing up memory when you only need to see the
2481first few bytes of a potentially large marker. If you want to save all the
2482data, set length_limit to 0xFFFF; that is enough since marker lengths are only
248316 bits. As a special case, setting length_limit to 0 prevents that marker
2484type from being saved at all. (That is the default behavior, in fact.)
2485
2486After jpeg_read_header() completes, you can examine the special markers by
2487following the cinfo->marker_list pointer chain. All the special markers in
2488the file appear in this list, in order of their occurrence in the file (but
2489omitting any markers of types you didn't ask for). Both the original data
2490length and the saved data length are recorded for each list entry; the latter
2491will not exceed length_limit for the particular marker type. Note that these
2492lengths exclude the marker length word, whereas the stored representation
2493within the JPEG file includes it. (Hence the maximum data length is really
2494only 65533.)
2495
2496It is possible that additional special markers appear in the file beyond the
2497SOS marker at which jpeg_read_header stops; if so, the marker list will be
2498extended during reading of the rest of the file. This is not expected to be
2499common, however. If you are short on memory you may want to reset the length
2500limit to zero for all marker types after finishing jpeg_read_header, to
2501ensure that the max_memory_to_use setting cannot be exceeded due to addition
2502of later markers.
2503
2504The marker list remains stored until you call jpeg_finish_decompress or
2505jpeg_abort, at which point the memory is freed and the list is set to empty.
2506(jpeg_destroy also releases the storage, of course.)
2507
2508Note that the library is internally interested in APP0 and APP14 markers;
2509if you try to set a small nonzero length limit on these types, the library
2510will silently force the length up to the minimum it wants. (But you can set
2511a zero length limit to prevent them from being saved at all.) Also, in a
251216-bit environment, the maximum length limit may be constrained to less than
251365533 by malloc() limitations. It is therefore best not to assume that the
2514effective length limit is exactly what you set it to be.
2515
2516
2517If you want to supply your own marker-reading routine, you do it by calling
2518jpeg_set_marker_processor(). A marker processor routine must have the
2519signature
DRCb7753512014-05-11 09:36:25 +00002520 boolean jpeg_marker_parser_method (j_decompress_ptr cinfo)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002521Although the marker code is not explicitly passed, the routine can find it
2522in cinfo->unread_marker. At the time of call, the marker proper has been
2523read from the data source module. The processor routine is responsible for
2524reading the marker length word and the remaining parameter bytes, if any.
2525Return TRUE to indicate success. (FALSE should be returned only if you are
2526using a suspending data source and it tells you to suspend. See the standard
2527marker processors in jdmarker.c for appropriate coding methods if you need to
2528use a suspending data source.)
2529
2530If you override the default APP0 or APP14 processors, it is up to you to
2531recognize JFIF and Adobe markers if you want colorspace recognition to occur
2532properly. We recommend copying and extending the default processors if you
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002533want to do that. (A better idea is to save these marker types for later
2534examination by calling jpeg_save_markers(); that method doesn't interfere
2535with the library's own processing of these markers.)
2536
2537jpeg_set_marker_processor() and jpeg_save_markers() are mutually exclusive
2538--- if you call one it overrides any previous call to the other, for the
2539particular marker type specified.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002540
2541A simple example of an external COM processor can be found in djpeg.c.
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002542Also, see jpegtran.c for an example of using jpeg_save_markers.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002543
2544
2545Raw (downsampled) image data
2546----------------------------
2547
2548Some applications need to supply already-downsampled image data to the JPEG
2549compressor, or to receive raw downsampled data from the decompressor. The
2550library supports this requirement by allowing the application to write or
2551read raw data, bypassing the normal preprocessing or postprocessing steps.
2552The interface is different from the standard one and is somewhat harder to
2553use. If your interest is merely in bypassing color conversion, we recommend
2554that you use the standard interface and simply set jpeg_color_space =
2555in_color_space (or jpeg_color_space = out_color_space for decompression).
2556The mechanism described in this section is necessary only to supply or
2557receive downsampled image data, in which not all components have the same
2558dimensions.
2559
2560
2561To compress raw data, you must supply the data in the colorspace to be used
2562in the JPEG file (please read the earlier section on Special color spaces)
2563and downsampled to the sampling factors specified in the JPEG parameters.
2564You must supply the data in the format used internally by the JPEG library,
2565namely a JSAMPIMAGE array. This is an array of pointers to two-dimensional
2566arrays, each of type JSAMPARRAY. Each 2-D array holds the values for one
2567color component. This structure is necessary since the components are of
2568different sizes. If the image dimensions are not a multiple of the MCU size,
2569you must also pad the data correctly (usually, this is done by replicating
2570the last column and/or row). The data must be padded to a multiple of a DCT
2571block in each component: that is, each downsampled row must contain a
2572multiple of 8 valid samples, and there must be a multiple of 8 sample rows
2573for each component. (For applications such as conversion of digital TV
2574images, the standard image size is usually a multiple of the DCT block size,
2575so that no padding need actually be done.)
2576
2577The procedure for compression of raw data is basically the same as normal
2578compression, except that you call jpeg_write_raw_data() in place of
2579jpeg_write_scanlines(). Before calling jpeg_start_compress(), you must do
2580the following:
2581 * Set cinfo->raw_data_in to TRUE. (It is set FALSE by jpeg_set_defaults().)
2582 This notifies the library that you will be supplying raw data.
2583 * Ensure jpeg_color_space is correct --- an explicit jpeg_set_colorspace()
2584 call is a good idea. Note that since color conversion is bypassed,
2585 in_color_space is ignored, except that jpeg_set_defaults() uses it to
2586 choose the default jpeg_color_space setting.
2587 * Ensure the sampling factors, cinfo->comp_info[i].h_samp_factor and
2588 cinfo->comp_info[i].v_samp_factor, are correct. Since these indicate the
2589 dimensions of the data you are supplying, it's wise to set them
2590 explicitly, rather than assuming the library's defaults are what you want.
2591
2592To pass raw data to the library, call jpeg_write_raw_data() in place of
2593jpeg_write_scanlines(). The two routines work similarly except that
2594jpeg_write_raw_data takes a JSAMPIMAGE data array rather than JSAMPARRAY.
2595The scanlines count passed to and returned from jpeg_write_raw_data is
2596measured in terms of the component with the largest v_samp_factor.
2597
2598jpeg_write_raw_data() processes one MCU row per call, which is to say
2599v_samp_factor*DCTSIZE sample rows of each component. The passed num_lines
2600value must be at least max_v_samp_factor*DCTSIZE, and the return value will
2601be exactly that amount (or possibly some multiple of that amount, in future
2602library versions). This is true even on the last call at the bottom of the
2603image; don't forget to pad your data as necessary.
2604
2605The required dimensions of the supplied data can be computed for each
2606component as
DRCb7753512014-05-11 09:36:25 +00002607 cinfo->comp_info[i].width_in_blocks*DCTSIZE samples per row
2608 cinfo->comp_info[i].height_in_blocks*DCTSIZE rows in image
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002609after jpeg_start_compress() has initialized those fields. If the valid data
2610is smaller than this, it must be padded appropriately. For some sampling
2611factors and image sizes, additional dummy DCT blocks are inserted to make
2612the image a multiple of the MCU dimensions. The library creates such dummy
2613blocks itself; it does not read them from your supplied data. Therefore you
2614need never pad by more than DCTSIZE samples. An example may help here.
2615Assume 2h2v downsampling of YCbCr data, that is
DRCb7753512014-05-11 09:36:25 +00002616 cinfo->comp_info[0].h_samp_factor = 2 for Y
2617 cinfo->comp_info[0].v_samp_factor = 2
2618 cinfo->comp_info[1].h_samp_factor = 1 for Cb
2619 cinfo->comp_info[1].v_samp_factor = 1
2620 cinfo->comp_info[2].h_samp_factor = 1 for Cr
2621 cinfo->comp_info[2].v_samp_factor = 1
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002622and suppose that the nominal image dimensions (cinfo->image_width and
2623cinfo->image_height) are 101x101 pixels. Then jpeg_start_compress() will
2624compute downsampled_width = 101 and width_in_blocks = 13 for Y,
2625downsampled_width = 51 and width_in_blocks = 7 for Cb and Cr (and the same
2626for the height fields). You must pad the Y data to at least 13*8 = 104
2627columns and rows, the Cb/Cr data to at least 7*8 = 56 columns and rows. The
2628MCU height is max_v_samp_factor = 2 DCT rows so you must pass at least 16
2629scanlines on each call to jpeg_write_raw_data(), which is to say 16 actual
2630sample rows of Y and 8 each of Cb and Cr. A total of 7 MCU rows are needed,
2631so you must pass a total of 7*16 = 112 "scanlines". The last DCT block row
2632of Y data is dummy, so it doesn't matter what you pass for it in the data
2633arrays, but the scanlines count must total up to 112 so that all of the Cb
2634and Cr data gets passed.
2635
Thomas G. Lanea8b67c41995-03-15 00:00:00 +00002636Output suspension is supported with raw-data compression: if the data
2637destination module suspends, jpeg_write_raw_data() will return 0.
2638In this case the same data rows must be passed again on the next call.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002639
2640
2641Decompression with raw data output implies bypassing all postprocessing:
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002642you cannot ask for rescaling or color quantization, for instance. More
2643seriously, you must deal with the color space and sampling factors present in
2644the incoming file. If your application only handles, say, 2h1v YCbCr data,
2645you must check for and fail on other color spaces or other sampling factors.
2646The library will not convert to a different color space for you.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002647
2648To obtain raw data output, set cinfo->raw_data_out = TRUE before
2649jpeg_start_decompress() (it is set FALSE by jpeg_read_header()). Be sure to
2650verify that the color space and sampling factors are ones you can handle.
2651Then call jpeg_read_raw_data() in place of jpeg_read_scanlines(). The
2652decompression process is otherwise the same as usual.
2653
2654jpeg_read_raw_data() returns one MCU row per call, and thus you must pass a
2655buffer of at least max_v_samp_factor*DCTSIZE scanlines (scanline counting is
2656the same as for raw-data compression). The buffer you pass must be large
2657enough to hold the actual data plus padding to DCT-block boundaries. As with
2658compression, any entirely dummy DCT blocks are not processed so you need not
2659allocate space for them, but the total scanline count includes them. The
2660above example of computing buffer dimensions for raw-data compression is
2661equally valid for decompression.
2662
2663Input suspension is supported with raw-data decompression: if the data source
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002664module suspends, jpeg_read_raw_data() will return 0. You can also use
2665buffered-image mode to read raw data in multiple passes.
2666
2667
2668Really raw data: DCT coefficients
2669---------------------------------
2670
2671It is possible to read or write the contents of a JPEG file as raw DCT
2672coefficients. This facility is mainly intended for use in lossless
2673transcoding between different JPEG file formats. Other possible applications
2674include lossless cropping of a JPEG image, lossless reassembly of a
2675multi-strip or multi-tile TIFF/JPEG file into a single JPEG datastream, etc.
2676
2677To read the contents of a JPEG file as DCT coefficients, open the file and do
2678jpeg_read_header() as usual. But instead of calling jpeg_start_decompress()
2679and jpeg_read_scanlines(), call jpeg_read_coefficients(). This will read the
2680entire image into a set of virtual coefficient-block arrays, one array per
2681component. The return value is a pointer to an array of virtual-array
2682descriptors. Each virtual array can be accessed directly using the JPEG
2683memory manager's access_virt_barray method (see Memory management, below,
Guido Vollbeding5996a252009-06-27 00:00:00 +00002684and also read structure.txt's discussion of virtual array handling). Or,
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002685for simple transcoding to a different JPEG file format, the array list can
2686just be handed directly to jpeg_write_coefficients().
2687
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002688Each block in the block arrays contains quantized coefficient values in
2689normal array order (not JPEG zigzag order). The block arrays contain only
2690DCT blocks containing real data; any entirely-dummy blocks added to fill out
2691interleaved MCUs at the right or bottom edges of the image are discarded
2692during reading and are not stored in the block arrays. (The size of each
2693block array can be determined from the width_in_blocks and height_in_blocks
2694fields of the component's comp_info entry.) This is also the data format
2695expected by jpeg_write_coefficients().
2696
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002697When you are done using the virtual arrays, call jpeg_finish_decompress()
2698to release the array storage and return the decompression object to an idle
2699state; or just call jpeg_destroy() if you don't need to reuse the object.
2700
2701If you use a suspending data source, jpeg_read_coefficients() will return
2702NULL if it is forced to suspend; a non-NULL return value indicates successful
2703completion. You need not test for a NULL return value when using a
2704non-suspending data source.
2705
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002706It is also possible to call jpeg_read_coefficients() to obtain access to the
2707decoder's coefficient arrays during a normal decode cycle in buffered-image
2708mode. This frammish might be useful for progressively displaying an incoming
2709image and then re-encoding it without loss. To do this, decode in buffered-
2710image mode as discussed previously, then call jpeg_read_coefficients() after
2711the last jpeg_finish_output() call. The arrays will be available for your use
2712until you call jpeg_finish_decompress().
2713
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002714
2715To write the contents of a JPEG file as DCT coefficients, you must provide
2716the DCT coefficients stored in virtual block arrays. You can either pass
2717block arrays read from an input JPEG file by jpeg_read_coefficients(), or
2718allocate virtual arrays from the JPEG compression object and fill them
2719yourself. In either case, jpeg_write_coefficients() is substituted for
2720jpeg_start_compress() and jpeg_write_scanlines(). Thus the sequence is
2721 * Create compression object
2722 * Set all compression parameters as necessary
2723 * Request virtual arrays if needed
2724 * jpeg_write_coefficients()
2725 * jpeg_finish_compress()
2726 * Destroy or re-use compression object
2727jpeg_write_coefficients() is passed a pointer to an array of virtual block
2728array descriptors; the number of arrays is equal to cinfo.num_components.
2729
2730The virtual arrays need only have been requested, not realized, before
2731jpeg_write_coefficients() is called. A side-effect of
2732jpeg_write_coefficients() is to realize any virtual arrays that have been
2733requested from the compression object's memory manager. Thus, when obtaining
2734the virtual arrays from the compression object, you should fill the arrays
2735after calling jpeg_write_coefficients(). The data is actually written out
2736when you call jpeg_finish_compress(); jpeg_write_coefficients() only writes
2737the file header.
2738
2739When writing raw DCT coefficients, it is crucial that the JPEG quantization
2740tables and sampling factors match the way the data was encoded, or the
2741resulting file will be invalid. For transcoding from an existing JPEG file,
2742we recommend using jpeg_copy_critical_parameters(). This routine initializes
2743all the compression parameters to default values (like jpeg_set_defaults()),
2744then copies the critical information from a source decompression object.
2745The decompression object should have just been used to read the entire
2746JPEG input file --- that is, it should be awaiting jpeg_finish_decompress().
2747
2748jpeg_write_coefficients() marks all tables stored in the compression object
2749as needing to be written to the output file (thus, it acts like
2750jpeg_start_compress(cinfo, TRUE)). This is for safety's sake, to avoid
2751emitting abbreviated JPEG files by accident. If you really want to emit an
2752abbreviated JPEG file, call jpeg_suppress_tables(), or set the tables'
2753individual sent_table flags, between calling jpeg_write_coefficients() and
2754jpeg_finish_compress().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002755
2756
2757Progress monitoring
2758-------------------
2759
2760Some applications may need to regain control from the JPEG library every so
2761often. The typical use of this feature is to produce a percent-done bar or
2762other progress display. (For a simple example, see cjpeg.c or djpeg.c.)
2763Although you do get control back frequently during the data-transferring pass
2764(the jpeg_read_scanlines or jpeg_write_scanlines loop), any additional passes
2765will occur inside jpeg_finish_compress or jpeg_start_decompress; those
2766routines may take a long time to execute, and you don't get control back
2767until they are done.
2768
2769You can define a progress-monitor routine which will be called periodically
2770by the library. No guarantees are made about how often this call will occur,
2771so we don't recommend you use it for mouse tracking or anything like that.
2772At present, a call will occur once per MCU row, scanline, or sample row
2773group, whichever unit is convenient for the current processing mode; so the
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002774wider the image, the longer the time between calls. During the data
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002775transferring pass, only one call occurs per call of jpeg_read_scanlines or
2776jpeg_write_scanlines, so don't pass a large number of scanlines at once if
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002777you want fine resolution in the progress count. (If you really need to use
2778the callback mechanism for time-critical tasks like mouse tracking, you could
2779insert additional calls inside some of the library's inner loops.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002780
2781To establish a progress-monitor callback, create a struct jpeg_progress_mgr,
2782fill in its progress_monitor field with a pointer to your callback routine,
2783and set cinfo->progress to point to the struct. The callback will be called
2784whenever cinfo->progress is non-NULL. (This pointer is set to NULL by
2785jpeg_create_compress or jpeg_create_decompress; the library will not change
2786it thereafter. So if you allocate dynamic storage for the progress struct,
2787make sure it will live as long as the JPEG object does. Allocating from the
2788JPEG memory manager with lifetime JPOOL_PERMANENT will work nicely.) You
2789can use the same callback routine for both compression and decompression.
2790
2791The jpeg_progress_mgr struct contains four fields which are set by the library:
DRCb7753512014-05-11 09:36:25 +00002792 long pass_counter; /* work units completed in this pass */
2793 long pass_limit; /* total number of work units in this pass */
2794 int completed_passes; /* passes completed so far */
2795 int total_passes; /* total number of passes expected */
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002796During any one pass, pass_counter increases from 0 up to (not including)
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002797pass_limit; the step size is usually but not necessarily 1. The pass_limit
2798value may change from one pass to another. The expected total number of
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002799passes is in total_passes, and the number of passes already completed is in
2800completed_passes. Thus the fraction of work completed may be estimated as
DRCb7753512014-05-11 09:36:25 +00002801 completed_passes + (pass_counter/pass_limit)
2802 --------------------------------------------
2803 total_passes
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002804ignoring the fact that the passes may not be equal amounts of work.
2805
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002806When decompressing, pass_limit can even change within a pass, because it
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002807depends on the number of scans in the JPEG file, which isn't always known in
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002808advance. The computed fraction-of-work-done may jump suddenly (if the library
2809discovers it has overestimated the number of scans) or even decrease (in the
2810opposite case). It is not wise to put great faith in the work estimate.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002811
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002812When using the decompressor's buffered-image mode, the progress monitor work
2813estimate is likely to be completely unhelpful, because the library has no way
2814to know how many output passes will be demanded of it. Currently, the library
2815sets total_passes based on the assumption that there will be one more output
2816pass if the input file end hasn't yet been read (jpeg_input_complete() isn't
2817TRUE), but no more output passes if the file end has been reached when the
2818output pass is started. This means that total_passes will rise as additional
2819output passes are requested. If you have a way of determining the input file
2820size, estimating progress based on the fraction of the file that's been read
2821will probably be more useful than using the library's value.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002822
2823
2824Memory management
2825-----------------
2826
2827This section covers some key facts about the JPEG library's built-in memory
Guido Vollbeding5996a252009-06-27 00:00:00 +00002828manager. For more info, please read structure.txt's section about the memory
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002829manager, and consult the source code if necessary.
2830
2831All memory and temporary file allocation within the library is done via the
2832memory manager. If necessary, you can replace the "back end" of the memory
2833manager to control allocation yourself (for example, if you don't want the
2834library to use malloc() and free() for some reason).
2835
2836Some data is allocated "permanently" and will not be freed until the JPEG
2837object is destroyed. Most data is allocated "per image" and is freed by
2838jpeg_finish_compress, jpeg_finish_decompress, or jpeg_abort. You can call the
2839memory manager yourself to allocate structures that will automatically be
2840freed at these times. Typical code for this is
2841 ptr = (*cinfo->mem->alloc_small) ((j_common_ptr) cinfo, JPOOL_IMAGE, size);
2842Use JPOOL_PERMANENT to get storage that lasts as long as the JPEG object.
2843Use alloc_large instead of alloc_small for anything bigger than a few Kbytes.
2844There are also alloc_sarray and alloc_barray routines that automatically
2845build 2-D sample or block arrays.
2846
2847The library's minimum space requirements to process an image depend on the
2848image's width, but not on its height, because the library ordinarily works
2849with "strip" buffers that are as wide as the image but just a few rows high.
2850Some operating modes (eg, two-pass color quantization) require full-image
2851buffers. Such buffers are treated as "virtual arrays": only the current strip
2852need be in memory, and the rest can be swapped out to a temporary file.
2853
2854If you use the simplest memory manager back end (jmemnobs.c), then no
2855temporary files are used; virtual arrays are simply malloc()'d. Images bigger
2856than memory can be processed only if your system supports virtual memory.
2857The other memory manager back ends support temporary files of various flavors
2858and thus work in machines without virtual memory. They may also be useful on
2859Unix machines if you need to process images that exceed available swap space.
2860
2861When using temporary files, the library will make the in-memory buffers for
2862its virtual arrays just big enough to stay within a "maximum memory" setting.
2863Your application can set this limit by setting cinfo->mem->max_memory_to_use
2864after creating the JPEG object. (Of course, there is still a minimum size for
2865the buffers, so the max-memory setting is effective only if it is bigger than
2866the minimum space needed.) If you allocate any large structures yourself, you
2867must allocate them before jpeg_start_compress() or jpeg_start_decompress() in
2868order to have them counted against the max memory limit. Also keep in mind
2869that space allocated with alloc_small() is ignored, on the assumption that
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002870it's too small to be worth worrying about; so a reasonable safety margin
2871should be left when setting max_memory_to_use.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002872
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002873
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002874Memory usage
2875------------
2876
2877Working memory requirements while performing compression or decompression
2878depend on image dimensions, image characteristics (such as colorspace and
2879JPEG process), and operating mode (application-selected options).
2880
2881As of v6b, the decompressor requires:
2882 1. About 24K in more-or-less-fixed-size data. This varies a bit depending
2883 on operating mode and image characteristics (particularly color vs.
2884 grayscale), but it doesn't depend on image dimensions.
2885 2. Strip buffers (of size proportional to the image width) for IDCT and
2886 upsampling results. The worst case for commonly used sampling factors
2887 is about 34 bytes * width in pixels for a color image. A grayscale image
2888 only needs about 8 bytes per pixel column.
2889 3. A full-image DCT coefficient buffer is needed to decode a multi-scan JPEG
2890 file (including progressive JPEGs), or whenever you select buffered-image
2891 mode. This takes 2 bytes/coefficient. At typical 2x2 sampling, that's
2892 3 bytes per pixel for a color image. Worst case (1x1 sampling) requires
2893 6 bytes/pixel. For grayscale, figure 2 bytes/pixel.
2894 4. To perform 2-pass color quantization, the decompressor also needs a
2895 128K color lookup table and a full-image pixel buffer (3 bytes/pixel).
2896This does not count any memory allocated by the application, such as a
2897buffer to hold the final output image.
2898
2899The above figures are valid for 8-bit JPEG data precision and a machine with
290032-bit ints. For 12-bit JPEG data, double the size of the strip buffers and
2901quantization pixel buffer. The "fixed-size" data will be somewhat smaller
2902with 16-bit ints, larger with 64-bit ints. Also, CMYK or other unusual
2903color spaces will require different amounts of space.
2904
2905The full-image coefficient and pixel buffers, if needed at all, do not
2906have to be fully RAM resident; you can have the library use temporary
2907files instead when the total memory usage would exceed a limit you set.
2908(But if your OS supports virtual memory, it's probably better to just use
2909jmemnobs and let the OS do the swapping.)
2910
2911The compressor's memory requirements are similar, except that it has no need
2912for color quantization. Also, it needs a full-image DCT coefficient buffer
2913if Huffman-table optimization is asked for, even if progressive mode is not
2914requested.
2915
2916If you need more detailed information about memory usage in a particular
2917situation, you can enable the MEM_STATS code in jmemmgr.c.
2918
2919
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002920Library compile-time options
2921----------------------------
2922
2923A number of compile-time options are available by modifying jmorecfg.h.
2924
2925The JPEG standard provides for both the baseline 8-bit DCT process and
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002926a 12-bit DCT process. The IJG code supports 12-bit lossy JPEG if you define
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002927BITS_IN_JSAMPLE as 12 rather than 8. Note that this causes JSAMPLE to be
2928larger than a char, so it affects the surrounding application's image data.
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +00002929The sample applications cjpeg and djpeg can support 12-bit mode only for PPM
2930and GIF file formats; you must disable the other file formats to compile a
Guido Vollbeding5996a252009-06-27 00:00:00 +0000293112-bit cjpeg or djpeg. (install.txt has more information about that.)
Thomas G. Lanea8b67c41995-03-15 00:00:00 +00002932At present, a 12-bit library can handle *only* 12-bit images, not both
DRC52ded872014-05-15 20:30:16 +00002933precisions.
Thomas G. Lanea8b67c41995-03-15 00:00:00 +00002934
2935Note that a 12-bit library always compresses in Huffman optimization mode,
2936in order to generate valid Huffman tables. This is necessary because our
2937default Huffman tables only cover 8-bit data. If you need to output 12-bit
2938files in one pass, you'll have to supply suitable default Huffman tables.
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002939You may also want to supply your own DCT quantization tables; the existing
2940quality-scaling code has been developed for 8-bit use, and probably doesn't
2941generate especially good tables for 12-bit.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002942
2943The maximum number of components (color channels) in the image is determined
2944by MAX_COMPONENTS. The JPEG standard allows up to 255 components, but we
2945expect that few applications will need more than four or so.
2946
2947On machines with unusual data type sizes, you may be able to improve
2948performance or reduce memory space by tweaking the various typedefs in
2949jmorecfg.h. In particular, on some RISC CPUs, access to arrays of "short"s
2950is quite slow; consider trading memory for speed by making JCOEF, INT16, and
2951UINT16 be "int" or "unsigned int". UINT8 is also a candidate to become int.
2952You probably don't want to make JSAMPLE be int unless you have lots of memory
2953to burn.
2954
2955You can reduce the size of the library by compiling out various optional
2956functions. To do this, undefine xxx_SUPPORTED symbols as necessary.
2957
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002958You can also save a few K by not having text error messages in the library;
2959the standard error message table occupies about 5Kb. This is particularly
DRCb7753512014-05-11 09:36:25 +00002960reasonable for embedded applications where there's no good way to display
Thomas G. Lane5ead57a1998-03-27 00:00:00 +00002961a message anyway. To do this, remove the creation of the message table
2962(jpeg_std_message_table[]) from jerror.c, and alter format_message to do
2963something reasonable without it. You could output the numeric value of the
2964message code number, for example. If you do this, you can also save a couple
2965more K by modifying the TRACEMSn() macros in jerror.h to expand to nothing;
2966you don't need trace capability anyway, right?
2967
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002968
2969Portability considerations
2970--------------------------
2971
2972The JPEG library has been written to be extremely portable; the sample
2973applications cjpeg and djpeg are slightly less so. This section summarizes
2974the design goals in this area. (If you encounter any bugs that cause the
2975library to be less portable than is claimed here, we'd appreciate hearing
2976about them.)
2977
DRCfced14c2014-05-21 04:13:09 +00002978The code works fine on ANSI C and C++ compilers, using any of the popular
2979system include file setups, and some not-so-popular ones too.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002980
2981The code is not dependent on the exact sizes of the C data types. As
2982distributed, we make the assumptions that
DRCb7753512014-05-11 09:36:25 +00002983 char is at least 8 bits wide
2984 short is at least 16 bits wide
2985 int is at least 16 bits wide
2986 long is at least 32 bits wide
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002987(These are the minimum requirements of the ANSI C standard.) Wider types will
2988work fine, although memory may be used inefficiently if char is much larger
2989than 8 bits or short is much bigger than 16 bits. The code should work
2990equally well with 16- or 32-bit ints.
2991
2992In a system where these assumptions are not met, you may be able to make the
2993code work by modifying the typedefs in jmorecfg.h. However, you will probably
2994have difficulty if int is less than 16 bits wide, since references to plain
2995int abound in the code.
2996
2997char can be either signed or unsigned, although the code runs faster if an
2998unsigned char type is available. If char is wider than 8 bits, you will need
2999to redefine JOCTET and/or provide custom data source/destination managers so
3000that JOCTET represents exactly 8 bits of data on external storage.
3001
3002The JPEG library proper does not assume ASCII representation of characters.
3003But some of the image file I/O modules in cjpeg/djpeg do have ASCII
3004dependencies in file-header manipulation; so does cjpeg's select_file_type()
3005routine.
3006
3007The JPEG library does not rely heavily on the C library. In particular, C
3008stdio is used only by the data source/destination modules and the error
3009handler, all of which are application-replaceable. (cjpeg/djpeg are more
3010heavily dependent on stdio.) malloc and free are called only from the memory
3011manager "back end" module, so you can use a different memory allocator by
3012replacing that one file.
3013
Guido Vollbeding5996a252009-06-27 00:00:00 +00003014More info about porting the code may be gleaned by reading jconfig.txt,
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00003015jmorecfg.h, and jinclude.h.