blob: b3333a8ef8fe476bd8a68d6f02214ff7adf282a9 [file] [log] [blame]
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001USING THE IJG JPEG LIBRARY
2
Thomas G. Lane489583f1996-02-07 00:00:00 +00003Copyright (C) 1994-1996, Thomas G. Lane.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00004This file is part of the Independent JPEG Group's software.
5For conditions of distribution and use, see the accompanying README file.
6
7
8This file describes how to use the IJG JPEG library within an application
9program. Read it if you want to write a program that uses the library.
10
11The file example.c provides heavily commented skeleton code for calling the
12JPEG library. Also see jpeglib.h (the include file to be used by application
13programs) for full details about data structures and function parameter lists.
14The library source code, of course, is the ultimate reference.
15
16Note that there have been *major* changes from the application interface
17presented by IJG version 4 and earlier versions. The old design had several
18inherent limitations, and it had accumulated a lot of cruft as we added
19features while trying to minimize application-interface changes. We have
20sacrificed backward compatibility in the version 5 rewrite, but we think the
21improvements justify this.
22
23
24TABLE OF CONTENTS
25-----------------
26
27Overview:
28 Functions provided by the library
29 Outline of typical usage
30Basic library usage:
31 Data formats
32 Compression details
33 Decompression details
34 Mechanics of usage: include files, linking, etc
35Advanced features:
36 Compression parameter selection
37 Decompression parameter selection
38 Special color spaces
39 Error handling
40 Compressed data handling (source and destination managers)
41 I/O suspension
Thomas G. Lanebc79e061995-08-02 00:00:00 +000042 Progressive JPEG support
43 Buffered-image mode
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +000044 Abbreviated datastreams and multiple images
45 Special markers
46 Raw (downsampled) image data
Thomas G. Lanebc79e061995-08-02 00:00:00 +000047 Really raw data: DCT coefficients
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +000048 Progress monitoring
49 Memory management
50 Library compile-time options
51 Portability considerations
52 Notes for MS-DOS implementors
53
54You should read at least the overview and basic usage sections before trying
55to program with the library. The sections on advanced features can be read
56if and when you need them.
57
58
59OVERVIEW
60========
61
62Functions provided by the library
63---------------------------------
64
65The IJG JPEG library provides C code to read and write JPEG-compressed image
66files. The surrounding application program receives or supplies image data a
67scanline at a time, using a straightforward uncompressed image format. All
68details of color conversion and other preprocessing/postprocessing can be
69handled by the library.
70
71The library includes a substantial amount of code that is not covered by the
72JPEG standard but is necessary for typical applications of JPEG. These
73functions preprocess the image before JPEG compression or postprocess it after
74decompression. They include colorspace conversion, downsampling/upsampling,
75and color quantization. The application indirectly selects use of this code
76by specifying the format in which it wishes to supply or receive image data.
77For example, if colormapped output is requested, then the decompression
78library automatically invokes color quantization.
79
80A wide range of quality vs. speed tradeoffs are possible in JPEG processing,
81and even more so in decompression postprocessing. The decompression library
82provides multiple implementations that cover most of the useful tradeoffs,
83ranging from very-high-quality down to fast-preview operation. On the
84compression side we have generally not provided low-quality choices, since
85compression is normally less time-critical. It should be understood that the
86low-quality modes may not meet the JPEG standard's accuracy requirements;
87nonetheless, they are useful for viewers.
88
89A word about functions *not* provided by the library. We handle a subset of
Thomas G. Lanebc79e061995-08-02 00:00:00 +000090the ISO JPEG standard; most baseline, extended-sequential, and progressive
91JPEG processes are supported. (Our subset includes all features now in common
92use.) Unsupported ISO options include:
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +000093 * Hierarchical storage
94 * Lossless JPEG
95 * Arithmetic entropy coding (unsupported for legal reasons)
96 * DNL marker
97 * Nonintegral subsampling ratios
98We support both 8- and 12-bit data precision, but this is a compile-time
99choice rather than a run-time choice; hence it is difficult to use both
100precisions in a single application.
101
102By itself, the library handles only interchange JPEG datastreams --- in
103particular the widely used JFIF file format. The library can be used by
104surrounding code to process interchange or abbreviated JPEG datastreams that
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000105are embedded in more complex file formats. (For example, this library is
106used by the free LIBTIFF library to support JPEG compression in TIFF.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000107
108
109Outline of typical usage
110------------------------
111
112The rough outline of a JPEG compression operation is:
113
114 Allocate and initialize a JPEG compression object
115 Specify the destination for the compressed data (eg, a file)
116 Set parameters for compression, including image size & colorspace
117 jpeg_start_compress(...);
118 while (scan lines remain to be written)
119 jpeg_write_scanlines(...);
120 jpeg_finish_compress(...);
121 Release the JPEG compression object
122
123A JPEG compression object holds parameters and working state for the JPEG
124library. We make creation/destruction of the object separate from starting
125or finishing compression of an image; the same object can be re-used for a
126series of image compression operations. This makes it easy to re-use the
127same parameter settings for a sequence of images. Re-use of a JPEG object
128also has important implications for processing abbreviated JPEG datastreams,
129as discussed later.
130
131The image data to be compressed is supplied to jpeg_write_scanlines() from
132in-memory buffers. If the application is doing file-to-file compression,
133reading image data from the source file is the application's responsibility.
134The library emits compressed data by calling a "data destination manager",
135which typically will write the data into a file; but the application can
136provide its own destination manager to do something else.
137
138Similarly, the rough outline of a JPEG decompression operation is:
139
140 Allocate and initialize a JPEG decompression object
141 Specify the source of the compressed data (eg, a file)
142 Call jpeg_read_header() to obtain image info
143 Set parameters for decompression
144 jpeg_start_decompress(...);
145 while (scan lines remain to be read)
146 jpeg_read_scanlines(...);
147 jpeg_finish_decompress(...);
148 Release the JPEG decompression object
149
150This is comparable to the compression outline except that reading the
151datastream header is a separate step. This is helpful because information
152about the image's size, colorspace, etc is available when the application
153selects decompression parameters. For example, the application can choose an
154output scaling ratio that will fit the image into the available screen size.
155
156The decompression library obtains compressed data by calling a data source
157manager, which typically will read the data from a file; but other behaviors
158can be obtained with a custom source manager. Decompressed data is delivered
159into in-memory buffers passed to jpeg_read_scanlines().
160
161It is possible to abort an incomplete compression or decompression operation
162by calling jpeg_abort(); or, if you do not need to retain the JPEG object,
163simply release it by calling jpeg_destroy().
164
165JPEG compression and decompression objects are two separate struct types.
166However, they share some common fields, and certain routines such as
167jpeg_destroy() can work on either type of object.
168
169The JPEG library has no static variables: all state is in the compression
170or decompression object. Therefore it is possible to process multiple
171compression and decompression operations concurrently, using multiple JPEG
172objects.
173
174Both compression and decompression can be done in an incremental memory-to-
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000175memory fashion, if suitable source/destination managers are used. See the
176section on "I/O suspension" for more details.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000177
178
179BASIC LIBRARY USAGE
180===================
181
182Data formats
183------------
184
185Before diving into procedural details, it is helpful to understand the
186image data format that the JPEG library expects or returns.
187
188The standard input image format is a rectangular array of pixels, with each
Thomas G. Lane489583f1996-02-07 00:00:00 +0000189pixel having the same number of "component" or "sample" values (color
190channels). You must specify how many components there are and the colorspace
191interpretation of the components. Most applications will use RGB data
192(three components per pixel) or grayscale data (one component per pixel).
193PLEASE NOTE THAT RGB DATA IS THREE SAMPLES PER PIXEL, GRAYSCALE ONLY ONE.
194A remarkable number of people manage to miss this, only to find that their
195programs don't work with grayscale JPEG files.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000196
Thomas G. Lane489583f1996-02-07 00:00:00 +0000197There is no provision for colormapped input. JPEG files are always full-color
198or full grayscale (or sometimes another colorspace such as CMYK). You can
199feed in a colormapped image by expanding it to full-color format. However
200JPEG often doesn't work very well with source data that has been colormapped,
201because of dithering noise. This is discussed in more detail in the JPEG FAQ
202and the other references mentioned in the README file.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000203
204Pixels are stored by scanlines, with each scanline running from left to
205right. The component values for each pixel are adjacent in the row; for
206example, R,G,B,R,G,B,R,G,B,... for 24-bit RGB color. Each scanline is an
207array of data type JSAMPLE --- which is typically "unsigned char", unless
208you've changed jmorecfg.h. (You can also change the RGB pixel layout, say
209to B,G,R order, by modifying jmorecfg.h. But see the restrictions listed in
210that file before doing so.)
211
212A 2-D array of pixels is formed by making a list of pointers to the starts of
213scanlines; so the scanlines need not be physically adjacent in memory. Even
214if you process just one scanline at a time, you must make a one-element
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000215pointer array to conform to this structure. Pointers to JSAMPLE rows are of
216type JSAMPROW, and the pointer to the pointer array is of type JSAMPARRAY.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000217
218The library accepts or supplies one or more complete scanlines per call.
219It is not possible to process part of a row at a time. Scanlines are always
220processed top-to-bottom. You can process an entire image in one call if you
221have it all in memory, but usually it's simplest to process one scanline at
222a time.
223
224For best results, source data values should have the precision specified by
225BITS_IN_JSAMPLE (normally 8 bits). For instance, if you choose to compress
226data that's only 6 bits/channel, you should left-justify each value in a
227byte before passing it to the compressor. If you need to compress data
228that has more than 8 bits/channel, compile with BITS_IN_JSAMPLE = 12.
229(See "Library compile-time options", later.)
230
Thomas G. Lane489583f1996-02-07 00:00:00 +0000231
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000232The data format returned by the decompressor is the same in all details,
Thomas G. Lane489583f1996-02-07 00:00:00 +0000233except that colormapped output is supported. (Again, a JPEG file is never
234colormapped. But you can ask the decompressor to perform on-the-fly color
235quantization to deliver colormapped output.) If you request colormapped
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000236output then the returned data array contains a single JSAMPLE per pixel;
237its value is an index into a color map. The color map is represented as
238a 2-D JSAMPARRAY in which each row holds the values of one color component,
239that is, colormap[i][j] is the value of the i'th color component for pixel
240value (map index) j. Note that since the colormap indexes are stored in
241JSAMPLEs, the maximum number of colors is limited by the size of JSAMPLE
242(ie, at most 256 colors for an 8-bit JPEG library).
243
244
245Compression details
246-------------------
247
248Here we revisit the JPEG compression outline given in the overview.
249
2501. Allocate and initialize a JPEG compression object.
251
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000252A JPEG compression object is a "struct jpeg_compress_struct". (It also has
253a bunch of subsidiary structures which are allocated via malloc(), but the
254application doesn't control those directly.) This struct can be just a local
255variable in the calling routine, if a single routine is going to execute the
256whole JPEG compression sequence. Otherwise it can be static or allocated
257from malloc().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000258
259You will also need a structure representing a JPEG error handler. The part
260of this that the library cares about is a "struct jpeg_error_mgr". If you
261are providing your own error handler, you'll typically want to embed the
262jpeg_error_mgr struct in a larger structure; this is discussed later under
263"Error handling". For now we'll assume you are just using the default error
264handler. The default error handler will print JPEG error/warning messages
265on stderr, and it will call exit() if a fatal error occurs.
266
267You must initialize the error handler structure, store a pointer to it into
268the JPEG object's "err" field, and then call jpeg_create_compress() to
269initialize the rest of the JPEG object.
270
271Typical code for this step, if you are using the default error handler, is
272
273 struct jpeg_compress_struct cinfo;
274 struct jpeg_error_mgr jerr;
275 ...
276 cinfo.err = jpeg_std_error(&jerr);
277 jpeg_create_compress(&cinfo);
278
279jpeg_create_compress allocates a small amount of memory, so it could fail
280if you are out of memory. In that case it will exit via the error handler;
281that's why the error handler must be initialized first.
282
283
2842. Specify the destination for the compressed data (eg, a file).
285
286As previously mentioned, the JPEG library delivers compressed data to a
287"data destination" module. The library includes one data destination
288module which knows how to write to a stdio stream. You can use your own
289destination module if you want to do something else, as discussed later.
290
291If you use the standard destination module, you must open the target stdio
292stream beforehand. Typical code for this step looks like:
293
294 FILE * outfile;
295 ...
296 if ((outfile = fopen(filename, "wb")) == NULL) {
297 fprintf(stderr, "can't open %s\n", filename);
298 exit(1);
299 }
300 jpeg_stdio_dest(&cinfo, outfile);
301
302where the last line invokes the standard destination module.
303
304WARNING: it is critical that the binary compressed data be delivered to the
305output file unchanged. On non-Unix systems the stdio library may perform
306newline translation or otherwise corrupt binary data. To suppress this
307behavior, you may need to use a "b" option to fopen (as shown above), or use
308setmode() or another routine to put the stdio stream in binary mode. See
309cjpeg.c and djpeg.c for code that has been found to work on many systems.
310
311You can select the data destination after setting other parameters (step 3),
312if that's more convenient. You may not change the destination between
313calling jpeg_start_compress() and jpeg_finish_compress().
314
315
3163. Set parameters for compression, including image size & colorspace.
317
318You must supply information about the source image by setting the following
319fields in the JPEG object (cinfo structure):
320
321 image_width Width of image, in pixels
322 image_height Height of image, in pixels
323 input_components Number of color channels (samples per pixel)
324 in_color_space Color space of source image
325
326The image dimensions are, hopefully, obvious. JPEG supports image dimensions
327of 1 to 64K pixels in either direction. The input color space is typically
328RGB or grayscale, and input_components is 3 or 1 accordingly. (See "Special
329color spaces", later, for more info.) The in_color_space field must be
330assigned one of the J_COLOR_SPACE enum constants, typically JCS_RGB or
331JCS_GRAYSCALE.
332
333JPEG has a large number of compression parameters that determine how the
334image is encoded. Most applications don't need or want to know about all
335these parameters. You can set all the parameters to reasonable defaults by
336calling jpeg_set_defaults(); then, if there are particular values you want
337to change, you can do so after that. The "Compression parameter selection"
338section tells about all the parameters.
339
340You must set in_color_space correctly before calling jpeg_set_defaults(),
341because the defaults depend on the source image colorspace. However the
342other three source image parameters need not be valid until you call
343jpeg_start_compress(). There's no harm in calling jpeg_set_defaults() more
344than once, if that happens to be convenient.
345
346Typical code for a 24-bit RGB source image is
347
348 cinfo.image_width = Width; /* image width and height, in pixels */
349 cinfo.image_height = Height;
350 cinfo.input_components = 3; /* # of color components per pixel */
351 cinfo.in_color_space = JCS_RGB; /* colorspace of input image */
352
353 jpeg_set_defaults(&cinfo);
354 /* Make optional parameter settings here */
355
356
3574. jpeg_start_compress(...);
358
359After you have established the data destination and set all the necessary
360source image info and other parameters, call jpeg_start_compress() to begin
361a compression cycle. This will initialize internal state, allocate working
362storage, and emit the first few bytes of the JPEG datastream header.
363
364Typical code:
365
366 jpeg_start_compress(&cinfo, TRUE);
367
368The "TRUE" parameter ensures that a complete JPEG interchange datastream
369will be written. This is appropriate in most cases. If you think you might
370want to use an abbreviated datastream, read the section on abbreviated
371datastreams, below.
372
373Once you have called jpeg_start_compress(), you may not alter any JPEG
374parameters or other fields of the JPEG object until you have completed
375the compression cycle.
376
377
3785. while (scan lines remain to be written)
379 jpeg_write_scanlines(...);
380
381Now write all the required image data by calling jpeg_write_scanlines()
382one or more times. You can pass one or more scanlines in each call, up
383to the total image height. In most applications it is convenient to pass
384just one or a few scanlines at a time. The expected format for the passed
385data is discussed under "Data formats", above.
386
387Image data should be written in top-to-bottom scanline order. The JPEG spec
388contains some weasel wording about how top and bottom are application-defined
389terms (a curious interpretation of the English language...) but if you want
390your files to be compatible with everyone else's, you WILL use top-to-bottom
391order. If the source data must be read in bottom-to-top order, you can use
392the JPEG library's virtual array mechanism to invert the data efficiently.
393Examples of this can be found in the sample application cjpeg.
394
395The library maintains a count of the number of scanlines written so far
396in the next_scanline field of the JPEG object. Usually you can just use
397this variable as the loop counter, so that the loop test looks like
398"while (cinfo.next_scanline < cinfo.image_height)".
399
400Code for this step depends heavily on the way that you store the source data.
401example.c shows the following code for the case of a full-size 2-D source
402array containing 3-byte RGB pixels:
403
404 JSAMPROW row_pointer[1]; /* pointer to a single row */
405 int row_stride; /* physical row width in buffer */
406
407 row_stride = image_width * 3; /* JSAMPLEs per row in image_buffer */
408
409 while (cinfo.next_scanline < cinfo.image_height) {
410 row_pointer[0] = & image_buffer[cinfo.next_scanline * row_stride];
411 jpeg_write_scanlines(&cinfo, row_pointer, 1);
412 }
413
414jpeg_write_scanlines() returns the number of scanlines actually written.
415This will normally be equal to the number passed in, so you can usually
416ignore the return value. It is different in just two cases:
417 * If you try to write more scanlines than the declared image height,
418 the additional scanlines are ignored.
419 * If you use a suspending data destination manager, output buffer overrun
420 will cause the compressor to return before accepting all the passed lines.
421 This feature is discussed under "I/O suspension", below. The normal
422 stdio destination manager will NOT cause this to happen.
423In any case, the return value is the same as the change in the value of
424next_scanline.
425
426
4276. jpeg_finish_compress(...);
428
429After all the image data has been written, call jpeg_finish_compress() to
430complete the compression cycle. This step is ESSENTIAL to ensure that the
431last bufferload of data is written to the data destination.
432jpeg_finish_compress() also releases working memory associated with the JPEG
433object.
434
435Typical code:
436
437 jpeg_finish_compress(&cinfo);
438
439If using the stdio destination manager, don't forget to close the output
440stdio stream if necessary.
441
442If you have requested a multi-pass operating mode, such as Huffman code
443optimization, jpeg_finish_compress() will perform the additional passes using
444data buffered by the first pass. In this case jpeg_finish_compress() may take
445quite a while to complete. With the default compression parameters, this will
446not happen.
447
448It is an error to call jpeg_finish_compress() before writing the necessary
449total number of scanlines. If you wish to abort compression, call
450jpeg_abort() as discussed below.
451
452After completing a compression cycle, you may dispose of the JPEG object
453as discussed next, or you may use it to compress another image. In that case
454return to step 2, 3, or 4 as appropriate. If you do not change the
455destination manager, the new datastream will be written to the same target.
456If you do not change any JPEG parameters, the new datastream will be written
457with the same parameters as before. Note that you can change the input image
458dimensions freely between cycles, but if you change the input colorspace, you
459should call jpeg_set_defaults() to adjust for the new colorspace; and then
460you'll need to repeat all of step 3.
461
462
4637. Release the JPEG compression object.
464
465When you are done with a JPEG compression object, destroy it by calling
466jpeg_destroy_compress(). This will free all subsidiary memory. Or you can
467call jpeg_destroy() which works for either compression or decompression
468objects --- this may be more convenient if you are sharing code between
469compression and decompression cases. (Actually, these routines are equivalent
470except for the declared type of the passed pointer. To avoid gripes from
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000471ANSI C compilers, jpeg_destroy() should be passed a j_common_ptr.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000472
473If you allocated the jpeg_compress_struct structure from malloc(), freeing
474it is your responsibility --- jpeg_destroy() won't. Ditto for the error
475handler structure.
476
477Typical code:
478
479 jpeg_destroy_compress(&cinfo);
480
481
4828. Aborting.
483
484If you decide to abort a compression cycle before finishing, you can clean up
485in either of two ways:
486
487* If you don't need the JPEG object any more, just call
488 jpeg_destroy_compress() or jpeg_destroy() to release memory. This is
489 legitimate at any point after calling jpeg_create_compress() --- in fact,
490 it's safe even if jpeg_create_compress() fails.
491
492* If you want to re-use the JPEG object, call jpeg_abort_compress(), or
493 jpeg_abort() which works on both compression and decompression objects.
494 This will return the object to an idle state, releasing any working memory.
495 jpeg_abort() is allowed at any time after successful object creation.
496
497Note that cleaning up the data destination, if required, is your
498responsibility.
499
500
501Decompression details
502---------------------
503
504Here we revisit the JPEG decompression outline given in the overview.
505
5061. Allocate and initialize a JPEG decompression object.
507
508This is just like initialization for compression, as discussed above,
509except that the object is a "struct jpeg_decompress_struct" and you
510call jpeg_create_decompress(). Error handling is exactly the same.
511
512Typical code:
513
514 struct jpeg_decompress_struct cinfo;
515 struct jpeg_error_mgr jerr;
516 ...
517 cinfo.err = jpeg_std_error(&jerr);
518 jpeg_create_decompress(&cinfo);
519
520(Both here and in the IJG code, we usually use variable name "cinfo" for
521both compression and decompression objects.)
522
523
5242. Specify the source of the compressed data (eg, a file).
525
526As previously mentioned, the JPEG library reads compressed data from a "data
527source" module. The library includes one data source module which knows how
528to read from a stdio stream. You can use your own source module if you want
529to do something else, as discussed later.
530
531If you use the standard source module, you must open the source stdio stream
532beforehand. Typical code for this step looks like:
533
534 FILE * infile;
535 ...
536 if ((infile = fopen(filename, "rb")) == NULL) {
537 fprintf(stderr, "can't open %s\n", filename);
538 exit(1);
539 }
540 jpeg_stdio_src(&cinfo, infile);
541
542where the last line invokes the standard source module.
543
544WARNING: it is critical that the binary compressed data be read unchanged.
545On non-Unix systems the stdio library may perform newline translation or
546otherwise corrupt binary data. To suppress this behavior, you may need to use
547a "b" option to fopen (as shown above), or use setmode() or another routine to
548put the stdio stream in binary mode. See cjpeg.c and djpeg.c for code that
549has been found to work on many systems.
550
551You may not change the data source between calling jpeg_read_header() and
552jpeg_finish_decompress(). If you wish to read a series of JPEG images from
553a single source file, you should repeat the jpeg_read_header() to
554jpeg_finish_decompress() sequence without reinitializing either the JPEG
555object or the data source module; this prevents buffered input data from
556being discarded.
557
558
5593. Call jpeg_read_header() to obtain image info.
560
561Typical code for this step is just
562
563 jpeg_read_header(&cinfo, TRUE);
564
565This will read the source datastream header markers, up to the beginning
566of the compressed data proper. On return, the image dimensions and other
567info have been stored in the JPEG object. The application may wish to
568consult this information before selecting decompression parameters.
569
570More complex code is necessary if
571 * A suspending data source is used --- in that case jpeg_read_header()
572 may return before it has read all the header data. See "I/O suspension",
573 below. The normal stdio source manager will NOT cause this to happen.
574 * Abbreviated JPEG files are to be processed --- see the section on
575 abbreviated datastreams. Standard applications that deal only in
576 interchange JPEG files need not be concerned with this case either.
577
578It is permissible to stop at this point if you just wanted to find out the
579image dimensions and other header info for a JPEG file. In that case,
580call jpeg_destroy() when you are done with the JPEG object, or call
581jpeg_abort() to return it to an idle state before selecting a new data
582source and reading another header.
583
584
5854. Set parameters for decompression.
586
587jpeg_read_header() sets appropriate default decompression parameters based on
588the properties of the image (in particular, its colorspace). However, you
589may well want to alter these defaults before beginning the decompression.
590For example, the default is to produce full color output from a color file.
591If you want colormapped output you must ask for it. Other options allow the
592returned image to be scaled and allow various speed/quality tradeoffs to be
593selected. "Decompression parameter selection", below, gives details.
594
595If the defaults are appropriate, nothing need be done at this step.
596
597Note that all default values are set by each call to jpeg_read_header().
598If you reuse a decompression object, you cannot expect your parameter
599settings to be preserved across cycles, as you can for compression.
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000600You must set desired parameter values each time.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000601
602
6035. jpeg_start_decompress(...);
604
605Once the parameter values are satisfactory, call jpeg_start_decompress() to
606begin decompression. This will initialize internal state, allocate working
607memory, and prepare for returning data.
608
609Typical code is just
610
611 jpeg_start_decompress(&cinfo);
612
613If you have requested a multi-pass operating mode, such as 2-pass color
614quantization, jpeg_start_decompress() will do everything needed before data
615output can begin. In this case jpeg_start_decompress() may take quite a while
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000616to complete. With a single-scan (non progressive) JPEG file and default
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000617decompression parameters, this will not happen; jpeg_start_decompress() will
618return quickly.
619
620After this call, the final output image dimensions, including any requested
621scaling, are available in the JPEG object; so is the selected colormap, if
622colormapped output has been requested. Useful fields include
623
624 output_width image width and height, as scaled
625 output_height
626 out_color_components # of color components in out_color_space
627 output_components # of color components returned per pixel
628 colormap the selected colormap, if any
629 actual_number_of_colors number of entries in colormap
630
631output_components is 1 (a colormap index) when quantizing colors; otherwise it
632equals out_color_components. It is the number of JSAMPLE values that will be
633emitted per pixel in the output arrays.
634
635Typically you will need to allocate data buffers to hold the incoming image.
636You will need output_width * output_components JSAMPLEs per scanline in your
637output buffer, and a total of output_height scanlines will be returned.
638
639Note: if you are using the JPEG library's internal memory manager to allocate
640data buffers (as djpeg does), then the manager's protocol requires that you
641request large buffers *before* calling jpeg_start_decompress(). This is a
642little tricky since the output_XXX fields are not normally valid then. You
643can make them valid by calling jpeg_calc_output_dimensions() after setting the
644relevant parameters (scaling, output color space, and quantization flag).
645
646
6476. while (scan lines remain to be read)
648 jpeg_read_scanlines(...);
649
650Now you can read the decompressed image data by calling jpeg_read_scanlines()
651one or more times. At each call, you pass in the maximum number of scanlines
652to be read (ie, the height of your working buffer); jpeg_read_scanlines()
653will return up to that many lines. The return value is the number of lines
654actually read. The format of the returned data is discussed under "Data
Thomas G. Lanea8b67c41995-03-15 00:00:00 +0000655formats", above. Don't forget that grayscale and color JPEGs will return
656different data formats!
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000657
658Image data is returned in top-to-bottom scanline order. If you must write
659out the image in bottom-to-top order, you can use the JPEG library's virtual
660array mechanism to invert the data efficiently. Examples of this can be
661found in the sample application djpeg.
662
663The library maintains a count of the number of scanlines returned so far
664in the output_scanline field of the JPEG object. Usually you can just use
665this variable as the loop counter, so that the loop test looks like
666"while (cinfo.output_scanline < cinfo.output_height)". (Note that the test
667should NOT be against image_height, unless you never use scaling. The
668image_height field is the height of the original unscaled image.)
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +0000669The return value always equals the change in the value of output_scanline.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000670
671If you don't use a suspending data source, it is safe to assume that
672jpeg_read_scanlines() reads at least one scanline per call, until the
Thomas G. Lane489583f1996-02-07 00:00:00 +0000673bottom of the image has been reached.
674
675If you use a buffer larger than one scanline, it is NOT safe to assume that
676jpeg_read_scanlines() fills it. (The current implementation won't return
677more than cinfo.rec_outbuf_height scanlines per call, no matter how large
678a buffer you pass.) So you must always provide a loop that calls
679jpeg_read_scanlines() repeatedly until the whole image has been read.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000680
681
6827. jpeg_finish_decompress(...);
683
684After all the image data has been read, call jpeg_finish_decompress() to
685complete the decompression cycle. This causes working memory associated
686with the JPEG object to be released.
687
688Typical code:
689
690 jpeg_finish_decompress(&cinfo);
691
692If using the stdio source manager, don't forget to close the source stdio
693stream if necessary.
694
695It is an error to call jpeg_finish_decompress() before reading the correct
696total number of scanlines. If you wish to abort compression, call
697jpeg_abort() as discussed below.
698
699After completing a decompression cycle, you may dispose of the JPEG object as
700discussed next, or you may use it to decompress another image. In that case
701return to step 2 or 3 as appropriate. If you do not change the source
702manager, the next image will be read from the same source.
703
704
7058. Release the JPEG decompression object.
706
707When you are done with a JPEG decompression object, destroy it by calling
708jpeg_destroy_decompress() or jpeg_destroy(). The previous discussion of
709destroying compression objects applies here too.
710
711Typical code:
712
713 jpeg_destroy_decompress(&cinfo);
714
715
7169. Aborting.
717
718You can abort a decompression cycle by calling jpeg_destroy_decompress() or
719jpeg_destroy() if you don't need the JPEG object any more, or
720jpeg_abort_decompress() or jpeg_abort() if you want to reuse the object.
721The previous discussion of aborting compression cycles applies here too.
722
723
724Mechanics of usage: include files, linking, etc
725-----------------------------------------------
726
727Applications using the JPEG library should include the header file jpeglib.h
728to obtain declarations of data types and routines. Before including
729jpeglib.h, include system headers that define at least the typedefs FILE and
730size_t. On ANSI-conforming systems, including <stdio.h> is sufficient; on
731older Unix systems, you may need <sys/types.h> to define size_t.
732
733If the application needs to refer to individual JPEG library error codes, also
734include jerror.h to define those symbols.
735
736jpeglib.h indirectly includes the files jconfig.h and jmorecfg.h. If you are
737installing the JPEG header files in a system directory, you will want to
738install all four files: jpeglib.h, jerror.h, jconfig.h, jmorecfg.h.
739
740The most convenient way to include the JPEG code into your executable program
741is to prepare a library file ("libjpeg.a", or a corresponding name on non-Unix
742machines) and reference it at your link step. If you use only half of the
743library (only compression or only decompression), only that much code will be
744included from the library, unless your linker is hopelessly brain-damaged.
745The supplied makefiles build libjpeg.a automatically (see install.doc).
746
747On some systems your application may need to set up a signal handler to ensure
748that temporary files are deleted if the program is interrupted. This is most
749critical if you are on MS-DOS and use the jmemdos.c memory manager back end;
750it will try to grab extended memory for temp files, and that space will NOT be
751freed automatically. See cjpeg.c or djpeg.c for an example signal handler.
752
753It may be worth pointing out that the core JPEG library does not actually
754require the stdio library: only the default source/destination managers and
755error handler need it. You can use the library in a stdio-less environment
756if you replace those modules and use jmemnobs.c (or another memory manager of
757your own devising). More info about the minimum system library requirements
758may be found in jinclude.h.
759
760
761ADVANCED FEATURES
762=================
763
764Compression parameter selection
765-------------------------------
766
767This section describes all the optional parameters you can set for JPEG
768compression, as well as the "helper" routines provided to assist in this
769task. Proper setting of some parameters requires detailed understanding
770of the JPEG standard; if you don't know what a parameter is for, it's best
771not to mess with it! See REFERENCES in the README file for pointers to
772more info about JPEG.
773
774It's a good idea to call jpeg_set_defaults() first, even if you plan to set
775all the parameters; that way your code is more likely to work with future JPEG
776libraries that have additional parameters. For the same reason, we recommend
777you use a helper routine where one is provided, in preference to twiddling
778cinfo fields directly.
779
780The helper routines are:
781
782jpeg_set_defaults (j_compress_ptr cinfo)
783 This routine sets all JPEG parameters to reasonable defaults, using
784 only the input image's color space (field in_color_space, which must
785 already be set in cinfo). Many applications will only need to use
786 this routine and perhaps jpeg_set_quality().
787
788jpeg_set_colorspace (j_compress_ptr cinfo, J_COLOR_SPACE colorspace)
789 Sets the JPEG file's colorspace (field jpeg_color_space) as specified,
790 and sets other color-space-dependent parameters appropriately. See
791 "Special color spaces", below, before using this. A large number of
792 parameters, including all per-component parameters, are set by this
793 routine; if you want to twiddle individual parameters you should call
794 jpeg_set_colorspace() before rather than after.
795
796jpeg_default_colorspace (j_compress_ptr cinfo)
797 Selects an appropriate JPEG colorspace based on cinfo->in_color_space,
798 and calls jpeg_set_colorspace(). This is actually a subroutine of
799 jpeg_set_defaults(). It's broken out in case you want to change
800 just the colorspace-dependent JPEG parameters.
801
802jpeg_set_quality (j_compress_ptr cinfo, int quality, boolean force_baseline)
803 Constructs JPEG quantization tables appropriate for the indicated
804 quality setting. The quality value is expressed on the 0..100 scale
805 recommended by IJG (cjpeg's "-quality" switch uses this routine).
806 Note that the exact mapping from quality values to tables may change
807 in future IJG releases as more is learned about DCT quantization.
808 If the force_baseline parameter is TRUE, then the quantization table
809 entries are constrained to the range 1..255 for full JPEG baseline
810 compatibility. In the current implementation, this only makes a
811 difference for quality settings below 25, and it effectively prevents
812 very small/low quality files from being generated. The IJG decoder
813 is capable of reading the non-baseline files generated at low quality
814 settings when force_baseline is FALSE, but other decoders may not be.
815
816jpeg_set_linear_quality (j_compress_ptr cinfo, int scale_factor,
817 boolean force_baseline)
818 Same as jpeg_set_quality() except that the generated tables are the
819 sample tables given in the JPEC spec section K.1, multiplied by the
820 specified scale factor (which is expressed as a percentage; thus
821 scale_factor = 100 reproduces the spec's tables). Note that larger
822 scale factors give lower quality. This entry point is useful for
823 conforming to the Adobe PostScript DCT conventions, but we do not
824 recommend linear scaling as a user-visible quality scale otherwise.
825 force_baseline again constrains the computed table entries to 1..255.
826
827int jpeg_quality_scaling (int quality)
828 Converts a value on the IJG-recommended quality scale to a linear
829 scaling percentage. Note that this routine may change or go away
830 in future releases --- IJG may choose to adopt a scaling method that
831 can't be expressed as a simple scalar multiplier, in which case the
832 premise of this routine collapses. Caveat user.
833
834jpeg_add_quant_table (j_compress_ptr cinfo, int which_tbl,
835 const unsigned int *basic_table,
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000836 int scale_factor, boolean force_baseline)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000837 Allows an arbitrary quantization table to be created. which_tbl
838 indicates which table slot to fill. basic_table points to an array
Thomas G. Lane489583f1996-02-07 00:00:00 +0000839 of 64 unsigned ints given in normal array order. These values are
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000840 multiplied by scale_factor/100 and then clamped to the range 1..65535
841 (or to 1..255 if force_baseline is TRUE).
Thomas G. Lane489583f1996-02-07 00:00:00 +0000842 CAUTION: prior to library version 6a, jpeg_add_quant_table expected
843 the basic table to be given in JPEG zigzag order. If you need to
844 write code that works with either older or newer versions of this
845 routine, you must check the library version number. Something like
846 "#if JPEG_LIB_VERSION >= 61" is the right test.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000847
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000848jpeg_simple_progression (j_compress_ptr cinfo)
849 Generates a default scan script for writing a progressive-JPEG file.
850 This is the recommended method of creating a progressive file,
851 unless you want to make a custom scan sequence. You must ensure that
852 the JPEG color space is set correctly before calling this routine.
853
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000854
855Compression parameters (cinfo fields) include:
856
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000857J_DCT_METHOD dct_method
858 Selects the algorithm used for the DCT step. Choices are:
859 JDCT_ISLOW: slow but accurate integer algorithm
860 JDCT_IFAST: faster, less accurate integer method
861 JDCT_FLOAT: floating-point method
862 JDCT_DEFAULT: default method (normally JDCT_ISLOW)
863 JDCT_FASTEST: fastest method (normally JDCT_IFAST)
Thomas G. Lanea8b67c41995-03-15 00:00:00 +0000864 The FLOAT method is very slightly more accurate than the ISLOW method,
865 but may give different results on different machines due to varying
866 roundoff behavior. The integer methods should give the same results
867 on all machines. On machines with sufficiently fast FP hardware, the
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000868 floating-point method may also be the fastest. The IFAST method is
869 considerably less accurate than the other two; its use is not
870 recommended if high quality is a concern. JDCT_DEFAULT and
871 JDCT_FASTEST are macros configurable by each installation.
872
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000873J_COLOR_SPACE jpeg_color_space
874int num_components
875 The JPEG color space and corresponding number of components; see
876 "Special color spaces", below, for more info. We recommend using
877 jpeg_set_color_space() if you want to change these.
878
879boolean optimize_coding
880 TRUE causes the compressor to compute optimal Huffman coding tables
881 for the image. This requires an extra pass over the data and
882 therefore costs a good deal of space and time. The default is
883 FALSE, which tells the compressor to use the supplied or default
884 Huffman tables. In most cases optimal tables save only a few percent
885 of file size compared to the default tables. Note that when this is
886 TRUE, you need not supply Huffman tables at all, and any you do
887 supply will be overwritten.
888
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000889unsigned int restart_interval
890int restart_in_rows
891 To emit restart markers in the JPEG file, set one of these nonzero.
892 Set restart_interval to specify the exact interval in MCU blocks.
893 Set restart_in_rows to specify the interval in MCU rows. (If
894 restart_in_rows is not 0, then restart_interval is set after the
895 image width in MCUs is computed.) Defaults are zero (no restarts).
896
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000897const jpeg_scan_info * scan_info
898int num_scans
899 By default, scan_info is NULL; this causes the compressor to write a
900 single-scan sequential JPEG file. If not NULL, scan_info points to
901 an array of scan definition records of length num_scans. The
902 compressor will then write a JPEG file having one scan for each scan
903 definition record. This is used to generate noninterleaved or
904 progressive JPEG files. The library checks that the scan array
905 defines a valid JPEG scan sequence. (jpeg_simple_progression creates
906 a suitable scan definition array for progressive JPEG.) This is
907 discussed further under "Progressive JPEG support".
908
909int smoothing_factor
910 If non-zero, the input image is smoothed; the value should be 1 for
911 minimal smoothing to 100 for maximum smoothing. Consult jcsample.c
912 for details of the smoothing algorithm. The default is zero.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000913
914boolean write_JFIF_header
915 If TRUE, a JFIF APP0 marker is emitted. jpeg_set_defaults() and
916 jpeg_set_colorspace() set this TRUE if a JFIF-legal JPEG color space
917 (ie, YCbCr or grayscale) is selected, otherwise FALSE.
918
919UINT8 density_unit
920UINT16 X_density
921UINT16 Y_density
922 The resolution information to be written into the JFIF marker;
923 not used otherwise. density_unit may be 0 for unknown,
924 1 for dots/inch, or 2 for dots/cm. The default values are 0,1,1
925 indicating square pixels of unknown size.
926
927boolean write_Adobe_marker
928 If TRUE, an Adobe APP14 marker is emitted. jpeg_set_defaults() and
929 jpeg_set_colorspace() set this TRUE if JPEG color space RGB, CMYK,
930 or YCCK is selected, otherwise FALSE. It is generally a bad idea
931 to set both write_JFIF_header and write_Adobe_marker. In fact,
932 you probably shouldn't change the default settings at all --- the
933 default behavior ensures that the JPEG file's color space can be
934 recognized by the decoder.
935
936JQUANT_TBL * quant_tbl_ptrs[NUM_QUANT_TBLS]
937 Pointers to coefficient quantization tables, one per table slot,
938 or NULL if no table is defined for a slot. Usually these should
939 be set via one of the above helper routines; jpeg_add_quant_table()
940 is general enough to define any quantization table. The other
941 routines will set up table slot 0 for luminance quality and table
942 slot 1 for chrominance.
943
944JHUFF_TBL * dc_huff_tbl_ptrs[NUM_HUFF_TBLS]
945JHUFF_TBL * ac_huff_tbl_ptrs[NUM_HUFF_TBLS]
946 Pointers to Huffman coding tables, one per table slot, or NULL if
947 no table is defined for a slot. Slots 0 and 1 are filled with the
948 JPEG sample tables by jpeg_set_defaults(). If you need to allocate
949 more table structures, jpeg_alloc_huff_table() may be used.
950 Note that optimal Huffman tables can be computed for an image
951 by setting optimize_coding, as discussed above; there's seldom
952 any need to mess with providing your own Huffman tables.
953
954There are some additional cinfo fields which are not documented here
955because you currently can't change them; for example, you can't set
956arith_code TRUE because arithmetic coding is unsupported.
957
958
959Per-component parameters are stored in the struct cinfo.comp_info[i] for
960component number i. Note that components here refer to components of the
961JPEG color space, *not* the source image color space. A suitably large
962comp_info[] array is allocated by jpeg_set_defaults(); if you choose not
963to use that routine, it's up to you to allocate the array.
964
965int component_id
966 The one-byte identifier code to be recorded in the JPEG file for
967 this component. For the standard color spaces, we recommend you
968 leave the default values alone.
969
970int h_samp_factor
971int v_samp_factor
972 Horizontal and vertical sampling factors for the component; must
973 be 1..4 according to the JPEG standard. Note that larger sampling
974 factors indicate a higher-resolution component; many people find
975 this behavior quite unintuitive. The default values are 2,2 for
976 luminance components and 1,1 for chrominance components, except
977 for grayscale where 1,1 is used.
978
979int quant_tbl_no
980 Quantization table number for component. The default value is
981 0 for luminance components and 1 for chrominance components.
982
983int dc_tbl_no
984int ac_tbl_no
985 DC and AC entropy coding table numbers. The default values are
986 0 for luminance components and 1 for chrominance components.
987
988int component_index
Thomas G. Lanebc79e061995-08-02 00:00:00 +0000989 Must equal the component's index in comp_info[]. (Beginning in
990 release v6, the compressor library will fill this in automatically;
991 you don't have to.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +0000992
993
994Decompression parameter selection
995---------------------------------
996
997Decompression parameter selection is somewhat simpler than compression
998parameter selection, since all of the JPEG internal parameters are
999recorded in the source file and need not be supplied by the application.
1000(Unless you are working with abbreviated files, in which case see
1001"Abbreviated datastreams", below.) Decompression parameters control
1002the postprocessing done on the image to deliver it in a format suitable
1003for the application's use. Many of the parameters control speed/quality
1004tradeoffs, in which faster decompression may be obtained at the price of
1005a poorer-quality image. The defaults select the highest quality (slowest)
1006processing.
1007
1008The following fields in the JPEG object are set by jpeg_read_header() and
1009may be useful to the application in choosing decompression parameters:
1010
1011JDIMENSION image_width Width and height of image
1012JDIMENSION image_height
1013int num_components Number of color components
1014J_COLOR_SPACE jpeg_color_space Colorspace of image
1015boolean saw_JFIF_marker TRUE if a JFIF APP0 marker was seen
1016 UINT8 density_unit Resolution data from JFIF marker
1017 UINT16 X_density
1018 UINT16 Y_density
1019boolean saw_Adobe_marker TRUE if an Adobe APP14 marker was seen
1020 UINT8 Adobe_transform Color transform code from Adobe marker
1021
1022The JPEG color space, unfortunately, is something of a guess since the JPEG
1023standard proper does not provide a way to record it. In practice most files
1024adhere to the JFIF or Adobe conventions, and the decoder will recognize these
1025correctly. See "Special color spaces", below, for more info.
1026
1027
1028The decompression parameters that determine the basic properties of the
1029returned image are:
1030
1031J_COLOR_SPACE out_color_space
1032 Output color space. jpeg_read_header() sets an appropriate default
1033 based on jpeg_color_space; typically it will be RGB or grayscale.
1034 The application can change this field to request output in a different
1035 colorspace. For example, set it to JCS_GRAYSCALE to get grayscale
1036 output from a color file. (This is useful for previewing: grayscale
1037 output is faster than full color since the color components need not
1038 be processed.) Note that not all possible color space transforms are
1039 currently implemented; you may need to extend jdcolor.c if you want an
1040 unusual conversion.
1041
1042unsigned int scale_num, scale_denom
1043 Scale the image by the fraction scale_num/scale_denom. Default is
1044 1/1, or no scaling. Currently, the only supported scaling ratios
1045 are 1/1, 1/2, 1/4, and 1/8. (The library design allows for arbitrary
1046 scaling ratios but this is not likely to be implemented any time soon.)
1047 Smaller scaling ratios permit significantly faster decoding since
1048 fewer pixels need be processed and a simpler IDCT method can be used.
1049
1050boolean quantize_colors
1051 If set TRUE, colormapped output will be delivered. Default is FALSE,
1052 meaning that full-color output will be delivered.
1053
1054The next three parameters are relevant only if quantize_colors is TRUE.
1055
1056int desired_number_of_colors
1057 Maximum number of colors to use in generating a library-supplied color
1058 map (the actual number of colors is returned in a different field).
1059 Default 256. Ignored when the application supplies its own color map.
1060
1061boolean two_pass_quantize
1062 If TRUE, an extra pass over the image is made to select a custom color
1063 map for the image. This usually looks a lot better than the one-size-
1064 fits-all colormap that is used otherwise. Default is TRUE. Ignored
1065 when the application supplies its own color map.
1066
1067J_DITHER_MODE dither_mode
1068 Selects color dithering method. Supported values are:
1069 JDITHER_NONE no dithering: fast, very low quality
1070 JDITHER_ORDERED ordered dither: moderate speed and quality
1071 JDITHER_FS Floyd-Steinberg dither: slow, high quality
1072 Default is JDITHER_FS. (At present, ordered dither is implemented
1073 only in the single-pass, standard-colormap case. If you ask for
1074 ordered dither when two_pass_quantize is TRUE or when you supply
1075 an external color map, you'll get F-S dithering.)
1076
1077When quantize_colors is TRUE, the target color map is described by the next
1078two fields. colormap is set to NULL by jpeg_read_header(). The application
1079can supply a color map by setting colormap non-NULL and setting
1080actual_number_of_colors to the map size. Otherwise, jpeg_start_decompress()
1081selects a suitable color map and sets these two fields itself.
1082[Implementation restriction: at present, an externally supplied colormap is
1083only accepted for 3-component output color spaces.]
1084
1085JSAMPARRAY colormap
1086 The color map, represented as a 2-D pixel array of out_color_components
1087 rows and actual_number_of_colors columns. Ignored if not quantizing.
Thomas G. Lanea8b67c41995-03-15 00:00:00 +00001088 CAUTION: if the JPEG library creates its own colormap, the storage
1089 pointed to by this field is released by jpeg_finish_decompress().
1090 Copy the colormap somewhere else first, if you want to save it.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001091
1092int actual_number_of_colors
1093 The number of colors in the color map.
1094
1095Additional decompression parameters that the application may set include:
1096
1097J_DCT_METHOD dct_method
1098 Selects the algorithm used for the DCT step. Choices are the same
1099 as described above for compression.
1100
1101boolean do_fancy_upsampling
1102 If TRUE, do careful upsampling of chroma components. If FALSE,
1103 a faster but sloppier method is used. Default is TRUE. The visual
1104 impact of the sloppier method is often very small.
1105
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001106boolean do_block_smoothing
1107 If TRUE, interblock smoothing is applied in early stages of decoding
1108 progressive JPEG files; if FALSE, not. Default is TRUE. Early
1109 progression stages look "fuzzy" with smoothing, "blocky" without.
1110 In any case, block smoothing ceases to be applied after the first few
1111 AC coefficients are known to full accuracy, so it is relevant only
1112 when using buffered-image mode for progressive images.
1113
1114boolean enable_1pass_quant
1115boolean enable_external_quant
1116boolean enable_2pass_quant
1117 These are significant only in buffered-image mode, which is
1118 described in its own section below.
1119
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001120
1121The output image dimensions are given by the following fields. These are
1122computed from the source image dimensions and the decompression parameters
1123by jpeg_start_decompress(). You can also call jpeg_calc_output_dimensions()
1124to obtain the values that will result from the current parameter settings.
1125This can be useful if you are trying to pick a scaling ratio that will get
1126close to a desired target size. It's also important if you are using the
1127JPEG library's memory manager to allocate output buffer space, because you
1128are supposed to request such buffers *before* jpeg_start_decompress().
1129
1130JDIMENSION output_width Actual dimensions of output image.
1131JDIMENSION output_height
1132int out_color_components Number of color components in out_color_space.
1133int output_components Number of color components returned.
1134int rec_outbuf_height Recommended height of scanline buffer.
1135
1136When quantizing colors, output_components is 1, indicating a single color map
1137index per pixel. Otherwise it equals out_color_components. The output arrays
1138are required to be output_width * output_components JSAMPLEs wide.
1139
1140rec_outbuf_height is the recommended minimum height (in scanlines) of the
1141buffer passed to jpeg_read_scanlines(). If the buffer is smaller, the
1142library will still work, but time will be wasted due to unnecessary data
1143copying. In high-quality modes, rec_outbuf_height is always 1, but some
1144faster, lower-quality modes set it to larger values (typically 2 to 4).
1145If you are going to ask for a high-speed processing mode, you may as well
1146go to the trouble of honoring rec_outbuf_height so as to avoid data copying.
1147
1148
1149Special color spaces
1150--------------------
1151
1152The JPEG standard itself is "color blind" and doesn't specify any particular
1153color space. It is customary to convert color data to a luminance/chrominance
1154color space before compressing, since this permits greater compression. The
1155existing de-facto JPEG file format standards specify YCbCr or grayscale data
1156(JFIF), or grayscale, RGB, YCbCr, CMYK, or YCCK (Adobe). For special
1157applications such as multispectral images, other color spaces can be used,
1158but it must be understood that such files will be unportable.
1159
1160The JPEG library can handle the most common colorspace conversions (namely
1161RGB <=> YCbCr and CMYK <=> YCCK). It can also deal with data of an unknown
1162color space, passing it through without conversion. If you deal extensively
1163with an unusual color space, you can easily extend the library to understand
1164additional color spaces and perform appropriate conversions.
1165
1166For compression, the source data's color space is specified by field
1167in_color_space. This is transformed to the JPEG file's color space given
1168by jpeg_color_space. jpeg_set_defaults() chooses a reasonable JPEG color
1169space depending on in_color_space, but you can override this by calling
1170jpeg_set_colorspace(). Of course you must select a supported transformation.
1171jccolor.c currently supports the following transformations:
1172 RGB => YCbCr
1173 RGB => GRAYSCALE
1174 YCbCr => GRAYSCALE
1175 CMYK => YCCK
1176plus the null transforms: GRAYSCALE => GRAYSCALE, RGB => RGB,
1177YCbCr => YCbCr, CMYK => CMYK, YCCK => YCCK, and UNKNOWN => UNKNOWN.
1178
1179The de-facto file format standards (JFIF and Adobe) specify APPn markers that
1180indicate the color space of the JPEG file. It is important to ensure that
1181these are written correctly, or omitted if the JPEG file's color space is not
1182one of the ones supported by the de-facto standards. jpeg_set_colorspace()
1183will set the compression parameters to include or omit the APPn markers
1184properly, so long as it is told the truth about the JPEG color space.
1185For example, if you are writing some random 3-component color space without
1186conversion, don't try to fake out the library by setting in_color_space and
1187jpeg_color_space to JCS_YCbCr; use JCS_UNKNOWN. You may want to write an
1188APPn marker of your own devising to identify the colorspace --- see "Special
1189markers", below.
1190
1191When told that the color space is UNKNOWN, the library will default to using
1192luminance-quality compression parameters for all color components. You may
1193well want to change these parameters. See the source code for
1194jpeg_set_colorspace(), in jcparam.c, for details.
1195
1196For decompression, the JPEG file's color space is given in jpeg_color_space,
1197and this is transformed to the output color space out_color_space.
1198jpeg_read_header's setting of jpeg_color_space can be relied on if the file
1199conforms to JFIF or Adobe conventions, but otherwise it is no better than a
1200guess. If you know the JPEG file's color space for certain, you can override
1201jpeg_read_header's guess by setting jpeg_color_space. jpeg_read_header also
1202selects a default output color space based on (its guess of) jpeg_color_space;
1203set out_color_space to override this. Again, you must select a supported
1204transformation. jdcolor.c currently supports
1205 YCbCr => GRAYSCALE
1206 YCbCr => RGB
1207 YCCK => CMYK
1208as well as the null transforms.
1209
1210The two-pass color quantizer, jquant2.c, is specialized to handle RGB data
1211(it weights distances appropriately for RGB colors). You'll need to modify
1212the code if you want to use it for non-RGB output color spaces. Note that
1213jquant2.c is used to map to an application-supplied colormap as well as for
1214the normal two-pass colormap selection process.
1215
1216CAUTION: it appears that Adobe Photoshop writes inverted data in CMYK JPEG
1217files: 0 represents 100% ink coverage, rather than 0% ink as you'd expect.
1218This is arguably a bug in Photoshop, but if you need to work with Photoshop
1219CMYK files, you will have to deal with it in your application. We cannot
1220"fix" this in the library by inverting the data during the CMYK<=>YCCK
1221transform, because that would break other applications, notably Ghostscript.
1222Photoshop versions prior to 3.0 write EPS files containing JPEG-encoded CMYK
1223data in the same inverted-YCCK representation used in bare JPEG files, but
1224the surrounding PostScript code performs an inversion using the PS image
1225operator. I am told that Photoshop 3.0 will write uninverted YCCK in
1226EPS/JPEG files, and will omit the PS-level inversion. (But the data
1227polarity used in bare JPEG files will not change in 3.0.) In either case,
1228the JPEG library must not invert the data itself, or else Ghostscript would
1229read these EPS files incorrectly.
1230
1231
1232Error handling
1233--------------
1234
1235When the default error handler is used, any error detected inside the JPEG
1236routines will cause a message to be printed on stderr, followed by exit().
1237You can supply your own error handling routines to override this behavior
1238and to control the treatment of nonfatal warnings and trace/debug messages.
1239The file example.c illustrates the most common case, which is to have the
1240application regain control after an error rather than exiting.
1241
1242The JPEG library never writes any message directly; it always goes through
1243the error handling routines. Three classes of messages are recognized:
1244 * Fatal errors: the library cannot continue.
1245 * Warnings: the library can continue, but the data is corrupt, and a
1246 damaged output image is likely to result.
1247 * Trace/informational messages. These come with a trace level indicating
1248 the importance of the message; you can control the verbosity of the
1249 program by adjusting the maximum trace level that will be displayed.
1250
1251You may, if you wish, simply replace the entire JPEG error handling module
1252(jerror.c) with your own code. However, you can avoid code duplication by
1253only replacing some of the routines depending on the behavior you need.
1254This is accomplished by calling jpeg_std_error() as usual, but then overriding
1255some of the method pointers in the jpeg_error_mgr struct, as illustrated by
1256example.c.
1257
1258All of the error handling routines will receive a pointer to the JPEG object
1259(a j_common_ptr which points to either a jpeg_compress_struct or a
1260jpeg_decompress_struct; if you need to tell which, test the is_decompressor
1261field). This struct includes a pointer to the error manager struct in its
1262"err" field. Frequently, custom error handler routines will need to access
1263additional data which is not known to the JPEG library or the standard error
1264handler. The most convenient way to do this is to embed either the JPEG
1265object or the jpeg_error_mgr struct in a larger structure that contains
1266additional fields; then casting the passed pointer provides access to the
1267additional fields. Again, see example.c for one way to do it.
1268
1269The individual methods that you might wish to override are:
1270
1271error_exit (j_common_ptr cinfo)
1272 Receives control for a fatal error. Information sufficient to
1273 generate the error message has been stored in cinfo->err; call
1274 output_message to display it. Control must NOT return to the caller;
1275 generally this routine will exit() or longjmp() somewhere.
1276 Typically you would override this routine to get rid of the exit()
1277 default behavior. Note that if you continue processing, you should
1278 clean up the JPEG object with jpeg_abort() or jpeg_destroy().
1279
1280output_message (j_common_ptr cinfo)
1281 Actual output of any JPEG message. Override this to send messages
1282 somewhere other than stderr. Note that this method does not know
1283 how to generate a message, only where to send it.
1284
1285format_message (j_common_ptr cinfo, char * buffer)
1286 Constructs a readable error message string based on the error info
1287 stored in cinfo->err. This method is called by output_message. Few
1288 applications should need to override this method. One possible
1289 reason for doing so is to implement dynamic switching of error message
1290 language.
1291
1292emit_message (j_common_ptr cinfo, int msg_level)
1293 Decide whether or not to emit a warning or trace message; if so,
1294 calls output_message. The main reason for overriding this method
1295 would be to abort on warnings. msg_level is -1 for warnings,
1296 0 and up for trace messages.
1297
1298Only error_exit() and emit_message() are called from the rest of the JPEG
1299library; the other two are internal to the error handler.
1300
1301The actual message texts are stored in an array of strings which is pointed to
1302by the field err->jpeg_message_table. The messages are numbered from 0 to
1303err->last_jpeg_message, and it is these code numbers that are used in the
1304JPEG library code. You could replace the message texts (for instance, with
1305messages in French or German) by changing the message table pointer. See
1306jerror.h for the default texts. CAUTION: this table will almost certainly
1307change or grow from one library version to the next.
1308
1309It may be useful for an application to add its own message texts that are
1310handled by the same mechanism. The error handler supports a second "add-on"
1311message table for this purpose. To define an addon table, set the pointer
1312err->addon_message_table and the message numbers err->first_addon_message and
1313err->last_addon_message. If you number the addon messages beginning at 1000
1314or so, you won't have to worry about conflicts with the library's built-in
1315messages. See the sample applications cjpeg/djpeg for an example of using
1316addon messages (the addon messages are defined in cderror.h).
1317
1318Actual invocation of the error handler is done via macros defined in jerror.h:
1319 ERREXITn(...) for fatal errors
1320 WARNMSn(...) for corrupt-data warnings
1321 TRACEMSn(...) for trace and informational messages.
1322These macros store the message code and any additional parameters into the
1323error handler struct, then invoke the error_exit() or emit_message() method.
1324The variants of each macro are for varying numbers of additional parameters.
1325The additional parameters are inserted into the generated message using
1326standard printf() format codes.
1327
1328See jerror.h and jerror.c for further details.
1329
1330
1331Compressed data handling (source and destination managers)
1332----------------------------------------------------------
1333
1334The JPEG compression library sends its compressed data to a "destination
1335manager" module. The default destination manager just writes the data to a
1336stdio stream, but you can provide your own manager to do something else.
1337Similarly, the decompression library calls a "source manager" to obtain the
1338compressed data; you can provide your own source manager if you want the data
1339to come from somewhere other than a stdio stream.
1340
1341In both cases, compressed data is processed a bufferload at a time: the
1342destination or source manager provides a work buffer, and the library invokes
1343the manager only when the buffer is filled or emptied. (You could define a
1344one-character buffer to force the manager to be invoked for each byte, but
1345that would be rather inefficient.) The buffer's size and location are
1346controlled by the manager, not by the library. For example, if you desired to
1347decompress a JPEG datastream that was all in memory, you could just make the
1348buffer pointer and length point to the original data in memory. Then the
1349buffer-reload procedure would be invoked only if the decompressor ran off the
1350end of the datastream, which would indicate an erroneous datastream.
1351
1352The work buffer is defined as an array of datatype JOCTET, which is generally
1353"char" or "unsigned char". On a machine where char is not exactly 8 bits
1354wide, you must define JOCTET as a wider data type and then modify the data
1355source and destination modules to transcribe the work arrays into 8-bit units
1356on external storage.
1357
1358A data destination manager struct contains a pointer and count defining the
1359next byte to write in the work buffer and the remaining free space:
1360
1361 JOCTET * next_output_byte; /* => next byte to write in buffer */
1362 size_t free_in_buffer; /* # of byte spaces remaining in buffer */
1363
1364The library increments the pointer and decrements the count until the buffer
1365is filled. The manager's empty_output_buffer method must reset the pointer
1366and count. The manager is expected to remember the buffer's starting address
1367and total size in private fields not visible to the library.
1368
1369A data destination manager provides three methods:
1370
1371init_destination (j_compress_ptr cinfo)
1372 Initialize destination. This is called by jpeg_start_compress()
1373 before any data is actually written. It must initialize
1374 next_output_byte and free_in_buffer. free_in_buffer must be
1375 initialized to a positive value.
1376
1377empty_output_buffer (j_compress_ptr cinfo)
1378 This is called whenever the buffer has filled (free_in_buffer
1379 reaches zero). In typical applications, it should write out the
1380 *entire* buffer (use the saved start address and buffer length;
1381 ignore the current state of next_output_byte and free_in_buffer).
1382 Then reset the pointer & count to the start of the buffer, and
1383 return TRUE indicating that the buffer has been dumped.
1384 free_in_buffer must be set to a positive value when TRUE is
1385 returned. A FALSE return should only be used when I/O suspension is
1386 desired (this operating mode is discussed in the next section).
1387
1388term_destination (j_compress_ptr cinfo)
1389 Terminate destination --- called by jpeg_finish_compress() after all
1390 data has been written. In most applications, this must flush any
1391 data remaining in the buffer. Use either next_output_byte or
1392 free_in_buffer to determine how much data is in the buffer.
1393
1394term_destination() is NOT called by jpeg_abort() or jpeg_destroy(). If you
1395want the destination manager to be cleaned up during an abort, you must do it
1396yourself.
1397
1398You will also need code to create a jpeg_destination_mgr struct, fill in its
1399method pointers, and insert a pointer to the struct into the "dest" field of
1400the JPEG compression object. This can be done in-line in your setup code if
1401you like, but it's probably cleaner to provide a separate routine similar to
1402the jpeg_stdio_dest() routine of the supplied destination manager.
1403
1404Decompression source managers follow a parallel design, but with some
1405additional frammishes. The source manager struct contains a pointer and count
1406defining the next byte to read from the work buffer and the number of bytes
1407remaining:
1408
1409 const JOCTET * next_input_byte; /* => next byte to read from buffer */
1410 size_t bytes_in_buffer; /* # of bytes remaining in buffer */
1411
1412The library increments the pointer and decrements the count until the buffer
1413is emptied. The manager's fill_input_buffer method must reset the pointer and
1414count. In most applications, the manager must remember the buffer's starting
1415address and total size in private fields not visible to the library.
1416
1417A data source manager provides five methods:
1418
1419init_source (j_decompress_ptr cinfo)
1420 Initialize source. This is called by jpeg_read_header() before any
1421 data is actually read. Unlike init_destination(), it may leave
1422 bytes_in_buffer set to 0 (in which case a fill_input_buffer() call
1423 will occur immediately).
1424
1425fill_input_buffer (j_decompress_ptr cinfo)
1426 This is called whenever bytes_in_buffer has reached zero and more
1427 data is wanted. In typical applications, it should read fresh data
1428 into the buffer (ignoring the current state of next_input_byte and
1429 bytes_in_buffer), reset the pointer & count to the start of the
1430 buffer, and return TRUE indicating that the buffer has been reloaded.
1431 It is not necessary to fill the buffer entirely, only to obtain at
1432 least one more byte. bytes_in_buffer MUST be set to a positive value
1433 if TRUE is returned. A FALSE return should only be used when I/O
1434 suspension is desired (this mode is discussed in the next section).
1435
1436skip_input_data (j_decompress_ptr cinfo, long num_bytes)
1437 Skip num_bytes worth of data. The buffer pointer and count should
1438 be advanced over num_bytes input bytes, refilling the buffer as
1439 needed. This is used to skip over a potentially large amount of
1440 uninteresting data (such as an APPn marker). In some applications
1441 it may be possible to optimize away the reading of the skipped data,
1442 but it's not clear that being smart is worth much trouble; large
1443 skips are uncommon. bytes_in_buffer may be zero on return.
1444 A zero or negative skip count should be treated as a no-op.
1445
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001446resync_to_restart (j_decompress_ptr cinfo, int desired)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001447 This routine is called only when the decompressor has failed to find
1448 a restart (RSTn) marker where one is expected. Its mission is to
1449 find a suitable point for resuming decompression. For most
1450 applications, we recommend that you just use the default resync
1451 procedure, jpeg_resync_to_restart(). However, if you are able to back
1452 up in the input data stream, or if you have a-priori knowledge about
1453 the likely location of restart markers, you may be able to do better.
1454 Read the read_restart_marker() and jpeg_resync_to_restart() routines
1455 in jdmarker.c if you think you'd like to implement your own resync
1456 procedure.
1457
1458term_source (j_decompress_ptr cinfo)
1459 Terminate source --- called by jpeg_finish_decompress() after all
1460 data has been read. Often a no-op.
1461
1462For both fill_input_buffer() and skip_input_data(), there is no such thing
1463as an EOF return. If the end of the file has been reached, the routine has
1464a choice of exiting via ERREXIT() or inserting fake data into the buffer.
1465In most cases, generating a warning message and inserting a fake EOI marker
1466is the best course of action --- this will allow the decompressor to output
1467however much of the image is there. In pathological cases, the decompressor
1468may swallow the EOI and again demand data ... just keep feeding it fake EOIs.
1469jdatasrc.c illustrates the recommended error recovery behavior.
1470
1471term_source() is NOT called by jpeg_abort() or jpeg_destroy(). If you want
1472the source manager to be cleaned up during an abort, you must do it yourself.
1473
1474You will also need code to create a jpeg_source_mgr struct, fill in its method
1475pointers, and insert a pointer to the struct into the "src" field of the JPEG
1476decompression object. This can be done in-line in your setup code if you
1477like, but it's probably cleaner to provide a separate routine similar to the
1478jpeg_stdio_src() routine of the supplied source manager.
1479
1480For more information, consult the stdio source and destination managers
1481in jdatasrc.c and jdatadst.c.
1482
1483
1484I/O suspension
1485--------------
1486
1487Some applications need to use the JPEG library as an incremental memory-to-
1488memory filter: when the compressed data buffer is filled or emptied, they want
1489control to return to the outer loop, rather than expecting that the buffer can
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001490be emptied or reloaded within the data source/destination manager subroutine.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001491The library supports this need by providing an "I/O suspension" mode, which we
1492describe in this section.
1493
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001494The I/O suspension mode is not a panacea: nothing is guaranteed about the
1495maximum amount of time spent in any one call to the library, so it will not
1496eliminate response-time problems in single-threaded applications. If you
1497need guaranteed response time, we suggest you "bite the bullet" and implement
1498a real multi-tasking capability.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001499
1500To use I/O suspension, cooperation is needed between the calling application
1501and the data source or destination manager; you will always need a custom
1502source/destination manager. (Please read the previous section if you haven't
1503already.) The basic idea is that the empty_output_buffer() or
1504fill_input_buffer() routine is a no-op, merely returning FALSE to indicate
1505that it has done nothing. Upon seeing this, the JPEG library suspends
1506operation and returns to its caller. The surrounding application is
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001507responsible for emptying or refilling the work buffer before calling the
1508JPEG library again.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001509
1510Compression suspension:
1511
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001512For compression suspension, use an empty_output_buffer() routine that returns
1513FALSE; typically it will not do anything else. This will cause the
1514compressor to return to the caller of jpeg_write_scanlines(), with the return
1515value indicating that not all the supplied scanlines have been accepted.
1516The application must make more room in the output buffer, adjust the output
1517buffer pointer/count appropriately, and then call jpeg_write_scanlines()
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001518again, pointing to the first unconsumed scanline.
1519
1520When forced to suspend, the compressor will backtrack to a convenient stopping
1521point (usually the start of the current MCU); it will regenerate some output
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001522data when restarted. Therefore, although empty_output_buffer() is only
1523called when the buffer is filled, you should NOT write out the entire buffer
1524after a suspension. Write only the data up to the current position of
1525next_output_byte/free_in_buffer. The data beyond that point will be
1526regenerated after resumption.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001527
1528Because of the backtracking behavior, a good-size output buffer is essential
1529for efficiency; you don't want the compressor to suspend often. (In fact, an
1530overly small buffer could lead to infinite looping, if a single MCU required
1531more data than would fit in the buffer.) We recommend a buffer of at least
1532several Kbytes. You may want to insert explicit code to ensure that you don't
1533call jpeg_write_scanlines() unless there is a reasonable amount of space in
1534the output buffer; in other words, flush the buffer before trying to compress
1535more data.
1536
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001537The compressor does not allow suspension while it is trying to write JPEG
1538markers at the beginning and end of the file. This means that:
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001539 * At the beginning of a compression operation, there must be enough free
1540 space in the output buffer to hold the header markers (typically 600 or
1541 so bytes). The recommended buffer size is bigger than this anyway, so
1542 this is not a problem as long as you start with an empty buffer. However,
1543 this restriction might catch you if you insert large special markers, such
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001544 as a JFIF thumbnail image, without flushing the buffer afterwards.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001545 * When you call jpeg_finish_compress(), there must be enough space in the
1546 output buffer to emit any buffered data and the final EOI marker. In the
1547 current implementation, half a dozen bytes should suffice for this, but
1548 for safety's sake we recommend ensuring that at least 100 bytes are free
1549 before calling jpeg_finish_compress().
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001550
1551A more significant restriction is that jpeg_finish_compress() cannot suspend.
1552This means you cannot use suspension with multi-pass operating modes, namely
1553Huffman code optimization and multiple-scan output. Those modes write the
1554whole file during jpeg_finish_compress(), which will certainly result in
1555buffer overrun. (Note that this restriction applies only to compression,
1556not decompression. The decompressor supports input suspension in all of its
1557operating modes.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001558
1559Decompression suspension:
1560
1561For decompression suspension, use a fill_input_buffer() routine that simply
1562returns FALSE (except perhaps during error recovery, as discussed below).
1563This will cause the decompressor to return to its caller with an indication
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001564that suspension has occurred. This can happen at four places:
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001565 * jpeg_read_header(): will return JPEG_SUSPENDED.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001566 * jpeg_start_decompress(): will return FALSE, rather than its usual TRUE.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001567 * jpeg_read_scanlines(): will return the number of scanlines already
1568 completed (possibly 0).
1569 * jpeg_finish_decompress(): will return FALSE, rather than its usual TRUE.
1570The surrounding application must recognize these cases, load more data into
1571the input buffer, and repeat the call. In the case of jpeg_read_scanlines(),
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001572increment the passed pointers past any scanlines successfully read.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001573
1574Just as with compression, the decompressor will typically backtrack to a
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001575convenient restart point before suspending. When fill_input_buffer() is
1576called, next_input_byte/bytes_in_buffer point to the current restart point,
1577which is where the decompressor will backtrack to if FALSE is returned.
1578The data beyond that position must NOT be discarded if you suspend; it needs
1579to be re-read upon resumption. In most implementations, you'll need to shift
1580this data down to the start of your work buffer and then load more data after
1581it. Again, this behavior means that a several-Kbyte work buffer is essential
1582for decent performance; furthermore, you should load a reasonable amount of
1583new data before resuming decompression. (If you loaded, say, only one new
1584byte each time around, you could waste a LOT of cycles.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001585
1586The skip_input_data() source manager routine requires special care in a
1587suspension scenario. This routine is NOT granted the ability to suspend the
1588decompressor; it can decrement bytes_in_buffer to zero, but no more. If the
1589requested skip distance exceeds the amount of data currently in the input
1590buffer, then skip_input_data() must set bytes_in_buffer to zero and record the
1591additional skip distance somewhere else. The decompressor will immediately
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001592call fill_input_buffer(), which should return FALSE, which will cause a
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001593suspension return. The surrounding application must then arrange to discard
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001594the recorded number of bytes before it resumes loading the input buffer.
1595(Yes, this design is rather baroque, but it avoids complexity in the far more
1596common case where a non-suspending source manager is used.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001597
1598If the input data has been exhausted, we recommend that you emit a warning
1599and insert dummy EOI markers just as a non-suspending data source manager
1600would do. This can be handled either in the surrounding application logic or
1601within fill_input_buffer(); the latter is probably more efficient. If
1602fill_input_buffer() knows that no more data is available, it can set the
1603pointer/count to point to a dummy EOI marker and then return TRUE just as
1604though it had read more data in a non-suspending situation.
1605
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00001606The decompressor does not attempt to suspend within any JPEG marker; it will
1607backtrack to the start of the marker. Hence the input buffer must be large
1608enough to hold the longest marker in the file. We recommend at least a 2K
1609buffer. The buffer would need to be 64K to allow for arbitrary COM or APPn
1610markers, but the decompressor does not actually try to read these; it just
1611skips them by calling skip_input_data(). If you provide a special marker
1612handling routine that does look at such markers, coping with buffer overflow
1613is your problem. Ordinary JPEG markers should normally not exceed a few
1614hundred bytes each (DHT tables are typically the longest). For robustness
1615against damaged marker length counts, you may wish to insert a test in your
1616application for the case that the input buffer is completely full and yet the
1617decoder has suspended without consuming any data --- otherwise, if this
1618situation did occur, it would lead to an endless loop.
1619
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +00001620Multiple-buffer management:
1621
1622In some applications it is desirable to store the compressed data in a linked
1623list of buffer areas, so as to avoid data copying. This can be handled by
1624having empty_output_buffer() or fill_input_buffer() set the pointer and count
1625to reference the next available buffer; FALSE is returned only if no more
1626buffers are available. Although seemingly straightforward, there is a
1627pitfall in this approach: the backtrack that occurs when FALSE is returned
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001628could back up into an earlier buffer. For example, when fill_input_buffer()
1629is called, the current pointer & count indicate the backtrack restart point.
1630Since fill_input_buffer() will set the pointer and count to refer to a new
1631buffer, the restart position must be saved somewhere else. Suppose a second
1632call to fill_input_buffer() occurs in the same library call, and no
1633additional input data is available, so fill_input_buffer must return FALSE.
1634If the JPEG library has not moved the pointer/count forward in the current
1635buffer, then *the correct restart point is the saved position in the prior
1636buffer*. Prior buffers may be discarded only after the library establishes
1637a restart point within a later buffer. Similar remarks apply for output into
1638a chain of buffers.
1639
1640The library will never attempt to backtrack over a skip_input_data() call,
1641so any skipped data can be permanently discarded. You still have to deal
1642with the case of skipping not-yet-received data, however.
1643
1644It's much simpler to use only a single buffer; when fill_input_buffer() is
1645called, move any unconsumed data (beyond the current pointer/count) down to
1646the beginning of this buffer and then load new data into the remaining buffer
1647space. This approach requires a little more data copying but is far easier
1648to get right.
1649
1650
1651Progressive JPEG support
1652------------------------
1653
1654Progressive JPEG rearranges the stored data into a series of scans of
1655increasing quality. In situations where a JPEG file is transmitted across a
1656slow communications link, a decoder can generate a low-quality image very
1657quickly from the first scan, then gradually improve the displayed quality as
1658more scans are received. The final image after all scans are complete is
1659identical to that of a regular (sequential) JPEG file of the same quality
1660setting. Progressive JPEG files are often slightly smaller than equivalent
1661sequential JPEG files, but the possibility of incremental display is the main
1662reason for using progressive JPEG.
1663
1664The IJG encoder library generates progressive JPEG files when given a
1665suitable "scan script" defining how to divide the data into scans.
1666Creation of progressive JPEG files is otherwise transparent to the encoder.
1667Progressive JPEG files can also be read transparently by the decoder library.
1668If the decoding application simply uses the library as defined above, it
1669will receive a final decoded image without any indication that the file was
1670progressive. Of course, this approach does not allow incremental display.
1671To perform incremental display, an application needs to use the decoder
1672library's "buffered-image" mode, in which it receives a decoded image
1673multiple times.
1674
1675Each displayed scan requires about as much work to decode as a full JPEG
1676image of the same size, so the decoder must be fairly fast in relation to the
1677data transmission rate in order to make incremental display useful. However,
1678it is possible to skip displaying the image and simply add the incoming bits
1679to the decoder's coefficient buffer. This is fast because only Huffman
1680decoding need be done, not IDCT, upsampling, colorspace conversion, etc.
1681The IJG decoder library allows the application to switch dynamically between
1682displaying the image and simply absorbing the incoming bits. A properly
1683coded application can automatically adapt the number of display passes to
1684suit the time available as the image is received. Also, a final
1685higher-quality display cycle can be performed from the buffered data after
1686the end of the file is reached.
1687
1688Progressive compression:
1689
1690To create a progressive JPEG file (or a multiple-scan sequential JPEG file),
1691set the scan_info cinfo field to point to an array of scan descriptors, and
1692perform compression as usual. Instead of constructing your own scan list,
1693you can call the jpeg_simple_progression() helper routine to create a
1694recommended progression sequence; this method should be used by all
1695applications that don't want to get involved in the nitty-gritty of
1696progressive scan sequence design. (If you want to provide user control of
1697scan sequences, you may wish to borrow the scan script reading code found
1698in rdswitch.c, so that you can read scan script files just like cjpeg's.)
1699When scan_info is not NULL, the compression library will store DCT'd data
1700into a buffer array as jpeg_write_scanlines() is called, and will emit all
1701the requested scans during jpeg_finish_compress(). This implies that
1702multiple-scan output cannot be created with a suspending data destination
1703manager, since jpeg_finish_compress() does not support suspension. We
1704should also note that the compressor currently forces Huffman optimization
1705mode when creating a progressive JPEG file, because the default Huffman
1706tables are unsuitable for progressive files.
1707
1708Progressive decompression:
1709
1710When buffered-image mode is not used, the decoder library will read all of
1711a multi-scan file during jpeg_start_decompress(), so that it can provide a
1712final decoded image. (Here "multi-scan" means either progressive or
1713multi-scan sequential.) This makes multi-scan files transparent to the
1714decoding application. However, existing applications that used suspending
1715input with version 5 of the IJG library will need to be modified to check
1716for a suspension return from jpeg_start_decompress().
1717
1718To perform incremental display, an application must use the library's
1719buffered-image mode. This is described in the next section.
1720
1721
1722Buffered-image mode
1723-------------------
1724
1725In buffered-image mode, the library stores the partially decoded image in a
1726coefficient buffer, from which it can be read out as many times as desired.
1727This mode is typically used for incremental display of progressive JPEG files,
1728but it can be used with any JPEG file. Each scan of a progressive JPEG file
1729adds more data (more detail) to the buffered image. The application can
1730display in lockstep with the source file (one display pass per input scan),
1731or it can allow input processing to outrun display processing. By making
1732input and display processing run independently, it is possible for the
1733application to adapt progressive display to a wide range of data transmission
1734rates.
1735
1736The basic control flow for buffered-image decoding is
1737
1738 jpeg_create_decompress()
1739 set data source
1740 jpeg_read_header()
1741 set overall decompression parameters
1742 cinfo.buffered_image = TRUE; /* select buffered-image mode */
1743 jpeg_start_decompress()
1744 for (each output pass) {
1745 adjust output decompression parameters if required
1746 jpeg_start_output() /* start a new output pass */
1747 for (all scanlines in image) {
1748 jpeg_read_scanlines()
1749 display scanlines
1750 }
1751 jpeg_finish_output() /* terminate output pass */
1752 }
1753 jpeg_finish_decompress()
1754 jpeg_destroy_decompress()
1755
1756This differs from ordinary unbuffered decoding in that there is an additional
1757level of looping. The application can choose how many output passes to make
1758and how to display each pass.
1759
1760The simplest approach to displaying progressive images is to do one display
1761pass for each scan appearing in the input file. In this case the outer loop
1762condition is typically
1763 while (! jpeg_input_complete(&cinfo))
1764and the start-output call should read
1765 jpeg_start_output(&cinfo, cinfo.input_scan_number);
1766The second parameter to jpeg_start_output() indicates which scan of the input
1767file is to be displayed; the scans are numbered starting at 1 for this
1768purpose. (You can use a loop counter starting at 1 if you like, but using
1769the library's input scan counter is easier.) The library automatically reads
1770data as necessary to complete each requested scan, and jpeg_finish_output()
1771advances to the next scan or end-of-image marker (hence input_scan_number
1772will be incremented by the time control arrives back at jpeg_start_output()).
1773With this technique, data is read from the input file only as needed, and
1774input and output processing run in lockstep.
1775
1776After reading the final scan and reaching the end of the input file, the
1777buffered image remains available; it can be read additional times by
1778repeating the jpeg_start_output()/jpeg_read_scanlines()/jpeg_finish_output()
1779sequence. For example, a useful technique is to use fast one-pass color
1780quantization for display passes made while the image is arriving, followed by
1781a final display pass using two-pass quantization for highest quality. This
1782is done by changing the library parameters before the final output pass.
1783Changing parameters between passes is discussed in detail below.
1784
1785In general the last scan of a progressive file cannot be recognized as such
1786until after it is read, so a post-input display pass is the best approach if
1787you want special processing in the final pass.
1788
1789When done with the image, be sure to call jpeg_finish_decompress() to release
1790the buffered image (or just use jpeg_destroy_decompress()).
1791
1792If input data arrives faster than it can be displayed, the application can
1793cause the library to decode input data in advance of what's needed to produce
1794output. This is done by calling the routine jpeg_consume_input().
1795The return value is one of the following:
1796 JPEG_REACHED_SOS: reached an SOS marker (the start of a new scan)
1797 JPEG_REACHED_EOI: reached the EOI marker (end of image)
1798 JPEG_ROW_COMPLETED: completed reading one MCU row of compressed data
1799 JPEG_SCAN_COMPLETED: completed reading last MCU row of current scan
1800 JPEG_SUSPENDED: suspended before completing any of the above
1801(JPEG_SUSPENDED can occur only if a suspending data source is used.) This
1802routine can be called at any time after initializing the JPEG object. It
1803reads some additional data and returns when one of the indicated significant
1804events occurs. (If called after the EOI marker is reached, it will
1805immediately return JPEG_REACHED_EOI without attempting to read more data.)
1806
1807The library's output processing will automatically call jpeg_consume_input()
1808whenever the output processing overtakes the input; thus, simple lockstep
1809display requires no direct calls to jpeg_consume_input(). But by adding
1810calls to jpeg_consume_input(), you can absorb data in advance of what is
1811being displayed. This has two benefits:
1812 * You can limit buildup of unprocessed data in your input buffer.
1813 * You can eliminate extra display passes by paying attention to the
1814 state of the library's input processing.
1815
1816The first of these benefits only requires interspersing calls to
1817jpeg_consume_input() with your display operations and any other processing
1818you may be doing. To avoid wasting cycles due to backtracking, it's best to
1819call jpeg_consume_input() only after a hundred or so new bytes have arrived.
1820This is discussed further under "I/O suspension", above. (Note: the JPEG
1821library currently is not thread-safe. You must not call jpeg_consume_input()
1822from one thread of control if a different library routine is working on the
1823same JPEG object in another thread.)
1824
1825When input arrives fast enough that more than one new scan is available
1826before you start a new output pass, you may as well skip the output pass
1827corresponding to the completed scan. This occurs for free if you pass
1828cinfo.input_scan_number as the target scan number to jpeg_start_output().
1829The input_scan_number field is simply the index of the scan currently being
1830consumed by the input processor. You can ensure that this is up-to-date by
1831emptying the input buffer just before calling jpeg_start_output(): call
1832jpeg_consume_input() repeatedly until it returns JPEG_SUSPENDED or
1833JPEG_REACHED_EOI.
1834
1835The target scan number passed to jpeg_start_output() is saved in the
1836cinfo.output_scan_number field. The library's output processing calls
1837jpeg_consume_input() whenever the current input scan number and row within
Thomas G. Lane489583f1996-02-07 00:00:00 +00001838that scan is less than or equal to the current output scan number and row.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001839Thus, input processing can "get ahead" of the output processing but is not
1840allowed to "fall behind". You can achieve several different effects by
1841manipulating this interlock rule. For example, if you pass a target scan
1842number greater than the current input scan number, the output processor will
1843wait until that scan starts to arrive before producing any output. (To avoid
1844an infinite loop, the target scan number is automatically reset to the last
1845scan number when the end of image is reached. Thus, if you specify a large
1846target scan number, the library will just absorb the entire input file and
1847then perform an output pass. This is effectively the same as what
1848jpeg_start_decompress() does when you don't select buffered-image mode.)
1849When you pass a target scan number equal to the current input scan number,
1850the image is displayed no faster than the current input scan arrives. The
1851final possibility is to pass a target scan number less than the current input
1852scan number; this disables the input/output interlock and causes the output
1853processor to simply display whatever it finds in the image buffer, without
1854waiting for input. (However, the library will not accept a target scan
1855number less than one, so you can't avoid waiting for the first scan.)
1856
Thomas G. Lane489583f1996-02-07 00:00:00 +00001857When data is arriving faster than the output display processing can advance
1858through the image, jpeg_consume_input() will store data into the buffered
1859image beyond the point at which the output processing is reading data out
1860again. If the input arrives fast enough, it may "wrap around" the buffer to
1861the point where the input is more than one whole scan ahead of the output.
1862If the output processing simply proceeds through its display pass without
1863paying attention to the input, the effect seen on-screen is that the lower
1864part of the image is one or more scans better in quality than the upper part.
1865Then, when the next output scan is started, you have a choice of what target
1866scan number to use. The recommended choice is to use the current input scan
1867number at that time, which implies that you've skipped the output scans
1868corresponding to the input scans that were completed while you processed the
1869previous output scan. In this way, the decoder automatically adapts its
1870speed to the arriving data, by skipping output scans as necessary to keep up
1871with the arriving data.
1872
1873When using this strategy, you'll want to be sure that you perform a final
1874output pass after receiving all the data; otherwise your last display may not
1875be full quality across the whole screen. So the right outer loop logic is
1876something like this:
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001877 do {
1878 absorb any waiting input by calling jpeg_consume_input()
1879 final_pass = jpeg_input_complete(&cinfo);
1880 adjust output decompression parameters if required
1881 jpeg_start_output(&cinfo, cinfo.input_scan_number);
1882 ...
1883 jpeg_finish_output()
1884 } while (! final_pass);
1885rather than quitting as soon as jpeg_input_complete() returns TRUE. This
1886arrangement makes it simple to use higher-quality decoding parameters
1887for the final pass. But if you don't want to use special parameters for
1888the final pass, the right loop logic is like this:
1889 for (;;) {
1890 absorb any waiting input by calling jpeg_consume_input()
1891 jpeg_start_output(&cinfo, cinfo.input_scan_number);
1892 ...
1893 jpeg_finish_output()
1894 if (jpeg_input_complete(&cinfo) &&
1895 cinfo.input_scan_number == cinfo.output_scan_number)
1896 break;
1897 }
Thomas G. Lane489583f1996-02-07 00:00:00 +00001898In this case you don't need to know in advance whether an output pass is to
1899be the last one, so it's not necessary to have reached EOF before starting
1900the final output pass; rather, what you want to test is whether the output
1901pass was performed in sync with the final input scan. This form of the loop
1902will avoid an extra output pass whenever the decoder is able (or nearly able)
1903to keep up with the incoming data.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001904
1905When the data transmission speed is high, you might begin a display pass,
Thomas G. Lane489583f1996-02-07 00:00:00 +00001906then find that much or all of the file has arrived before you can complete
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001907the pass. (You can detect this by noting the JPEG_REACHED_EOI return code
1908from jpeg_consume_input(), or equivalently by testing jpeg_input_complete().)
1909In this situation you may wish to abort the current display pass and start a
1910new one using the newly arrived information. To do so, just call
1911jpeg_finish_output() and then start a new pass with jpeg_start_output().
1912
1913A variant strategy is to abort and restart display if more than one complete
1914scan arrives during an output pass; this can be detected by noting
1915JPEG_REACHED_SOS returns and/or examining cinfo.input_scan_number. This
1916idea should be employed with caution, however, since the display process
1917might never get to the bottom of the image before being aborted, resulting
1918in the lower part of the screen being several passes worse than the upper.
1919In most cases it's probably best to abort an output pass only if the whole
1920file has arrived and you want to begin the final output pass immediately.
1921
1922When receiving data across a communication link, we recommend always using
1923the current input scan number for the output target scan number; if a
1924higher-quality final pass is to be done, it should be started (aborting any
1925incomplete output pass) as soon as the end of file is received. However,
1926many other strategies are possible. For example, the application can examine
1927the parameters of the current input scan and decide whether to display it or
1928not. If the scan contains only chroma data, one might choose not to use it
1929as the target scan, expecting that the scan will be small and will arrive
1930quickly. To skip to the next scan, call jpeg_consume_input() until it
1931returns JPEG_REACHED_SOS or JPEG_REACHED_EOI. Or just use the next higher
1932number as the target scan for jpeg_start_output(); but that method doesn't
1933let you inspect the next scan's parameters before deciding to display it.
1934
1935
1936In buffered-image mode, jpeg_start_decompress() never performs input and
1937thus never suspends. An application that uses input suspension with
1938buffered-image mode must be prepared for suspension returns from these
1939routines:
1940* jpeg_start_output() performs input only if you request 2-pass quantization
1941 and the target scan isn't fully read yet. (This is discussed below.)
1942* jpeg_read_scanlines(), as always, returns the number of scanlines that it
1943 was able to produce before suspending.
1944* jpeg_finish_output() will read any markers following the target scan,
Thomas G. Lane489583f1996-02-07 00:00:00 +00001945 up to the end of the file or the SOS marker that begins another scan.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001946 (But it reads no input if jpeg_consume_input() has already reached the
Thomas G. Lane489583f1996-02-07 00:00:00 +00001947 end of the file or a SOS marker beyond the target output scan.)
1948* jpeg_finish_decompress() will read until the end of file, and thus can
Thomas G. Lanebc79e061995-08-02 00:00:00 +00001949 suspend if the end hasn't already been reached (as can be tested by
1950 calling jpeg_input_complete()).
1951jpeg_start_output(), jpeg_finish_output(), and jpeg_finish_decompress()
1952all return TRUE if they completed their tasks, FALSE if they had to suspend.
1953In the event of a FALSE return, the application must load more input data
1954and repeat the call. Applications that use non-suspending data sources need
1955not check the return values of these three routines.
1956
1957
1958It is possible to change decoding parameters between output passes in the
1959buffered-image mode. The decoder library currently supports only very
1960limited changes of parameters. ONLY THE FOLLOWING parameter changes are
1961allowed after jpeg_start_decompress() is called:
1962* dct_method can be changed before each call to jpeg_start_output().
1963 For example, one could use a fast DCT method for early scans, changing
1964 to a higher quality method for the final scan.
1965* dither_mode can be changed before each call to jpeg_start_output();
1966 of course this has no impact if not using color quantization. Typically
1967 one would use ordered dither for initial passes, then switch to
1968 Floyd-Steinberg dither for the final pass. Caution: changing dither mode
1969 can cause more memory to be allocated by the library. Although the amount
1970 of memory involved is not large (a scanline or so), it may cause the
1971 initial max_memory_to_use specification to be exceeded, which in the worst
1972 case would result in an out-of-memory failure.
1973* do_block_smoothing can be changed before each call to jpeg_start_output().
1974 This setting is relevant only when decoding a progressive JPEG image.
1975 During the first DC-only scan, block smoothing provides a very "fuzzy" look
1976 instead of the very "blocky" look seen without it; which is better seems a
1977 matter of personal taste. But block smoothing is nearly always a win
1978 during later stages, especially when decoding a successive-approximation
1979 image: smoothing helps to hide the slight blockiness that otherwise shows
1980 up on smooth gradients until the lowest coefficient bits are sent.
1981* Color quantization mode can be changed under the rules described below.
1982 You *cannot* change between full-color and quantized output (because that
1983 would alter the required I/O buffer sizes), but you can change which
1984 quantization method is used.
1985
1986When generating color-quantized output, changing quantization method is a
1987very useful way of switching between high-speed and high-quality display.
1988The library allows you to change among its three quantization methods:
19891. Single-pass quantization to a fixed color cube.
1990 Selected by cinfo.two_pass_quantize = FALSE and cinfo.colormap = NULL.
19912. Single-pass quantization to an application-supplied colormap.
1992 Selected by setting cinfo.colormap to point to the colormap (the value of
1993 two_pass_quantize is ignored); also set cinfo.actual_number_of_colors.
19943. Two-pass quantization to a colormap chosen specifically for the image.
1995 Selected by cinfo.two_pass_quantize = TRUE and cinfo.colormap = NULL.
1996 (This is the default setting selected by jpeg_read_header, but it is
1997 probably NOT what you want for the first pass of progressive display!)
1998These methods offer successively better quality and lesser speed. However,
1999only the first method is available for quantizing in non-RGB color spaces.
2000
2001IMPORTANT: because the different quantizer methods have very different
2002working-storage requirements, the library requires you to indicate which
2003one(s) you intend to use before you call jpeg_start_decompress(). (If we did
2004not require this, the max_memory_to_use setting would be a complete fiction.)
2005You do this by setting one or more of these three cinfo fields to TRUE:
2006 enable_1pass_quant Fixed color cube colormap
2007 enable_external_quant Externally-supplied colormap
2008 enable_2pass_quant Two-pass custom colormap
2009All three are initialized FALSE by jpeg_read_header(). But
2010jpeg_start_decompress() automatically sets TRUE the one selected by the
2011current two_pass_quantize and colormap settings, so you only need to set the
2012enable flags for any other quantization methods you plan to change to later.
2013
2014After setting the enable flags correctly at jpeg_start_decompress() time, you
2015can change to any enabled quantization method by setting two_pass_quantize
2016and colormap properly just before calling jpeg_start_output(). The following
2017special rules apply:
20181. You must explicitly set cinfo.colormap to NULL when switching to 1-pass
2019 or 2-pass mode from a different mode, or when you want the 2-pass
2020 quantizer to be re-run to generate a new colormap.
20212. To switch to an external colormap, or to change to a different external
2022 colormap than was used on the prior pass, you must call
2023 jpeg_new_colormap() after setting cinfo.colormap.
2024NOTE: if you want to use the same colormap as was used in the prior pass,
2025you should not do either of these things. This will save some nontrivial
2026switchover costs.
2027(These requirements exist because cinfo.colormap will always be non-NULL
2028after completing a prior output pass, since both the 1-pass and 2-pass
2029quantizers set it to point to their output colormaps. Thus you have to
2030do one of these two things to notify the library that something has changed.
2031Yup, it's a bit klugy, but it's necessary to do it this way for backwards
2032compatibility.)
2033
2034Note that in buffered-image mode, the library generates any requested colormap
2035during jpeg_start_output(), not during jpeg_start_decompress().
2036
2037When using two-pass quantization, jpeg_start_output() makes a pass over the
2038buffered image to determine the optimum color map; it therefore may take a
2039significant amount of time, whereas ordinarily it does little work. The
2040progress monitor hook is called during this pass, if defined. It is also
2041important to realize that if the specified target scan number is greater than
2042or equal to the current input scan number, jpeg_start_output() will attempt
2043to consume input as it makes this pass. If you use a suspending data source,
2044you need to check for a FALSE return from jpeg_start_output() under these
2045conditions. The combination of 2-pass quantization and a not-yet-fully-read
2046target scan is the only case in which jpeg_start_output() will consume input.
2047
2048
2049Application authors who support buffered-image mode may be tempted to use it
2050for all JPEG images, even single-scan ones. This will work, but it is
2051inefficient: there is no need to create an image-sized coefficient buffer for
2052single-scan images. Requesting buffered-image mode for such an image wastes
2053memory. Worse, it can cost time on large images, since the buffered data has
2054to be swapped out or written to a temporary file. If you are concerned about
2055maximum performance on baseline JPEG files, you should use buffered-image
2056mode only when the incoming file actually has multiple scans. This can be
2057tested by calling jpeg_has_multiple_scans(), which will return a correct
2058result at any time after jpeg_read_header() completes.
2059
2060It is also worth noting that when you use jpeg_consume_input() to let input
2061processing get ahead of output processing, the resulting pattern of access to
2062the coefficient buffer is quite nonsequential. It's best to use the memory
2063manager jmemnobs.c if you can (ie, if you have enough real or virtual main
2064memory). If not, at least make sure that max_memory_to_use is set as high as
2065possible. If the JPEG memory manager has to use a temporary file, you will
2066probably see a lot of disk traffic and poor performance. (This could be
2067improved with additional work on the memory manager, but we haven't gotten
2068around to it yet.)
2069
2070In some applications it may be convenient to use jpeg_consume_input() for all
2071input processing, including reading the initial markers; that is, you may
2072wish to call jpeg_consume_input() instead of jpeg_read_header() during
2073startup. This works, but note that you must check for JPEG_REACHED_SOS and
2074JPEG_REACHED_EOI return codes as the equivalent of jpeg_read_header's codes.
2075Once the first SOS marker has been reached, you must call
2076jpeg_start_decompress() before jpeg_consume_input() will consume more input;
2077it'll just keep returning JPEG_REACHED_SOS until you do. If you read a
2078tables-only file this way, jpeg_consume_input() will return JPEG_REACHED_EOI
2079without ever returning JPEG_REACHED_SOS; be sure to check for this case.
2080If this happens, the decompressor will not read any more input until you call
2081jpeg_abort() to reset it. It is OK to call jpeg_consume_input() even when not
2082using buffered-image mode, but in that case it's basically a no-op after the
2083initial markers have been read: it will just return JPEG_SUSPENDED.
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +00002084
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002085
2086Abbreviated datastreams and multiple images
2087-------------------------------------------
2088
2089A JPEG compression or decompression object can be reused to process multiple
2090images. This saves a small amount of time per image by eliminating the
2091"create" and "destroy" operations, but that isn't the real purpose of the
2092feature. Rather, reuse of an object provides support for abbreviated JPEG
2093datastreams. Object reuse can also simplify processing a series of images in
2094a single input or output file. This section explains these features.
2095
2096A JPEG file normally contains several hundred bytes worth of quantization
2097and Huffman tables. In a situation where many images will be stored or
2098transmitted with identical tables, this may represent an annoying overhead.
2099The JPEG standard therefore permits tables to be omitted. The standard
2100defines three classes of JPEG datastreams:
2101 * "Interchange" datastreams contain an image and all tables needed to decode
2102 the image. These are the usual kind of JPEG file.
2103 * "Abbreviated image" datastreams contain an image, but are missing some or
2104 all of the tables needed to decode that image.
2105 * "Abbreviated table specification" (henceforth "tables-only") datastreams
2106 contain only table specifications.
2107To decode an abbreviated image, it is necessary to load the missing table(s)
2108into the decoder beforehand. This can be accomplished by reading a separate
2109tables-only file. A variant scheme uses a series of images in which the first
2110image is an interchange (complete) datastream, while subsequent ones are
2111abbreviated and rely on the tables loaded by the first image. It is assumed
2112that once the decoder has read a table, it will remember that table until a
2113new definition for the same table number is encountered.
2114
2115It is the application designer's responsibility to figure out how to associate
2116the correct tables with an abbreviated image. While abbreviated datastreams
2117can be useful in a closed environment, their use is strongly discouraged in
2118any situation where data exchange with other applications might be needed.
2119Caveat designer.
2120
2121The JPEG library provides support for reading and writing any combination of
2122tables-only datastreams and abbreviated images. In both compression and
2123decompression objects, a quantization or Huffman table will be retained for
2124the lifetime of the object, unless it is overwritten by a new table definition.
2125
2126
2127To create abbreviated image datastreams, it is only necessary to tell the
2128compressor not to emit some or all of the tables it is using. Each
2129quantization and Huffman table struct contains a boolean field "sent_table",
2130which normally is initialized to FALSE. For each table used by the image, the
2131header-writing process emits the table and sets sent_table = TRUE unless it is
2132already TRUE. (In normal usage, this prevents outputting the same table
2133definition multiple times, as would otherwise occur because the chroma
2134components typically share tables.) Thus, setting this field to TRUE before
2135calling jpeg_start_compress() will prevent the table from being written at
2136all.
2137
2138If you want to create a "pure" abbreviated image file containing no tables,
2139just call "jpeg_suppress_tables(&cinfo, TRUE)" after constructing all the
2140tables. If you want to emit some but not all tables, you'll need to set the
2141individual sent_table fields directly.
2142
2143To create an abbreviated image, you must also call jpeg_start_compress()
2144with a second parameter of FALSE, not TRUE. Otherwise jpeg_start_compress()
2145will force all the sent_table fields to FALSE. (This is a safety feature to
2146prevent abbreviated images from being created accidentally.)
2147
2148To create a tables-only file, perform the same parameter setup that you
2149normally would, but instead of calling jpeg_start_compress() and so on, call
2150jpeg_write_tables(&cinfo). This will write an abbreviated datastream
2151containing only SOI, DQT and/or DHT markers, and EOI. All the quantization
2152and Huffman tables that are currently defined in the compression object will
2153be emitted unless their sent_tables flag is already TRUE, and then all the
2154sent_tables flags will be set TRUE.
2155
2156A sure-fire way to create matching tables-only and abbreviated image files
2157is to proceed as follows:
2158
2159 create JPEG compression object
2160 set JPEG parameters
2161 set destination to tables-only file
2162 jpeg_write_tables(&cinfo);
2163 set destination to image file
2164 jpeg_start_compress(&cinfo, FALSE);
2165 write data...
2166 jpeg_finish_compress(&cinfo);
2167
2168Since the JPEG parameters are not altered between writing the table file and
2169the abbreviated image file, the same tables are sure to be used. Of course,
2170you can repeat the jpeg_start_compress() ... jpeg_finish_compress() sequence
2171many times to produce many abbreviated image files matching the table file.
2172
2173You cannot suppress output of the computed Huffman tables when Huffman
2174optimization is selected. (If you could, there'd be no way to decode the
2175image...) Generally, you don't want to set optimize_coding = TRUE when
2176you are trying to produce abbreviated files.
2177
2178In some cases you might want to compress an image using tables which are
2179not stored in the application, but are defined in an interchange or
2180tables-only file readable by the application. This can be done by setting up
2181a JPEG decompression object to read the specification file, then copying the
Thomas G. Lane489583f1996-02-07 00:00:00 +00002182tables into your compression object. See jpeg_copy_critical_parameters()
2183for an example of copying quantization tables.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002184
2185
2186To read abbreviated image files, you simply need to load the proper tables
2187into the decompression object before trying to read the abbreviated image.
2188If the proper tables are stored in the application program, you can just
2189allocate the table structs and fill in their contents directly. More commonly
2190you'd want to read the tables from a tables-only file. The jpeg_read_header()
2191call is sufficient to read a tables-only file. You must pass a second
2192parameter of FALSE to indicate that you do not require an image to be present.
2193Thus, the typical scenario is
2194
2195 create JPEG decompression object
2196 set source to tables-only file
2197 jpeg_read_header(&cinfo, FALSE);
2198 set source to abbreviated image file
2199 jpeg_read_header(&cinfo, TRUE);
2200 set decompression parameters
2201 jpeg_start_decompress(&cinfo);
2202 read data...
2203 jpeg_finish_decompress(&cinfo);
2204
2205In some cases, you may want to read a file without knowing whether it contains
2206an image or just tables. In that case, pass FALSE and check the return value
2207from jpeg_read_header(): it will be JPEG_HEADER_OK if an image was found,
2208JPEG_HEADER_TABLES_ONLY if only tables were found. (A third return value,
2209JPEG_SUSPENDED, is possible when using a suspending data source manager.)
2210Note that jpeg_read_header() will not complain if you read an abbreviated
2211image for which you haven't loaded the missing tables; the missing-table check
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002212occurs later, in jpeg_start_decompress().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002213
2214
2215It is possible to read a series of images from a single source file by
2216repeating the jpeg_read_header() ... jpeg_finish_decompress() sequence,
2217without releasing/recreating the JPEG object or the data source module.
2218(If you did reinitialize, any partial bufferload left in the data source
2219buffer at the end of one image would be discarded, causing you to lose the
2220start of the next image.) When you use this method, stored tables are
2221automatically carried forward, so some of the images can be abbreviated images
2222that depend on tables from earlier images.
2223
2224If you intend to write a series of images into a single destination file,
2225you might want to make a specialized data destination module that doesn't
2226flush the output buffer at term_destination() time. This would speed things
2227up by some trifling amount. Of course, you'd need to remember to flush the
2228buffer after the last image. You can make the later images be abbreviated
2229ones by passing FALSE to jpeg_start_compress().
2230
2231
2232Special markers
2233---------------
2234
2235Some applications may need to insert or extract special data in the JPEG
2236datastream. The JPEG standard provides marker types "COM" (comment) and
2237"APP0" through "APP15" (application) to hold application-specific data.
2238Unfortunately, the use of these markers is not specified by the standard.
2239COM markers are fairly widely used to hold user-supplied text. The JFIF file
2240format spec uses APP0 markers with specified initial strings to hold certain
2241data. Adobe applications use APP14 markers beginning with the string "Adobe"
2242for miscellaneous data. Other APPn markers are rarely seen, but might
2243contain almost anything.
2244
2245If you wish to store user-supplied text, we recommend you use COM markers
2246and place readable 7-bit ASCII text in them. Newline conventions are not
2247standardized --- expect to find LF (Unix style), CR/LF (DOS style), or CR
2248(Mac style). A robust COM reader should be able to cope with random binary
2249garbage, including nulls, since some applications generate COM markers
2250containing non-ASCII junk. (But yours should not be one of them.)
2251
2252For program-supplied data, use an APPn marker, and be sure to begin it with an
2253identifying string so that you can tell whether the marker is actually yours.
2254It's probably best to avoid using APP0 or APP14 for any private markers.
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002255(NOTE: the upcoming SPIFF standard will use APP8 markers; we recommend you
2256not use APP8 markers for any private purposes, either.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002257
2258Keep in mind that at most 65533 bytes can be put into one marker, but you
2259can have as many markers as you like.
2260
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002261By default, the IJG compression library will write a JFIF APP0 marker if the
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002262selected JPEG colorspace is grayscale or YCbCr, or an Adobe APP14 marker if
2263the selected colorspace is RGB, CMYK, or YCCK. You can disable this, but
2264we don't recommend it. The decompression library will recognize JFIF and
2265Adobe markers and will set the JPEG colorspace properly when one is found.
2266
2267You can write special markers immediately following the datastream header by
2268calling jpeg_write_marker() after jpeg_start_compress() and before the first
2269call to jpeg_write_scanlines(). When you do this, the markers appear after
2270the SOI and the JFIF APP0 and Adobe APP14 markers (if written), but before
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002271all else. Specify the marker type parameter as "JPEG_COM" for COM or
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002272"JPEG_APP0 + n" for APPn. (Actually, jpeg_write_marker will let you write
2273any marker type, but we don't recommend writing any other kinds of marker.)
2274For example, to write a user comment string pointed to by comment_text:
2275 jpeg_write_marker(cinfo, JPEG_COM, comment_text, strlen(comment_text));
2276Or if you prefer to synthesize the marker byte sequence yourself, you can
2277just cram it straight into the data destination module.
2278
2279For decompression, you can supply your own routine to process COM or APPn
2280markers by calling jpeg_set_marker_processor(). Usually you'd call this
2281after creating a decompression object and before calling jpeg_read_header(),
2282because the markers of interest will normally be scanned by jpeg_read_header.
2283Once you've supplied a routine, it will be used for the life of that
2284decompression object. A separate routine may be registered for COM and for
2285each APPn marker code.
2286
2287A marker processor routine must have the signature
2288 boolean jpeg_marker_parser_method (j_decompress_ptr cinfo)
2289Although the marker code is not explicitly passed, the routine can find it
2290in cinfo->unread_marker. At the time of call, the marker proper has been
2291read from the data source module. The processor routine is responsible for
2292reading the marker length word and the remaining parameter bytes, if any.
2293Return TRUE to indicate success. (FALSE should be returned only if you are
2294using a suspending data source and it tells you to suspend. See the standard
2295marker processors in jdmarker.c for appropriate coding methods if you need to
2296use a suspending data source.)
2297
2298If you override the default APP0 or APP14 processors, it is up to you to
2299recognize JFIF and Adobe markers if you want colorspace recognition to occur
2300properly. We recommend copying and extending the default processors if you
2301want to do that.
2302
2303A simple example of an external COM processor can be found in djpeg.c.
2304
2305
2306Raw (downsampled) image data
2307----------------------------
2308
2309Some applications need to supply already-downsampled image data to the JPEG
2310compressor, or to receive raw downsampled data from the decompressor. The
2311library supports this requirement by allowing the application to write or
2312read raw data, bypassing the normal preprocessing or postprocessing steps.
2313The interface is different from the standard one and is somewhat harder to
2314use. If your interest is merely in bypassing color conversion, we recommend
2315that you use the standard interface and simply set jpeg_color_space =
2316in_color_space (or jpeg_color_space = out_color_space for decompression).
2317The mechanism described in this section is necessary only to supply or
2318receive downsampled image data, in which not all components have the same
2319dimensions.
2320
2321
2322To compress raw data, you must supply the data in the colorspace to be used
2323in the JPEG file (please read the earlier section on Special color spaces)
2324and downsampled to the sampling factors specified in the JPEG parameters.
2325You must supply the data in the format used internally by the JPEG library,
2326namely a JSAMPIMAGE array. This is an array of pointers to two-dimensional
2327arrays, each of type JSAMPARRAY. Each 2-D array holds the values for one
2328color component. This structure is necessary since the components are of
2329different sizes. If the image dimensions are not a multiple of the MCU size,
2330you must also pad the data correctly (usually, this is done by replicating
2331the last column and/or row). The data must be padded to a multiple of a DCT
2332block in each component: that is, each downsampled row must contain a
2333multiple of 8 valid samples, and there must be a multiple of 8 sample rows
2334for each component. (For applications such as conversion of digital TV
2335images, the standard image size is usually a multiple of the DCT block size,
2336so that no padding need actually be done.)
2337
2338The procedure for compression of raw data is basically the same as normal
2339compression, except that you call jpeg_write_raw_data() in place of
2340jpeg_write_scanlines(). Before calling jpeg_start_compress(), you must do
2341the following:
2342 * Set cinfo->raw_data_in to TRUE. (It is set FALSE by jpeg_set_defaults().)
2343 This notifies the library that you will be supplying raw data.
2344 * Ensure jpeg_color_space is correct --- an explicit jpeg_set_colorspace()
2345 call is a good idea. Note that since color conversion is bypassed,
2346 in_color_space is ignored, except that jpeg_set_defaults() uses it to
2347 choose the default jpeg_color_space setting.
2348 * Ensure the sampling factors, cinfo->comp_info[i].h_samp_factor and
2349 cinfo->comp_info[i].v_samp_factor, are correct. Since these indicate the
2350 dimensions of the data you are supplying, it's wise to set them
2351 explicitly, rather than assuming the library's defaults are what you want.
2352
2353To pass raw data to the library, call jpeg_write_raw_data() in place of
2354jpeg_write_scanlines(). The two routines work similarly except that
2355jpeg_write_raw_data takes a JSAMPIMAGE data array rather than JSAMPARRAY.
2356The scanlines count passed to and returned from jpeg_write_raw_data is
2357measured in terms of the component with the largest v_samp_factor.
2358
2359jpeg_write_raw_data() processes one MCU row per call, which is to say
2360v_samp_factor*DCTSIZE sample rows of each component. The passed num_lines
2361value must be at least max_v_samp_factor*DCTSIZE, and the return value will
2362be exactly that amount (or possibly some multiple of that amount, in future
2363library versions). This is true even on the last call at the bottom of the
2364image; don't forget to pad your data as necessary.
2365
2366The required dimensions of the supplied data can be computed for each
2367component as
2368 cinfo->comp_info[i].width_in_blocks*DCTSIZE samples per row
2369 cinfo->comp_info[i].height_in_blocks*DCTSIZE rows in image
2370after jpeg_start_compress() has initialized those fields. If the valid data
2371is smaller than this, it must be padded appropriately. For some sampling
2372factors and image sizes, additional dummy DCT blocks are inserted to make
2373the image a multiple of the MCU dimensions. The library creates such dummy
2374blocks itself; it does not read them from your supplied data. Therefore you
2375need never pad by more than DCTSIZE samples. An example may help here.
2376Assume 2h2v downsampling of YCbCr data, that is
2377 cinfo->comp_info[0].h_samp_factor = 2 for Y
2378 cinfo->comp_info[0].v_samp_factor = 2
2379 cinfo->comp_info[1].h_samp_factor = 1 for Cb
2380 cinfo->comp_info[1].v_samp_factor = 1
2381 cinfo->comp_info[2].h_samp_factor = 1 for Cr
2382 cinfo->comp_info[2].v_samp_factor = 1
2383and suppose that the nominal image dimensions (cinfo->image_width and
2384cinfo->image_height) are 101x101 pixels. Then jpeg_start_compress() will
2385compute downsampled_width = 101 and width_in_blocks = 13 for Y,
2386downsampled_width = 51 and width_in_blocks = 7 for Cb and Cr (and the same
2387for the height fields). You must pad the Y data to at least 13*8 = 104
2388columns and rows, the Cb/Cr data to at least 7*8 = 56 columns and rows. The
2389MCU height is max_v_samp_factor = 2 DCT rows so you must pass at least 16
2390scanlines on each call to jpeg_write_raw_data(), which is to say 16 actual
2391sample rows of Y and 8 each of Cb and Cr. A total of 7 MCU rows are needed,
2392so you must pass a total of 7*16 = 112 "scanlines". The last DCT block row
2393of Y data is dummy, so it doesn't matter what you pass for it in the data
2394arrays, but the scanlines count must total up to 112 so that all of the Cb
2395and Cr data gets passed.
2396
Thomas G. Lanea8b67c41995-03-15 00:00:00 +00002397Output suspension is supported with raw-data compression: if the data
2398destination module suspends, jpeg_write_raw_data() will return 0.
2399In this case the same data rows must be passed again on the next call.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002400
2401
2402Decompression with raw data output implies bypassing all postprocessing:
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002403you cannot ask for rescaling or color quantization, for instance. More
2404seriously, you must deal with the color space and sampling factors present in
2405the incoming file. If your application only handles, say, 2h1v YCbCr data,
2406you must check for and fail on other color spaces or other sampling factors.
2407The library will not convert to a different color space for you.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002408
2409To obtain raw data output, set cinfo->raw_data_out = TRUE before
2410jpeg_start_decompress() (it is set FALSE by jpeg_read_header()). Be sure to
2411verify that the color space and sampling factors are ones you can handle.
2412Then call jpeg_read_raw_data() in place of jpeg_read_scanlines(). The
2413decompression process is otherwise the same as usual.
2414
2415jpeg_read_raw_data() returns one MCU row per call, and thus you must pass a
2416buffer of at least max_v_samp_factor*DCTSIZE scanlines (scanline counting is
2417the same as for raw-data compression). The buffer you pass must be large
2418enough to hold the actual data plus padding to DCT-block boundaries. As with
2419compression, any entirely dummy DCT blocks are not processed so you need not
2420allocate space for them, but the total scanline count includes them. The
2421above example of computing buffer dimensions for raw-data compression is
2422equally valid for decompression.
2423
2424Input suspension is supported with raw-data decompression: if the data source
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002425module suspends, jpeg_read_raw_data() will return 0. You can also use
2426buffered-image mode to read raw data in multiple passes.
2427
2428
2429Really raw data: DCT coefficients
2430---------------------------------
2431
2432It is possible to read or write the contents of a JPEG file as raw DCT
2433coefficients. This facility is mainly intended for use in lossless
2434transcoding between different JPEG file formats. Other possible applications
2435include lossless cropping of a JPEG image, lossless reassembly of a
2436multi-strip or multi-tile TIFF/JPEG file into a single JPEG datastream, etc.
2437
2438To read the contents of a JPEG file as DCT coefficients, open the file and do
2439jpeg_read_header() as usual. But instead of calling jpeg_start_decompress()
2440and jpeg_read_scanlines(), call jpeg_read_coefficients(). This will read the
2441entire image into a set of virtual coefficient-block arrays, one array per
2442component. The return value is a pointer to an array of virtual-array
2443descriptors. Each virtual array can be accessed directly using the JPEG
2444memory manager's access_virt_barray method (see Memory management, below,
2445and also read structure.doc's discussion of virtual array handling). Or,
2446for simple transcoding to a different JPEG file format, the array list can
2447just be handed directly to jpeg_write_coefficients().
2448
2449When you are done using the virtual arrays, call jpeg_finish_decompress()
2450to release the array storage and return the decompression object to an idle
2451state; or just call jpeg_destroy() if you don't need to reuse the object.
2452
2453If you use a suspending data source, jpeg_read_coefficients() will return
2454NULL if it is forced to suspend; a non-NULL return value indicates successful
2455completion. You need not test for a NULL return value when using a
2456non-suspending data source.
2457
2458Each block in the block arrays contains quantized coefficient values in
2459normal array order (not JPEG zigzag order). The block arrays contain only
2460DCT blocks containing real data; any entirely-dummy blocks added to fill out
2461interleaved MCUs at the right or bottom edges of the image are discarded
2462during reading and are not stored in the block arrays. (The size of each
2463block array can be determined from the width_in_blocks and height_in_blocks
2464fields of the component's comp_info entry.) This is also the data format
2465expected by jpeg_write_coefficients().
2466
2467To write the contents of a JPEG file as DCT coefficients, you must provide
2468the DCT coefficients stored in virtual block arrays. You can either pass
2469block arrays read from an input JPEG file by jpeg_read_coefficients(), or
2470allocate virtual arrays from the JPEG compression object and fill them
2471yourself. In either case, jpeg_write_coefficients() is substituted for
2472jpeg_start_compress() and jpeg_write_scanlines(). Thus the sequence is
2473 * Create compression object
2474 * Set all compression parameters as necessary
2475 * Request virtual arrays if needed
2476 * jpeg_write_coefficients()
2477 * jpeg_finish_compress()
2478 * Destroy or re-use compression object
2479jpeg_write_coefficients() is passed a pointer to an array of virtual block
2480array descriptors; the number of arrays is equal to cinfo.num_components.
2481
2482The virtual arrays need only have been requested, not realized, before
2483jpeg_write_coefficients() is called. A side-effect of
2484jpeg_write_coefficients() is to realize any virtual arrays that have been
2485requested from the compression object's memory manager. Thus, when obtaining
2486the virtual arrays from the compression object, you should fill the arrays
2487after calling jpeg_write_coefficients(). The data is actually written out
2488when you call jpeg_finish_compress(); jpeg_write_coefficients() only writes
2489the file header.
2490
2491When writing raw DCT coefficients, it is crucial that the JPEG quantization
2492tables and sampling factors match the way the data was encoded, or the
2493resulting file will be invalid. For transcoding from an existing JPEG file,
2494we recommend using jpeg_copy_critical_parameters(). This routine initializes
2495all the compression parameters to default values (like jpeg_set_defaults()),
2496then copies the critical information from a source decompression object.
2497The decompression object should have just been used to read the entire
2498JPEG input file --- that is, it should be awaiting jpeg_finish_decompress().
2499
2500jpeg_write_coefficients() marks all tables stored in the compression object
2501as needing to be written to the output file (thus, it acts like
2502jpeg_start_compress(cinfo, TRUE)). This is for safety's sake, to avoid
2503emitting abbreviated JPEG files by accident. If you really want to emit an
2504abbreviated JPEG file, call jpeg_suppress_tables(), or set the tables'
2505individual sent_table flags, between calling jpeg_write_coefficients() and
2506jpeg_finish_compress().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002507
2508
2509Progress monitoring
2510-------------------
2511
2512Some applications may need to regain control from the JPEG library every so
2513often. The typical use of this feature is to produce a percent-done bar or
2514other progress display. (For a simple example, see cjpeg.c or djpeg.c.)
2515Although you do get control back frequently during the data-transferring pass
2516(the jpeg_read_scanlines or jpeg_write_scanlines loop), any additional passes
2517will occur inside jpeg_finish_compress or jpeg_start_decompress; those
2518routines may take a long time to execute, and you don't get control back
2519until they are done.
2520
2521You can define a progress-monitor routine which will be called periodically
2522by the library. No guarantees are made about how often this call will occur,
2523so we don't recommend you use it for mouse tracking or anything like that.
2524At present, a call will occur once per MCU row, scanline, or sample row
2525group, whichever unit is convenient for the current processing mode; so the
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002526wider the image, the longer the time between calls. During the data
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002527transferring pass, only one call occurs per call of jpeg_read_scanlines or
2528jpeg_write_scanlines, so don't pass a large number of scanlines at once if
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002529you want fine resolution in the progress count. (If you really need to use
2530the callback mechanism for time-critical tasks like mouse tracking, you could
2531insert additional calls inside some of the library's inner loops.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002532
2533To establish a progress-monitor callback, create a struct jpeg_progress_mgr,
2534fill in its progress_monitor field with a pointer to your callback routine,
2535and set cinfo->progress to point to the struct. The callback will be called
2536whenever cinfo->progress is non-NULL. (This pointer is set to NULL by
2537jpeg_create_compress or jpeg_create_decompress; the library will not change
2538it thereafter. So if you allocate dynamic storage for the progress struct,
2539make sure it will live as long as the JPEG object does. Allocating from the
2540JPEG memory manager with lifetime JPOOL_PERMANENT will work nicely.) You
2541can use the same callback routine for both compression and decompression.
2542
2543The jpeg_progress_mgr struct contains four fields which are set by the library:
2544 long pass_counter; /* work units completed in this pass */
2545 long pass_limit; /* total number of work units in this pass */
2546 int completed_passes; /* passes completed so far */
2547 int total_passes; /* total number of passes expected */
2548During any one pass, pass_counter increases from 0 up to (not including)
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002549pass_limit; the step size is usually but not necessarily 1. The pass_limit
2550value may change from one pass to another. The expected total number of
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002551passes is in total_passes, and the number of passes already completed is in
2552completed_passes. Thus the fraction of work completed may be estimated as
2553 completed_passes + (pass_counter/pass_limit)
2554 --------------------------------------------
2555 total_passes
2556ignoring the fact that the passes may not be equal amounts of work.
2557
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002558When decompressing, pass_limit can even change within a pass, because it
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002559depends on the number of scans in the JPEG file, which isn't always known in
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002560advance. The computed fraction-of-work-done may jump suddenly (if the library
2561discovers it has overestimated the number of scans) or even decrease (in the
2562opposite case). It is not wise to put great faith in the work estimate.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002563
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002564When using the decompressor's buffered-image mode, the progress monitor work
2565estimate is likely to be completely unhelpful, because the library has no way
2566to know how many output passes will be demanded of it. Currently, the library
2567sets total_passes based on the assumption that there will be one more output
2568pass if the input file end hasn't yet been read (jpeg_input_complete() isn't
2569TRUE), but no more output passes if the file end has been reached when the
2570output pass is started. This means that total_passes will rise as additional
2571output passes are requested. If you have a way of determining the input file
2572size, estimating progress based on the fraction of the file that's been read
2573will probably be more useful than using the library's value.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002574
2575
2576Memory management
2577-----------------
2578
2579This section covers some key facts about the JPEG library's built-in memory
2580manager. For more info, please read structure.doc's section about the memory
2581manager, and consult the source code if necessary.
2582
2583All memory and temporary file allocation within the library is done via the
2584memory manager. If necessary, you can replace the "back end" of the memory
2585manager to control allocation yourself (for example, if you don't want the
2586library to use malloc() and free() for some reason).
2587
2588Some data is allocated "permanently" and will not be freed until the JPEG
2589object is destroyed. Most data is allocated "per image" and is freed by
2590jpeg_finish_compress, jpeg_finish_decompress, or jpeg_abort. You can call the
2591memory manager yourself to allocate structures that will automatically be
2592freed at these times. Typical code for this is
2593 ptr = (*cinfo->mem->alloc_small) ((j_common_ptr) cinfo, JPOOL_IMAGE, size);
2594Use JPOOL_PERMANENT to get storage that lasts as long as the JPEG object.
2595Use alloc_large instead of alloc_small for anything bigger than a few Kbytes.
2596There are also alloc_sarray and alloc_barray routines that automatically
2597build 2-D sample or block arrays.
2598
2599The library's minimum space requirements to process an image depend on the
2600image's width, but not on its height, because the library ordinarily works
2601with "strip" buffers that are as wide as the image but just a few rows high.
2602Some operating modes (eg, two-pass color quantization) require full-image
2603buffers. Such buffers are treated as "virtual arrays": only the current strip
2604need be in memory, and the rest can be swapped out to a temporary file.
2605
2606If you use the simplest memory manager back end (jmemnobs.c), then no
2607temporary files are used; virtual arrays are simply malloc()'d. Images bigger
2608than memory can be processed only if your system supports virtual memory.
2609The other memory manager back ends support temporary files of various flavors
2610and thus work in machines without virtual memory. They may also be useful on
2611Unix machines if you need to process images that exceed available swap space.
2612
2613When using temporary files, the library will make the in-memory buffers for
2614its virtual arrays just big enough to stay within a "maximum memory" setting.
2615Your application can set this limit by setting cinfo->mem->max_memory_to_use
2616after creating the JPEG object. (Of course, there is still a minimum size for
2617the buffers, so the max-memory setting is effective only if it is bigger than
2618the minimum space needed.) If you allocate any large structures yourself, you
2619must allocate them before jpeg_start_compress() or jpeg_start_decompress() in
2620order to have them counted against the max memory limit. Also keep in mind
2621that space allocated with alloc_small() is ignored, on the assumption that
Thomas G. Lanebc79e061995-08-02 00:00:00 +00002622it's too small to be worth worrying about; so a reasonable safety margin
2623should be left when setting max_memory_to_use.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002624
2625If you use the jmemname.c or jmemdos.c memory manager back end, it is
2626important to clean up the JPEG object properly to ensure that the temporary
2627files get deleted. (This is especially crucial with jmemdos.c, where the
2628"temporary files" may be extended-memory segments; if they are not freed,
2629DOS will require a reboot to recover the memory.) Thus, with these memory
2630managers, it's a good idea to provide a signal handler that will trap any
2631early exit from your program. The handler should call either jpeg_abort()
2632or jpeg_destroy() for any active JPEG objects. A handler is not needed with
Thomas G. Lane489583f1996-02-07 00:00:00 +00002633jmemnobs.c, and shouldn't be necessary with jmemansi.c or jmemmac.c either,
2634since the C library is supposed to take care of deleting files made with
2635tmpfile().
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002636
2637
2638Library compile-time options
2639----------------------------
2640
2641A number of compile-time options are available by modifying jmorecfg.h.
2642
2643The JPEG standard provides for both the baseline 8-bit DCT process and
2644a 12-bit DCT process. 12-bit lossy JPEG is supported if you define
2645BITS_IN_JSAMPLE as 12 rather than 8. Note that this causes JSAMPLE to be
2646larger than a char, so it affects the surrounding application's image data.
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +00002647The sample applications cjpeg and djpeg can support 12-bit mode only for PPM
2648and GIF file formats; you must disable the other file formats to compile a
Thomas G. Lanea8b67c41995-03-15 00:00:00 +0000264912-bit cjpeg or djpeg. (install.doc has more information about that.)
2650At present, a 12-bit library can handle *only* 12-bit images, not both
2651precisions. (If you need to include both 8- and 12-bit libraries in a single
2652application, you could probably do it by defining NEED_SHORT_EXTERNAL_NAMES
2653for just one of the copies. You'd have to access the 8-bit and 12-bit copies
2654from separate application source files. This is untested ... if you try it,
2655we'd like to hear whether it works!)
2656
2657Note that a 12-bit library always compresses in Huffman optimization mode,
2658in order to generate valid Huffman tables. This is necessary because our
2659default Huffman tables only cover 8-bit data. If you need to output 12-bit
2660files in one pass, you'll have to supply suitable default Huffman tables.
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002661
2662The maximum number of components (color channels) in the image is determined
2663by MAX_COMPONENTS. The JPEG standard allows up to 255 components, but we
2664expect that few applications will need more than four or so.
2665
2666On machines with unusual data type sizes, you may be able to improve
2667performance or reduce memory space by tweaking the various typedefs in
2668jmorecfg.h. In particular, on some RISC CPUs, access to arrays of "short"s
2669is quite slow; consider trading memory for speed by making JCOEF, INT16, and
2670UINT16 be "int" or "unsigned int". UINT8 is also a candidate to become int.
2671You probably don't want to make JSAMPLE be int unless you have lots of memory
2672to burn.
2673
2674You can reduce the size of the library by compiling out various optional
2675functions. To do this, undefine xxx_SUPPORTED symbols as necessary.
2676
2677
2678Portability considerations
2679--------------------------
2680
2681The JPEG library has been written to be extremely portable; the sample
2682applications cjpeg and djpeg are slightly less so. This section summarizes
2683the design goals in this area. (If you encounter any bugs that cause the
2684library to be less portable than is claimed here, we'd appreciate hearing
2685about them.)
2686
2687The code works fine on both ANSI and pre-ANSI C compilers, using any of the
2688popular system include file setups, and some not-so-popular ones too. See
2689install.doc for configuration procedures.
2690
2691The code is not dependent on the exact sizes of the C data types. As
2692distributed, we make the assumptions that
2693 char is at least 8 bits wide
2694 short is at least 16 bits wide
2695 int is at least 16 bits wide
2696 long is at least 32 bits wide
2697(These are the minimum requirements of the ANSI C standard.) Wider types will
2698work fine, although memory may be used inefficiently if char is much larger
2699than 8 bits or short is much bigger than 16 bits. The code should work
2700equally well with 16- or 32-bit ints.
2701
2702In a system where these assumptions are not met, you may be able to make the
2703code work by modifying the typedefs in jmorecfg.h. However, you will probably
2704have difficulty if int is less than 16 bits wide, since references to plain
2705int abound in the code.
2706
2707char can be either signed or unsigned, although the code runs faster if an
2708unsigned char type is available. If char is wider than 8 bits, you will need
2709to redefine JOCTET and/or provide custom data source/destination managers so
2710that JOCTET represents exactly 8 bits of data on external storage.
2711
2712The JPEG library proper does not assume ASCII representation of characters.
2713But some of the image file I/O modules in cjpeg/djpeg do have ASCII
2714dependencies in file-header manipulation; so does cjpeg's select_file_type()
2715routine.
2716
2717The JPEG library does not rely heavily on the C library. In particular, C
2718stdio is used only by the data source/destination modules and the error
2719handler, all of which are application-replaceable. (cjpeg/djpeg are more
2720heavily dependent on stdio.) malloc and free are called only from the memory
2721manager "back end" module, so you can use a different memory allocator by
2722replacing that one file.
2723
2724The code generally assumes that C names must be unique in the first 15
2725characters. However, global function names can be made unique in the
2726first 6 characters by defining NEED_SHORT_EXTERNAL_NAMES.
2727
2728More info about porting the code may be gleaned by reading jconfig.doc,
2729jmorecfg.h, and jinclude.h.
2730
2731
2732Notes for MS-DOS implementors
2733-----------------------------
2734
2735The IJG code is designed to work efficiently in 80x86 "small" or "medium"
2736memory models (i.e., data pointers are 16 bits unless explicitly declared
2737"far"; code pointers can be either size). You may be able to use small
2738model to compile cjpeg or djpeg by itself, but you will probably have to use
2739medium model for any larger application. This won't make much difference in
2740performance. You *will* take a noticeable performance hit if you use a
2741large-data memory model (perhaps 10%-25%), and you should avoid "huge" model
2742if at all possible.
2743
2744The JPEG library typically needs 2Kb-3Kb of stack space. It will also
2745malloc about 20K-30K of near heap space while executing (and lots of far
2746heap, but that doesn't count in this calculation). This figure will vary
2747depending on selected operating mode, and to a lesser extent on image size.
Thomas G. Lane9ba2f5e1994-12-07 00:00:00 +00002748There is also about 5Kb-6Kb of constant data which will be allocated in the
2749near data segment (about 4Kb of this is the error message table).
2750Thus you have perhaps 20K available for other modules' static data and near
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002751heap space before you need to go to a larger memory model. The C library's
2752static data will account for several K of this, but that still leaves a good
2753deal for your needs. (If you are tight on space, you could reduce the sizes
2754of the I/O buffers allocated by jdatasrc.c and jdatadst.c, say from 4K to
Thomas G. Lanebc79e061995-08-02 00:00:00 +000027551K. Another possibility is to move the error message table to far memory;
2756this should be doable with only localized hacking on jerror.c.)
Thomas G. Lane36a4ccc1994-09-24 00:00:00 +00002757
2758About 2K of the near heap space is "permanent" memory that will not be
2759released until you destroy the JPEG object. This is only an issue if you
2760save a JPEG object between compression or decompression operations.
2761
2762Far data space may also be a tight resource when you are dealing with large
2763images. The most memory-intensive case is decompression with two-pass color
2764quantization, or single-pass quantization to an externally supplied color
2765map. This requires a 128Kb color lookup table plus strip buffers amounting
2766to about 50 bytes per column for typical sampling ratios (eg, about 32000
2767bytes for a 640-pixel-wide image). You may not be able to process wide
2768images if you have large data structures of your own.
2769
2770Of course, all of these concerns vanish if you use a 32-bit flat-memory-model
2771compiler, such as DJGPP or Watcom C. We highly recommend flat model if you
2772can use it; the JPEG library is significantly faster in flat model.