Demo/sgi/video/cmif-film.ms - platform/external/python/cpython3 - Gitiles

 .de PP
 .LP
 ..
 .de pT
 .IP \fB\\$1\fP
 ..
 .TL
 CMIF video file format
 .AU
 Jack Jansen
 (Version of 27-Feb-92)
 .SH
 Introduction
 .PP
 The CMIF video format was invented to allow various applications
 to exchange video data. The format consists of
 a header containing global information (like data format)
 followed by a sequence of frames, each consisting of a header
 followed by the actual frame data.
 All information except pixel data is
 encoded in ASCII. Pixel data is \fIalways\fP encoded in Silicon Graphics
 order, which means that the first pixel in the frame is the lower left
 pixel on the screen.
 .PP
 All ASCII data except the first line of the file
 is in python format. This means that
 outer parentheses can be ommitted, and parentheses around a tuple with
 one element can also be omitted. So, the lines
 .IP
 .ft C
 .nf
 ('grey',(4))
 ('grey',4)
 'grey',4
 .LP
 have the same meaning.
 To ease parsing in C programs, however, it is advised that there are
 no parenteses around single items, and that there are parentheses around
 lists. So, the second format above is preferred.
 .PP
 The current version is version 3, but this document will also explain
 shortly what the previous formats looked like.
 .SH
 Header.
 .PP
 The header consists of three lines. The first line identifies the file
 as a CMIF video file, and gives the version number.
 It looks as follows:
 .IP
 .ft C
 CMIF video 3.0
 .LP
 All programs expect the layout to be exactly like this, so no
 extra spaces, etc. should be added.
 .PP
 The second line specifies the data format. Its format is a python
 tuple with two members. The first member is a string giving the format
 type and the second is a tuple containing type-specific information.
 The following formats are currently understood:
 .pT rgb
 The video data is 24 bit RGB packed into 32 bit words.
 R is the least significant byte, then G and then B. The top byte is
 unused.
 .IP
 There is no type-specific information, so the complete data format
 line is
 .IP
 .ft C
 ('rgb',())
 .pT grey
 The video data is greyscale, at most 8 bits. Data is packed into
 8 bit bytes (in the low-order bits). The extra information is the
 number of significant bits, so an example data format line is
 .IP
 .ft C
 ('grey',(6))
 .pT yiq
 The video data is in YIQ format. This is a format that has one luminance
 component, Y, and two chrominance components, I and Q. The luminance and
 chrominance components are encoded in \fItwo\fP pixel arrays: first an
 array of 8-bit luminance values followed by a array of 16 bit chrominance
 values. See the section on chrominance coding for details.
 .IP
 The type specific part contains the number of bits for Y, I and Q,
 the chrominance packfactor and the colormap offset. So, a sample format
 information line of
 .IP
 .ft C
 ('yiq',(5,3,3,2,1024))
 .IP
 means that the pictures have 5 bit Y values (in the luminance array),
 3 bits of I and Q each (in the chrominance array), chrominance data
 is packed for 2x2 pixels, and the first colormap index used is 1024.
 .pT hls
 The video data is in HLS format. L is the luminance component, H and S
 are the chrominance components. The data format and type specific information
 are the same as for the yiq format.
 .pT hsv
 The video data is in HSV format. V is the luminance component, H and S
 are the chrominance components. Again, data format and type specific
 information are the same as for the yiq format.
 .pT rgb8
 The video data is in 8 bit dithered rgb format. This is the format
 used internally by the Indigo. bit 0-2 are green, bit 3-4 are blue and
 bit 5-7 are red. Because rgb8 is treated more-or-less like yiq format
 internally the type-specific information is the same, with zeroes for
 the (unused) chrominance sizes:
 .IP
 .ft C
 ('rgb8',(8,0,0,0,0))
 .PP
 The third header line contains width and height of the video image,
 in pixels, and the pack factor of the picture. For compatability, RGB
 images must have a pack factor of 0 (zero), and non-RGB images must
 have a pack factor of at least 1.
 The packfactor is the amount of compression done on the original video
 signal to obtain pictures. In other words, if only one out of three pixels
 and lines is stored (so every 9 original pixels have one pixel in the
 data) the packfactor is three. Width and height are the size of the
 \fIoriginal\fP picture.
 Viewers are expected to enlarge the picture so it is shown in the
 original size. RGB videos cannot be packed.
 So, a size line like
 .IP
 .ft C
 200,200,2
 .LP
 means that this was a 200x200 picture that is stored as 100x100 pixels.
 .SH
 Frame header
 .PP
 Each frame is preceded by a single header line. This line contains timing information
 and optional size information. The time information is mandatory, and
 contains the time this frame should be displayed, in milliseconds since
 the start of the film. Frames should be stored in chronological order.
 .PP
 An optional second number is interpreted as the size of the luminance
 data in bytes. Currently this number, if present, should always be the
 same as \fCwidth*height/(packfactor*packfactor)\fP (times 4 for RGB
 data), but this might change if we come up with variable-length encoding
 for frame data.
 .PP
 An optional third number is the size of the chrominance data
 in bytes. If present, the number should be equal to
 .ft C
 luminance_size2*/(chrompack*chrompack).
 .SH
 Frame data
 .PP
 For RGB films, the frame data is an array of 32 bit pixels containing
 RGB data in the lower 24 bits. For greyscale films, the frame data
 is an array of 8 bit pixels. For split luminance/chrominance films the
 data consists of two parts: first an array of 8 bit luminance values
 followed by an  array of 16 bit chrominance values.
 .PP
 For all data formats, the data is stored left-to-right, bottom-to-top.
 .SH
 Chrominance coding
 .PP
 Since the human eye is apparently more sensitive to luminance changes
 than to chrominance changes we support a coding where we split the luminance
 and chrominance components of the video image. The main point of this
 is that it allows us to transmit chrominance data in a coarser granularity
 than luminance data, for instance one chrominance pixel for every
 2x2 luminance pixels. According to the theory this should result in an
 acceptable picture while reducing the data by a fair amount.
 .PP
 The coding of split chrominance/luminance data is a bit tricky, to
 make maximum use of the graphics hardware on the Personal Iris. Therefore,
 there are the following constraints on the number of bits used:
 .IP -
 No more than 8 luminance bits,
 .IP -
 No more than 11 bits total,
 .IP -
 The luminance bits are in the low-end of the data word, and are stored
 as 8 bit bytes,
 .IP -
 The two sets of chrominance bits are stored in 16 bit words, correctly
 aligned,
 .IP -
 The color map offset is added to the chrominance data. The offset should
 be at most 4096-256-2**(total number of bits). To reduce interference with
 other applications the offset should be at least 1024.
 .LP
 So, as an example, an HLS video with 5 bits L, 4 bits H, 2 bits S and an
 offset of 1024 will look as follows in-core and in-file:
 .IP
 .nf
 .ft C
 	  31         15    11 10 9 8  5 4   0
          +-----------------------------------+
 incore   +               0+ 1+  S +  H +   L +
          +-----------------------------------+
                                   +----------+
 L-array                           +  0 +   L +
                                   +----------+
                      +-----------------------+
 C-array              +   0+ 1+  S +  H +   0 +
                      +-----------------------+
	.de PP
	.LP
	..
	.de pT
	.IP \fB\\$1\fP
	..
	.TL
	CMIF video file format
	.AU
	Jack Jansen
	(Version of 27-Feb-92)
	.SH
	Introduction
	.PP
	The CMIF video format was invented to allow various applications
	to exchange video data. The format consists of
	a header containing global information (like data format)
	followed by a sequence of frames, each consisting of a header
	followed by the actual frame data.
	All information except pixel data is
	encoded in ASCII. Pixel data is \fIalways\fP encoded in Silicon Graphics
	order, which means that the first pixel in the frame is the lower left
	pixel on the screen.
	.PP
	All ASCII data except the first line of the file
	is in python format. This means that
	outer parentheses can be ommitted, and parentheses around a tuple with
	one element can also be omitted. So, the lines
	.IP
	.ft C
	.nf
	('grey',(4))
	('grey',4)
	'grey',4
	.LP
	have the same meaning.
	To ease parsing in C programs, however, it is advised that there are
	no parenteses around single items, and that there are parentheses around
	lists. So, the second format above is preferred.
	.PP
	The current version is version 3, but this document will also explain
	shortly what the previous formats looked like.
	.SH
	Header.
	.PP
	The header consists of three lines. The first line identifies the file
	as a CMIF video file, and gives the version number.
	It looks as follows:
	.IP
	.ft C
	CMIF video 3.0
	.LP
	All programs expect the layout to be exactly like this, so no
	extra spaces, etc. should be added.
	.PP
	The second line specifies the data format. Its format is a python
	tuple with two members. The first member is a string giving the format
	type and the second is a tuple containing type-specific information.
	The following formats are currently understood:
	.pT rgb
	The video data is 24 bit RGB packed into 32 bit words.
	R is the least significant byte, then G and then B. The top byte is
	unused.
	.IP
	There is no type-specific information, so the complete data format
	line is
	.IP
	.ft C
	('rgb',())
	.pT grey
	The video data is greyscale, at most 8 bits. Data is packed into
	8 bit bytes (in the low-order bits). The extra information is the
	number of significant bits, so an example data format line is
	.IP
	.ft C
	('grey',(6))
	.pT yiq
	The video data is in YIQ format. This is a format that has one luminance
	component, Y, and two chrominance components, I and Q. The luminance and
	chrominance components are encoded in \fItwo\fP pixel arrays: first an
	array of 8-bit luminance values followed by a array of 16 bit chrominance
	values. See the section on chrominance coding for details.
	.IP
	The type specific part contains the number of bits for Y, I and Q,
	the chrominance packfactor and the colormap offset. So, a sample format
	information line of
	.IP
	.ft C
	('yiq',(5,3,3,2,1024))
	.IP
	means that the pictures have 5 bit Y values (in the luminance array),
	3 bits of I and Q each (in the chrominance array), chrominance data
	is packed for 2x2 pixels, and the first colormap index used is 1024.
	.pT hls
	The video data is in HLS format. L is the luminance component, H and S
	are the chrominance components. The data format and type specific information
	are the same as for the yiq format.
	.pT hsv
	The video data is in HSV format. V is the luminance component, H and S
	are the chrominance components. Again, data format and type specific
	information are the same as for the yiq format.
	.pT rgb8
	The video data is in 8 bit dithered rgb format. This is the format
	used internally by the Indigo. bit 0-2 are green, bit 3-4 are blue and
	bit 5-7 are red. Because rgb8 is treated more-or-less like yiq format
	internally the type-specific information is the same, with zeroes for
	the (unused) chrominance sizes:
	.IP
	.ft C
	('rgb8',(8,0,0,0,0))
	.PP
	The third header line contains width and height of the video image,
	in pixels, and the pack factor of the picture. For compatability, RGB
	images must have a pack factor of 0 (zero), and non-RGB images must
	have a pack factor of at least 1.
	The packfactor is the amount of compression done on the original video
	signal to obtain pictures. In other words, if only one out of three pixels
	and lines is stored (so every 9 original pixels have one pixel in the
	data) the packfactor is three. Width and height are the size of the
	\fIoriginal\fP picture.
	Viewers are expected to enlarge the picture so it is shown in the
	original size. RGB videos cannot be packed.
	So, a size line like
	.IP
	.ft C
	200,200,2
	.LP
	means that this was a 200x200 picture that is stored as 100x100 pixels.
	.SH
	Frame header
	.PP
	Each frame is preceded by a single header line. This line contains timing information
	and optional size information. The time information is mandatory, and
	contains the time this frame should be displayed, in milliseconds since
	the start of the film. Frames should be stored in chronological order.
	.PP
	An optional second number is interpreted as the size of the luminance
	data in bytes. Currently this number, if present, should always be the
	same as \fCwidthheight/(packfactorpackfactor)\fP (times 4 for RGB
	data), but this might change if we come up with variable-length encoding
	for frame data.
	.PP
	An optional third number is the size of the chrominance data
	in bytes. If present, the number should be equal to
	.ft C
	luminance_size2/(chrompackchrompack).
	.SH
	Frame data
	.PP
	For RGB films, the frame data is an array of 32 bit pixels containing
	RGB data in the lower 24 bits. For greyscale films, the frame data
	is an array of 8 bit pixels. For split luminance/chrominance films the
	data consists of two parts: first an array of 8 bit luminance values
	followed by an array of 16 bit chrominance values.
	.PP
	For all data formats, the data is stored left-to-right, bottom-to-top.
	.SH
	Chrominance coding
	.PP
	Since the human eye is apparently more sensitive to luminance changes
	than to chrominance changes we support a coding where we split the luminance
	and chrominance components of the video image. The main point of this
	is that it allows us to transmit chrominance data in a coarser granularity
	than luminance data, for instance one chrominance pixel for every
	2x2 luminance pixels. According to the theory this should result in an
	acceptable picture while reducing the data by a fair amount.
	.PP
	The coding of split chrominance/luminance data is a bit tricky, to
	make maximum use of the graphics hardware on the Personal Iris. Therefore,
	there are the following constraints on the number of bits used:
	.IP -
	No more than 8 luminance bits,
	.IP -
	No more than 11 bits total,
	.IP -
	The luminance bits are in the low-end of the data word, and are stored
	as 8 bit bytes,
	.IP -
	The two sets of chrominance bits are stored in 16 bit words, correctly
	aligned,
	.IP -
	The color map offset is added to the chrominance data. The offset should
	be at most 4096-256-2**(total number of bits). To reduce interference with
	other applications the offset should be at least 1024.
	.LP
	So, as an example, an HLS video with 5 bits L, 4 bits H, 2 bits S and an
	offset of 1024 will look as follows in-core and in-file:
	.IP
	.nf
	.ft C
	31 15 11 10 9 8 5 4 0
	+-----------------------------------+
	incore + 0+ 1+ S + H + L +
	+-----------------------------------+
	+----------+
	L-array + 0 + L +
	+----------+
	+-----------------------+
	C-array + 0+ 1+ S + H + 0 +
	+-----------------------+