| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 1 | /* ****************************************************************** |
| 2 | FSE : Finite State Entropy coder |
| 3 | header file |
| 4 | Copyright (C) 2013-2015, Yann Collet. |
| 5 | |
| 6 | BSD 2-Clause License (http://www.opensource.org/licenses/bsd-license.php) |
| 7 | |
| 8 | Redistribution and use in source and binary forms, with or without |
| 9 | modification, are permitted provided that the following conditions are |
| 10 | met: |
| 11 | |
| 12 | * Redistributions of source code must retain the above copyright |
| 13 | notice, this list of conditions and the following disclaimer. |
| 14 | * Redistributions in binary form must reproduce the above |
| 15 | copyright notice, this list of conditions and the following disclaimer |
| 16 | in the documentation and/or other materials provided with the |
| 17 | distribution. |
| 18 | |
| 19 | THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS |
| 20 | "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT |
| 21 | LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR |
| 22 | A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT |
| 23 | OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, |
| 24 | SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT |
| 25 | LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, |
| 26 | DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY |
| 27 | THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT |
| 28 | (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE |
| 29 | OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. |
| 30 | |
| 31 | You can contact the author at : |
| 32 | - Source repository : https://github.com/Cyan4973/FiniteStateEntropy |
| 33 | - Public forum : https://groups.google.com/forum/#!forum/lz4c |
| 34 | ****************************************************************** */ |
| Yann Collet | aa07405 | 2015-10-30 11:21:50 +0100 | [diff] [blame] | 35 | #ifndef FSE_H |
| 36 | #define FSE_H |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 37 | |
| 38 | #if defined (__cplusplus) |
| 39 | extern "C" { |
| 40 | #endif |
| 41 | |
| 42 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 43 | /* ***************************************** |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 44 | * Includes |
| 45 | ******************************************/ |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 46 | #include <stddef.h> /* size_t, ptrdiff_t */ |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 47 | |
| 48 | |
| Yann Collet | fb810d6 | 2016-01-28 00:18:06 +0100 | [diff] [blame] | 49 | /*-**************************************** |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 50 | * FSE simple functions |
| 51 | ******************************************/ |
| 52 | size_t FSE_compress(void* dst, size_t maxDstSize, |
| 53 | const void* src, size_t srcSize); |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 54 | size_t FSE_decompress(void* dst, size_t maxDstSize, |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 55 | const void* cSrc, size_t cSrcSize); |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 56 | /*! |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 57 | FSE_compress(): |
| 58 | Compress content of buffer 'src', of size 'srcSize', into destination buffer 'dst'. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 59 | 'dst' buffer must be already allocated. Compression runs faster is maxDstSize >= FSE_compressBound(srcSize) |
| 60 | return : size of compressed data (<= maxDstSize) |
| 61 | Special values : if return == 0, srcData is not compressible => Nothing is stored within dst !!! |
| 62 | if return == 1, srcData is a single byte symbol * srcSize times. Use RLE compression instead. |
| 63 | if FSE_isError(return), compression failed (more details using FSE_getErrorName()) |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 64 | |
| 65 | FSE_decompress(): |
| 66 | Decompress FSE data from buffer 'cSrc', of size 'cSrcSize', |
| 67 | into already allocated destination buffer 'dst', of size 'maxDstSize'. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 68 | return : size of regenerated data (<= maxDstSize) |
| 69 | or an error code, which can be tested using FSE_isError() |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 70 | |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 71 | ** Important ** : FSE_decompress() doesn't decompress non-compressible nor RLE data !!! |
| 72 | Why ? : making this distinction requires a header. |
| Yann Collet | e8c6bb1 | 2015-07-26 00:23:57 +0100 | [diff] [blame] | 73 | Header management is intentionally delegated to the user layer, which can better manage special cases. |
| 74 | */ |
| 75 | |
| 76 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 77 | /* ***************************************** |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 78 | * Tool functions |
| 79 | ******************************************/ |
| 80 | size_t FSE_compressBound(size_t size); /* maximum compressed size */ |
| 81 | |
| 82 | /* Error Management */ |
| 83 | unsigned FSE_isError(size_t code); /* tells if a return value is an error code */ |
| 84 | const char* FSE_getErrorName(size_t code); /* provides error code string (useful for debugging) */ |
| 85 | |
| 86 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 87 | /* ***************************************** |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 88 | * FSE advanced functions |
| 89 | ******************************************/ |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 90 | /*! |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 91 | FSE_compress2(): |
| 92 | Same as FSE_compress(), but allows the selection of 'maxSymbolValue' and 'tableLog' |
| 93 | Both parameters can be defined as '0' to mean : use default value |
| 94 | return : size of compressed data |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 95 | Special values : if return == 0, srcData is not compressible => Nothing is stored within cSrc !!! |
| 96 | if return == 1, srcData is a single byte symbol * srcSize times. Use RLE compression. |
| 97 | if FSE_isError(return), it's an error code. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 98 | */ |
| 99 | size_t FSE_compress2 (void* dst, size_t dstSize, const void* src, size_t srcSize, unsigned maxSymbolValue, unsigned tableLog); |
| 100 | |
| 101 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 102 | /* ***************************************** |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 103 | * FSE detailed API |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 104 | ******************************************/ |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 105 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 106 | FSE_compress() does the following: |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 107 | 1. count symbol occurrence from source[] into table count[] |
| 108 | 2. normalize counters so that sum(count[]) == Power_of_2 (2^tableLog) |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 109 | 3. save normalized counters to memory buffer using writeNCount() |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 110 | 4. build encoding table 'CTable' from normalized counters |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 111 | 5. encode the data stream using encoding table 'CTable' |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 112 | |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 113 | FSE_decompress() does the following: |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 114 | 1. read normalized counters with readNCount() |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 115 | 2. build decoding table 'DTable' from normalized counters |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 116 | 3. decode the data stream using decoding table 'DTable' |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 117 | |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 118 | The following API allows targeting specific sub-functions for advanced tasks. |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 119 | For example, it's possible to compress several blocks using the same 'CTable', |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 120 | or to save and provide normalized distribution using external method. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 121 | */ |
| 122 | |
| 123 | /* *** COMPRESSION *** */ |
| 124 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 125 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 126 | FSE_count(): |
| Yann Collet | 4ddb1f5 | 2016-01-28 03:24:53 +0100 | [diff] [blame] | 127 | Provides the precise count of each byte within a table 'count' |
| 128 | 'count' is a table of unsigned int, of minimum size (*maxSymbolValuePtr+1). |
| 129 | *maxSymbolValuePtr will be updated if detected smaller than initial value. |
| 130 | @return : the count of the most frequent symbol (which is not identified) |
| 131 | if return == srcSize, there is only one symbol. |
| 132 | Can also return an error code, which can be tested with FSE_isError() */ |
| 133 | size_t FSE_count(unsigned* count, unsigned* maxSymbolValuePtr, const void* src, size_t srcSize); |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 134 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 135 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 136 | FSE_optimalTableLog(): |
| 137 | dynamically downsize 'tableLog' when conditions are met. |
| 138 | It saves CPU time, by using smaller tables, while preserving or even improving compression ratio. |
| 139 | return : recommended tableLog (necessarily <= initial 'tableLog') */ |
| 140 | unsigned FSE_optimalTableLog(unsigned tableLog, size_t srcSize, unsigned maxSymbolValue); |
| 141 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 142 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 143 | FSE_normalizeCount(): |
| 144 | normalize counters so that sum(count[]) == Power_of_2 (2^tableLog) |
| 145 | 'normalizedCounter' is a table of short, of minimum size (maxSymbolValue+1). |
| 146 | return : tableLog, |
| 147 | or an errorCode, which can be tested using FSE_isError() */ |
| 148 | size_t FSE_normalizeCount(short* normalizedCounter, unsigned tableLog, const unsigned* count, size_t srcSize, unsigned maxSymbolValue); |
| 149 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 150 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 151 | FSE_NCountWriteBound(): |
| 152 | Provides the maximum possible size of an FSE normalized table, given 'maxSymbolValue' and 'tableLog' |
| 153 | Typically useful for allocation purpose. */ |
| 154 | size_t FSE_NCountWriteBound(unsigned maxSymbolValue, unsigned tableLog); |
| 155 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 156 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 157 | FSE_writeNCount(): |
| 158 | Compactly save 'normalizedCounter' into 'buffer'. |
| 159 | return : size of the compressed table |
| 160 | or an errorCode, which can be tested using FSE_isError() */ |
| 161 | size_t FSE_writeNCount (void* buffer, size_t bufferSize, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog); |
| 162 | |
| 163 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 164 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 165 | Constructor and Destructor of type FSE_CTable |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 166 | Note that its size depends on 'tableLog' and 'maxSymbolValue' */ |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 167 | typedef unsigned FSE_CTable; /* don't allocate that. It's only meant to be more restrictive than void* */ |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 168 | FSE_CTable* FSE_createCTable (unsigned tableLog, unsigned maxSymbolValue); |
| 169 | void FSE_freeCTable (FSE_CTable* ct); |
| 170 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 171 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 172 | FSE_buildCTable(): |
| Yann Collet | ae7aa06 | 2016-02-03 02:46:46 +0100 | [diff] [blame^] | 173 | Builds @ct, which must be already allocated, using FSE_createCTable() |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 174 | return : 0 |
| 175 | or an errorCode, which can be tested using FSE_isError() */ |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 176 | size_t FSE_buildCTable(FSE_CTable* ct, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog); |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 177 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 178 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 179 | FSE_compress_usingCTable(): |
| Yann Collet | ae7aa06 | 2016-02-03 02:46:46 +0100 | [diff] [blame^] | 180 | Compress @src using @ct into @dst which must be already allocated |
| 181 | return : size of compressed data (<= @dstCapacity) |
| 182 | or 0 if compressed data could not fit into @dst |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 183 | or an errorCode, which can be tested using FSE_isError() */ |
| Yann Collet | ae7aa06 | 2016-02-03 02:46:46 +0100 | [diff] [blame^] | 184 | size_t FSE_compress_usingCTable (void* dst, size_t dstCapacity, const void* src, size_t srcSize, const FSE_CTable* ct); |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 185 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 186 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 187 | Tutorial : |
| 188 | ---------- |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 189 | The first step is to count all symbols. FSE_count() does this job very fast. |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 190 | Result will be saved into 'count', a table of unsigned int, which must be already allocated, and have 'maxSymbolValuePtr[0]+1' cells. |
| 191 | 'src' is a table of bytes of size 'srcSize'. All values within 'src' MUST be <= maxSymbolValuePtr[0] |
| 192 | maxSymbolValuePtr[0] will be updated, with its real value (necessarily <= original value) |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 193 | FSE_count() will return the number of occurrence of the most frequent symbol. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 194 | This can be used to know if there is a single symbol within 'src', and to quickly evaluate its compressibility. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 195 | If there is an error, the function will return an ErrorCode (which can be tested using FSE_isError()). |
| 196 | |
| 197 | The next step is to normalize the frequencies. |
| 198 | FSE_normalizeCount() will ensure that sum of frequencies is == 2 ^'tableLog'. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 199 | It also guarantees a minimum of 1 to any Symbol with frequency >= 1. |
| 200 | You can use 'tableLog'==0 to mean "use default tableLog value". |
| 201 | If you are unsure of which tableLog value to use, you can ask FSE_optimalTableLog(), |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 202 | which will provide the optimal valid tableLog given sourceSize, maxSymbolValue, and a user-defined maximum (0 means "default"). |
| 203 | |
| 204 | The result of FSE_normalizeCount() will be saved into a table, |
| 205 | called 'normalizedCounter', which is a table of signed short. |
| 206 | 'normalizedCounter' must be already allocated, and have at least 'maxSymbolValue+1' cells. |
| 207 | The return value is tableLog if everything proceeded as expected. |
| 208 | It is 0 if there is a single symbol within distribution. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 209 | If there is an error (ex: invalid tableLog value), the function will return an ErrorCode (which can be tested using FSE_isError()). |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 210 | |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 211 | 'normalizedCounter' can be saved in a compact manner to a memory area using FSE_writeNCount(). |
| 212 | 'buffer' must be already allocated. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 213 | For guaranteed success, buffer size must be at least FSE_headerBound(). |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 214 | The result of the function is the number of bytes written into 'buffer'. |
| 215 | If there is an error, the function will return an ErrorCode (which can be tested using FSE_isError(); ex : buffer size too small). |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 216 | |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 217 | 'normalizedCounter' can then be used to create the compression table 'CTable'. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 218 | The space required by 'CTable' must be already allocated, using FSE_createCTable(). |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 219 | You can then use FSE_buildCTable() to fill 'CTable'. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 220 | If there is an error, both functions will return an ErrorCode (which can be tested using FSE_isError()). |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 221 | |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 222 | 'CTable' can then be used to compress 'src', with FSE_compress_usingCTable(). |
| 223 | Similar to FSE_count(), the convention is that 'src' is assumed to be a table of char of size 'srcSize' |
| Yann Collet | ae7aa06 | 2016-02-03 02:46:46 +0100 | [diff] [blame^] | 224 | The function returns the size of compressed data (without header), necessarily <= @dstCapacity. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 225 | If it returns '0', compressed data could not fit into 'dst'. |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 226 | If there is an error, the function will return an ErrorCode (which can be tested using FSE_isError()). |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 227 | */ |
| 228 | |
| 229 | |
| 230 | /* *** DECOMPRESSION *** */ |
| 231 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 232 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 233 | FSE_readNCount(): |
| 234 | Read compactly saved 'normalizedCounter' from 'rBuffer'. |
| 235 | return : size read from 'rBuffer' |
| 236 | or an errorCode, which can be tested using FSE_isError() |
| 237 | maxSymbolValuePtr[0] and tableLogPtr[0] will also be updated with their respective values */ |
| 238 | size_t FSE_readNCount (short* normalizedCounter, unsigned* maxSymbolValuePtr, unsigned* tableLogPtr, const void* rBuffer, size_t rBuffSize); |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 239 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 240 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 241 | Constructor and Destructor of type FSE_DTable |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 242 | Note that its size depends on 'tableLog' */ |
| 243 | typedef unsigned FSE_DTable; /* don't allocate that. It's just a way to be more restrictive than void* */ |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 244 | FSE_DTable* FSE_createDTable(unsigned tableLog); |
| 245 | void FSE_freeDTable(FSE_DTable* dt); |
| 246 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 247 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 248 | FSE_buildDTable(): |
| 249 | Builds 'dt', which must be already allocated, using FSE_createDTable() |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 250 | return : 0, |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 251 | or an errorCode, which can be tested using FSE_isError() */ |
| 252 | size_t FSE_buildDTable (FSE_DTable* dt, const short* normalizedCounter, unsigned maxSymbolValue, unsigned tableLog); |
| 253 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 254 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 255 | FSE_decompress_usingDTable(): |
| Yann Collet | ae7aa06 | 2016-02-03 02:46:46 +0100 | [diff] [blame^] | 256 | Decompress compressed source @cSrc of size @cSrcSize using @dt |
| 257 | into @dst which must be already allocated. |
| 258 | return : size of regenerated data (necessarily <= @dstCapacity) |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 259 | or an errorCode, which can be tested using FSE_isError() */ |
| Yann Collet | ae7aa06 | 2016-02-03 02:46:46 +0100 | [diff] [blame^] | 260 | size_t FSE_decompress_usingDTable(void* dst, size_t dstCapacity, const void* cSrc, size_t cSrcSize, const FSE_DTable* dt); |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 261 | |
| Yann Collet | 3b994cb | 2016-01-06 01:58:37 +0100 | [diff] [blame] | 262 | /*! |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 263 | Tutorial : |
| 264 | ---------- |
| 265 | (Note : these functions only decompress FSE-compressed blocks. |
| 266 | If block is uncompressed, use memcpy() instead |
| 267 | If block is a single repeated byte, use memset() instead ) |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 268 | |
| 269 | The first step is to obtain the normalized frequencies of symbols. |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 270 | This can be performed by FSE_readNCount() if it was saved using FSE_writeNCount(). |
| 271 | 'normalizedCounter' must be already allocated, and have at least 'maxSymbolValuePtr[0]+1' cells of signed short. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 272 | In practice, that means it's necessary to know 'maxSymbolValue' beforehand, |
| 273 | or size the table to handle worst case situations (typically 256). |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 274 | FSE_readNCount() will provide 'tableLog' and 'maxSymbolValue'. |
| 275 | The result of FSE_readNCount() is the number of bytes read from 'rBuffer'. |
| 276 | Note that 'rBufferSize' must be at least 4 bytes, even if useful information is less than that. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 277 | If there is an error, the function will return an error code, which can be tested using FSE_isError(). |
| 278 | |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 279 | The next step is to build the decompression tables 'FSE_DTable' from 'normalizedCounter'. |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 280 | This is performed by the function FSE_buildDTable(). |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 281 | The space required by 'FSE_DTable' must be already allocated using FSE_createDTable(). |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 282 | If there is an error, the function will return an error code, which can be tested using FSE_isError(). |
| 283 | |
| Yann Collet | 1efa31f | 2015-07-04 15:56:41 -0800 | [diff] [blame] | 284 | 'FSE_DTable' can then be used to decompress 'cSrc', with FSE_decompress_usingDTable(). |
| Yann Collet | a787550 | 2015-08-07 15:21:00 +0100 | [diff] [blame] | 285 | 'cSrcSize' must be strictly correct, otherwise decompression will fail. |
| 286 | FSE_decompress_usingDTable() result will tell how many bytes were regenerated (<=maxDstSize). |
| 287 | If there is an error, the function will return an error code, which can be tested using FSE_isError(). (ex: dst buffer too small) |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 288 | */ |
| 289 | |
| 290 | |
| Yann Collet | 4856a00 | 2015-01-24 01:58:16 +0100 | [diff] [blame] | 291 | #if defined (__cplusplus) |
| 292 | } |
| 293 | #endif |
| Yann Collet | aa07405 | 2015-10-30 11:21:50 +0100 | [diff] [blame] | 294 | |
| 295 | #endif /* FSE_H */ |