- File www/varint.wiki — part of check-in [eccdf70b22] at 2013-01-20 02:52:43 on branch trunk — Fix typos in the VARINT documentation. (user: drh size: 3167)
A variable length integer is an encoding of 64-bit unsigned integers into between 1 and 9 bytes. The encoding has the following properties:
- Smaller (and more common) values use fewer bytes and take up less space than larger (and less common) values.
- The length of any varint can be determined by looking at just the first byte of the encoding.
- Lexicographical and numeric ordering for varints are the same. Hence if a group of varints are order lexicographically (that is to say, if they are order by memcmp() with shorter varints coming first) then those varints will also be in numeric order. This property means that varints can be used as keys in the key/value backend storage and the records will occur in numerical order of the keys.
The encoding is described by algorithms to decode (convert from varint to 8-byte unsigned integer) and to encode (convert from 8-byte unsigned integer to varint). Treat each byte of the encoding as an unsigned integer between 0 and 255. Let the bytes of the encoding be called A0, A1, A2, ..., A8.
- If A0 is between 0 and 240 inclusive, then the result is the value of A0.
- If A0 is between 241 and 248 inclusive, then the result is 240+256*(A0-241)+A1.
- If A0 is 249 then the result is 2287+256*A1+A2.
- If A0 is 250 then the result is A1..A3 as a 3-byte big-ending integer.
- If A0 is 251 then the result is A1..A4 as a 4-byte big-ending integer.
- If A0 is 252 then the result is A1..A5 as a 5-byte big-ending integer.
- If A0 is 253 then the result is A1..A6 as a 6-byte big-ending integer.
- If A0 is 254 then the result is A1..A7 as a 7-byte big-ending integer.
- If A0 is 255 then the result is A1..A8 as a 8-byte big-ending integer.
Let the input value be V.
- If V<=240 then output a single by A0 equal to V.
- If V<=2287 then output A0 as (V-240)/256 + 241 and A1 as (V-240)%256.
- If V<=67823 then output A0 as 249, A1 as (V-2288)/256, and A2 as (V-2288)%256.
- If V<=16777215 then output A0 as 250 and A1 through A3 as a big-endian 3-byte integer.
- If V<=4294967295 then output A0 as 251 and A1..A4 as a big-ending 4-byte integer.
- If V<=1099511627775 then output A0 as 252 and A1..A5 as a big-ending 5-byte integer.
- If V<=281474976710655 then output A0 as 253 and A1..A6 as a big-ending 6-byte integer.
- If V<=72057594037927935 then output A0 as 254 and A1..A7 as a big-ending 7-byte integer.
- Otherwise then output A0 as 255 and A1..A8 as a big-ending 8-byte integer.
Bytes Max Value Digits 1 240 2.3 2 2287 3.3 3 67823 4.8 4 224-1 7.2 5 232-1 9.6 6 240-1 12.0 7 248-1 14.4 8 256-1 16.8 9 264-1 19.2