Values

Bytes

Bytes encode themselves.

\[\begin{split}\begin{array}{llcll@{\qquad}l} \def\mathdef1979#1{{}}\mathdef1979{byte} & \href{../binary/values.html#binary-byte}{\mathtt{byte}} &::=& \def\mathdef2018#1{\mathtt{0x#1}}\mathdef2018{00} &\Rightarrow& \def\mathdef2019#1{\mathtt{0x#1}}\mathdef2019{00} \\ &&|&& \dots \\ &&|& \def\mathdef2020#1{\mathtt{0x#1}}\mathdef2020{FF} &\Rightarrow& \def\mathdef2021#1{\mathtt{0x#1}}\mathdef2021{FF} \\ \end{array}\end{split}\]

Integers

All integers are encoded using the LEB128 variable-length integer encoding, in either unsigned or signed variant.

Unsigned integers are encoded in unsigned LEB128 format. As an additional constraint, the total number of bytes encoding a value of type \(\href{../syntax/values.html#syntax-int}{\mathit{u}N}\) must not exceed \(\mathrm{ceil}(N/7)\) bytes.

\[\begin{split}\begin{array}{llclll@{\qquad}l} \def\mathdef1979#1{{}}\mathdef1979{unsigned integer} & \href{../binary/values.html#binary-int}{\def\mathdef1980#1{{\mathtt{u}#1}}\mathdef1980{N}} &::=& n{:}\href{../binary/values.html#binary-byte}{\mathtt{byte}} &\Rightarrow& n & (\mathrel{\mbox{if}} n < 2^7 \wedge n < 2^N) \\ &&|& n{:}\href{../binary/values.html#binary-byte}{\mathtt{byte}}~~m{:}\def\mathdef2022#1{{\mathtt{u}#1}}\mathdef2022{(N\mathtt{-7})} &\Rightarrow& 2^7\cdot m + (n-2^7) & (\mathrel{\mbox{if}} n \geq 2^7 \wedge N > 7) \\ \end{array}\end{split}\]

Signed integers are encoded in signed LEB128 format, which uses a two’s complement representation. As an additional constraint, the total number of bytes encoding a value of type \(\href{../syntax/values.html#syntax-int}{\mathit{s}N}\) must not exceed \(\mathrm{ceil}(N/7)\) bytes.

\[\begin{split}\begin{array}{llclll@{\qquad}l} \def\mathdef1979#1{{}}\mathdef1979{signed integer} & \href{../binary/values.html#binary-int}{\def\mathdef1986#1{{\mathtt{s}#1}}\mathdef1986{N}} &::=& n{:}\href{../binary/values.html#binary-byte}{\mathtt{byte}} &\Rightarrow& n & (\mathrel{\mbox{if}} n < 2^6 \wedge n < 2^{N-1}) \\ &&|& n{:}\href{../binary/values.html#binary-byte}{\mathtt{byte}} &\Rightarrow& n-2^7 & (\mathrel{\mbox{if}} 2^6 \leq n < 2^7 \wedge n \geq 2^7-2^{N-1}) \\ &&|& n{:}\href{../binary/values.html#binary-byte}{\mathtt{byte}}~~m{:}\def\mathdef2023#1{{\mathtt{s}#1}}\mathdef2023{(N\mathtt{-7})} &\Rightarrow& 2^7\cdot m + (n-2^7) & (\mathrel{\mbox{if}} n \geq 2^7 \wedge N > 7) \\ \end{array}\end{split}\]

Uninterpreted integers are encoded as signed integers.

\[\begin{array}{llclll@{\qquad\qquad}l} \def\mathdef1979#1{{}}\mathdef1979{uninterpreted integer} & \href{../binary/values.html#binary-int}{\def\mathdef1991#1{{\mathtt{i}#1}}\mathdef1991{N}} &::=& n{:}\href{../binary/values.html#binary-int}{\def\mathdef1986#1{{\mathtt{s}#1}}\mathdef1986{N}} &\Rightarrow& i & (\mathrel{\mbox{if}} n = \href{../exec/numerics.html#aux-signed}{\mathrm{signed}}_N(i)) \end{array}\]

Note

The side conditions \(N > 7\) in the productions for non-terminal bytes of the \(\def\mathdef2024#1{{\mathit{u#1}}}\mathdef2024{}\) and \(\def\mathdef2025#1{{\mathit{s#1}}}\mathdef2025{}\) encodings restrict the encoding’s length. However, “trailing zeros” are still allowed within these bounds. For example, \(\def\mathdef2026#1{\mathtt{0x#1}}\mathdef2026{03}\) and \(\def\mathdef2027#1{\mathtt{0x#1}}\mathdef2027{83}~\def\mathdef2028#1{\mathtt{0x#1}}\mathdef2028{00}\) are both well-formed encodings for the value \(3\) as a \(\href{../syntax/values.html#syntax-int}{\mathit{u8}}\). Similarly, either of \(\def\mathdef2029#1{\mathtt{0x#1}}\mathdef2029{7e}\) and \(\def\mathdef2030#1{\mathtt{0x#1}}\mathdef2030{FE}~\def\mathdef2031#1{\mathtt{0x#1}}\mathdef2031{7F}\) and \(\def\mathdef2032#1{\mathtt{0x#1}}\mathdef2032{FE}~\def\mathdef2033#1{\mathtt{0x#1}}\mathdef2033{FF}~\def\mathdef2034#1{\mathtt{0x#1}}\mathdef2034{7F}\) are well-formed encodings of the value \(-2\) as a \(\href{../syntax/values.html#syntax-int}{\mathit{s16}}\).

The side conditions on the value \(n\) of terminal bytes further enforce that any unused bits in these bytes must be \(0\) for positive values and \(1\) for negative ones. For example, \(\def\mathdef2035#1{\mathtt{0x#1}}\mathdef2035{83}~\def\mathdef2036#1{\mathtt{0x#1}}\mathdef2036{10}\) is malformed as a \(\href{../syntax/values.html#syntax-int}{\mathit{u8}}\) encoding. Similarly, both \(\def\mathdef2037#1{\mathtt{0x#1}}\mathdef2037{83}~\def\mathdef2038#1{\mathtt{0x#1}}\mathdef2038{3E}\) and \(\def\mathdef2039#1{\mathtt{0x#1}}\mathdef2039{FF}~\def\mathdef2040#1{\mathtt{0x#1}}\mathdef2040{7B}\) are malformed as \(\href{../syntax/values.html#syntax-int}{\mathit{s8}}\) encodings.

Floating-Point

Floating-point values are encoded directly by their IEEE 754 (Section 3.4) bit pattern in little endian byte order:

\[\begin{split}\begin{array}{llclll@{\qquad\qquad}l} \def\mathdef1979#1{{}}\mathdef1979{floating-point value} & \href{../binary/values.html#binary-float}{\def\mathdef1994#1{{\mathtt{f}#1}}\mathdef1994{N}} &::=& b^\ast{:\,}\href{../binary/values.html#binary-byte}{\mathtt{byte}}^{N/8} &\Rightarrow& \href{../exec/numerics.html#aux-bytes}{\mathrm{bytes}}_{\href{../syntax/values.html#syntax-float}{\mathit{f}N}}^{-1}(b^\ast) \\ \end{array}\end{split}\]

Names

Names are encoded as a vector of bytes containing the Unicode (Section 3.9) UTF-8 encoding of the name’s character sequence.

\[\begin{split}\begin{array}{llclllll} \def\mathdef1979#1{{}}\mathdef1979{name} & \href{../binary/values.html#binary-name}{\mathtt{name}} &::=& b^\ast{:}\href{../binary/conventions.html#binary-vec}{\mathtt{vec}}(\href{../binary/values.html#binary-byte}{\mathtt{byte}}) &\Rightarrow& \href{../syntax/values.html#syntax-name}{\mathit{name}} && (\mathrel{\mbox{if}} \href{../binary/values.html#binary-utf8}{\mathrm{utf8}}(\href{../syntax/values.html#syntax-name}{\mathit{name}}) = b^\ast) \\ \end{array}\end{split}\]

The auxiliary \(\href{../binary/values.html#binary-utf8}{\mathrm{utf8}}\) function expressing this encoding is defined as follows:

\[\begin{split}\begin{array}{@{}l@{}} \begin{array}{@{}lcl@{\qquad}l@{}} \href{../binary/values.html#binary-utf8}{\mathrm{utf8}}(c^\ast) &=& (\href{../binary/values.html#binary-utf8}{\mathrm{utf8}}(c))^\ast \\[1ex] \href{../binary/values.html#binary-utf8}{\mathrm{utf8}}(c) &=& b & (\begin{array}[t]{@{}c@{~}l@{}} \mathrel{\mbox{if}} & c < \def\mathdef2041#1{\mathrm{U{+}#1}}\mathdef2041{80} \\ \wedge & c = b) \\ \end{array} \\ \href{../binary/values.html#binary-utf8}{\mathrm{utf8}}(c) &=& b_1~b_2 & (\begin{array}[t]{@{}c@{~}l@{}} \mathrel{\mbox{if}} & \def\mathdef2042#1{\mathrm{U{+}#1}}\mathdef2042{80} \leq c < \def\mathdef2043#1{\mathrm{U{+}#1}}\mathdef2043{800} \\ \wedge & c = 2^6(b_1-\def\mathdef2044#1{\mathtt{0x#1}}\mathdef2044{C0})+(b_2-\def\mathdef2045#1{\mathtt{0x#1}}\mathdef2045{80})) \\ \end{array} \\ \href{../binary/values.html#binary-utf8}{\mathrm{utf8}}(c) &=& b_1~b_2~b_3 & (\begin{array}[t]{@{}c@{~}l@{}} \mathrel{\mbox{if}} & \def\mathdef2046#1{\mathrm{U{+}#1}}\mathdef2046{800} \leq c < \def\mathdef2047#1{\mathrm{U{+}#1}}\mathdef2047{D800} \vee \def\mathdef2048#1{\mathrm{U{+}#1}}\mathdef2048{E000} \leq c < \def\mathdef2049#1{\mathrm{U{+}#1}}\mathdef2049{10000} \\ \wedge & c = 2^{12}(b_1-\def\mathdef2050#1{\mathtt{0x#1}}\mathdef2050{E0})+2^6(b_2-\def\mathdef2051#1{\mathtt{0x#1}}\mathdef2051{80})+(b_3-\def\mathdef2052#1{\mathtt{0x#1}}\mathdef2052{80})) \\ \end{array} \\ \href{../binary/values.html#binary-utf8}{\mathrm{utf8}}(c) &=& b_1~b_2~b_3~b_4 & (\begin{array}[t]{@{}c@{~}l@{}} \mathrel{\mbox{if}} & \def\mathdef2053#1{\mathrm{U{+}#1}}\mathdef2053{10000} \leq c < \def\mathdef2054#1{\mathrm{U{+}#1}}\mathdef2054{110000} \\ \wedge & c = 2^{18}(b_1-\def\mathdef2055#1{\mathtt{0x#1}}\mathdef2055{F0})+2^{12}(b_2-\def\mathdef2056#1{\mathtt{0x#1}}\mathdef2056{80})+2^6(b_3-\def\mathdef2057#1{\mathtt{0x#1}}\mathdef2057{80})+(b_4-\def\mathdef2058#1{\mathtt{0x#1}}\mathdef2058{80})) \\ \end{array} \\ \end{array} \\ \mathrel{\mbox{where}} b_2, b_3, b_4 < \def\mathdef2059#1{\mathtt{0x#1}}\mathdef2059{C0} \\ \end{array}\end{split}\]

Note

Unlike in some other formats, name strings are not 0-terminated.