Semantic Interpretation Revisited #420

mikeday · 2022-11-27T11:14:33Z

In issue #166 I brought up the semantic interpretation of binary formats, but should have started with some simple examples such as the many ways in which you can interpret formats that parse a single byte:

format: U8
meaning: 8-bit unsigned integer
type: {n:Int | 0 ≤ n < 256} (or possibly {n:Nat | n < 256})

format: S8
meaning: 8-bit signed integer
type: {n:Int | -128 ≤ n < 128 }

format: if flags.signed then S8 else U8
meaning: an 8-bit integer that is either signed or unsigned depending on a flag bit seen earlier
type: if flags.signed then {n:Int | -128 ≤ n < 128 } else {n:Int | 0 ≤ n < 256}

format: Byte (or Bits8 or BitVector 8 or whatever)
meaning: a vector of eight bits
type: Array 8 Bool

format: ZeroPaddingByte (just a handy typedef)
meaning: eight bits, all zero
type: Unit (conveys no information)

format: ASCIIChar
meaning: an 7-bit ASCII character (high bit must be zero)
type: ASCIIChar (Unicode subtyping questions?)

format: UTF8CodeUnit
meaning: an 8-bit UTF-8 code unit
type: ??? (string questions?)

And so on, clearly there are many formats that match a single byte and they can have many different semantic interpretations and hence types as a result! The type of values parsed by these formats then affects how they can be used, for example whether they can be array lengths or indices.

However if we consider more complex examples, like a partial map or function represented as an array of pairs:

format Map {
    len: U8,
    pairs: Array len Pair
}

format Pair {
    fst: U8,
    snd: U8
}

The interpretation of these formats is records, integers, and arrays, and treating them as a map would require another layer on top of that, like making them an instance of a type class interface with methods.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semantic Interpretation Revisited #420

Semantic Interpretation Revisited #420

mikeday commented Nov 27, 2022

Semantic Interpretation Revisited #420

Semantic Interpretation Revisited #420

Comments

mikeday commented Nov 27, 2022