Make function types more general and symmetric #356

rossberg · 2024-05-08T01:59:54Z

Currently, the grammar for function types is as follows:

functype      ::= (func <paramlist> <resultlist>)
paramlist     ::= (param "<label>" <valtype>)*
resultlist    ::= (result "<label>" <valtype>)* | (result <valtype>)

All parameters and results must be named, except a singleton result.

There is broad variety across programming languages in whether and how they allow/require/distinguish named vs unnamed parameters, as well as unnamed vs unnamed results, at a function's def site, use site, or both. The current design is rather specific in that regard and somewhat biased. From purely an interface perspective, there is no reason to treat parameters differently from results, or allow omitting names in some cases but not others. And at least for some languages, it would be useful to make explicit which components have "proper" names and which ones haven't, since that enables more idiomatic bindings.

See here for more discussion.

There are several degrees to which the grammar could be generalised:

paramlist     ::= (param "<label>" <valtype>)* | (param <valtype>)
resultlist    ::= (result "<label>" <valtype>)* | (result <valtype>)

or

paramlist     ::= (param "<label>" <valtype>)* | (param <valtype>)*
resultlist    ::= (result "<label>" <valtype>)* | (result <valtype>)*

or

paramlist     ::= (param "<label>"? <valtype>)*
resultlist    ::= (result "<label>"? <valtype>)*

I'd suggest one of the latter two, which make names uniformly optional, and then specify a canonical scheme for synthesised names in contexts/bind-gens that need them, for example, "_1", "_2", etc. based on position. This would apply symmetrically to parameters and results.

The text was updated successfully, but these errors were encountered:

lukewagner · 2024-05-08T17:25:43Z

Personally, I like the symmetry in the abstract, and I can think of a few times where, when writing a WIT interface, I feel like I'm being forced to add a parameter name that adds no value (e.g., handle: func(request: request) -> result<response, error-code>). I do worry, though, that the additional degree of freedom invites more subjective stylistic variation (some folks are going to want to give everything a parameter name, others are going to want to leave them off by default). E.g., WASI would need to establish a style guideline on this with the criteria for named-vs-unnamed.

That being said, looking at all the WIT interfaces I'm seeing being written in practice, I basically never see use of non-empty (result "label" <valtype>)+. I expect most folks don't even know it's possible and default to defining a record that is returned, leading to the stylistic question of "when should you return a record vs. use multi-named-return?". This has made me wonder whether we should actually lean into the asymmetry harder and deprecate multi-return, so that resultlist ::= ϵ | (result <valtype>).

I can see arguments for both cases; it makes me think that perhaps the current state of the proposal is stuck in a sort of uncanny valley between fully embracing symmetry or asymmetry, and so we should shift to one side or the other. But which one I'm not sure. I'd be interested to hear more thoughts on this!

oovm · 2024-05-12T04:14:52Z

What confuses me is whether the return value with label is a tuple class or an anonymous class.

If it is a tuple class, then you can actually express the real tuple<f32, f32>

If it is an anonymous class, are (a: u32, b: f32) and (b: f32, a: u32) equivalent?

lukewagner · 2024-05-14T00:06:16Z

As it is currently, since the string names of params and results are part of the function type and thus part of function-type-equality, the latter two are distinct (when used as a function's results), and so you'd think of them more like a record type.

Mossaka · 2024-07-08T16:34:37Z

FWIW, Go is the only language that I know to support named/unnamed multi-return without treating them as a tuple, unlike languages such as Python. Here is an example of "naked" named multi-return.

func split(sum int) (x, y int) {
  x = sum * 4 / 9
  y = sum - x
  return
}

From Go's documentation, it says that "these names should be used to document the meaning of the return values."

rossberg · 2024-07-08T17:58:05Z

Another classic example is the Scheme/Lisp family of languages.

lukewagner mentioned this issue May 8, 2024

Comments/Questions #276

Closed

lukewagner mentioned this issue Jun 14, 2024

Remove named multi-return from function types (for now at least) #368

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make function types more general and symmetric #356

Make function types more general and symmetric #356

rossberg commented May 8, 2024

lukewagner commented May 8, 2024

oovm commented May 12, 2024 •

edited

Loading

lukewagner commented May 14, 2024

Mossaka commented Jul 8, 2024

rossberg commented Jul 8, 2024

Make function types more general and symmetric #356

Make function types more general and symmetric #356

Comments

rossberg commented May 8, 2024

lukewagner commented May 8, 2024

oovm commented May 12, 2024 • edited Loading

lukewagner commented May 14, 2024

Mossaka commented Jul 8, 2024

rossberg commented Jul 8, 2024

oovm commented May 12, 2024 •

edited

Loading