How are Scheme closures formally defined?

Question

I’ve recently ventured into the awesome land of writing a Scheme interpreter, and I’ve run into a roadblock: closures. From what I understand, they encapsulate a local environment with a procedure that gets restored every time the closure is called (this may not be exactly right). The issue that I can’t seem to find anywhere online is how a closure is formally defined i.e., in an EBNF grammar. Most examples I’ve seen say that a closure is a procedure with zero arguments that has a lambda expression nested inside a let expression. Is this the only way to define a Scheme closure? More importantly, if there’s no formal way to formally define a closure, how do you actually interpret it? What happens if you translate all let expressions to lambdas? For example, if I declare a closure as such

(define (foo) (let ((y 0)) (λ (x) (…))))

Then assign it to a variable

(define bar (foo))

In what order is this evaluated? From what I’ve seen, when foo is declared, it stores a pointer to the parent environment, and declares its own environment. If I call (bar), should I substitute in the saved local environment immediately after?

Closures are a question of semantics, not syntax. A definition of the grammar will show no sign of them. — molbdnilo, Jan 20 '22 at 05:03
@molbdnilo Is there a semantic definition, then? I mainly just want to know how to tell my interpreter “okay, this is a closure.”? — TheSinisterStone, Jan 20 '22 at 05:13
There is no definition in the Scheme report, as far as I know, but there are plenty of juicy details in R. Kent Dybvig's (of Chez Scheme fame) [dissertation](https://legacy.cs.indiana.edu/~dyb/pubs/3imp.pdf). — molbdnilo, Jan 20 '22 at 08:14
The chapter [Metalinguistic Abstraction](https://mitpress.mit.edu/sites/default/files/sicp/full-text/book/book-Z-H-25.html#%_chap_4) in the [SICP](https://mitpress.mit.edu/sites/default/files/sicp/index.html) explains all you need. Take a look at `make-procedure`. — ceving, Jan 20 '22 at 08:22
see if [this](https://stackoverflow.com/a/34910496/849891) answer of mine helps in any way. it shows the original McCarthy's Lisp in 13-15 lines of pseudocode, augmented to support closures (the "funarg device"). — Will Ness, Jan 23 '22 at 07:45
R7RS defines formal semantics for closures, see 7.2. If you're not a big fan of denotational semantics and would have preferred operational, see "An Operational Semantics for Scheme", J. Matthews. — SK-logic, Jan 25 '22 at 09:55

ignis volens · Accepted Answer · 2022-01-20T15:13:13.807

I don't think it's helpful, today, to think of closures as some special magic thing: long ago in languages in the prehistory of Scheme they were, but in modern languages they are not a special thing at all: they just follow from the semantics of the language in an obvious way.

The two important things (these are both quotes from R7RS, both from section 1.1 are these:

Scheme is a statically scoped programming language. Each use of a variable is associated with a lexically apparent binding of that variable.

and

All objects created in the course of a Scheme computation, including procedures and continuations, have unlimited extent.

What this means is that Scheme is a language with lexical scope and indefinite extent: any variable binding exists for as long as there is possibility of reference. And, conveniently, you can always tell statically (ie by reading the code) what bindings a bit of code may refer to.

And the important thing here is that these rules are absurdly simple: there are no weird special cases. If a reference to a variable binding is visible in a bit of code, which you can tell by looking at the code, then it is visible. It's not visible only sometimes, or only during some interval, or only if the Moon is gibbous: it's visible.

But the implication of the rules is that procedures, somehow, need to remember all the bindings that they reference or may reference and which were in scope when they were created. Because scope is static it is always possible to determine which bindings are in scope (disclaimer: I'm not sure how this works formally for global bindings).

So then the very old-fashioned definition of a closure would be a procedure defined in a scope in which bindings to which it refers exist. This would be a closure:

(define x
  (let ((y 1))
    (λ (z)
      (set! y (+ y z))
      y)))

And this procedure would return a closure:

(define make-incrementor
  (λ (val)
    (λ ()
      (let ((v val))
        (set! val (+ val 1))
        v))))

But you can see that in both cases the behaviour of these things just follows immediately from the scope and extent rules of the language: there's no special 'this is a closure' rule.

In the first case the function which ends up as the value of x both refers to and mutates the binding of y as well as referring to the binding of z established when it was called.

In the second case, calling make-incrementor establishes a binding for val, which binding is then referred to and mutated by the function that it returns.

I'm never sure if it helps to understand things to turn all the lets into λs, but the second thing turns into

(define make-incrementor
  (λ (val)
    (λ ()
      ((λ (v)
         (set! val (+ val 1))
         v)
       val))))

And you can see now that the function returned by make-incrementor, when called, now immediately calls another function which binds v solely to its argument, which itself is the value of the binding established by make-incrementor: it's doing this simply to keep hold of the pre-increment value of that binding of course.

Again, the rules are simple: you can just look at the code and see what it does. There is no special 'closure' case.

If you actually do want the formal semantics that gives rise to this, then 7.2 of R7RS has the formal semantics of the language.

alinsoar · Answer 2 · 2022-01-23T09:34:01.333

A closure is a pair of a pointer to some code and a pointer to the environment the code should be evaluated in, which is the same as the environment the closure was created in.

The presence of closures in the language makes the environment look like a tree. Without closures the environment is like a stack. This is how the environment was in the first lisp systems. Stallman stated he chose dynamic environment in elisp because the static environment was hard to understand at the time (1986).

The closures are one of the most central concepts of computation and they allow the derivation of many other concepts like coroutines, fibers, continuations, threads, thunks to delay, etc etc.

How are Scheme closures formally defined?

2 Answers2