doc: define trust in "Validating a Lean Proof" by OrfeasLitos · Pull Request #847 · leanprover/reference-manual

OrfeasLitos · 2026-05-10T21:28:20Z

Trust was previously discussed in an informal way.
Trust is now defined along three axes: proof authors, verification software, and correctness of statement.
The trust definition is also referred to in "Elaboration and Compilation".

Trust was previously discussed in an informal way. Trust is now defined along three axes: proof authors, verification software, and correctness of statement. The trust definition is also referred to in "Elaboration and Compilation".

OrfeasLitos · 2026-05-16T17:22:59Z

Some more context can be found in this Zulip discussion.

github-actions · 2026-05-16T17:26:05Z

Preview for this PR is ready! 🎉 (also as a proofreading version). built with commit 5d349db.

nomeata · 2026-05-17T08:08:34Z

-In particular, we use {deftech}_honest_ when the goal is to create a valid proof.
-This allows for mistakes and bugs in proofs and meta-code (tactics, attributes, commands, etc.), but not for code that clearly only serves to circumvent the system (such as using the {option}`debug.skipKernelTC`).
-Note that the {keyword}`unsafe` marker on API functions is unrelated to whether this API can be used in an dishonest way.
+* Regarding the _proof author_, it matters a lot whether one is dealing with an {tech}[honest] proof attempt, and needs protection against only benign mistakes, or a possibly-{tech}[malicious] proof attempt that actively tries to mislead.


“matters a lot” is a bit informal for the reference manual.

Also, the phrasing is a bit off: the sentence beings with “proof author” but then talks about honest/malicoius proof “attempts”. And below its suddenly “code”. Not sure how to fix best, though.

nomeata · 2026-05-17T08:09:46Z


-Depending on the circumstances, additional steps may be recommended to rule out misleading proofs.
-In particular, it matters a lot whether one is dealing with an {tech}[honest] proof attempt, and needs protection against only benign mistakes, or a possibly-{tech}[malicious] proof attempt that actively tries to mislead.
+The steps needed to rule out a misleading proof depend on the author's {deftech}[trust] assumptions towards the _proof author_ (including authors that proved depended-upon theorems), the _verification software_, and the _correctness of the statement_.


This is a good start. I wonder if we should differentiate between “proof author” and “theorem statement author”, but probably confusing at this point; the difference really only matters in the comparator workflow.

nomeata · 2026-05-17T08:17:35Z

+
+* Regarding the verification software, Lean takes {ref "elaboration-compilation"}[a number of steps] to process a theorem and its proof.
+Different uses correspond to trusting different steps of this pipeline.
+At a minimum, the Lean kernel or an alternative kernel such as [`nanoda`](https://github.com/ammkrn/nanoda_lib) has to be trusted.


The middle sentence feels rather unhelpful here, not sure what it should tell the user. If this bullet point is meant to introduce the concept of the trusted code base, maybe it should say say.

Maybe it should be a third bullet and say something like

* Depending on the circumstances and trust assumptions described in the two bullets above, the validity of the result relies on different software components. This is called the “trusted computing base” (TCB), and typically comprises at least the Lean kernel. Most of Leans {ref "elaboration-compilation"}[elaboration pipeline], in particular proof tactics, are not part of the TCB.

nomeata · 2026-05-17T08:23:06Z

+* Regarding the correctness of the statement, it is important to distinguish the question “does the theorem have a valid proof” from “what does the theorem statement mean”.
+No matter what software is used and how trusted the environment is, a theorem is meaningful only if its author(s) and user(s) are certain that its statement mathematically expresses its intended informal meaning.
+
+    As written, the statement is {tech (key := "Lean elaborator")}[elaborated] before being passed to the kernel.
+    To avoid trusting the {tech (key := "Lean elaborator")}[elaborator], one has to {keywordOf Lean.Parser.Command.print}`#print` the statement to get the elaborated type and manually verify that it indeed expresses the desired statement.
+    This brings {keywordOf Lean.Parser.Command.print}`#print` into the trusted base, which is less than 1000 lines of code.


This is good in spirit, but rather flawed in content:

#print uses the delaborator, which is far more than 1000 lines of code, is known to be easily cheateable (custom delaborator, custom notation). One would rather have to use pp.raw.

Reading pp.raw output is not user friendly, once type classes come into play, even experts will have trouble with this.

Recursive definitions are elaborated into something unintelligle. There the best bet is to manually verify equational theorems that characterize the theorems.

And even then malicious code can easily compromise #print. So to be bulletproof, one would have to use a pretty-printer from an external checker, making all the usability things even worse.

TL;DR: We don't have a practical story for this problem yet, and I definitely don’t want to point out half-baked incomplete solutions here. The best we have right now is “Assume theorem statement author is honest and include the elaborator in your TCB for understanding theorem statement” followed by “Review the raw dumps of the code of the theorem statement and all involved definitions generated from the external checker and hire an expert to make sense of them”.

That's quite a big issue, given existing claims about a small trusted base. The small trusted base claim doesn't appear only in the docs "enabling the trusted kernel to be very small", but also in the Lean FPO home page. Assuming what you say is precise, then this claim is flat out incorrect.

What makes the situation even worse is that the need to trust the elaborator (or to "review raw dumps of code") is present both when we care whether "the theorem has a valid proof" or "what does the theorem statement mean", so it seems to me we can't lawyer our way into making the small trusted base partially correct.

OrfeasLitos marked this pull request as draft May 10, 2026 21:32

OrfeasLitos changed the title ~~Clarify trust requirements on kernel~~ doc: clarify trust requirements on kernel May 13, 2026

doc: define trust in "Validating a Lean Proof"

5d349db

Trust was previously discussed in an informal way. Trust is now defined along three axes: proof authors, verification software, and correctness of statement. The trust definition is also referred to in "Elaboration and Compilation".

OrfeasLitos force-pushed the trusted-base branch from 8424fbb to 5d349db Compare May 16, 2026 17:19

OrfeasLitos changed the title ~~doc: clarify trust requirements on kernel~~ doc: define trust in "Validating a Lean Proof" May 16, 2026

OrfeasLitos marked this pull request as ready for review May 16, 2026 17:20

david-christiansen added the HTML available HTML has been generated for this PR label May 16, 2026

nomeata reviewed May 17, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc: define trust in "Validating a Lean Proof"#847

doc: define trust in "Validating a Lean Proof"#847
OrfeasLitos wants to merge 1 commit into
leanprover:mainfrom
OrfeasLitos:trusted-base

OrfeasLitos commented May 10, 2026 •

edited

Loading

Uh oh!

OrfeasLitos commented May 16, 2026

Uh oh!

github-actions Bot commented May 16, 2026

Uh oh!

nomeata May 17, 2026

Uh oh!

nomeata May 17, 2026

Uh oh!

nomeata May 17, 2026

Uh oh!

nomeata May 17, 2026

Uh oh!

OrfeasLitos May 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

OrfeasLitos commented May 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OrfeasLitos commented May 16, 2026

Uh oh!

github-actions Bot commented May 16, 2026

Uh oh!

nomeata May 17, 2026

Choose a reason for hiding this comment

Uh oh!

nomeata May 17, 2026

Choose a reason for hiding this comment

Uh oh!

nomeata May 17, 2026

Choose a reason for hiding this comment

Uh oh!

nomeata May 17, 2026

Choose a reason for hiding this comment

Uh oh!

OrfeasLitos May 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

OrfeasLitos commented May 10, 2026 •

edited

Loading