Futarchy for Math

Nov 13, 2025

Previously, I wrote about Vadim’s work towards a decentralized marketplace for formal theorems. Now, he’s announced that the code for this site has been made open-source! 🎉

In honor of this, and in keeping with some of my recent posts on futarchy, here are some thoughts on the potential synergies of these epistemic technologies.

Theorem marketplaces and the theory-building problem

In a footnote on my previous post, I alluded to how I thought that one challenge formal theorem bounties face has to do with “theory building”, (which is a term I took from this essay by W. T. Gowers, I encourage you all to read it).

To elaborate on this with a visual example, take a look at this diagram.

This graph (zoomable/clickable version here) is the dependency chart for the “Prime Number Theorem +” project, a multi-collaborator effort to formalize various versions of the prime number theorem in Lean. As should be clear from the picture, the vast majority of effort in this project is not to do with formalization of PNT itself from its direct dependencies. Rather, most of the work has to do with the formalization of those dependencies (and of the dependencies of the dependencies, et cetera).

The larger the theorem gets, the more multifarious the dependencies become. I would like to be able to show you the same chart for Kevin Buzzard’s project to formalize Fermat’s Last Theorem, but unfortunately it’s grown so large that it seems it’s been split up into multiple pages. The FLT project is a particularly interesting example because as Kevin has described, an important part of the work of that project was making choices about what proof approach to take - As I understand it, much mathematical research has been done on the ideas behind Wiles’ original proof, and there are now multiple versions of the FLT proof to choose from. For the sake of the FLT project’s success, it’s important to choose one that will be amenable to formalization.

The problem this poses to theorem marketplaces is: Given that so much of the work of formalizing a major theorem is nonproximate to the proof of the theorem itself, how can we ensure that those who complete that work are compensated in proportion to their contribution? Put another way, how can markets do the important work of planning out something like a dependency chart in advance, so that we can tell what low level work is really needed?

Futarchy to the rescue

The framing of “markets need to formulate a plan” suggests futarchy as an answer. And closer inspection suggests this could be a promising application for other reasons: The person or team most confident in a particular plan seems likely to be both the one(s) with the most expertise in the style of approach behind that plan, as well as the most holdings in shares that the plan will succeed. Thus, there is a built-in incentive mechanism for them to contribute work towards realizing that plan when the futarchy decides to use it. Many of the markets the futarchy makes will touch on related formal questions, and so there will likely be arbitrage opportunities to support the liquidity in these markets, and the formal verification procedure provides a consistent, uncontroversial resolution mechanism for all of these.

A Proposal

Here is a proposal. Effectively, you could think of this as parametrizable pseudocode for a futarchic smart contract designed to maximize the likelihood of creating a successful formal proof of a target theorem by a target date.

We start with the theorem to be proved by year XXXX (for example “Fermat’s Last Theorem will be formally proved by 2029”).
Some percentage of the funding is set aside as a prize for the formalization itself.
The rest is used for the recursive construction of markets for intermediate results.
People can then propose candidates for what these results will be in the form of “factorizations”: Pairs or sets of subclaims that together imply the result (for example: “the Taniyama-Shimura conjecture is true” and “Taniyama-Shimura conjecture implies Fermat’s last theorem”). The important considerations for the subclaims are that they:
1. Are likely to be proved by year XXXX (as the recursively constructed futarchic contract will attempt to make true).
2. Together, they constitute formal proof of the main theorem (as guaranteed by the formal proof verification procedure).
If multiple factorizations are proposed, Futarchy decides which factorization gets funded.

You could augment this in a few ways:

Make additional markets about the length of the proofs, and motivate the futarchy to find a short one, so that the proof is more comprehensible.
If you are interested in a proof without a particular preference over time frame, you could create multiple futarchies, and set up an agent to bet NO on near-term proofs which funds those futarchies with its profits.

I’m not sure I’d really advocate that anyone put this into practice - I still think theorem marketplaces have the sociological problems I discussed before. Still, I think this is an interesting thought experiment on how mechanized mathematics can be tilted towards theory-building.

Vadim

Nov 13Edited

Omggggg thanks for this new piece of wisdom, figuring out how to make something like this happen was totally in my wish list and essential for theorem marketplaces to work (if they're ever going to work at all).

Thoughts:

1. I totally think we ("we" in general, humans) should create things like these even when we expect it won't actually become an additional infrastructure for mathematical progress or whatever. When we come up with novel and interesting things, I think it's worth creating them even when it's not obvious how they're going to be useful, because we can't possibly think of all uses people might find — indeed people themselves might not even realize they need this tool until they realize it exists and go, "hmm, what about my use case?". "Do good and throw it into the sea", as the saying goes; implement things and leave them for people to find, because these things are going to find interesting consequences for themselves anyway.

One example is that I implemented the theorem marketplace that had many problems, including the one described in this post, and it (at least partly?) inspired you to make this post. And that means it was totally worth the effort!

I would go ahead and just implement this, except I could barely stay on top of the theorem marketplace project when it was actively being developed, and I think I would need either people helping me create it open-source style or actually receive a salary or a grant for it (for I cannot afford yet to just retire and pull any epistemology-creating shit I want). You've fired my imagination, though, I don't know what I'm going to do now.

2. Not sure we need to decide on one factorization to fund; perhaps we could just distribute funds according to how promising the factorizations are? (Maybe with some cutoff value?)

3. What about frontrunners, e. g. you submit a factorization and someone submits an improved version of the same factorization and gets paid for both their improvement and your initial work, instead of doing the recursive factorization thing?

4. You mentioned markets about the length of proofs, and also that futarchy would decide which factorization to fund. This reminds me: Lean Copilot has integration with the Lean Progress (https://leandojo.org/leanprogress.html) model, which, for a given unfinished proof, predicts how many steps remain to finalize it. Neither the dataset nor the model are available in open source. For the sake of my project of using AlphaEvolve for theorem proving (https://ratandtiger.substack.com/p/alphaevolve-lean-theorem-prover), I started assembling a dataset to train a corresponding model (intending to make both open-source), but got sidetracked. If I finished this, it could be used for such use cases (to inform traders), too.

(Also, damn, I owe everyone updates about my projects...)

2 replies by Bolton and others

2 more comments...

The Quantum Milkman

Discussion about this post

Ready for more?