(Defining) zero-knowledge proofs

tl;dr: A zero-knowledge proof (ZKP) system for an NP relation $R$ allows a prover, who has a statement $\mathbf{x}$ and a witness $\mathbf{w}$ to convince a verifier, who only has the statement $\mathbf{x}$, that $R(\mathbf{x}; \mathbf{w}) = 1$. Importantly, the proof leaks nothing about the secret witness $\mathbf{w}$. (e.g., a ZKP can be used to convince a verifier that the prover knows the solution $\mathbf{w}$ to a Sudoku puzzle $\mathbf{x}$, without leaking anyhting about the solution!)

Introduction

This blog post formalizes zero-knowledge proof (ZKP) systems, mostly deferring to Groth’s formalization from [Grot16]¹.

For a more mild, high-level introduction to ZKPs via a Sudoku puzzle example, see these slides.

Preliminaries

We assume familiarity with the NP computational class and its NP relation characterization
Let $\negl(\cdot)$ denote a negligible function
We denote NP relations by $R$
We denote an NP statement by $\stmt$ and a witness by $\witn$
We often denote $R(\stmt;\witn) = 1$ as $(\stmt, \witn)\in R$.
We use $(x|| y)\gets (\Adv||\mathcal{X})(a,b,c)$ to denote that an algorithm $\Adv$ on input $(a,b,c)$ returns $x$ and another algorithm $\mathcal{X}$ on the same input $(a,b,c)$ returns $y$
We denote the set of all relations outputted by a relation generator $\relgen$ by $\allrels\bydef \mathsf{Image}(\relgen)$

Algorithms

A zero-knowledge proof (ZKP) system is a tuple of 3 algorithms:

$\mathcal{R}(1^\lambda) \rightarrow (R,\mathsf{aux})$

A relation generator that, given a security parameter $\lambda$ returns a binary relation $R(\stmt; \witn)$ decidable in polynomial time, together with some auxiliary information $\mathsf{aux}$². The set of all possible binary relations it can output (i.e., its image) is denoted by $\allrels$.

A recent impossibility result³$^,$⁴ requires that the relation generator be benign. Otherwise, knowledge-soundness cannot be achieved.

Groth points out¹ that for ZKPs built from bilinear groups (e.g., Groth16), it is natural to let the bilinear group be returned via $\mathsf{aux}$.

$\mathsf{ZKP}.\mathsf{Setup}(1^\lambda, R) \Rightarrow (\mathsf{prk},\mathsf{vk},\mathsf{td})$

Derives a proving key $\prk$ and its associated verifying key $\vk$ from the NP relation $R$. Provers will use the (large) $\prk$ to create proofs. Verifiers will use the (succinct) $\vk$ to verify proof. Also, derives a trapdoor $\td$ (a.k.a., toxic waste) which can be used to simulate (fake) zero-knowledge proofs.

$\mathsf{ZKP}.\mathsf{Prove}(\mathsf{prk}, \mathbf{x}; \mathbf{w}) \Rightarrow \pi$

Computes a zero-knowledge proof $\pi$ for $R(\stmt;\witn) = 1$ using the proving key $\prk$.

$\mathsf{ZKP}.\mathsf{Verify}(\mathsf{vk}, \mathbf{x}; \pi) \Rightarrow \{0,1\}$

Verifies a zero-knowledge proof $\pi$ against the verifying key $\vk$. The proof argues that the prover knows some witness $\witn$ such that $R(\stmt;\witn) = 1$, without leaking any information about $\witn$.

$\mathsf{ZKP}.\mathsf{Sim}(\mathsf{td}, \mathbf{x}) \Rightarrow \pi$

Creates a zero-knowledge proof $\pi$ for $\stmt$, given only the simulation trapdoor $\td$. This is referred to as simulating a proof [without access to a valid witness]. The simulated proof argues that the prover knows some witness $\witn$ such that $R(\stmt;\witn) = 1$, even though the prover does not actually know such a witness; it just has access to the trapdoor $\td$. Importantly, the distribution of simulated proofs is indistinguishable from the distribution of honestly-generated proofs via $\zkpProve$.

The existence of a $\zkpSim$ algorithm is actually used to formalize the zero-knowledge property of a ZKP system.

Perfect correctness

Correctness says that any proof $\pi$ returned by $\zkpProve$ should verify via $\zkpVerify$.

[Perfect correctness]: $\forall$ security parameters $\lambda\in \mathbb{N}$, $\forall$ relations $R\in\allrels$, $\forall (\stmt,\witn) \in R$, \begin{align} \Pr\left[\begin{array}% (\prk,\vk,\cdot)\gets\zkpSetup(1^\lambda, R),\\
\pi\gets \zkpProve(\prk, \stmt; \witn) :\\\ \zkpVerify(\vk,\stmt;\pi) = 1 \end{array}\right] = 1 \end{align}

The definition above can be relaxed by only requiring that the probability is non-negligible (i.e., $1-\negl(\lambda)$) instead of 1.

Knowledge-soundness

Intuitively, knowledge-soundness says that if a proof verifies for a statment $\stmt$, then the prover knows a witness $\witn$ such that $R(\stmt; \witn) = 1$. One way this is formalized is by saying that, for any adversary $\Adv$, there exists an extractor $\mathcal{X}_\mathcal{A}$ (potentially-specialized for $\Adv$⁵) such that if $\Adv$ produces a valid proof $\pi$ for $\stmt$ then the extractor, given $\Adv$, the statement $\stmt$ and proof $\pi$ as input, produces a valid witness $\witn$. (In fact, the opposite is what is typically formalized: that it is not possible for the adversary to output a valid proof whose witness cannot be extracted.)

[Computational knowledge-soundness]: $\forall$ security parameters $\lambda\in \mathbb{N}$, $\forall$ polynomial-time (non-uniform) adversaries $\Adv$, $\exists$ a polynomial-time (non-uniform) extractor $\mathcal{X}_\Adv$, such that: \begin{align} \Pr\left[\begin{array}% (R,\mathsf{aux}) \gets \relgen(1^\lambda),\\
(\prk,\vk,\cdot)\gets\zkpSetup(1^\lambda, R),\\
\left((\stmt,\pi)||\witn\right)\gets \left(\Adv||\mathcal{X}_\Adv\right)(\prk,\vk,R,\mathsf{aux}) :\\\ (\stmt,\witn)\notin R \wedge \zkpVerify(\vk,\stmt;\pi) = 1 \end{array}\right] = \negl(\lambda) \end{align}

Recall the $\left(\Adv||\mathcal{X}_\Adv\right)(\prk,\vk,R,\mathsf{aux})$ notation from the preliminaries.

Describe computational soundness and explain where it suffices.

Zero-knowledge

The notion of a zero-knowledge proof was first proposed by Goldwasser, Micali and Rackoff⁶ and later generalized to any NP language by Goldreich, Micali and Wigderson⁷.

The key idea around defining zero-knowledge is to show that there exists a $\zkpSim$ algorithm, similar to the $\zkpProve$ proving algorithm albeit with a bit more power, that can produce proofs with the same statistical distribution for any statement $\stmt$ but without actually knowing a witness $\witn$.

This is formalized by arguing that no adversary can distinguish between proofs produced via $\zkpProve$ and those produced via $\zkpSim$.

For the purpose of this blog post, the simulator’s extra power will be that it knows the trapdoor $\td$ behind the proving key $\prk$.

[Perfect zero-knowledge]: $\forall$ security parameters $\lambda\in \mathbb{N}$, $\forall (\stmt,\witn) \in R$, where $(R,\mathsf{aux})\gets \relgen(1^\lambda)$, $\forall$ adversaries $\Adv$, \begin{align} \Pr\left[\begin{array}% (\prk,\vk,\td)\gets\zkpSetup(1^\lambda, R),\\
\pi\gets \zkpProve(\prk, \stmt; \witn) :\\\ \Adv(\prk,\vk,\td,\mathsf{aux},\stmt;\pi) = 1 \end{array}\right] = \Pr\left[\begin{array}% (\prk,\vk,\td)\gets\zkpSetup(1^\lambda, R),\\
\pi\gets \zkpSim(\td, \stmt) :\\\ \Adv(\prk,\vk,\td,\mathsf{aux},\stmt;\pi) = 1 \end{array}\right] \end{align}

The adversary is even given the trapdoor $\td$.

The many flavors of zero-knowledge

There are many ways of defining the zero-knowledge property of a ZKP scheme.

Interactive: Some proof systems are interactive. Their zero-knowledge property is defined in terms of the existence of a simulator who, unlike a dishonest verifier, need not interact with the prover. This gives the simulator a bit more power to forge proofs.
HVZK + Fiat-Shamir: Some interactive proof systems (e.g., Bulletproofs⁸) satisfy a relaxed version of zero-knowledge, called honest-verifier zero-knowledge (HVZK), where the verifier cannot be malicious and must follow the protocol. Such schemes are then converted to non-interactive ZKP schemes via the Fiat-Shamir (FS)⁹ transform.

On the Size of Pairing-Based Non-interactive Arguments, by Groth, Jens, in Advances in Cryptology – EUROCRYPT 2016, 2016 ↩ ↩²
The auxiliary information helps model adversarial relation generators and circumvent impossibility results³$^,$⁴ around them. ↩
On the Existence of Extractable One-way Functions, by Bitansky, Nir and Canetti, Ran and Paneth, Omer and Rosen, Alon, in Proceedings of the Forty-sixth Annual ACM Symposium on Theory of Computing, 2014, [URL] ↩ ↩²
Limits of Extractability Assumptions with Distributional Auxiliary Input, by Boyle, Elette and Pass, Rafael, in Advances in Cryptology – ASIACRYPT 2015, 2015 ↩ ↩²
This order of quantifiers ($\forall \Adv$, $\exists$ an extractor) implies that the extractor gets full access to the adversary’s state and its randomly-tossed coins, if any. A stronger property would be black-box extraction which say $\exists \mathcal{X},\forall \Adv$ and the extractor $\mathcal{X}$ only gets oracle access to $\Adv$ (e.g., cannot rewind it; does not see its “code” or state). There is quite a zoo of notions here to be unpacked. ↩
The Knowledge Complexity of Interactive Proof-systems, by Goldwasser, S and Micali, S and Rackoff, C, in Proceedings of the Seventeenth Annual ACM Symposium on Theory of Computing, 1985, [URL] ↩
Proofs that yield nothing but their validity and a methodology of cryptographic protocol design, by Goldreich, Oded and Micali, Silvio and Wigderson, Avi, in 27th Annual Symposium on Foundations of Computer Science (sfcs 1986), 1986 ↩
Bulletproofs: Short Proofs for Confidential Transactions and More, by B. Bünz and J. Bootle and D. Boneh and A. Poelstra and P. Wuille and G. Maxwell, in 2018 IEEE Symposium on Security and Privacy (SP), 2018 ↩
How To Prove Yourself: Practical Solutions to Identification and Signature Problems, by Fiat, Amos and Shamir, Adi, in Advances in Cryptology — CRYPTO’ 86, 1987 ↩

PREVIOUSNP relations

NEXTQAPs & R1CS