LMSA — Research Synthesis (deep-research workflow, 101 agents, 19 primary sources)¶
Historical / iteration note (2026-06-11). This document is part of the research/design trail and reflects an earlier iteration; some counts, status labels, and construction details predate the current Construction F. The authoritative current specification is
docs/30, the verification status and tallies are indocs/31and reproducible viaformal/count-artifacts.sh(29 artifacts, 134 lemmas, 33/33 genuineness, 6 Gobra), and the cross-document reconciliation isdocs/35. Numbers below are preserved as the historical record.
Cross-check of the from-scratch derivation in 01-forward-backward-analysis.md against
the fact-checked literature sweep. Verified claims are marked with their adversarial
vote (e.g. 3-0 = all three verifiers confirmed).
A. The wall is confirmed independently (not just "literature says")¶
- True BLS-style algebraic aggregation of ML-DSA is ruled out by Fiat–Shamir-with-
Aborts: the norm-gated rejection loop (
‖z‖∞ < γ1−β) destroys the linear-homomorphic closure BLS relies on. (FIPS 204; verified 3-0.) This is exactly Track B / Wall 2 of doc 01, reached here from the scheme structure rather than the verifier algebra. - ML-DSA security = MLWE + MSIS + SelfTargetMSIS over
R_q; QROM reduction from MLWE is non-tight (Ω(ε²/Q⁴), EUROCRYPT 2024, arXiv 2312.16619; 3-0). Any aggregate's security argument must preserve these. - ML-DSA-87 = NIST L5: 2^285 classical gates / 229-bit quantum Core-SVP; pk 2592 B, sig 4627 B (the per-sig baseline to beat). (3-0.)
B. Correction to my last turn (own it)¶
I claimed a shared global A turns Path A into "clean half-aggregation" that folds the
z-bulk into one short vector. That was over-optimistic. The literature shows native
FSwA half-aggregation for the Dilithium family is non-compact:
- Boudgoust–Takahashi (ESORICS 2023, eprint 2023/159): first FSwA sequential half-
aggregate σ = (u, z₁,…,z_N) — it aggregates only the u-parts and keeps every
z_i; compression ≈ 1% over concatenation (rate → 0.9927 at N=10⁶). (3-0.)
- Boudgoust–Roux-Langlois (eprint 2021/263), GLP/FSwA, public aggregation: aggregates
come out larger than concatenation — explicitly pedagogical. (neg. result 3-0.)
Why (this is the mechanism I under-weighted): each signature's Fiat–Shamir binding
c̃_i = H(μ_i ‖ w1_i') is per-signer and nonlinear, and w1_i' = UseHint(h_i, A z_i −
c_i t1_i 2^d) needs the individual A z_i. So the verifier must keep each z_i to
re-check each hash binding; a folded Σ e_i z_i cannot be unfolded for those checks.
z is ≈5/8 of a Dilithium signature, so not compressing z ⇒ ≈no compression. A shared
A does not rescue this. → Folding/half-aggregation is a dead end for compact
distinct-message ML-DSA aggregation. Shared A still helps elsewhere (below).
C. The achievable compact path: SNARK-of-knowledge (LaBRADOR/Greyhound)¶
State of the art realizes aggregation as a succinct argument of knowledge of N
witnesses each satisfying Ver(pk_i, m_i, σ_i)=1 — not an algebraically combined sig.
- LaBRADOR (CRYPTO 2023, eprint 2022/1341): proofs for R1CS / dot-product &
quadratic constraints natively over R_q, from Module-SIS; 58 KB for 2²⁰
constraints, ~10× smaller than hash-based PCPs. NI knowledge-soundness proven CRYPTO
2024 via "predicate special soundness." (3-0.)
- Greyhound (CRYPTO 2024, eprint 2024/1293): first concretely-efficient lattice
polynomial commitment (Module-SIS), O(√N) verifier; composed with LaBRADOR →
polylog proof, sublinear verifier. (3-0.)
- GPU: ICICLE-LaBRADOR exists (ingonyama-zk/icicle-labrador).
Two of my earlier instincts are vindicated by this:
1. Lattice-native is the right call. LaBRADOR's constraint system is over R_q —
the exact ring ML-DSA verification lives in. The "no field-embedding / no NTT
impedance-mismatch" property from the original Module-STARK template is real, but the
right engine is LaBRADOR (Module-SIS), not a hash+FRI STARK — same lattice-native
benefit, ~3–4× smaller proofs, and recursive (residual is itself a Module-SIS object
that re-aggregates — the "residual aggregates further" property, rigorously).
2. Shared global A helps — but as a prover-cost optimization, not a compressor:
a global ρ makes A a circuit constant, removing per-signer in-circuit ExpandA
(SHAKE-128) expansions. Real savings; just not the folding win I claimed.
D. The genuinely novel frontier (this project)¶
Every demonstrated, benchmarked lattice SNARK-aggregate targets FALCON (GPV hash- then-sign), not ML-DSA/Dilithium: - Aardal et al. (CRYPTO 2024) + LaZer (CCS 2024, eprint 2024/1846): Falcon-512 + LaBRADOR → 73 KB @ 1024 sigs (0.6 s prover), 72 KB @ 100,000 sigs (65 s, 25.5 GB); 93/120/165 KB at N=500/1000/2000. Beats STARK-Winternitz (165 KB) and the Chipmunk lattice multisig (118 KB @ 1024). (3-0.)
No primary source benchmarks an ML-DSA-87 aggregate. Falcon arithmetizes cheaply
(one Gaussian-norm check). ML-DSA's FSwA verifier is heavier to arithmetize:
UseHint (w0/w1 decomposition), the ‖z‖∞ < γ1−β range check, the hint-weight check,
SampleInBall, and the SHAKE-256 challenge hash. So Falcon's KB/seconds do NOT
transfer — the core open work is the ML-DSA-87 verification circuit over R_q.
E. Formal-verification gap (matches the project's stated goal)¶
No machine-checked (Coq/Lean/EasyCrypt) proof exists of: LaBRADOR knowledge-soundness, the predicate-special-soundness framework, or an end-to-end ML-DSA aggregation circuit. The original brief ("provably solves … by formal verification") therefore aims at genuinely open territory — a real contribution, not a re-implementation.
F. Mapping to QRL 2.0 (updated, with numbers)¶
- Validators signing the same block (consensus) → shared-
Alattice multisignature (Path B). Reaches an ML-DSA-shaped verifier under an aggregate key; exact FIPS-204 only for cohorts inside nativeγ1(Wall 2), else adjustedγ1. Data point: Chipmunk-class = 118 KB @ 1024 (synchronized). - Arbitrary distinct-key/message transactions → LaBRADOR/Greyhound SNARK of N ML-DSA-87 sigs (Path A, corrected). Verified by a LaBRADOR verifier, not ML-DSA. Novel for ML-DSA; lattice-native; recursive. Target sizes: expect larger than Falcon's 73–165 KB pending the FSwA-circuit arithmetization, but still near-constant in N and orders below concatenation.
- Folding/half-aggregation: dropped (non-compact, per §B).
G. Open questions to drive (from the sweep, prioritized)¶
- Concrete proof size / prover time / verifier cost of LaBRADOR+Greyhound over the ML-DSA-87 verification relation (arithmetizing UseHint, norm, SampleInBall, SHAKE), vs the Falcon-512 numbers.
- Do the Module-SIS params for a sound aggregate over ML-DSA-87 statements preserve L5 (LaBRADOR uses its own MSIS instances)?
- GPU (ICICLE) + recursion ceiling for 10⁴–10⁶ ML-DSA-87 sigs.
- A machine-checked soundness path (EasyCrypt/Lean) for the chosen stack.
Sources (primary): FIPS 204; Dilithium round-3 spec; arXiv 2312.16619; eprint 2022/1341 (LaBRADOR), 2024/1293 (Greyhound), 2024/1846 (LaZer), 2024/311 = hal-04700114 (Falcon+ LaBRADOR), 2023/159 (FSwA SAS), 2021/263 (GLP half-agg); ingonyama-zk/icicle-labrador.