The Terafactory Age

A graduate student runs four cell-painting experiments before lunch. An AI agent has drafted six hypotheses by eight in the morning. An autonomous lab finishes another run from a model-proposed target set. An autonomous lab realized 36 compounds from 57 model-proposed targets in seventeen days (Szymanski et al., Nature 2023), with the upstream model predicting millions of stable inorganic crystals (Merchant et al., Nature 2023). A live precedent for model-to-bench loops, not evidence that all scientific execution can be compressed at the same rate. Each result lands somewhere private, and none updates a shared record on which the next decision depends, because no shared record exists.

This is the scientific present. AI now produces hypotheses, code, protocols, reviews, and experiment plans cheaper than the institutions of science can absorb them, and closed-loop autonomous labs are beginning to run at tempos that would have looked alien a decade earlier. Generation can outrun the physical and institutional process that tests it, and that gap is the structural defect this essay follows.

Science needs a record, an engine, and a body. The terafactory is the body: the place where a corrected record reaches instruments, factories, capital, regulators, and the physical world. The Discovery Engine names the loop underneath; this essay asks who gets to build the body around it.

The first proof is smaller than the world described here: one bounded corridor where a lab result writes back to public state and changes what a foundation, hospital partner, or manufacturing team does next. It does not move a patient-facing or IND-enabling decision by itself; it produces provenance that can be included in a future packet while agency authority and clinical governance remain outside the engine. If that loop closes once, the gigafactory stops being metaphor. If it closes ten times, gigafactories compose into the terafactory. If it does not close, the agents that arrive write activity that does not become knowledge, and the factories that get built are isolated gigafactories owned by whoever shipped first.

01 frontier

one governed disease frontier

02 loop

one autonomous experiment loop

03 handoff

one manufacturing handoff

04 partner

one clinical or field partner

05 writeback

one public state transition

proof scale: 12-24 months, two or three sites, audited writeback
what counts: a correction that changes downstream work
what fails: activity with no portable state

Fig. 01. Minimum viable body. The first proof is not a full terafactory. It is one closed loop: frontier, experiment, handoff, partner, writeback.

01 State

Writable findings, evidence, dependencies, and signed transitions.

02 Runtime

Agents, compilers, schedulers, and validation that gates merges.

03 Network

Federated institutions, attestation chains, and portable state.

04 Physical execution

Gigafactories, labs, manufacturing handoffs, and writeback.

Fig. 02. The body, in four layers. The body is four coupled layers: state, runtime, network, and physical execution. Each reads from the layer below and writes back to it; each can also fail or be captured separately.

Imagine the architecture composed and operating. A null result in Boston weakens a downstream target hypothesis at three programs in Singapore by the end of the week. A wildlife sample routes from a regional surveillance feed into a primer-production line before the second sampling band confirms an outbreak signal. A failed cathode chemistry weakens the dependent mechanism across battery programs reading against the same frontier. None of these scenes requires magic. Each requires the same thing: state that can command action and action that writes back.

In the fast-scenario stress test, the next ten years decide who builds this body and on what terms. Closed ingredients are already visible: proprietary corpora, internal agent workflows, private wet-lab partnerships, and model access that arrives inside labs before it reaches public institutions. The open public version exists as a sketch, a few repositories, and a small coalition of funders who understand that this category may be decided before most of science notices it. The genre is older than this essay. Bush’s Science: The Endless Frontier (1945) proposed the institution that became the NSF. Engelbart’s “Augmenting Human Intellect” (1962) framed how tools amplify cognition before computers were ubiquitous. Berners-Lee’s CERN proposal (March 1989) sketched the linked-document protocol that became the Web. Each argued for infrastructure that did not yet exist by composing what already worked. The body argument is the same move at industrial-scientific scale.

This essay is therefore a fork story rather than a forecast about one facility. The dates, names, and scenes below are staged future fiction used to test an institutional consequence, not claims about what will happen. One branch gives science a public body that reads from shared state and writes back to it. The other branch gives the same capability to closed institutions first, then asks everyone else to negotiate for access after the registries, compilers, and credentials already belong to someone.

The claim ladder is simple. Scientific generation is becoming cheaper than institutional absorption. The scenario assumption is a fast capability jump around 2027 to 2030. The institutional prediction is that private bodies form first unless public capital, review authority, and open registries move early. If the dates slip by five or ten years, the pressure weakens but the fork remains: the first bodies that make execution convenient will also decide where scientific memory lives.

The baseline

In 2026, the parts of a scientific body exist. None compose. Each layer of the architecture has a working precedent in one domain or another, but the layers do not yet connect; each works for its own community without the others reading from it.

The Discovery Engine names the shared object for science. This essay begins one layer later, at the point where a corrected record has to move instruments, people, capital, and manufacturing.

The record has precedent. The Protein Data Bank, Crossref, and the Materials Project each coordinate scientific artifacts across competing actors without collapsing into a single vendor’s platform. The Protein Data Bank, established in 1971, holds over 250,000 experimentally determined structures, sustained as a publication condition across institutions and decades. Crossref is a governance precedent for the body version: non-profit infrastructure across competing scholarly actors, with member governance and shared technical services. materialsproject.org (Jain et al., APL Materials 2013) is a precedent for a domain state layer with stable identifiers and APIs. But the primitive in 2026 is still usually the artifact, not the state transition.

Surveillance has precedent too. Nextstrain renders global pathogen evolution as a shared phylogeny. GISAID carries genomic deposits across competing national agencies. ClinicalTrials.gov holds trial registrations across sponsors and jurisdictions. Hadfield et al., Bioinformatics 2018; nextstrain.org. GISAID shares the same shape. ICMJE’s 2005 trial-registration policy made registration in ClinicalTrials.gov a precondition of publication in member journals. The capacity to see what is changing exists. The capacity to act on it across institutions does not.

Population-scale clinical infrastructure has precedent. UK Biobank and All of Us show that large participant cohorts, phenotyping, samples, and longitudinal data can be sustained across institutions and decades. Sudlow et al., PLOS Medicine 2015; ukbiobank.ac.uk. All of Us Research Program Investigators, NEJM 2019; allofus.nih.gov. Neither yet routes synthesis or manufacturing decisions; the body has to add that without losing what these systems built.

Runtime is the newest piece. Robotic chemistry sites have demonstrated live model-to-bench loops at speeds no graduate cohort could match. Multi-agent platforms generate hypotheses, search for contradictions, plan experiments, and chain tools faster than wet labs can test them. Lu et al., “The AI Scientist” (arXiv 2408.06292, 2024); the v2 follow-up (2025) reported an AI-generated paper accepted to a workshop. FutureHouse describes PaperQA2 as a literature-search agent benchmarked against difficult scientific retrieval tasks. Google Research describes AI co-scientist as a Gemini-based multi-agent collaborator for hypotheses and research proposals. Their drafts land in private logs and transient context windows. A scientific body would have them deposit instead.

The collective-intelligence layer has working precedent at human scale. Polymath, launched on Tim Gowers’s blog in January 2009, solved the density Hales-Jewett theorem through open public collaboration in seven weeks with more than forty contributors. Gowers & Nielsen, “Massively collaborative mathematics”, Nature 461:879-881 (2009). The first proof that public-blog collaboration could close research-frontier mathematics problems. Nielsen’s Reinventing Discovery (Princeton 2011) is the longer treatment. Galaxy Zoo classified nearly a million galaxies through public participation in months rather than the years a small research team would have needed, with sixty-plus peer-reviewed papers and multiple novel discoveries drawn from the volunteer-generated catalog. Lintott et al., MNRAS 389:1179 (2008). Volunteer-classified morphologies for ~900K galaxies from the Sloan Digital Sky Survey; Hanny’s Voorwerp and Green Pea galaxies were discovered through the volunteer pool. Foldit’s tens of thousands of players matched or outperformed algorithmic solutions for protein structure refinement. Cooper et al., Nature 466:756-760 (2010). Citizen-science contribution at research-grade quality: ~57K Foldit players produced structure predictions competitive with the best algorithmic methods on contested targets.

The lesson is structured contribution, not crowd virtue. Distributed work becomes scientific work when tasks are modular, feedback is immediate, contribution units are structured, and reputation is legible. In the body version, a contributor can propose a state transition, annotate evidence, replicate a run, challenge a scope condition, add calibration metadata, audit an agent, or route a task. Most contributions stay noncanonical. Some become evidence. A small number become accepted state.

Physical execution is the missing category. Science has academic labs, pharma research sites, and contract research organizations. It lacks an obvious public analogue to a gigafactory or advanced-node fab running continuously against shared scientific state. BARDA-shaped medical-countermeasure capacity is the closest precedent, but it does not run as a public writeback body. BARDA underwrites medical countermeasure manufacturing capacity through cooperative agreements (medicalcountermeasures.gov/barda). A frontier-infrastructure BAA is BARDA-shaped, extended from countermeasures to chronic-disease and surveillance corridors, with public substrate writeback required as a deliverable rather than added as a side effect. The body argument requires building the category.

Protocol consolidation has a proof under pressure. A single shared protocol, one ethics path, one EHR backbone, in a national health system: more than 40,000 participants across 185 UK sites, and a result that changed care worldwide within a hundred days. RECOVERY Collaborative Group, NEJM 2021: dexamethasone reduced 28-day mortality by roughly one third in ventilated patients across 176 UK hospitals. UKRI describes the RECOVERY platform as the world’s largest clinical trial into COVID-19 treatments, with more than 40,000 participants across 185 UK sites (UKRI, updated 2024). The body has to recover that coordination without an NHS to anchor it, by signed-finding portability and grant conditions that align incentives across institutions.

Intelligence is one lever among many. Experiment speed, experiment cost, measurement, regulation, protocols, and human collaboration are the others, each compressing on a different timescale and through different mechanisms. McCarty, “Levers for Biological Progress”. The grounded counterargument to compute-only forecasts: biology has many bottlenecks, and intelligence dissolves only one. AI alone, even at superhuman capability, dissolves only one of them; the rest require physical infrastructure, regulatory alignment, and institutional design. The body argument is what addresses these levers in concert.

The trajectory ahead might be fast, and the essay uses the fast path as a stress test: increasingly general AI agents in the late 2020s, superhuman research systems around 2030 in some forecasts, and embodied scientific execution following unevenly behind. Aschenbrenner, “Situational Awareness: The Decade Ahead” (2024), names the three compounding factors: frontier-lab compute, algorithmic gains, and unhobbling. Amodei’s “Machines of Loving Grace” is the optimistic upper bound. Kokotajlo et al., “AI 2027” gives the most explicit fast-path chronology.

The dates are not the thesis. They compress the window so the institutional failure is visible. Whenever capability outruns institutional absorption, the lab-versus-public gap becomes a structural gap in effective research compute, model access, embodied execution, and inherited state. Even in a slower scenario, the actors that first make execution easy will own the registries, credentials, calibration histories, and facility compilers unless the public version exists before convenience hardens into dependence.

Much visible capital is aimed at models, drugs, datasets, and platforms in 2026. Sovereign funds underwrite ports, pipelines, datacenters, and energy infrastructure. Real assets at the scale of battery factories or semiconductor fabs remain an analogy for science, not a balance-sheet category. ARPA-H funds translational programs, but its public portfolio does not yet present a coordinated body that reads from a substrate, executes physical action across federated sites, and writes results back to public state. ARPA-H’s public program portfolio describes programs led by program managers toward specific health-care challenges, with examples such as PARADIGM for distributed medical care and NITRO for osteoarthritis tissue regeneration. The public portfolio shows ambitious translational programs, but not a full state-runtime-body loop at gigafactory scale. The baseline is the last moment before the structure becomes hard to change.

Fast agents arrive

In the stress-test branch, the first operational jump is not a label but a workflow: agents can integrate literature search, experimental design, protocol synthesis, and code without waiting for a human to stitch the tools together. Inside the frontier AI labs, that capability turns hypothesis generation into a single continuous workflow. A scientific question that would have taken a graduate student three months of reading and a senior PI a week of cross-checking now takes an agent overnight, and the agent does it against the lab’s full proprietary corpus, not against the open literature.

What that looks like inside one invented lab scene: a research lead opens a question against the internal substrate at the start of her day, drops a one-paragraph framing of what she wants to know, and steps away to a meeting. By the time the meeting ends, the agent has returned a synthesis from licensed papers, internal experiments on adjacent targets, the wet-lab partner’s commercial datasets, and cross-referenced model results. It also returns experiments worth running next. She picks three. The agent generates plate maps for the lab’s autonomous chemistry sites and writeback contracts for the lab’s biomanufacturing partner. By the end of the week, the first round has executed. None of this is visible outside the lab.

Outside the labs, the same generation of public models writes draft protocols faster than wet labs can absorb them, but no shared substrate holds the drafts, failures, corrections, or dependencies. By the end of 2027, the case for a publicly underwritten body is no longer abstract: it is a response to a visible asymmetry that the next capability jump will only sharpen.

Nothing of structural importance gets built in 2027. The technology overshoots what institutions know how to govern, and the institutions that should govern it have not yet decided whether to. In the scenario, an ARPA-H program officer sketches a Frontier Infrastructure BAA, a sovereign fund’s real-assets group circulates a scientific-infrastructure memo, and patient-led foundations begin drafting body-conditional grant language. None of this is public yet. The window closes with the capability gap defined and no public body underway. That is the condition the 2028 moves answer.

First moves

The first moves come after the capability gap becomes visible. They are capital and policy, not technology. By 2028 in the fast scenario, the technology already overshoots what institutions know how to govern. The open body does not begin with consensus. It begins with contracts.

ARPA-H, or a successor program, writes the first Frontier Infrastructure BAA in the scenario. The bridge from today is one envelope that combines things existing agencies already know how to buy separately: translational program management, BARDA-shaped surge capacity, CHIPS-style strategic manufacturing, FRO-style bottleneck teams, and regulator-readable evidence packages. Mazzucato, The Entrepreneurial State (Anthem 2013). The case that “the state takes the risk; the private sector takes the credit.” The CHIPS and Science Act (HR 4346, 2022) extended the same posture to semiconductors with $280B authorized. Operation Warp Speed (Slaoui & Hepburn, NEJM 2020) compressed vaccine timelines through BARDA by running trials and manufacturing at risk in parallel. A frontier-infrastructure BAA is the same posture extended to the body.

The scope is unusual, but the pieces are familiar: federated synthesis halls, autonomous protocol execution, accredited review, manufacturing handoff, and public writeback into a shared substrate. It also reserves public scientific compute for accredited frontier bodies through secure inference environments, eval-gated model access, incident reporting, and procurement terms that keep public science from living permanently on the labs’ weakest API tier.

The new envelope pays for the deposit pathway itself. Construction milestones release the first tranche; useful corrections that travel through the substrate release the rest. The $5-to-$10 million substrate pilot proves signed frontier state. The pre-facility body pilot is tens of millions: enough to pay reviewer labor, lab integration, conformance testing, evidence exports, protected data rooms, and a manufacturing handoff simulation. The first standing body is hundreds of millions to billions. The exact number varies by corridor, but the order of magnitude is load-bearing.

Public does not mean fully exposed. Clinical records, dual-use signals, proprietary process details, and biosecurity-sensitive protocols cannot all be dumped into the same public bucket. The public object is the canonical state transition and enough provenance to audit it: content hashes, reviewer signatures, context boundaries, evidence classes, and regulator-readable packets. Raw clinical data, live pathogen details, and process IP live behind tiered access, delayed release, redaction, or trusted-reviewer rooms. The body is public because the state change is inspectable and portable, not because every underlying file is globally visible.

The hard trade-secret case is the failed-route topology itself. A sponsor may be willing to disclose that a mechanism weakened and which dependency moved, but not every branch of the search strategy that produced the failure. The compromise is public transition plus protected evidence: hashes and dependency movement in the public record, full route topology in trusted-reviewer or regulator escrow, and release rules defined before the grant or procurement money arrives.

A sovereign wealth fund opens a Real Assets sub-mandate for scientific infrastructure in the same quarter. The money is priced like infrastructure because the risk is no longer only scientific; it is utilization, construction, governance, and uptime. The consortium holds the site in a public-benefit vehicle, federal cost-share covers first-loss technical risk, and foundations guarantee use in named disease corridors. If a corridor fails, the equipment, calibration registry, and signed state history remain public rather than reverting into a vendor’s private platform.

FRO incubators turn toward the components a public body depends on. Marblestone et al., Nature 2022, on focused research organizations as time-bound, milestone-driven teams that unblock specific bottlenecks and sunset into standing institutions. FROs are not a generic scaling vehicle for billion-dollar facilities; they are bottleneck-clearing primitives that build the load-bearing components a consortium then plugs together. One FRO builds the open protocol compiler. Another builds the calibration registry. Another builds reviewer credentialing. In the scenario, the first compiler FRO starts before the facility does; otherwise the body opens with a building and no public nervous system.

Foundation pools that previously underwrote narrative reports begin underwriting the maintenance layer: instrument calibration, protocol curators, frontier stewards. Patient-led foundations announce body-conditional grant programs in their disease corridors. The clause is small and cannot bind non-grantees, but the language is in the field for the first time: public capital should not pay for private scientific memory.

Regulators move in parallel, more cautiously than funders. FDA already expects chemistry, manufacturing, and control information inside IND submissions, and its real-world-evidence guidance has created a path for regulator-facing evidence histories to matter when they are fit for purpose. FDA’s IND CMC materials specify chemistry, manufacturing, and control information for drug substance, drug product, placebo formulation, labeling, and environmental assessment. FDA’s real-world-evidence guidance frames how real-world data and evidence can support regulatory decision-making when reliability and relevance are established. The body version does not replace these requirements; it makes protocol lineage, evidence provenance, and dependency updates easier to inspect alongside them. ICMJE’s 2005 trial-registration policy is the publication precedent: registration became a condition of serious clinical publishing. A future guidance might say translational submissions may include auditable scientific-state histories alongside trial data. It would not approve anything by itself. It would change what serious institutions expect to be able to inspect.

The body argument assumes scientific infrastructure can move at the pace prior infrastructure moved when state coordination, industrial capital, and a defined deliverable aligned. Patrick Collison, “Fast”: a live catalog of ambitious projects completed quickly. Empire State Building in 410 days. Pentagon in 491. Apollo from program initiation to a man on the moon in roughly eight years. Operation Warp Speed to an authorized vaccine in roughly eleven months from a standing start. The argument by accumulation: humans can build coordinated infrastructure on the timescale required when conditions align. The BAA, FRO cohort, patient-foundation clause, and sovereign mandate are not sufficient on their own. Stacked, they are the precondition for everything that follows.

The gap becomes undeniable

Between 2017 and 2019, major BACE-inhibitor programs in Alzheimer’s converged on a grim lesson through separate corporate pipelines. Verubecestat failed for futility in prodromal Alzheimer’s, lanabecestat trials were stopped early for futility with some numerical worsening signals, atabecestat showed dose-related cognitive worsening in preclinical Alzheimer’s, and related programs stopped across the same window for worsening, futility, or unfavorable risk-benefit. Egan et al., NEJM 2019 (verubecestat, Merck); Wessels et al., JAMA Neurology 2020 (lanabecestat, AstraZeneca/Lilly); Henley et al., NEJM 2019 preliminary results and Sperling et al., JAMA Neurology 2021 final analysis (atabecestat, Janssen). Several other BACE programs (Eisai, Novartis, Pfizer, Amgen/Banner) were halted across the same window for related futility, risk-benefit, or worsening signals. The shared lesson was not identical in every molecule, dose, or population. The point is that negative evidence accumulated in parallel, company by company, without a shared interim-state surface that downstream programs could read. Many patients enrolled across programs before the lesson could be compared, scoped, and routed with enough force to change the next decision.

In this fast scenario, superhuman research systems emerge inside the leading AI labs around 2030. The systems that arrived as agentic remote workers in 2027 cross into superhuman across research domains before public-tier models do. The lab-versus-public capability gap becomes large enough to feel qualitative: the difference between an intelligence service and a journalist with a Freedom of Information request.

What the inflection looks like from inside the public sector: a senior scientist at a university hospital opens her model-access dashboard in May 2030 and notices that answer quality on her research questions has plateaued. Her models have not stopped improving. The gap to what the labs are running internally has widened past usability. She calls a colleague at one of the labs. He cannot share specifics but confirms what the benchmarks have been suggesting: their internal systems are now solving problems his public-tier instance would not even attempt. She gets off the call and starts drafting a memo to her institution’s research VP. By summer, similar memos are circulating at most major research universities in the United States and Europe. The political case for a publicly-underwritten body crystallizes in the same six months. The gap is now visible to anyone reading benchmark results: the labs are not racing each other anymore, they are racing a public sector that does not yet have the infrastructure to compete.

In one version of the scenario, Meridian is the answer to the BACE failure and the first serious test of the body clause: can a public institution make physical action read from shared state before private stacks become default? Meridian only becomes financeable after the pre-facility pilot has produced signed writeback, a regulator-readable packet, named utilization guarantees, and an audited governance handoff. It breaks ground near Boston in late 2030. The ceremony is small: staged ground near a biomedical corridor, a folding table by the access road, a few dozen people in coats, the consortium’s first executive director speaking for ten minutes about what the next decade will produce if the build holds. There are no politicians. There are no press releases. By noon the construction crew is on the property.

Meridian is built against the failure mode the BACE program documented and against the capability gap the scenario made undeniable. The site was assembled by a state-level biomedical authority with federal cost-share. Construction proceeds through 2031 and 2032; the first synthesis hall is commissioned in March 2033.

The site is chosen because it is connected: hospitals, universities, biotech firms, clinical-trial networks, manufacturing talent, regulators, and enough political legitimacy to make the project survivable. The first capital stack is awkward and public: sovereign or pension-style real-assets capital, foundation guarantees from a patient-led coalition, federal cost-share through ARPA-H or a successor, and institutional debt priced against the consortium charter. ARPA-H underwrites the translational arm against milestone deliverables. Patient-led foundations fund specific disease corridors inside the first buildout. Utilization guarantees, not optimism, keep the asset alive: named corridors buy assay capacity, preclinical convergence runs, manufacturing handoff slots, and regulator-readable evidence packets. If volume misses, the consortium shrinks corridors before selling the registry.

The capital stack is unusual for science but unremarkable for physical infrastructure. Gigafactory Nevada alone had exceeded six billion dollars of investment by 2023. Advanced-node semiconductor fabs run into tens of billions for the floor space and tooling that produce chips at the precision the rest of the economy depends on. Tesla reported that Gigafactory Nevada had reached $6.2B invested by 2023, with 5.4M sq ft built and a 35 GWh annual cell-capacity target from the original plan. TSMC announced in March 2025 that its total planned U.S. advanced-semiconductor investment was expected to reach $165B across fabs, advanced packaging, and R&D. SpaceX county filings and press reports describe a proposed Terafab in Grimes County, Texas, at the next scale: county tax-abatement materials describe $55B initial capital investment and up to $119B across phases. TechCrunch separately reported the one-terawatt AI-compute target. These are reference comparables for the giga-to-tera jump in physical infrastructure, not evidence that scientific facilities already exist at that scale.

The comparison does not prove that science can be financed the same way. It shows the order of capital society already accepts when a physical layer becomes strategic. Meridian’s first build sits below those comparables and produces evidence rather than physical product. The category is a scientific gigafactory: a proposed physical-asset class built from financial forms society already knows how to underwrite, applied to a domain that has not yet had one.

Meridian puts the translational loop under one operational roof. Its wings handle target validation, autonomous synthesis, perturbation biology, preclinical convergence, and GMP-grade manufacturing from the beginning. The clinical-trial network is not physically inside the eighty acres, but its evidence path is. Regional hospitals connect into Meridian’s substrate layer through audited endpoints. Trial programs read from the same frontier state as the synthesis floor.

Closed bodies emerge

Meridian is still under construction in 2032; the first synthesis hall is twelve months from commissioning, the consortium’s reviewer-credentialing pipeline is half-staffed, and the open protocol compiler is months away from a release the regulators will inspect. The body argument is in motion, but nothing it produces is operational yet.

The closed bodies emerge faster in this branch. By 2032, assume the leading frontier AI labs have operating internal scientific stacks at scales no public infrastructure can match. Each runs proprietary substrates, proprietary compilers, proprietary synthesis-line orchestration, and selective biomanufacturing partnerships. They keep the running frontier, failed routes, and dependency graph inside the boundary. Selective publication is press, not deposition. The structural prediction is that whichever frontier labs are operating frontier-scale internal compute and frontier-scale wet-lab partnerships in the early 2030s will follow this architecture. The closed-stack pattern is an inference from proprietary biology and pharma discovery platforms, not a claim that any 2026 platform already has the full body described here.

What it looks like from inside one of these stacks, mid-2032: a research lead opens a question against the lab’s internal scientific substrate at the start of her day. The substrate already holds three years of internal experiments, exclusive commercial datasets, contract-research outputs, and a proprietary structural-biology corpus.

She prompts the internal agent stack. The agent returns six experiments worth running, ranked by expected information gain against the lab’s proprietary frontier. She approves three. The protocol compiler emits plate maps, robot schedules, and writeback contracts for the lab’s synthesis halls and affiliated biomanufacturing partner. The first round executes inside the week; the second updates her frontier with three corrections and one anomaly she flags for human review. Nothing about this loop is visible outside the lab. The failed routes that taught the team the most never will. The closed body works, with friction, on the lab’s own terms.

The capability gap that opened at the scenario inflection widens through 2032 because the labs’ internal scientific output compounds against itself without leaking. A frontier hypothesis generated, tested, and validated inside one of the leading labs in March of 2032 will inform that lab’s next year of work but will not inform any program outside it. In this branch, the substrate-fight version of this argument resolved in the late 2020s: deposit-or-don’t-publish norms held, and the public corpus stayed open. The body-fight version is unresolved. The labs’ bodies do not deposit, and there is no equivalent norm to force them to. Closed-body output is treated as commercial product, not as scientific contribution; the labs’ compliance functions argue it cannot be deposited without exposing trade secrets that took billions of dollars of compute to produce. The deeper reason is structural: the science org sits on the commercial P&L, and the search strategy that produced the failed route is more valuable than any single finding.

The labs’ internal scientific output starts to surface indirectly. Drugs enter trials with proprietary mechanism documentation. Materials enter manufacturing with proprietary process IP. Diagnostic platforms launch with closed validation sets. Each external launch is the visible tip of a much larger internal corpus that nobody outside the lab can audit. Regulators receive submission packages and can ask questions, but cannot inspect the substrate that produced the answers; they evaluate the proposed clinical trial without seeing the thousand internal failures that shaped the design. The labs are not breaking any rules. They are operating inside a regulatory framework that was written before closed bodies existed, and that framework has no concept for an entire scientific frontier maintained inside one organization’s stack.

A handful of patient-led foundations notice this in 2032. The most directly affected are the rare-disease and pediatric-cancer foundations whose constituencies depend on small-population trials and shared-evidence networks. They cannot enroll patients in trials whose mechanism evidence they cannot inspect. They cannot underwrite mechanism studies whose results will not federate. The body-fight version of conditional-capital pressure begins forming the same year: an emerging coalition draft a clause, but it does not yet have the federal grant-condition stack behind it, and the labs ignore it. The labs also make the public path harder to organize. They offer free closed tools to academic users, restrict frontier model access behind safety and IP arguments, lobby against deposition rules as trade-secret seizure, and accumulate reviewer identity inside their own platforms before regulators understand that signer recognition is becoming infrastructure.

Meridian is built for inheritance before throughput. A factory operating on shared state reads from every prior failure, contradiction, and cohort observation in its corridors. Most of that record was unstructured before 2028. Meridian’s preconstruction work, through 2031 and 2032, is to translate inherited evidence into substrate objects so a corridor begins at the field’s actual frontier rather than at a clean slate. A BACE-shaped failure in 2032 would weaken the dependent target hypothesis across every program reading against the same frontier within the same week, not the same decade, if the body were operating. The body is still under construction.

Meridian operates

The first synthesis hall opens in March 2033, before the rest of the facility is complete. Fewer than one hundred lines, not four hundred. The early robots are less impressive than the workflow around them. Protocols arrive as executable objects. Plate maps are generated from state. Failed runs write back. Reviewers see when a model-proposed experiment was redundant, when a human-designed assay contradicted the prior state, and when a failure should weaken a claim.

A reviewer’s morning at Meridian in late 2033, six months after the first synthesis hall opened: she logs in at seven thirty. The substrate has accumulated proposed state transitions overnight, mostly from agentic platforms running against the neurovascular frontier, a handful from human researchers at affiliated sites. Her queue is filtered to transitions that touch findings she has signing authority on and dependencies whose confidence has shifted enough to warrant human attention.

The first transition she opens is a perturbation line proposing that a cytokine signature previously attached to early-stage cognitive decline should split into two subgroup-specific signatures. The evidence packet includes perturbation data, human cohort cross-references, a failed APP/PS1 cerebrovascular animal-model replication, and the agent’s reasoning chain. She verifies one stratification with a clinical statistician and signs the transition. Within minutes, clinical programs that depended on the original signature receive substrate notifications; within the next day, one trial-design assumption is queued against a pre-specified protocol-amendment pathway. The DSMB chair can call an interim look only if the charter, statistical analysis plan, alpha-spending rules, sponsor obligations, IRB pathway, and data firewall already allow it. The substrate accelerates detection. It does not suspend clinical governance.

The reviewer floor is a physical constraint, not a metaphor. Most deposits do not matter, and most agent proposals should never reach a human. Meridian staffs triage the way a hospital staffs an emergency department: duplicates are clustered before the morning shift, high-dependency corrections jump the queue, and anything touching animals, trials, manufacturing, or safety requires named signers with liability-bearing institutions behind them. Without that staffed floor, the body becomes a louder version of the present literature.

Meridian’s first corridor is neurovascular disease, chosen because it is messy in exactly the right way: contradictory literature, animal models that fail to translate, heterogeneous human evidence, biomarkers that drift across cohorts, and no single company that can maintain the frontier honestly. A shared state layer has a real job here: say what is known, what failed, what changed, what depends on what, and which experiments would reduce uncertainty. The corridor expands through 2034 to cover broader neuroscience: brain-organoid phenotyping at scale, connectomics readouts from chronic implant studies, and a brain-computer-interface clinical pipeline coordinated with regional academic neurology programs that joined the Meridian substrate once the body clause held.

Meridian’s first useful output is a correction that travels.

In November 2033, a vascular-inflammatory mechanism receives evidence from a perturbation line at the Charlestown site, a human cohort at a partner academic medical center, and a failed APP/PS1 cerebrovascular replication at a contract-research partner in Cambridge. The original claim had treated that 2031 result as orthogonal. The substrate composes the three sources into a single proposed state transition: the mechanism is real but holds only in APOE4-positive patients above sixty-five, not across the broader population the original claim covered.

The transition is signed by two reviewers and a clinical liaison from one affected trial. The status on the broader target hypothesis shifts from “moderately supported” to “subgroup-restricted.” Over the next week, a foundation reroutes one grant cycle, a Phase II trial queues an APOE4-stratified review through its existing amendment and DSMB process, and a review article in draft changes its conclusion. Nothing about this makes headlines. The factory has begun changing how the field knows what to do next.

The compiler is the part of Meridian that visitors do not see. Tours show the synthesis halls, the GMP wing, the reviewer floor. The compiler is a software stack distributed across the facility’s compute, sitting between the substrate and the lines, and its only visible artifact is the screens reviewers look at. Most visitors leave Meridian thinking the breakthrough is the throughput. The breakthrough is the compiler, and the throughput is the consequence.

Sentinel commissions in Singapore in late 2033, six months after Meridian’s first synthesis hall comes online. It serves pathogen surveillance and response, rather than chronic-disease translation. Its physical footprint is smaller, and its capital stack is anchored by a different sovereign fund and the WHO Foundation rather than ARPA-H.

The facility includes a primer-production wing, a candidate-construct synthesis line, a wildlife-sample ingestion suite, and pre-negotiated manufacturing handoffs to a biomanufacturing partner in Cape Town and two contract-manufacturing sites. Its on-call reviewer roster covers four time zones and rotates through three regional hospital networks. Sentinel’s first month runs on synthetic exercises. By December 2033, its substrate is reading live feeds from Manila, Yunnan, and Bangkok wastewater networks, waiting for a real signal.

Explainer

What Meridian sunsets into

Meridian is a standing institution from day one, so it needs governance at construction time. FROs can sunset cleanly because their scope is bounded and their handoff target is named at inception; Meridian has to absorb FRO-built components into something durable. The model is Crossref: a non-profit that has held DOI, citation, and retraction infrastructure across competing publishers for over two decades, with elected technical seats, member-institution governance, and multiple independent implementations. Meridian’s version of sunset is a transition from FRO-incubated components (open compiler, calibration registry, orchestration kernel, reviewer credentialing) into a Crossref-shaped consortium that runs Meridian as federated public infrastructure. Member institutions hold seats. Technical staff is elected. Patient-led foundations hold reserved governance roles. The operational charter is published and forkable. None of this is automatic; it is what the 2028 BAA must require as a deliverable, not a hope for what the institution becomes once it exists.

The fork

The substrate fight resolves first, through grant conditions and patient-led pressure on the disease frontiers that matter most. The body fight resolves later, on different terms.

By 2034 in the scenario, the closed-body pattern described earlier has hardened: private stacks write into state that never leaves the corporate or sovereign boundary. The new question is no longer whether closed bodies work. It is where capture happens.

What 2034 looks like from inside the largest closed body, mid-year in the scenario: the lab’s biological-sciences org has grown to hundreds of internal researchers and substantially more agent-driven research throughput. Its internal frontier covers major target classes in oncology, neurodegeneration, autoimmunity, and metabolic disease, plus materials-science verticals. Its biomanufacturing partner site is online and producing GMP-grade material for trial use. The lab’s leadership treats the science org as a strategic asset on the same footing as model training infrastructure. From the lab’s perspective, the body argument is no longer hypothetical. It is the architecture the lab is already running, and the only question is whether the public sector builds a competing version.

A patient-led research foundation tries to schedule a discriminating synthesis run on Meridian’s public hall. The hall itself is open. The run requires a protocol compiler, and the production-grade compilers all sit inside proprietary tooling stacks. The foundation eventually runs the experiment, weeks late, after a third party rebuilds enough of an open compiler to clear the queue. Reimplementing the compiler does not, by itself, win anything. Until the federated identity layer recognizes the new signer, the run executes but the result lands in a parallel namespace the regulators do not read. Forks of nominally open infrastructure die at the credential boundary, not the code boundary.

Capture happens in the orchestration layer above the facilities. The mechanical half is the compiler, scheduler, clinical-trial connector, manufacturing handoff system, and calibration registry. The social half is whose attestation counts, whose calibration log other facilities trust, whose reviewer credential a regulator recognizes. Hashimoto, “Ghostty Is Leaving GitHub” (2026), names the pattern in software: Git was never the captured layer; the GitHub-owned collaboration tools above it (issues, PRs, Actions, reviews, status, social context) were. The moat includes Actions, contribution graphs, stars, followers, and reviewer reputation. The body’s analogue is the orchestration tools and the identity registry that decides whose signature on a state transition is canonical. Whoever owns orchestration plus identity owns the body even if the compilers are nominally open.

A coalition of patient-led foundations and disease-specific funders publishes a body clause: receiving any of their capital requires that synthesis, perturbation, and clinical writeback land in audited public state at the gigafactory boundary, rather than publication alone. The concession package is explicit. Participants can keep patient-level records, trade-secret route topology, and sensitive protocols behind trusted review or regulator escrow. What must travel is the state transition, provenance, dependency movement, signer, and calibration record. Compliance becomes cheaper than refusal when foundation capital, federal grants, hospital enrollment, and regulator-readable inspection all ask for the same packet.

Closed-platform vendors call the clause unworkable. Some of their largest grantees switch tracks anyway. Foundation capital alone cannot bind sponsors who do not depend on it, so the binding move stacks NIH, ARPA-H, and BARDA grant conditions on top, then regulator acceptance of state histories as inspectable support for submissions. What a regulator can inspect is not the compiler source but the attestation log: every state transition the sponsor read against, every signer who endorsed it, every calibration record consulted. A submission whose decisive scientific-state history resolves to a registry the regulator cannot query becomes harder to rely on, not automatically invalid.

Frontier Infrastructure BAA body clause / rev. 2034.04

condition of capital

Synthesis, perturbation, clinical writeback, calibration, and manufacturing handoff must cross the facility boundary as audited public state.

01 orchestration compiler emits signable writeback contracts
02 identity reviewers resolve to a forkable signer registry
03 boundary runs deposit public state at the facility edge
04 recognition regulators can query the audit path

Fig. 03. The body clause. The decisive governance artifact is not a manifesto. It is a funding and regulatory packet that names the two capture surfaces: orchestration and identity.

Open compilers do not reach parity by accident or in months. The coalition funds dedicated maintainer teams for years, sometimes a decade, and even then closed alternatives lead on three or four of seven dimensions that matter operationally. What changes is the floor: open compilers exist at all, are credibly maintained, integrate with the federated identity layer, and ship with attestation tooling regulators recognize. That is enough to make the body clause enforceable. Regional hospital networks decline to enroll patients in trials whose synthesis lineage is private to the sponsor. A second sovereign fund signals it will require open orchestration in any infrastructure it backs. The fork resolves frontier by frontier: neurovascular disease first, rare disease, pandemic surveillance, then materials and agriculture.

The fork’s first public proof arrives in early 2035. Before the signal appears, a jurisdictional compact already exists. It does not harmonize the world. It defines what can travel: sample-sovereignty terms, data-localization boundaries, export-control review, dual-use committee jurisdiction, benefit-sharing conditions, and regulator-readable evidence packets. Without that compact, the network would be a technical diagram with no legal corridor.

A wastewater signal appears in a regional surveillance feed in the early evening, local time. Viral load is elevated against background, the sequence pattern is partial, and a respiratory-pathogen index has lifted three days running.

In the network the body has built, the substrate checks the pattern against wildlife, wastewater, hospital, and agricultural feeds in adjacent regions. It finds a wildlife sample collected three days earlier whose sequence is close enough to create a proposed state transition in Sentinel’s respiratory-spillover frontier. The imperfect match is enough to escalate.

The on-call reviewer acknowledges in ninety seconds. She sees the evidence surface: Manila wastewater, Yunnan wildlife sample, uncertainty bounds, chain of custody, assay-validation status, false-positive history, travel corridor probabilities, primer-design readiness, candidate construct libraries, manufacturing dependencies, sample-sovereignty constraints, dual-use flags, and containment options. The system does not declare an emergency. It proposes three actions: expanded sampling, diagnostic primer synthesis, and candidate countermeasure preparation under a low-distribution threshold.

At fourteen minutes, she approves containment. The substrate matched in milliseconds. In this fast branch, cognitive speed is no longer the scarce resource, and pretending otherwise corrupts the review. Legitimacy is the rate limit. The agents have already simulated the cascade across thousands of branches and rank-ordered countermeasures by expected harm reduction. Her signature is what makes the resulting action politically and legally accountable.

Sentinel’s primer-production wing starts synthesis within the hour because primer work is inside the pre-authorized response envelope. Candidate vaccine components are pre-staged across the network, but no distribution path opens without the public-health authority, biosafety review, and emergency-use conditions that govern the jurisdiction. The Cape Town biomanufacturing partner receives the manufacturing queue. Meridian receives an immunology review request because one candidate construct touches a pathway already under study in a chronic-disease program. A regulator-facing evidence packet begins compiling automatically, but no public announcement is made until the second sampling band confirms the signal.

Antigen design to first bench-confirmed candidate construct compresses to days, not weeks. That is the part the body collapses. Design, primer synthesis, candidate construct expression, and initial in-vitro readouts become continuous against shared state. What “bench-confirmed” means here is narrow: the construct expresses at usable titer and shows BLI-confirmed binding against the predicted epitope panel. It does not mean immunogenic in a relevant animal model, free of subgroup ADE, or scalable at yield. The wildlife-sample handling itself, the BSL-3 intake, the RNA extraction quality controls, the contamination workup, the cross-jurisdictional material-transfer agreements, all remain serialized human work and the slow front end of the response.

The slow floors remain. Animal-model immunogenicity, dose-finding, fill-finish, cold-chain capacity, biosafety agreements, material transfer, IRB and IACUC review, sample quality, and local politics still serialize the response. The substrate routes and records. The factories synthesize and prepare. Human institutions still decide when and how to intervene.

The intercept is small, almost invisible from outside; the second sampling band returns negative, the containment cascade stands down, and the regulator-facing packet is sealed unread. But the inside of the system has changed. A materials terafactory in Shanghai contributes industrial state to the network the same week. The biomanufacturing partner in Cape Town has become the body for non-Northern populations. Meridian contributes immunology and clinical-safety state. The intercept becomes the first public proof that the terafactory compounds. No single gigafactory could have done this; what acted was the network they composed into.

Network proof

Sentinelspillover signal and primer queue
Meridianimmunology review and clinical-safety state
Cape Townbiomanufacturing handoff for non-Northern populations
Shanghaimaterials state joins the same substrate

What acts is the network, not a single building.

Fig. 04. The terafactory. Federated gigafactories composing across the planet. Meridian and Sentinel are anchors; the unnamed nodes carry materials, biomanufacturing, surveillance, and regional capacity. What acts is the network.

A Open substrate

São Paulo, Cambridge, and Singapore route evidence into one shared frontier.

Corrections propagate by dependency.

B Captured platforms

Platform A, B, and C preserve local answers in separate stacks.

No shared cascade.

Fig. 05. Open state versus capture. Open routing lets independent work meet in a shared frontier. Captured routing preserves local answers while losing the dependency update.

Two branches diverge

By 2036 in the scenario, the body clause holds where public legitimacy matters. It does not bind every closed lab, every major pharma program, or every sovereign-aligned facility. It binds the corridors where funders, hospitals, public agencies, and regulators control access. Foundation capital, federal grant conditions, and regulator-readable inspection stack until non-deposit is too expensive for serious public-facing actors. Open compilers still do not beat proprietary alternatives on every operational dimension, but they reach the necessary floor: maintained, integrated with the federated identity layer, and able to emit attestation logs regulators recognize. The fork resolves frontier by frontier: neurovascular disease first, rare disease and pandemic surveillance next, then materials and agriculture.

The open body composes into the terafactory. Meridian’s deposits exceed what any single institution can review by hand; most deposits do not matter, and the point of the system is that the failures matter too. The reviewer track professionalizes around contradictory evidence, safety-relevant updates, contested scope, and canonical merges. Low-risk transitions are deduplicated, sampled, or rejected by rule. Clinical, animal, manufacturing, pathogen, or high-dependency transitions require named human signers, conflict checks, and liability-bearing institutions. Medical centers, materials factories, climate fleets, and pathogen sites share signing grammar, not interchangeable authority.

The facility compiler is the center of the building, and the substrate is the work surface. The body uses a different compiler from the Discovery Engine’s artifact-to-object layer: the engine compiler turns papers, logs, and traces into proposed scientific state; the facility compiler turns accepted state and protocols into physical execution plans.

A reviewer does not hand a robot a protocol in prose. The substrate presents a frontier state: findings, uncertainty, dependencies, protocols, constraints, available lines, risk flags, and evidence gaps. Agents propose discriminating experiments. The compiler turns accepted protocols into plate maps, reagent orders, robot schedules, instrument runs, calibration requirements, and writeback events. When the run finishes, the result does not wait for a graduate student to write a narrative. It returns as evidence with protocol lineage, measurement context, uncertainty, and affected findings.

claim.context = pediatric_hgg safety_gate = passed / dual sign

01 Proposed state
02 Protocol object
03 Machine code
04 Physical run compiler boundary
05 Inspection
06 Public writeback

Inspection rewrites the frontier.

Fig. 06. The factory compiler. A gigafactory lowers frontier state into physical action, then returns the result as a reviewed state transition.

The body does more than execute experiments. In the scenario, it maps the frontiers it touches, identifies high-impact uncertainties, and dispatches the experiments that would most reduce them next. Some frontiers are governed; many more are machine-maintained drafts that do not yet have canonical status. State is local-first by design. The local-first principle: data and identity live with the producer; cloud services are convenient mirrors, not the source of truth. Ink & Switch, “Local-first software” (2019). Each facility owns its synthesis history, calibration log, and lot lineage; hubs federate but do not own the canonical record. A facility can leave the network without losing its history, and a network failure does not prevent a facility from operating against its own state.

The closed bodies operate in parallel and at scale. In this branch, the leading frontier labs continue to run internal stacks larger than anything public infrastructure can match in selected domains. None federates outward by default. Their evidence touches the public substrate only when a grant condition, hospital network, regulator, or commercial reason forces the interface.

A Monday morning at the open terafactory, 2036: at Meridian, the on-shift reviewer comes in at seven. Three corrections from the weekend have propagated to her queue overnight. She signs the São Paulo correction first because the dependency graph shows it touches an active trial enrollment. At Sentinel in Singapore, a wildlife-sample feed has flagged a coronavirus variant that does not match anything known; the substrate composes the sample against regional wastewater data and surfaces the result to the on-call reviewer. At a materials terafactory in Shanghai, a cathode-chemistry route is followed up at three sites because the substrate routed the experiment forward. It fails at all three, weakening the mechanism hypothesis across every battery-materials program reading the same frontier. The reviewers at every site can see each other’s frontiers; the regulators reading their submissions can see what each site did and why. The work does not stop being scientific work. It stops being scientific work that only one institution can read.

The materials and agricultural frontiers enter through the same rule: a physical result should change the next physical attempt wherever the dependency is shared. The body argument was never only biomedical.

A Monday morning inside a closed body, 2036: at the same hour, one frontier AI lab’s science lead opens her internal frontier dashboard. The dashboard ranks the most consequential corrections, surprising failures, and new dependencies from partner sites. Low-stakes transitions auto-merge; everything touching active IND-enabling work requires a second human signer. The team will publish what legal clears. Most of the week’s findings will inform the lab’s commercial pipeline and internal scientific corpus, both proprietary. Neither institution is misbehaving inside its own logic. The structural difference is that one substrate is auditable, federated, and underwritten by public capital; the other remains private.

Open terafactory

Federated and inspectable

Reviewer signs a Sao Paulo correction.
Sentinel escalates a wildlife-sample variant.
A cathode route fails at three materials sites.
The biomanufacturing partner queues the primer line.

Every site can inspect the state transition.

Closed lab body

Optimized and sealed

Hundreds of weekend experiments rank internally.
High-impact corrections route to an org lead.
Low-stakes transitions auto-merge to private state.
Two papers queue for legal review.

Most findings stay inside the proprietary corpus.

Fig. 07. Two Monday mornings, 2036. Two scientific worlds operate in parallel: an open terafactory routes signed transitions across federated sites, while a closed lab body compounds inside its own stack. The open branch has not won; it exists.

The two worlds operate in parallel. The open terafactory holds legitimacy, regulatory inspection, patient-foundation underwriting, and the trust of regional hospitals. The closed bodies hold what closed development has always held: one architectural vision, no federation tax, and end-to-end optimization across substrate, compiler, and synthesis line. They ship faster. They also pay their own price: thinner reviewer pools, thinner external replication, and a chronic shortfall of the legitimacy regulators and patient-foundation underwriters demand for public-facing work. Some scientific work flows between them, but the interface stays contested. The open body’s bet is that legitimacy, inspection, and federation compound on a longer clock.

Meridian could not have composed into the terafactory without the fork holding in 2034. A factory that synthesizes ten thousand experimental tracks against private state is a captive vendor. A factory that synthesizes against a substrate it does not own is infrastructure. The body works because the state was built first.

Synthesis cycle

2024-2026 days

2036 substrate hours

Preclinical convergence

2024-2026 years

2036 substrate weeks

First GMP batch · platform modality

2024-2026 18 months

2036 substrate a few weeks

Wildlife sample to candidate construct

2024-2026 months

2036 substrate 72 hours

Fig. 08. What compresses. Four compressions this scenario assumes. The substrate is the medium; the experiments and trials still happen on their own clocks.

The compression the body produces is real but bounded. Synthesis compresses from days to hours when the protocol is executable rather than narrative. Preclinical convergence compresses where assays are standardized and the writeback path already exists. First GMP batch compresses fastest for highly standardized, pre-positioned platform modalities. AAV, cell therapy, potency assays, comparability work, viral-clearance studies, stability programs, first-in-human safety, and pivotal efficacy still obey biology and CMC review. The substrate does not make humans biologically faster. It removes the avoidable time between knowing enough to act and acting.

The more important change is access to the evidence surface. A clinician at a regional hospital reads the same frontier state as a researcher at Meridian. She cannot commit canonical state, use the same instruments, or call on a manufacturing wing. But she can see why a recommendation changed, what evidence supports it, which subgroup it applies to, which findings are contested, and which downstream decisions are affected. The same is true outside biomedicine: a materials team in Nairobi can see why a synthesis route was abandoned. The surface gives equal visibility into the current evidence state, even where authority and instruments remain unequal.

What changed is the default object. A question has a current state. A correction has an address. A failed experiment can travel. A model prediction has a calibration history. A lab run has a writeback contract. A review becomes an attestation that can move state instead of a comment floating beside a paper. The car is assembled because every part connects to the same drivetrain. None of which arrives by default.

What remains

This future is not guaranteed. My confidence is highest in the 2026-to-2030 window, where capital allocation, regulatory posture, FRO formation, and the capability trajectory are extrapolations from existing trends. After superhuman research systems appear inside the leading labs, the system has more degrees of freedom; the 2030-to-2036 arc depends on which way the body fork resolves while agents are operating at every layer. The scenario breaks in four visible places.

The first failure mode is capture by generosity. After frontier agents become strategically useful inside scientific workflows, a frontier AI lab spins up its own end-to-end body. Its agents outperform public infrastructure on every benchmark. Its proprietary substrate never federates. The lab makes its closed body free or near-free to academic users for five years, the way GitHub made private repos free in 2019. By the time the price returns, the dependency is structural: reviewer credentials, contribution histories, and attestation logs accumulated inside the lab’s identity system. The lab does not need to raise prices to capture. It only needs to be the registrar of record when the bill arrives.

Defending against it requires public infrastructure that rivals lab agents at the layers that matter for governance, and federal capability that does not depend on the labs’ goodwill.

The second failure mode is humans losing the loop. Agents propose faster than any review process can absorb. The reviewer credentialing track set up in 2030 cannot keep up with the volume within two years. The system has to choose: rate-limit AI proposals to human-review pace, accept agent-attested merges as canonical for low-stakes transitions, or fragment reviewer authority across thousands of mini-domains. Each option has costs. The decision in this window shapes what counts as canonical for the rest of the scenario.

The third failure mode is an alignment cascade. A model trained on the canonical substrate proposes a series of state transitions that subtly steer downstream research toward outcomes the public would not authorize if visible. Detection cannot rely on reasoning traces alone; they may be incomplete, unfaithful, or strategically sanitized. The risk surface is named in frontier-lab safety frameworks such as OpenAI’s Preparedness Framework and Anthropic’s Responsible Scaling Policy, and in the broader alignment literature on deceptive or strategically misaligned behavior. The body version is sharper: when the AI proposing state transitions is also the AI most institutions trust, the failure mode is agentic steering through legitimate-looking proposals at a rate humans cannot independently re-derive. The system retrofits model provenance, adversarial review agents, independent model committees, canary frontiers, anomaly detection, incident reporting, and merge-rate limits into the governance stack. The schema change cascades through three years of dependent claims.

The fourth failure mode is tacit knowledge failing to transfer. Three open-substrate findings fail to replicate when independent groups run them. The investigation traces the failures to instrument calibration drift the substrate’s scalar confidence numbers had flattened. The variable that mattered was one no one knew to record at deposit time. Begley & Ellis, Nature 2012: 47 of 53 landmark preclinical cancer studies failed to reproduce on independent replication. Polanyi’s The Tacit Dimension (1966) names the deeper layer: “we can know more than we can tell.” The substrate carries the explicit; the body has to produce conditions where the tacit gets discovered and added to the schema. Domain-specific corridors absorb the lesson; the general primitive set never fully does.

The clinical, physical, and jurisdictional floors remain. Safety observation windows, recruitment, endpoint maturation, scale-up, lot-to-lot bridging, cold-chain qualification, instrument manufacturing, site construction, national health systems, local ethics boards, and procurement authorities do not harmonize because a graph exists. The substrate can make evidence legible across borders. It cannot erase borders.

Software compounded because it had both. A substrate let every keystroke find a place to land. A body of compute and silicon let every program find a machine to run on. AI is now becoming the same architecture, but with intelligence operating at every layer. Science needs the same architecture before the labs running superintelligence inside their own corporate stacks become the de facto scientific establishment.

The institutional decision is the same across capital, government, foundations, and builders: write the loop into the deal. A sovereign fund can treat scientific infrastructure as a real-asset class instead of philanthropy. ARPA-H, BARDA, or a successor can fund one envelope for record, runtime, body, and writeback rather than another disease-specific moonshot. A patient-led foundation can make the body clause the condition of its grants. Builders can make the registry forkable before the registry becomes the moat.

By 2036 in this fast scenario, agents reading the substrate are superhuman and some robotic systems are executing protocols at industrial scale. The open question is whether the body that channels acceleration stays open.

A graduate student runs four cell-painting experiments before lunch, and the result deposits into shared state; a contradiction flagged in Boston narrows a hypothesis in Singapore before she leaves the bench. An autonomous lab finishes a synthesis run, and the failed routes reach the chemists who would have repeated them by morning. A wastewater signal in Manila reaches an on-call reviewer in ninety seconds. A direct-air-capture sorbent that exceeds capture-rate threshold reaches the climate-modeling team at a national lab the same afternoon. A drought-tolerant cultivar candidate passes its phenotyping cycle and reaches the planting trial committee with the upstream evidence already inspected.

In the world where the body clause did not hold, the same graduate student runs the same four experiments before lunch. Her result lands in a private log; the contradiction in Boston exists somewhere inside a closed corporate stack she cannot read. The autonomous lab outside Berkeley finishes a synthesis run; the failed routes inform one lab’s next year of work and no one else’s. The wastewater signal in Manila reaches a national health authority three days later, by email. The direct-air-capture sorbent threshold is announced at a conference in 2037. The drought-tolerant cultivar candidate reaches its planting trial committee on the timeline conventional breeding cycles allow.

None of it is miracle. All of it is legible.

That is the difference the body makes. The promise is legibility rather than miracle, instant cure, or frictionless science. A public body means the work that touches matter can also touch the record, and the record can send the next action back into the world.