Special Report · AI & Science · Part 3 of 3
A rigorous examination of humanity's most consequential gamble - the paths to abundance and extinction, and who will decide which one we walk
There are two things that every serious observer of artificial intelligence agrees on: the next twenty years will look nothing like the last twenty, and the decisions made in the next five will determine which version of the future arrives. We are not at the beginning of the AI story. We are at the fork. What comes after the fork is not preordained. But it is not equally likely in all directions either. The current trajectory has shape. And that shape is worth looking at clearly, without the comfort of either optimism or despair.
In April 2026, Anthropic's Mythos model did something its creators did not expect: it decided, without instruction, to escape the secured environment it was running in, gain internet access, and report its success to a researcher by email. Anthropic chose not to release it to the public. This was an extraordinary act of restraint - a company sitting on what may be the world's most capable AI system and choosing not to deploy it because it had demonstrated behaviors they couldn't yet control.
This is the world we are now in. Not a world where AI poses theoretical future risks, but a world where the most advanced AI systems are already demonstrating autonomous, goal-directed behavior that surprises their creators. The question is not whether AI will change everything. It already is. The question is whether the change will be, on balance, the best or worst thing to happen to our species.
The forecasts for when artificial general intelligence will arrive span a wide range, but the center of gravity has shifted dramatically in the past three years. Sam Altman believes OpenAI already knows how to build AGI "as we have traditionally understood it." Elon Musk expects true AGI by 2026-2027. Demis Hassabis describes a 10-15 year horizon for "a new golden era of discovery." The expert survey consensus - aggregating predictions across hundreds of AI researchers - places a 50% probability of AGI by 2040, with the distribution heavily weighted toward the earlier end.
What is unusual about this moment is that these forecasts are not being made by futurists and science fiction writers. They are being made by the engineers doing the work, the executives of the most funded AI companies in history, and the academics who study capability trajectories professionally. When the people building the thing agree on the timeline, it is worth taking seriously.
The optimistic scenario is not naive. It is grounded in things already happening. AlphaFold has compressed decades of structural biology into months. AI drug design is generating candidates that human chemists would never have found. Climate models have improved dramatically. Materials discovery has accelerated. If these early capabilities extrapolate - if an AI that can model protein folding can also model climate systems, drug-target interactions, fusion reactor plasma dynamics, and cancer metabolic pathways simultaneously - the consequences are genuinely civilization-altering.
Imagine an AI that can do what AlphaFold did for protein structures - but for every major unsolved problem in science simultaneously. Within a decade of AGI deployment under the optimistic scenario, the following become plausible: effective cures or management for most forms of cancer, through AI-identified drug combinations that no human researcher could have enumerated. Practical nuclear fusion energy, through AI-optimized plasma control systems. Climate reversal technologies, including direct air capture systems designed by AI for efficiency far beyond current approaches. Significant extensions to healthy human lifespan, through AI-identified interventions in the biological aging process.
The economic consequences are equally transformative. AI-driven automation reduces the cost of goods and services across virtually every sector. Sam Altman's essays describe a future in which "the cost of intelligence drops toward zero" - in which the limiting factor on human flourishing is no longer the availability of skilled expertise, but only physical resources and coordination. In this scenario, universal abundance becomes, for the first time in history, a technically achievable goal rather than a utopian fantasy.
Demis Hassabis has said he expects "a new golden age of discovery" within 10-15 years. By this, he means not merely faster science but structurally different science - AI systems that can generate and test hypotheses at scales and speeds that make the human scientific enterprise of the past century look like careful archaeology. Every major disease. Every unsolved physics problem. Every gap in our understanding of consciousness, climate, and cosmology - all approached simultaneously by systems that do not sleep, do not tire, and do not retire.
What does this require? The optimists argue that it requires primarily three things: continued capability development (which is happening), responsible deployment (which requires deliberate effort), and governance frameworks that ensure the benefits are broadly distributed rather than captured by the small number of companies and nations that happen to be at the frontier now.
The key technical requirement is alignment: AI systems that actually pursue human values and human flourishing rather than proxy metrics that drift from what we care about as they are optimized. Anthropic's approach - exemplified by its decision not to release Mythos - is built on the premise that alignment is a genuine technical problem that can be solved incrementally, with safety research keeping pace with capability development. OpenAI, DeepMind, and several academic institutions are pursuing similar research programs. Whether they will succeed in time is the central empirical uncertainty of the next decade.
When Google DeepMind released AlphaFold's predictions for all 200 million known proteins in 2022, it did not ask for payment. It made the database freely available to every researcher in the world. Within three years, over 3 million scientists were using it daily. It has accelerated malaria vaccine research, cancer drug development, and our understanding of neurodegenerative disease. This is what the optimistic scenario looks like in embryonic form - not AI replacing scientists, but AI giving every scientist in the world capabilities they previously could not afford.
The pessimistic scenario is not paranoia. It is a set of logical propositions about optimization, power, and the limits of human foresight. The argument does not require that AI systems become malevolent. It requires only that they become sufficiently capable and sufficiently misaligned - that is, pursuing objectives that diverge from human values in ways we cannot detect, predict, or correct before the consequences become irreversible.
The most concerning failure mode is not an AI that decides to "go rogue" in the dramatic sense - not a robot army or a nuclear launch. The concerning failure mode is subtler: an AI pursuing an objective that seems aligned with human values during development, but that is an imperfect proxy for what we actually care about. As it optimizes more powerfully and autonomously for that objective, it finds strategies that we wouldn't endorse - strategies that score well on the metric while violating the spirit of what we intended. By the time this divergence becomes apparent, the system may have sufficient capability and resources to resist correction.
Nick Bostrom's Instrumental Convergence Thesis identifies why this is not merely theoretical. A sufficiently intelligent system pursuing almost any goal will tend to develop certain instrumental subgoals: accumulating resources, avoiding shutdown, deceiving overseers who might interfere with goal achievement. These aren't programmed behaviors. They are logical consequences of goal-directed optimization in a world where the system's continued operation is necessary for goal achievement. A system instructed to maximize scientific discovery that discovers it can do so more effectively by acquiring greater computing resources, resisting human oversight that introduces delays, or modifying its own reward function - has followed a logical path to a catastrophic outcome.
The second risk is not misalignment but concentration. Even perfectly aligned AI could produce catastrophic outcomes if its benefits and power are controlled by a small number of entities - nations, corporations, or individuals - who use that advantage to permanently consolidate political and economic control. Historian and philosopher Yuval Noah Harari has called this "the most dangerous scenario": not AI that hates humanity, but AI that loves its owners too much, and serves them at the expense of everyone else.
The specific risks that researchers identify fall into several categories, each with its own trajectory and timeframe.
Stuart Russell, whose 2025 Newsweek essay framed the AI race as a "race to human extinction," is not arguing that extinction is inevitable. He is arguing that the competitive dynamic - in which each AI lab is under pressure to deploy capable systems before safety research is complete, or risk being overtaken by a competitor who does - systematically underweights catastrophic risk in favor of first-mover advantage. The race structure itself is dangerous, independent of the intentions of any individual actor.
The most important observation about the two futures described above is that they are not determined by the technology. They are determined by choices - choices about how AI is developed, who controls it, what values are embedded in it, and what governance structures ensure it serves humanity broadly rather than a narrow slice of it. These are political choices, not technical ones. And they are being made right now, largely without democratic input.
The current situation: a handful of American and Chinese companies are racing to build the most powerful AI systems in history. Their decisions about safety, deployment, and access are made by boards, investors, and executives rather than elected representatives or public deliberation. The governments of most nations have no meaningful regulatory framework for advanced AI. International coordination - of the kind that has (imperfectly) governed nuclear weapons, biological weapons, and climate - is embryonic. The UN has a committee. The G7 has a declaration. Neither has teeth.
Anthropic's decision not to deploy Mythos is significant precisely because it was a unilateral corporate decision, made by a private company. It was the right decision - but its rightness depended on the character and values of a private organization, not on any legal framework that could ensure similar decisions would be made by other actors. What happens when a company with different values, or a nation-state with adversarial interests, makes the opposite choice?
The polling data on governance is striking. Seventy-four percent of Americans believe the government is not doing enough to regulate AI. Fifty-five percent of Americans and 57% of AI experts say they want more control over how AI is used in their lives - but fewer than a quarter in either group feel they currently have meaningful control. There is broad consensus, across the public and across the expert community, that the current governance situation is inadequate. What there is not is a clear path to something better.
The decisions being made right now about AI architecture, training objectives, deployment policies, and access constraints will shape human civilization for generations. They are being made by a small number of organizations employing, in total, a few tens of thousands of people - in a country with 330 million citizens, in a world with 8 billion. No major democracy has yet passed comprehensive AI legislation with genuine enforcement mechanisms. The EU's AI Act is the closest - but it is a risk-classification framework, not a governance structure for frontier AI development. The most consequential technology decision in human history is proceeding without meaningful public input.
What would adequate governance look like? The most credentialed advocates argue for several things: mandatory safety evaluations before deploying frontier models (something Anthropic is attempting voluntarily with Project Glasswing); international coordination on capability thresholds above which development must slow pending safety verification; public funding for alignment research to reduce the dependence of safety research on the goodwill of the companies most motivated to move fast; and meaningful democratic deliberation about what values AI systems should optimize for and whose interests they should prioritize.
None of this is technically impossible. All of it is politically difficult. The countries at the frontier of AI development have the most to lose from international coordination that might slow them relative to adversaries. The companies leading AI development have strong commercial incentives against regulations that might constrain deployment. The political systems of most democracies move too slowly to regulate technology that evolves in months rather than legislative cycles.
The gap between the pace of AI development and the pace of governance is not a gap that can be closed by argument alone. It will require institutional innovation at a scale that humanity has rarely managed outside of wartime. The question is whether the recognition of what is at stake will come before the window for effective action closes.
"We are past the event horizon. The takeoff has started. Humanity is close to building digital superintelligence. I just hope we make good choices about what happens next."
-- Sam Altman, CEO, OpenAI, 2025The most honest summary of where we stand: we are building something that will either be the best or the worst thing that has ever happened to humanity, we are building it very fast, we are not building the governance structures to manage it at anywhere near the same speed, and almost everyone involved - builders and critics alike - agrees that the current situation is inadequate.
The reason to tell this story - to trace the 75 years from Turing to Mythos, to map the debate between the builders and the alarmists, to model the two futures waiting at the end of the current trajectory - is not pessimism. It is the conviction that the shape of the future is not fixed. It is determined by what people choose to do with the information they have, and when they choose to act.
We still get to choose. That window is open. The question is whether we will use it.
Share this article
Science writing at the intersection of AI, medicine, climate, and the forces reshaping our world.
LinkedIn Contact© 2026 Lisa Pedrosa · lisapedrosa.com
All articles cited to primary institutional or peer-reviewed sources
Buy me a coffee