Medicine · AI & Oncology
A stool sample. A machine learning model. And a 90% cancer detection rate. The gut microbiome just became medicine's most promising diagnostic.
Read
Your gut contains roughly 38 trillion microorganisms - bacteria, archaea, viruses, and fungi - whose combined genetic content is more than 100 times the size of the human genome. For most of the history of medicine, this ecosystem was essentially invisible. We knew it was there. We knew disrupting it was bad. We did not have the tools to read it at any meaningful resolution. A research team at the University of Geneva has now built those tools, and what they have found is this: the microbiome knows when you have cancer before you do.
The Finding
The paper, published in Cell Host and Microbe, describes a new machine learning model trained to detect colorectal cancer from a single stool sample by reading patterns in gut bacteria at a finer resolution than any previous method. The detection rate: 90% of colorectal cancers identified. The specificity: high enough for clinical use. The comparison point: a colonoscopy, the current gold standard, detects approximately 94% of colorectal cancers. The Geneva test, which requires no sedation, no bowel preparation, no clinical visit beyond a sample drop-off, achieves 90% from a non-invasive sample collected at home.
Existing non-invasive colorectal cancer screening tests - the fecal immunochemical test (FIT) and the commercial Cologuard DNA stool test - achieve sensitivity in the range of 70 to 79 percent. The Geneva model outperforms both, substantially, while costing less than a colonoscopy and requiring none of the procedure's logistical burden. In countries where colorectal cancer screening uptake is poor specifically because of the colonoscopy barrier - and it is the third most common cancer worldwide - this matters enormously.
The researchers went further. The same model can detect precancerous adenomas - the polyps that precede colorectal cancer - and can classify which stage of the cancer spectrum a patient is in. Early-stage detection is where most cancer screening fails; the gut microbiome signal appears early, before tumors become clinically evident. The promise here is not just diagnosis but prevention through early intervention.
The Science
The Geneva team's central innovation is not the machine learning model itself - it is the resolution at which they read the microbiome. Previous studies linking gut bacteria to colorectal cancer looked at the species level: Bacteroidetes, Firmicutes, Fusobacterium nucleatum. Species-level analysis is a useful signal, but it is too coarse. Different strains within the same species behave completely differently - some promote cancer, some have no effect, some may even be protective. The Geneva team used what they call subspecies-level classification.
To do this, they built the first comprehensive catalogue of human gut bacteria at intermediate resolution - precise enough to distinguish functionally distinct microbial subgroups, consistent enough to be replicated across diverse populations. This catalogue, generated from large-scale metagenomic sequencing, is the key technical contribution of the work. Without it, the machine learning model has no meaningful input. With it, the model can read disease-relevant microbial patterns that were previously invisible.
What Subspecies Resolution Means
Imagine trying to detect cancer from a blood test, but your test can only tell you whether the blood came from a human or a dog - not from which individual, and not from which organ. That is roughly the diagnostic power of species-level microbiome analysis. Subspecies resolution is the equivalent of getting a detailed individual profile. The patterns that predict cancer are not carried by Bacteroidetes as a whole - they are carried by specific lineages of specific strains. Reading those lineages is what makes the 90% detection possible.
The microbiome's relationship with colorectal cancer runs deeper than simple correlation. Specific bacterial communities are known to promote oncogenesis through several mechanisms: producing genotoxic compounds that damage colonic epithelial DNA, generating chronic low-grade inflammation that creates a pro-tumorigenic environment, metabolizing bile acids into forms that promote cell proliferation, and disrupting the mucosal barrier that protects the gut lining from bacterial penetration.
What the Geneva model reads is not individual bacteria making these things happen. It reads the ecological signature - the community-level pattern that emerges when a cancerous environment begins to shift the microbial balance. This is an important distinction. The model is not detecting a single bacterial biomarker. It is detecting a microbial ecosystem state. The signal is the whole community, read at high resolution, processed by a model trained on the difference between healthy and cancerous states.
The model was trained on stool samples from patients with confirmed colorectal cancer at various stages, patients with adenomas, and healthy controls. The training used combined metagenomic data with patient records to learn the microbial patterns associated with each state. The result is a classifier that generalizes across different populations - the researchers specifically designed the subspecies catalogue to be consistent across ethnically and geographically diverse cohorts, addressing a known failure mode of earlier microbiome-based cancer research.
Why This Matters
Colorectal cancer is the third most diagnosed cancer worldwide and the second leading cause of cancer death. It is also one of the most preventable and treatable cancers when caught early. A stage I colorectal cancer has a five-year survival rate above 90 percent. Stage IV has a five-year survival rate below 15 percent. The difference between those two outcomes is almost entirely about when the cancer is found - and whether the person had access to a screening test they were willing to undergo.
Colonoscopy is genuinely effective. It finds polyps, removes them during the same procedure, and catches cancers before they become deadly. It is also uncomfortable, time-consuming, requires bowel preparation the day before, sedation during the procedure, recovery time afterward, and usually requires a companion to drive the patient home. In countries with voluntary screening programs, uptake consistently falls short of public health targets. The people who most need early detection are often those least likely to complete a colonoscopy: people who are working multiple jobs, who lack health insurance, who do not speak the language of the healthcare system, who live far from a colonoscopy facility, or who simply find the procedure too off-putting to schedule.
The best cancer screening test is not the one with the highest sensitivity in a clinical trial. It is the one people actually do. A home stool test at 90% detection will save more lives than a colonoscopy program at 94% if it reaches the people who never went for the colonoscopy.
Lisa Pedrosa
The FIT test was designed to address exactly this problem - a simple stool sample mailed in, testing for blood in the stool. It is cheap, non-invasive, and widely used. But its sensitivity of roughly 70 to 79 percent means that 20 to 30 percent of colorectal cancers produce no false positive - they simply go undetected. Cologuard improved on this by adding DNA analysis to the stool sample, but at a cost that limits its accessibility in many countries. The Geneva model's 90% detection rate represents a meaningful jump above FIT performance, potentially from a test that is comparably low-cost and comparably easy to administer.
The further possibility - already being discussed - is integration with existing screening infrastructure. Rather than replacing colonoscopy, a microbiome-based test could function as a triage tool: a low-barrier first test that flags high-risk individuals for follow-up colonoscopy. If the 90% detector catches the cases that need the procedure, colonoscopy resources can be concentrated where they matter most. The math here is compelling: a larger base screened non-invasively, with a smaller proportion referred for the definitive procedure, could increase both total cancer detection and efficiency of specialist time.
What Comes Next
The Geneva study is a proof of concept and a demonstration of scale - not yet a validated clinical test. The next steps are prospective clinical trials, where the model is tested on new patients before their cancer diagnosis is confirmed, rather than on retrospective datasets where outcomes are already known. Prospective validation is the critical step between a promising research result and a test that regulators will approve and clinicians will use.
The research group is also working on expanding the model's disease applications. The same subspecies catalogue that enables cancer detection has potential utility in inflammatory bowel disease, where gut microbial signatures differ between Crohn's disease and ulcerative colitis and change with disease activity. It has potential in metabolic disease, where the microbiome interacts with host metabolism in ways that affect obesity, type 2 diabetes, and cardiovascular risk. The catalogue is the foundation; cancer detection is the first application demonstrated on it, but it is not the only one.
There is also a larger scientific story here, about what the microbiome is becoming as a field. For most of the past two decades, microbiome research has been characterized by spectacular correlations and frustrating failures to translate them into clinical tools. The connection between gut bacteria and disease is real and deep, but the species-level resolution of earlier work could not generate actionable diagnostics. The subspecies catalogue addresses this limitation directly. If it holds up in prospective trials across diverse populations, it represents a methodological advance that applies not just to colorectal cancer but to the entire project of reading disease in the gut.
The colonoscopy will not disappear. It remains the gold standard for detection, the only tool that is simultaneously diagnostic and therapeutic - finding a polyp and removing it in the same procedure. But the question of who gets screened, and when, and how, is a question about logistics and access as much as it is about biology. A 90% sensitive stool test that can be done at home, returned in the mail, and processed by a machine learning model changes that access equation fundamentally. The gut microbiome has been speaking for decades. We are only now starting to read it properly.
Primary Sources
Share this article
Science writing at the intersection of AI, medicine, climate, and the forces reshaping our world.
LinkedIn Contact© 2026 Lisa Pedrosa · lisapedrosa.com
All articles cited to primary institutional or peer-reviewed sources
Buy me a coffee