Generare Raises $21.5M to Decode Microbial Data for Drug Discovery
Jan de Vries ·
Listen to this article~4 min

Paris biotech startup Generare secures $21.5M to decode microbial genomes, creating a novel molecular dataset to solve drug discovery's 'data problem' and fuel AI-driven research.
Here's a story that might just change how we find new medicines. It's about a Paris-based startup called Generare, and they've just secured a major $21.5 million in Series A funding. Their mission? To solve what they call a fundamental 'data problem' in drug discovery by mining the vast, untapped chemistry written into microbial genomes over billions of years.
Think about it. For decades, drug discovery has been working with a pretty limited set of chemical data. It's like trying to write the next great novel using only the same 100 words over and over. The results are bound to get repetitive. That's the bottleneck Generare is tackling head-on.
### The Core Problem in Modern Drug Discovery
Guillaume Vandenesch, the CEO and co-founder, puts it bluntly. "Drug discovery has a data problem," he says. "The entire field trains its models on the same recycled chemistry and expects different outcomes." The real issue isn't a lack of fancy algorithms. It's the absence of genuinely new, high-quality molecular data to feed those algorithms. Without fresh ingredients, you can't cook a new dish.
Generare's solution is to go straight to nature's original library: microbial genomes. They've built a proprietary platform that reads these genomes, expresses their 'silent' chemistry, and generates novel molecular data. They're focused on small molecules—the type of compounds behind many of our most successful drugs.

### How Their Technology Unlocks Nature's Secrets
Their process is fascinating. They screen tens of thousands of microbial genomes, looking for gene sequences likely to produce bioactive molecules. Then they express those sequences and characterize the resulting compounds. It's a high-throughput method validated by academic research.
The potential here is staggering. The company states that roughly 97% of the molecular chemistry encoded in microbial genomes remains completely unread. That's three billion years of evolutionary R&D just sitting there, waiting to be discovered.
- They are building what they call the largest proprietary dataset of 'cryptic' small molecules.
- Each new molecule discovered is characterized for structure, biological activity, and drug potential.
- This creates a compounding cycle where the dataset grows and becomes more valuable with every discovery.
### Why This Matters in the Age of AI
This is where it gets really interesting for the future. Artificial intelligence is transforming drug discovery, but AI has a critical limitation: it can't invent completely new chemistry to investigate. It can only work with the data it's given. If you keep feeding it the same old data, you'll keep getting the same old results, no matter how smart the model is.
Generare's dataset provides that essential fuel—truly novel structures, pre-vetted by evolution and linked to biological activity. This fresh data can dramatically improve AI's performance in finding new drug candidates. Dr. Vincent Libis, the CSO and co-founder, believes "nature has been the number one source of innovative modes of action for drugs, and we've only scratched the surface."
### What the New Funding Enables
So, what does this $21.5 million infusion mean? It's all about scaling up. The funding was co-led by Alven and Daphni, with participation from existing investors. It will allow Generare to:
- Increase their molecule discovery capacity tenfold by 2027, aiming for over 2,000 molecules with a long-term goal of surpassing 10,000.
- Nearly double their current team of 25 experts, which includes computational biologists, chemists, and engineers.
- Further scale their proprietary discovery platform to supply pharmaceutical companies with genuinely new starting points for their research.
In a field hungry for innovation, Generare is betting that the best blueprint for the next century of medicine isn't in a lab notebook—it's in the microbial world, and they're building the tools to read it.