Compound
GrantsAbout

2026 Compound

Compound
GrantsAbout
Back to Database
The recalibration phase of AI in drug discovery
Bio

The recalibration phase of AI in drug discovery

What we need to do to see the promise of AI in drug discovery

Concept

Despite the years of hype of AI in drug discovery, there are few companies that have made it to the clinic. There are some gaps in our focus on number of molecules screened rather than underlying models, for example. Other difficulties include lack of feedback from clinical trials. If we address these two problems, we can finally see the renaissance of AI in drug discovery.

Longer Description

This was discussed in last week’s Decoding TechBio where they presented an article which displayed the similarities of Exscientia’s AI designed candidates to a drug, Haloperidol, which was approved by the FDA in 1967. Similarly, a paper in February of this year analyzed the chemical space of drugs created by AI-native companies, showing that adjacent, not new, chemical space was inhabited by the molecules (their figure below). To some degree this makes sense because the models were likely trained on successful drugs for the corresponding targets. However, the molecules inhabiting the same chemical space are clearly not living up to the expectations we had regarding AI in drug discovery - where ideally we would be finding more effective treatments by the models finding molecules outside the known chemical space. The ‘novel chemical space’ issue is part of the reason why I’m so bullish on metabolomic mining companies, such as Enveda and Brightseed. Other companies are mining genomes such as Erebagen and Hexagon Bio to discover new molecules are also promising. A recent article by the Eroom’s Law inventor, Jack Scannell et al., compares high-throughput methods for screening in low predictive disease models and low-throughput methods for screening in high disease predictive models. His analysis on the positive predictive value against the number of candidates selected shows the comparatively small effect of orders of magnitude more molecules compared to the predictive validity of the assay being used. Another nuance to drug discovery is that cost savings matter the most in the clinic - especially in Phase II clinical trials. Brender and Cortés-Ciriano modeled in a recent article that cost, quality, and speed savings are most salient in the clinic, especially in Phase II clinical trials (figure below). Both of these views are somewhat at odds with the current state of the field where companies focusing on target discovery and target binding abound.

Other thoughts

Moreover, the Scannell et al. 2022, and Brender and Cortés-Ciriano articles highlight the field’s emphasis on throughput (Scannell et al., 2022) or cost and speed (Brender and Cortés-Ciriano) as opposed to quality. Thankfully, improved quality can come in many forms. Although imperfect, below are some cited modes of drug discovery improvement gains:

  • Better disease models are crucial as these are the modes through which positive predictive value is significantly increased. (Summarized in Box 1 and 2 are various disease/screening models iterations of Hepatitis C and bacterial infections).
  • Improved elucidation of modes of interaction as opposed to targets.
  • Patient stratification - which in recent clinical trials has increased clinical trial success by 50%.
  • 3D cell cultures which are more representative than cell lines.
  • Animal models which correctly recapitulate disease and therapeutic effect.
  • Frequent measurements of cellular toxicity with human HepG2 cells, which is a major cause of failed drugs.
  • Better dosing windows which can be achieved through rat PK models.

Comparable companies

All AI in drug discovery companies.

Related reading

  • Drugs from AI that are in clinical trials are structurally similar to previously approved drugs https://www.cas.org/resources/cas-insights/drug-discovery/ai-designed-drug-candidates
  • New chemical space not seen: https://media.nature.com/original/magazine-assets/d41573-022-00025-1/20096834
  • More on that here: https://www.science.org/content/blog-post/ai-generated-clinical-candidates-so-far
  • MIT researchers shown that alphafold can’t do: "Predicting drug binding is probably one of the most difficult tasks in biology: these are many-atom interactions between complex molecules with many potential conformations, and the aim of docking is to pinpoint just one of them.” https://www.salon.com/2022/09/24/no-ai-probably-wont-revolutionizedrug-development/
  • Source of the compound libraries: https://www.nature.com/articles/nrd.2017.232/)

Related Theses

New bio data + AI businesses
01
Bio

New bio data + AI businesses

Full-stack business required when new data modalities come online

Read
Ozempic for Sleep
02
Bio

Ozempic for Sleep

A sleep-focused therapeutics startup

Read
Cancer Vaccines
03
Bio

Cancer Vaccines

Vaccinate yourself against your own tumor

Read

2026 Compound