AI in drug development

0.0(0)

Studied by 0 people

Call Kai

Learn

Practice Test

Spaced Repetition

Match

Flashcards

Knowt Play

Card Sorting

1/13

There's no tags or description

Looks like no tags are added yet.

Last updated 8:33 PM on 6/7/26

Name	Mastery	Learn	Test	Matching	Spaced	Call with Kai

No analytics yet

Send a link to your students to track their progress

14 Terms

New cards

what are the problems in drug development that could be solved with the use of AI

problems

Expensive- 2-3 billion dollars to make a single drug
Time consuming 10-15 years- researchers have to search through large amounts of biological and chemical data
High risk- 90% of candidate drugs fail before approval
- Failure usually due to lack of efficacy, toxicity, poor PK- which could have been predicted earlier
- Failure at phase II waste billions

how AI can help

AI can address this by processing biological, chemical and clinical data to allow for earlier and more accurate predictions at each stage of the pipeline
This helps reduce attrition, costs, shorten development (3-8 years), improves target selections, also allows for personalised medicine, improved safety profiles

New cards

describe the traditional computational methods used in drug development

In silico screening- HTS in computer, fits molecules into protein target and scores how well they fit. Top ranked molecules progress to in vitro assay
De novo design- screen library of fragments that fit into the target site and modify them to see how they affect the binding at target site
QSAR- predict biological activity of a molecule, detailed computer modelling of ligand properties and this is correlated with activity data at target site e.g. CoMFA predicts biological activity of molecules based on 3D shapes (3D QSAR)

New cards

describe machine learning and how it works

we don’t have to have preset algorithms and ideas been tested
the machine sources out the information. It uses large datasets to train the algorithms to improve through experience
2 types- supervised and unsupervised

New cards

describe the 2 types of machine learning including their use in drug development and pros/cons

supervised

Human input is used to train the AI what to look for
- classification- algorithm is used to assign data set into specific categories
- Regression- algorithm is used to understand relationship between dependent and independent variable
Uses- predict drug-target interactions, toxicity and efficacy
Pro- high accuracy and easier validation
Con- requires high quality labelled datasets

unsupervised

ML discovers hidden patterns in data without need for human intervention, though used at the end to filter what is needed
- clustering- grouping unlabelled data based on similarities and differences
- Association- uses different rules to find relationships between variables
- Dimensional reduction- used to remove noise from the data to reduce the dataset to manageable size
Uses- drug clustering, patient stratification, biomarker discovery
Pros- discover new relationships and new disease subtype
Con- identifies meaningless correlations

New cards

describe deep learning

allows the system to learn from data on its own and interrogate the data more
Uses neural networks made from layers of nodes that mirror the neural framework of our brain with the way it inputs, processes and outputs data
Structure-
- Input layer- this is where all the data goes in
- Hidden layer- do the heavy thinking, analyse patterns and learn from the data
- Output layer- gives the final prediction
Pro- more powerful, allows integration of highly complex data beyond traditional statistics

New cards

give an example of a deep learning AI system used for drug discovery

AlphaFold

Used for target identification and structure prediction
Problem- many proteins are potential drug targets but drug design often requires knowledge of the 3D structure
Traditional methods such as x ray crystallography, cryo-EM, NMR are expensive and technically difficult
Alpha fold uses deep learning to predict protein folding and protein structure directly from amino acid sequence
knowing protein structure is important as it determines ligand binding, receptor activation, enzyme activity and knowing structure allows rational drug design and identification of binding pockets
used in drug development to generate biological knowledge that was previously inaccessible

New cards

give an example of an AI-driven drug

Insilico Medicine (INS018_055)

Disease- idiopathic pulmonary fibrosis
Drug discovery for fibrosis is difficult because multiple pathways involved and poorly understood mechanisms and high failure rate
Machine learning analysed biological datasets and identified novel pathways involved in fibrosis
Generative AI designed candidate drugs rather than screening millions of existing compounds
Candidate drug progressed from target discovery to preclinical candidate in approx 18 months compared to 4-6 years

New cards

name the 4 phases in drug development pipeline where AI can be implemented

target identification

Hit identification/LI/LO

preclinical testing

clinical trial optimisation

New cards

describe how AI can be implemented in target identification phase

Used to analyse large scale biological datasets to identify novel therapeutic targets involved in disease pathology
AI can identify patterns, correlations and disease-associated molecular signatures that may not be apparent to humans
Can analyse differences between healthy and diseased tissues to identify genes, proteins, signalling pathways associated with disease
Accelerates target discovery but reduces likelihood that good targets will be overlooked due to the complexity of the biological data

New cards

describe how AI can be implemented in HIT identification (LI/LO) phase

Can screen vast numbers of compounds and predict which molecules are most likely to interact effectively with a therapeutic target
AI driven screening allows researchers to computationally evaluate billions of compounds
Can predict which molecules are most likely to bind to a target protein based on structure-activity relationships
Can estimate important relationships such as binding affinity, selectivity, solubility- poor candidates eliminated earlier and resources focused on promising candidates
Reduces time and cost required to identify a viable lead compound

New cards

describe how AI can be implemented in Preclinical testing phase

Used to predict how candidate drugs will behave in biological systems before extensive animal and laboratory testing
AI can predict ADME properties - used to estimate whether a compound will reach its target tissue and achieve therapeutic concentrations
Can predict potential toxicities and DDI and identify safety concerns earlier so unsuitable compounds are removed before costly animal studies are done
Can simulate how drugs interact with cells, tissues and biological pathways which decreases likelihood of late-stage failure

New cards

describe how AI can be implemented in Clinical trial optimisation phase

Being used to improve clinical trial design, patient recruitment, monitoring and data analysis
Increases trial success rates and reduces development costs
Challenges of clinical research- identifying patients who will most likely benefit from treatment
AI can analyse genetic data, biomarkers, medical records to identify patient populations more appropriate for a particular trial
Supports precision medicine by enriching clinical trials with patients who possess molecular characteristics relevant to the drugs MOA

New cards

describe an example of AI in drug repurposing

Baricitinib for COVID 19 (Benevolent AI)

Original indication- RA
MOA- JAK1/2 inhibitor that suppresses the inflammatory cytokine cascade driving the destruction of joints in RA
The role of AI
- used to identify current drug that inhibits viral entry or replication and/or suppress the inflammatory response
- used AI-driven knowledge graph which contained biomedical data on drugs, genes, proteins alongside mechanisms, processes and pathways of already approved drugs with anti-inflammatory activity
- KG identified AAK1 (AP2-associated protein kinase 1) as a key regulator of clathrin-mediated endocytosis- used by the SARS-CoV-2 virus to enter cells
- Among the already FDA-approved drugs that inhibit AAK1, baricitinib ranked the highest in efficacy; became key part of the COVID-19 treatment protocols
  - dual MOA- JAK1/2 +AAK1 inhibitor = anti-inflammatory and blocked viral entry
Timeline
- AI prediction to clinical trial entry ~ 3 months
- From prediction to FDA approval ~ 28 months compared to usual 12-15 yrs
- However baricitinb success was also helped by the high demand during that global pandemic; there was regulatory flexibility (emergency use authorisation)

New cards

describe the limitations of using AI in drug discovery

Data quality and availability
- AI is only as good as the data used and biological datasets are often incomplete, bias towards well-studied diseases (white EU populations) or unavailable in public domain
Black box problem
- Deep learning models produce accurate predictions without mechanistic explanations of the drugs MOA that is needed by regulatory agencies
- The internal decision-making process is difficult for humans to interpret
- Drug development requires scientifically justified reasons so without a mechanistic interpretation the data cannot be trusted
Novelty ceiling
- Generative AI can produce molecules that are synthetically inaccessible or non-producible and that puts researchers back at square 1