AI in drug development

0.0(0)
Studied by 0 people
call kaiCall Kai
learnLearn
examPractice Test
spaced repetitionSpaced Repetition
heart puzzleMatch
flashcardsFlashcards
GameKnowt Play
Card Sorting

1/13

encourage image

There's no tags or description

Looks like no tags are added yet.

Last updated 8:33 PM on 6/7/26
Name
Mastery
Learn
Test
Matching
Spaced
Call with Kai

No analytics yet

Send a link to your students to track their progress

14 Terms

1
New cards

what are the problems in drug development that could be solved with the use of AI

problems

  • Expensive- 2-3 billion dollars to make a single drug

  • Time consuming 10-15 years- researchers have to search through large amounts of biological and chemical data

  • High risk- 90% of candidate drugs fail before approval

    • Failure usually due to lack of efficacy, toxicity, poor PK- which could have been predicted earlier

    • Failure at phase II waste billions

how AI can help

  • AI can address this by processing biological, chemical and clinical data to allow for earlier and more accurate predictions at each stage of the pipeline

  • This helps reduce attrition, costs, shorten development (3-8 years), improves target selections, also allows for personalised medicine, improved safety profiles

2
New cards

describe the traditional computational methods used in drug development

  • In silico screening- HTS in computer, fits molecules into protein target and scores how well they fit. Top ranked molecules progress to in vitro assay

  • De novo design- screen library of fragments that fit into the target site and modify them to see how they affect the binding at target site

  • QSAR- predict biological activity of a molecule, detailed computer modelling of ligand properties and this is correlated with activity data at target site e.g. CoMFA predicts biological activity of molecules based on 3D shapes (3D QSAR)

3
New cards

describe machine learning and how it works

  • we don’t have to have preset algorithms and ideas been tested

  • the machine sources out the information. It uses large datasets to train the algorithms to improve through experience

  • 2 types- supervised and unsupervised

4
New cards

describe the 2 types of machine learning including their use in drug development and pros/cons

supervised

  • Human input is used to train the AI what to look for

    • classification- algorithm is used to assign data set into specific categories

    • Regression- algorithm is used to understand relationship between dependent and independent variable

  • Uses- predict drug-target interactions, toxicity and efficacy

  • Pro- high accuracy and easier validation 

  • Con- requires high quality labelled datasets

unsupervised

  • ML discovers hidden patterns in data without need for human intervention, though used at the end to filter what is needed

    • clustering- grouping unlabelled data based on similarities and differences

    • Association- uses different rules to find relationships between variables

    • Dimensional reduction- used to remove noise from the data to reduce the dataset to manageable size

  • Uses- drug clustering, patient stratification, biomarker discovery

  • Pros- discover new relationships and new disease subtype

  • Con- identifies meaningless correlations

5
New cards

describe deep learning

  • allows the system to learn from data on its own and interrogate the data more

  • Uses neural networks made from layers of nodes that mirror the neural framework of our brain with the way it inputs, processes and outputs data

  • Structure-

    • Input layer- this is where all the data goes in

    • Hidden layer- do the heavy thinking, analyse patterns and learn from the data

    • Output layer- gives the final prediction

  • Pro- more powerful, allows integration of highly complex data beyond traditional statistics

6
New cards

give an example of a deep learning AI system used for drug discovery

AlphaFold

  • Used for target identification and structure prediction

  • Problem- many proteins are potential drug targets but drug design often requires knowledge of the 3D structure

  • Traditional methods such as x ray crystallography, cryo-EM,  NMR are expensive and technically difficult

  • Alpha fold uses deep learning to predict protein folding and protein structure directly from amino acid sequence

  • knowing protein structure is important as it determines ligand binding, receptor activation, enzyme activity and knowing structure allows rational drug design and identification of binding pockets

  • used in drug development to generate biological knowledge that was previously inaccessible

7
New cards

give an example of an AI-driven drug

Insilico Medicine (INS018_055)

  • Disease- idiopathic pulmonary fibrosis

  • Drug discovery for fibrosis is difficult because multiple pathways involved and poorly understood mechanisms and high failure rate

  • Machine learning analysed biological datasets and identified novel pathways involved in fibrosis

  • Generative AI designed candidate drugs rather than screening millions of existing compounds

  • Candidate drug progressed from target discovery to preclinical candidate in approx 18 months compared to 4-6 years

8
New cards

name the 4 phases in drug development pipeline where AI can be implemented

target identification

Hit identification/LI/LO

preclinical testing

clinical trial optimisation

9
New cards

describe how AI can be implemented in target identification phase

  • Used to analyse large scale biological datasets to identify novel therapeutic targets involved in disease pathology

  • AI can identify patterns, correlations and disease-associated molecular signatures that may not be apparent to humans

  • Can analyse differences between healthy and diseased tissues to identify genes, proteins, signalling pathways associated with disease

  • Accelerates target discovery but reduces likelihood that good targets will be overlooked due to the complexity of the biological data

10
New cards

describe how AI can be implemented in HIT identification (LI/LO) phase

  • Can screen vast numbers of compounds and predict which molecules are most likely to interact effectively with a therapeutic target

  • AI driven screening allows researchers to computationally evaluate billions of compounds

  • Can predict which molecules are most likely to bind to a target protein based on structure-activity relationships

  • Can estimate important relationships such as binding affinity, selectivity, solubility- poor candidates eliminated earlier and resources focused on promising candidates

  • Reduces time and cost required to identify a viable lead compound

11
New cards

describe how AI can be implemented in Preclinical testing phase

  • Used to predict how candidate drugs will behave in biological systems before extensive animal and laboratory testing

  • AI can predict ADME properties - used to estimate whether a compound will reach its target tissue and achieve therapeutic concentrations

  • Can predict potential toxicities and DDI and identify safety concerns earlier so unsuitable compounds are removed before costly animal studies are done

  • Can simulate how drugs interact with cells, tissues and biological pathways which decreases likelihood of late-stage failure

12
New cards

describe how AI can be implemented in Clinical trial optimisation phase

  • Being used to improve clinical trial design, patient recruitment, monitoring and data analysis

  • Increases trial success rates and reduces development costs

  • Challenges of clinical research- identifying patients who will most likely benefit from treatment

  • AI can analyse genetic data, biomarkers, medical records to identify patient populations more appropriate for a particular trial

  • Supports precision medicine by enriching clinical trials with patients who possess molecular characteristics relevant to the drugs MOA

13
New cards

describe an example of AI in drug repurposing

Baricitinib for COVID 19 (Benevolent AI)

  • Original indication- RA

  • MOA- JAK1/2 inhibitor that suppresses the inflammatory cytokine cascade driving the destruction of joints in RA

  • The role of AI

    • used to identify current drug that inhibits viral entry or replication and/or suppress the inflammatory response

    • used AI-driven knowledge graph which contained biomedical data on drugs, genes, proteins alongside mechanisms, processes and pathways of already approved drugs with anti-inflammatory activity

    • KG identified AAK1 (AP2-associated protein kinase 1) as a key regulator of clathrin-mediated endocytosis- used by the SARS-CoV-2 virus to enter cells

    • Among the already FDA-approved drugs that inhibit AAK1, baricitinib ranked the highest in efficacy; became key part of the COVID-19 treatment protocols

      • dual MOA- JAK1/2 +AAK1 inhibitor = anti-inflammatory and blocked viral entry

  • Timeline

    • AI prediction to clinical trial entry ~ 3 months

    • From prediction to FDA approval ~ 28 months compared to usual 12-15 yrs

    • However baricitinb success was also helped by the high demand during that global pandemic; there was regulatory flexibility (emergency use authorisation)

14
New cards

describe the limitations of using AI in drug discovery

  • Data quality and availability

    • AI is only as good as the data used and biological datasets are often incomplete, bias towards well-studied diseases (white EU populations) or unavailable in public domain

  • Black box problem

    • Deep learning models produce accurate predictions without mechanistic explanations of the drugs MOA that is needed by regulatory agencies

    • The internal decision-making process is difficult for humans to interpret

    • Drug development requires scientifically justified reasons so without a mechanistic interpretation the data cannot be trusted

  • Novelty ceiling

    • Generative AI can produce molecules that are synthetically inaccessible or non-producible and that puts researchers back at square 1