Advanced Machine Learning for Drug Discovery

5

(78 reviews)

COHORT-BASED COURSE

NEXT COHORT STARTS IN

Advanced Machine Learning for Drug Discovery
Self-paced
NEXT COHORT

Nov 28-Dec 27, 2026

DURATION
SELF-PACED (SEE BELOW)
LEVEL

Advanced

HOSTED BY
Pankaj Mishra, PhD

Pankaj Mishra, PhD

Industrial Molecular AI Builder, Co-founder and CTO at Future Therapeutics, Co-founder of Neovarsity

Most Complete

Unlock Lifetime Access with a One-Time Payment

€480.00
Billed once

Lifetime access to lectures, recordings, and community from day one

Perfect for professionals and employer-supported learners seeking instant, complete access

Don't Miss Out!
Be the first to know when new cohorts open

About the Course

SELF-PACED | ONLINE | GO FROM BEGINNER TO ADVANCED LEVEL

🚨 NOTE 🚨: This course is offered only in self-paced mode. To access the course, enroll using the Enroll Now button and you will immediately get access to all course materials in your dashboard. There are no live cohort sessions. Please ignore any live start dates shown on the website. Support is available via Slack and email.

This is an end-to-end, hands-on machine learning program built specifically for molecular drug discovery. You will learn how to build real predictive models on chemical and bioactivity datasets, handle chemical data bias, validate models correctly, and deliver results that hold up under real-world constraints.

The curriculum is designed for professionals and advanced researchers who want practical ML capability in drug discovery, not generic ML theory. You will work with real molecular representations, partitioning strategies that prevent leakage, and model evaluation methods that reflect how discovery teams actually use ML.

This is not an “AI overview” course and it is not a collection of toy notebooks. You will implement full workflows from data collection to model deployment-ready evaluation, including interpretability and explainability methods that are critical when ML outputs drive scientific decisions.

WHO THIS IS FOR

This course is for you if you want to apply machine learning to real drug discovery problems, using real molecular datasets and workflows, not toy examples.

You are a strong fit if you are:

  • Medicinal or computational chemist transitioning into molecular machine learning
  • Cheminformatics or drug discovery scientist building predictive models
  • ML engineer working in life sciences who wants domain correct modeling workflows
  • PhD student or postdoc who wants industry-grade modeling skills in molecular property prediction

This course is also the right next step if your goal is generative AI for molecules, because it builds the foundations that determine whether your training data, objectives, and evaluation are even valid. Generative modeling without this is guessing with confidence.

WHAT YOU WILL BE ABLE TO DO

By the end of the program, you will be able to build end-to-end molecular ML pipelines that are reliable, defensible, and usable for drug discovery decision making.

You will be able to:

  • Collect and structure molecular drug discovery datasets from real sources
  • Run exploratory bioactivity data analysis and molecular visualization
  • Represent molecules using practical molecular representations for ML workflows
  • Build robust train-test splits using chemical clustering and scaffold-based partitioning to avoid leakage
  • Train and tune classical and advanced ML models for molecular prediction tasks, including bias-aware workflows
  • Handle chemical data bias using methods that reduce false confidence and improve generalization
  • Interpret model behavior using explainability tools like feature importance, LIME, and SHAP, so outputs can be trusted in scientific settings
  • Complete capstone-level molecular ML projects such as toxicity prediction, solubility prediction, and drug repurposing

Most importantly: you will be able to evaluate models like a real drug discovery team, instead of optimizing metrics that collapse in the real world.

Flexible learning options

  • Attend live (virtual) lectures
  • Access recorded lectures in your private dashboard

Practical application

  • Apply your skills through hands-on projects
  • Engage in real-world case studies

Personalized learning experiences

  • Tailored support and guidance
  • 24x7 support by our dedicated support team

Specialized community access

  • To our Members-only Slack community
  • To our invite-only deep tech global Slack community

Syllabus Overview

  • Reading

    Working with Python Virtual Environments
  • Reading

    Molecular ML Software Installation (Windows)
  • Reading

    Molecular ML Software Installation (macOS)
  • Reading

    Molecular ML Software Installation (Linux)
  • Video

    Introduction to Machine Learning in Drug Discovery
  • Video

    Molecular ML Software Stack
  • Video

    Dataset and Literature
  • Video

    Molecular Drug Discovery Data Collection
  • Video

    Hands-on data analysis with Pandas
Meet your Instructors
Pankaj Mishra, PhD
Pankaj Mishra, PhD
Instructor
Industrial Molecular AI Builder, Co-founder and CTO at Future Therapeutics, Co-founder of Neovarsity
I’m the Co-founder and CTO of Future Therapeutics, an AI-native biotech based in Berlin, where we build proprietary AI systems for drug discovery. I hold a PhD from the University of Freiburg, specializing in small molecule AI, and I’m trained in building models for low-data simulation and modeling, the reality most discovery teams operate in. I’ve been doing this long before it became mainstream, back in 2018 I was already building deep learning systems to explore “ultra-large chemical space”. Over the past few years, my focus has been generative molecular design. I’m also a Co-founder of Neovarsity, and since 2021 I’ve taught scientists and engineers across biopharma how to apply AI in real R&D workflows, including teams from J&J, Bayer, Takeda, Novartis, and others.

Earn Certificate of Achievement

Get certified and add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review

certificate

What Our Learners Say

I'm working on new projects, applying skills learned at Neovarsity
I have learned how to use cheminformatics as a crucial and powerful tool in the design of small molecules, how to calculate molecular descriptors and fingerprints from scratch related to my system’s properties, and how to use
Stefani Gamboa
Stefani Gamboa
PhD
Aix-Marseille University, France
My Neovarsity coursework gave me the expertise needed
I have been actively applying my newly acquired skills to the medicinal chemistry projects I am involved in [at Rutgers]. This involved creating targeted libraries and generating various substituents for my lead compounds. Fu
Anastasiia Tsymbal
Anastasiia Tsymbal
Research Associate
Rutgers University, New Jersey, USA
Clear and Engaging: A Recommended Course in ML for Drug Discovery
The course is very well explained and very interesting. The teacher explains the concepts very well and highlights the key points in ML for Drug Discovery. In addition, the support is really good, fast and reliable if there a
Dr. Samuël Demin
Dr. Samuël Demin
Senior Scientist
Johnson & Johnson, Antwerp, Belgium
I highly recommend this course
I had the privilege of taking the Advanced Machine Learning for Drug Discovery course on Neovarsity, and it exceeded all my expectations. The instructor's expertise and engaging teaching style made the learning experience tru
Promila Sharan, PhD
Promila Sharan, PhD
Data Scientist
CCS Haryana Agricultural University

Enroll Now

Advanced Machine Learning for Drug Discovery
For customised payments options, contact us

28 Nov, 2026

One-Time Payment
€480.00

All taxes included

  • One year complete access
  • Shareable certificate on completion
  • Career guidance from instructors
Filling Fast!
Frequently Asked Questions

This course is currently offered only in self-paced mode. There are no live sessions and no live cohort.


No. Ignore any live start date shown on the website. You can start immediately after enrolling.


Click the Enroll Now button. After enrollment, you will get immediate access to all course materials in your dashboard.


Yes. Support is available via Slack and email.


It is hands-on and workflow-driven. The syllabus emphasizes data collection, analysis, partitioning, model development, bias handling, explainability, and capstone projects.


You do not need professional ML experience, but basic familiarity helps. Strong comfort with chemical data is more important. The course itself recommends taking cheminformatics first.


No. This is predictive molecular machine learning, not generative modeling. It is the primer that makes generative AI possible to do correctly.


Yes. You will receive a certificate after completing the course requirements, including the capstones.


Feel free to reach out anytime. You can start an online chat with Catherine or email her directly at [email protected]. We’re here to help!