Cheminformatics: Tools and Applications

5

(96 reviews)

COHORT-BASED COURSE

NEXT COHORT STARTS IN

Cheminformatics: Tools and Applications
Self-paced
NEXT COHORT

Oct 17-Dec 27, 2026

DURATION
Self-paced (see below)
LEVEL

Advanced

HOSTED BY
Pankaj Mishra, PhD

Pankaj Mishra, PhD

Industrial Molecular AI Builder, Co-founder and CTO at Future Therapeutics, Co-founder of Neovarsity

Most Complete

Unlock Lifetime Access with a One-Time Payment

€480.00
Billed once

Lifetime access to lectures, recordings, and community from day one

Perfect for professionals and employer-supported learners seeking instant, complete access

Don't Miss Out!
Be the first to know when new cohorts open

About the Course

SELF-PACED | ONLINE | GO FROM BEGINNER TO ADVANCED

🚨 NOTE 🚨: This course is offered only in self-paced mode. To access the course, enroll using the Enroll Now button and you will immediately get access to all course materials in your dashboard. There are no live cohort sessions. Please ignore any live start dates shown on the website. Support is available via Slack and email.

The most practical cheminformatics program for the AI-driven era

This course teaches you how to work with molecular data correctly. You will learn how chemical structures are represented, stored, searched, compared, and transformed in real discovery pipelines. The focus is applied cheminformatics for machine learning and generative modeling. You will build the data foundation.

You will build practical workflows for molecular similarity, substructure analysis, SAR exploration, virtual screening, and library design using industry-standard tools. The goal is operational competence. By the end, you should be able to take a raw chemical dataset and turn it into something usable, analyzable, and defensible.

This course forms the technical foundation required for any serious work in molecular modeling, QSAR, or generative chemistry. It is about understanding chemical data, not producing models that hide broken assumptions.

WHO THIS IS FOR

This course is for you if you work with chemical structures and datasets and want to do it properly. That includes medicinal chemists moving into data-driven workflows, computational chemists who want stronger fundamentals, cheminformatics engineers, and PhD students or postdocs working with molecular data.

This course is also the right primer if you plan to move into molecular AI or molecule generation later, because it gives you the chemical data foundations those models depend on.

WHAT YOU WILL BE ABLE TO DO

By the end of the course, you will be able to clean and standardize real-world chemical datasets, represent molecules using appropriate fingerprints and descriptors, and perform similarity, clustering, and SAR analyses with clear reasoning.

You will be able to design and evaluate virtual screening workflows, build and analyze combinatorial libraries, perform scaffold and substructure analysis, and make technically defensible decisions about chemical space exploration.

You will not train generative models or neural networks in this course. Instead, you will gain the core cheminformatics competence required to judge whether downstream modeling, including generative AI, is even being done correctly.

Capstone Projects and Certificate

You will complete 3 hands-on capstone projects that apply the full workflow end-to-end, from chemical data preparation to analysis and decision making. After successfully completing all capstones, you will receive a course completion certificate that you can request from your dashboard.

Flexible learning options

  • Attend live (virtual) lectures
  • Access recorded lectures in your private dashboard

Practical application

  • Apply your skills through hands-on projects
  • Engage in real-world case studies

Personalized learning experiences

  • Tailored support and guidance
  • 24x7 support by our dedicated support team

Specialized community access

  • To our Members-only Slack community
  • To our invite-only deep tech global Slack community

Syllabus Overview

  • Reading

    Python Virtual Environments for Cheminformatics
  • Reading

    Installation of Cheminformatics Software on Windows
  • Reading

    Installation of Cheminformatics Software on macOS
  • Reading

    Installation of Cheminformatics Software on Linux
  • Reading

    DataWarrior Installation Guide
  • Reading

    KNIME Installation Guide
  • Quiz

    Quiz 1: Software & Installation
  • Video

    Introduction to Drawing Molecules
  • Video

    Web-based Molecular Drawing
  • Video

    Local Tools for Molecular Drawing
  • Video

    Advanced Molecular Drawing: Additional Insights
  • Quiz

    Quiz 2: Drawing Techniques
  • Video

    Introduction to Chemical File Types
  • Video

    Chemical Files: SMILES Format Basics
  • Video

    Chemical Files: Advanced SMILES
  • Video

    Chemical Files: SMARTS Format Part I
  • Video

    Chemical Files: SMARTS Format Part II
  • Video

    SMARTS Format Challenges and Limitations
  • Video

    SDF File Format Explained
  • Video

    PDB File Format Overview
  • Quiz

    Quiz 3: Chemical File Types
  • Video

    Introduction to File Conversions
  • Video

    Using PandasTools for File Conversion
  • Video

    KNIME for Chemical File Conversion
  • Quiz

    Quiz 4: File Conversion Techniques
Meet your Instructors
Pankaj Mishra, PhD
Pankaj Mishra, PhD
Instructor
Industrial Molecular AI Builder, Co-founder and CTO at Future Therapeutics, Co-founder of Neovarsity
I’m the Co-founder and CTO of Future Therapeutics, an AI-native biotech based in Berlin, where we build proprietary AI systems for drug discovery. I hold a PhD from the University of Freiburg, specializing in small molecule AI, and I’m trained in building models for low-data simulation and modeling, the reality most discovery teams operate in. I’ve been doing this long before it became mainstream, back in 2018 I was already building deep learning systems to explore “ultra-large chemical space”. Over the past few years, my focus has been generative molecular design. I’m also a Co-founder of Neovarsity, and since 2021 I’ve taught scientists and engineers across biopharma how to apply AI in real R&D workflows, including teams from J&J, Bayer, Takeda, Novartis, and others.

Earn Certificate of Achievement

Get certified and add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review

certificate

Enroll Now

Cheminformatics: Tools and Applications
For customised payments options, contact us

17 Oct, 2026

One-Time Payment
€480.00

All taxes included

  • One year complete access
  • Shareable certificate on completion
  • Career guidance from instructors
Filling Fast!
Frequently Asked Questions

This course is 100% self-paced. There are no live sessions and no live cohort.


No. Ignore the live date shown on the website. The course is currently offered only in self-paced mode, so you can start immediately after enrolling.


Click the Enroll Now button. Once you enroll, you will get immediate access to all course materials in your dashboard.


Yes. You can start immediately. There is no waiting period and no fixed batch schedule.


Yes. You will have support via Slack and email even though the course is not live.


Yes. You will receive a certificate of completion after you finish the course requirements, including the capstone projects.


Yes. You will complete 3 capstone projects to apply what you learn through real workflows.


It depends on your pace. Some learners finish faster by moving full-time, others take longer while balancing work. Since it’s self-paced, you control the timeline.


Not necessarily. This course focuses on cheminformatics foundations, not deep learning or generative models. Comfort with basic Python helps a lot.


No. This course teaches cheminformatics and molecular data workflows. It’s the right foundation if you want to move into molecular AI later, but it does not cover generative modeling itself.


Start an online chat with Catherine or email [email protected]. We are here to help.