DrugBank | Machine Learning

Explore DrugBank’s
Machine Learning Solutions

DrugBank provides machine-learning ready, structured and curated datasets that allow for the exploration of different algorithms, approaches, and features. Train and evaluate your machine-learning models using our detailed, labelled datasets, or build predictive models for drug targets, side effects, toxicity, and drug-drug interactions. Our customers have seen success in building ML models for drug development and discovery.

Explore our key datasets for Machine Learning

Chemical Structures

Access chemical structures and protein sequences for pre-clinical drugs, as well as every drug approved by the FDA and Health Canada with our chemical structures dataset. DrugBank includes structures of formulations and salts, and structures of drug metabolites to ensure customers are easily able to integrate the data into their models to make meaningful progress in drug discovery. The dataset is available in multiple formats including SDF, SMILES, and InChi and protein sequences are available in FASTA and include UniProt and Genbank identifiers.

Pharmacology

Customers use our machine-readable pharmacology dataset for building similarity-based predictors, training predictive models, and developing intelligent drug development solutions. Our pharmacology dataset includes detailed descriptions of the mechanism of action, metabolism, absorption, distribution, elimination and pharmacokinetic and pharmacodynamic parameters such as half-life, clearance, and LD50.

Pharmacogenomics

The pharmacogenomics dataset includes data on SNP mediated adverse drug reactions and SNP mediated pharmacological effects, including a description of the effect, affected drugs, references, SNP IDs, and allele name, gene identifier and affected genotype and coverage of predicted markers for some pre-clinical drugs. In addition, the Structured Indication Dataset provides detailed information on genetic variants that are part of the approved indication. This is useful for customers looking to create efficiencies within their drug repurposing ML models.

Drug-Protein Relationships

Adverse Effects

Our customers use the adverse effects dataset to build predictive models in their drug discovery solutions. This dataset includes more than 110,000 adverse effects linked to drugs, clinical trial data, drug labels, and post-market reporting, and also include incidence rates when available. Each listing includes the names and synonyms of the condition, and associated ICD10, MedDRA and SNOMED-CT identifiers to facilitate data integrations.

Indications

DrugBank offers an indication dataset that covers more than 10,000 drug indications approved by Health Canada and the FDA, as well as common off-label indications. They include a text description, type of indication, references to drug labels, clinical guidelines and scientific literature. Each condition is associated with ICD10, MedDRA and SNOMED-CT identifiers to facilitate data integrations, making it easy for our customers to build intelligent models.

Data Integration Identifiers

DrugBank makes data integrations easy by providing extensive synonyms, external identifiers, formulations, salt forms and chemical structures. Our customers use these to facilitate data integration and cross-mapping with other datasets. External mappings include MedDRA, ICD-10, SNOMED-CT, Uniprot, PDB, UNII, CAS, InChI, InChIKey, NDC, NDA, EMA and ATC codes.

Drug Metabolism

The drug metabolism dataset provides structured descriptions of every step in the metabolism of a drug including the enzymes involved and the chemical structures of every metabolite.

Healx integrates DrugBank into their internal databases, empowering them to use a wide range of data to train their drug repurposing algorithms. By using DrugBank datasets, Healx is able to lower the time and cost of their R&D and get repurposed drugs to market sooner.

Read the customer story

We’ve been very happy with the DrugBank data and service. The data is well structured and DrugBank is always very responsive to requests.

Richard Smith

Senior Software Developer, Healx

Molecular Health offers software solutions for evidence-based healthcare decision support and smarter drug development. They integrate DrugBank data seamlessly into in-house products to enhance outcomes for data-driven decision making.

Read the customer story

We value DrugBank as a well-established, comprehensive and constantly improving drug database.

Explore DrugBank’s
Machine Learning Solutions

Use DrugBank Data to

Train your machine
learning models

Enhance your
data pool

Build predictive
models

Explore our key datasets for Machine Learning

Chemical Structures

Pharmacology

Pharmacogenomics

Drug-Protein Relationships

Adverse Effects

Indications

Data Integration Identifiers

Drug Metabolism

Healx integrates DrugBank into their internal databases, empowering them to use a wide range of data to train their drug repurposing algorithms. By using DrugBank datasets, Healx is able to lower the time and cost of their R&D and get repurposed drugs to market sooner.

Read the customer story

Molecular Health offers software solutions for evidence-based healthcare decision support and smarter drug development. They integrate DrugBank data seamlessly into in-house products to enhance outcomes for data-driven decision making.

Read the customer story

Learn more about our
Machine Learning Solutions

Our products and services can be tailored to your company’s needs. Contact us today to talk about which solution is right for you.

Drug Datasets

Structured drug data for data science & ML

Drug Datasets

Free drug data for students & profs

Clinical API

Clinical intelligence tool for your software

Drug Search

Customizable drug search options

Drug-Drug Interaction Checker

Search for drug interactions with our API

Drug Allergy

Get drug allergy and cross-sensitivities info

US Drug Labels

Integrate drug manufacturer information

Drug Discovery & Repurposing

Speed up & uncover bigger discoveries with ML

In Silico Testing

Validate targets quickly & accurately

Precision Medicine

Build evidence-based tailored treatment plans

Clinical Trial Matching

Match patients to emerging trials faster

Telehealth

Power your remote care software

Electronic Medical Records

Empower providers with reliable drug information

Help Center

Got questions? We've got answers

Blog

Front-page news and deep-dive content

Customer Stories

Learn how our customers are changing the world

Books & Webinars

Check out hours of guides, books, and more

Publications Directory

Growing list of DrugBank-cited publications

Explore DrugBank’s Machine Learning Solutions

Use DrugBank Data to

Train your machinelearning models

Enhance yourdata pool

Build predictivemodels

Explore our key datasets for Machine Learning

Chemical Structures

Pharmacology

Pharmacogenomics

Drug-Protein Relationships

Adverse Effects

Indications

Data Integration Identifiers

Drug Metabolism

Healx integrates DrugBank into their internal databases, empowering them to use a wide range of data to train their drug repurposing algorithms. By using DrugBank datasets, Healx is able to lower the time and cost of their R&D and get repurposed drugs to market sooner.

Read the customer story

Molecular Health offers software solutions for evidence-based healthcare decision support and smarter drug development. They integrate DrugBank data seamlessly into in-house products to enhance outcomes for data-driven decision making.

Read the customer story

Learn more about ourMachine Learning Solutions

Our products and services can be tailored to your company’s needs. Contact us today to talk about which solution is right for you.

Explore DrugBank’s
Machine Learning Solutions

Train your machine
learning models

Enhance your
data pool

Build predictive
models

Learn more about our
Machine Learning Solutions