Access chemical structures and protein sequences for pre-clinical drugs, as well as every drug approved by the FDA and Health Canada with our chemical structures dataset. DrugBank includes structures of formulations and salts, and structures of drug metabolites to ensure customers are easily able to integrate the data into their models to make meaningful progress in drug discovery. The dataset is available in multiple formats including SDF, SMILES, and InChi and protein sequences are available in FASTA and include UniProt and Genbank identifiers.
Customers use our machine-readable pharmacology dataset for building similarity-based predictors, training predictive models, and developing intelligent drug development solutions. Our pharmacology dataset includes detailed descriptions of the mechanism of action, metabolism, absorption, distribution, elimination and pharmacokinetic and pharmacodynamic parameters such as half-life, clearance, and LD50.
The pharmacogenomics dataset includes data on SNP mediated adverse drug reactions and SNP mediated pharmacological effects, including a description of the effect, affected drugs, references, SNP IDs, and allele name, gene identifier and affected genotype and coverage of predicted markers for some pre-clinical drugs. In addition, the Structured Indication Dataset provides detailed information on genetic variants that are part of the approved indication. This is useful for customers looking to create efficiencies within their drug repurposing ML models.
Access chemical structures and protein sequences for pre-clinical drugs, as well as every drug approved by the FDA and Health Canada with our chemical structures dataset. DrugBank includes structures of formulations and salts, and structures of drug metabolites to ensure customers are easily able to integrate the data into their models to make meaningful progress in drug discovery. The dataset is available in multiple formats including SDF, SMILES, and InChi and protein sequences are available in FASTA and include UniProt and Genbank identifiers.
Our customers use the adverse effects dataset to build predictive models in their drug discovery solutions. This dataset includes more than 110,000 adverse effects linked to drugs, clinical trial data, drug labels, and post-market reporting, and also include incidence rates when available. Each listing includes the names and synonyms of the condition, and associated ICD10, MedDRA and SNOMED-CT identifiers to facilitate data integrations.
DrugBank offers an indication dataset that covers more than 10,000 drug indications approved by Health Canada and the FDA, as well as common off-label indications. They include a text description, type of indication, references to drug labels, clinical guidelines and scientific literature. Each condition is associated with ICD10, MedDRA and SNOMED-CT identifiers to facilitate data integrations, making it easy for our customers to build intelligent models.
DrugBank makes data integrations easy by providing extensive synonyms, external identifiers, formulations, salt forms and chemical structures. Our customers use these to facilitate data integration and cross-mapping with other datasets. External mappings include MedDRA, ICD-10, SNOMED-CT, Uniprot, PDB, UNII, CAS, InChI, InChIKey, NDC, NDA, EMA and ATC codes.
The drug metabolism dataset provides structured descriptions of every step in the metabolism of a drug including the enzymes involved and the chemical structures of every metabolite.
Healx integrates DrugBank into their internal databases, empowering them to use a wide range of data to train their drug repurposing algorithms. By using DrugBank datasets, Healx is able to lower the time and cost of their R&D and get repurposed drugs to market sooner.
Read the customer story