Data Sources
Galen integrates 10+ biomedical databases into a unified knowledge graph. Each database contributes specific types of evidence — from drug bioactivity to clinical genomics to protein interactions.
ChEMBL
Drug activityBioactivity measurements for 2.4M+ compounds
cBioPortal
Clinical genomicsSomatic mutations across 498 cancer studies
STRING
Protein interactionsProtein-protein interaction networks (11M+ interactions)
UniProt
Protein functionUniversal protein function annotations
Reactome
Biological pathwaysCurated biological pathway database (2,600+ pathways)
ClinVar
Genomic variantsClinical significance of genetic variants
IntOGen
Cancer driver genesCancer driver gene predictions across 28 tumor types
BioGRID
Genetic interactionsGenetic and physical interactions including synthetic lethality
GO
Molecular functionGene Ontology functional annotations
TCGA
Cancer genomicsThe Cancer Genome Atlas multi-omic data
PubMed
Biomedical literatureBiomedical literature mining (34.7M+ abstracts)
GTEx
Gene expressionNormal tissue gene expression across 54 tissues
ClinicalTrials.gov
Clinical trialsActive interventional trials matched by mutation, cancer type, and eligibility criteria.