usegalen.com Get API Key

The Causal Inference Engine for Cancer Biology.

Compute causal effects from experimentally verified relationships. Every answer traces to its original experiment. Deterministic. Auditable. Not generated.

Peer-reviewed databases. Formal do-calculus. Provenance on every edge.

Get Your API Key →Explore the Docs

Already have an account? Sign in to Dashboard

Every fact has an address

In a world where AI generates plausible-sounding cancer claims with no way to check them, Galen returns structured data where every relationship cites a specific ChEMBL assay, cBioPortal study, or STRING interaction score. This isn’t a policy choice — it’s an architectural guarantee. The system cannot return a fact without its provenance.

Each relationship is annotated with a Pearl Causal Hierarchy layer: L1 (observed association), L2 (experimental intervention), or L3 (counterfactual). This tells you not just that a relationship exists, but how strong the evidence is behind it.

This isn’t just a technical choice. Cancer’s complexity exceeds human cognitive capacity, and existing tools treat pieces in isolation. Galen is building the first system that reasons across all of cancer biology from first principles. Learn about our approach →

How Galen is built

Not a fine-tuned LLM. Not a RAG pipeline. A purpose-built causal inference engine.

10+ databases

Verified, versioned local copies of ChEMBL, cBioPortal, STRING, UniProt, and 9 more. Not API calls — local data.

Autonomous RL agent

Runs 24/7. Decides what to research next. Discovers, validates, and promotes causal relationships.

Structural Causal Model

Formal do-calculus and counterfactual inference over experimental evidence. Not statistical correlations.

Pearl Causal Hierarchy

Every relationship graded: L1 (observed), L2 (experimental), L3 (counterfactual). Evidence quality is structural.

Learn more about the Pearl Causal Hierarchy →

Three kinds of cancer information tools

Galen is not a better AI. It’s a different category of tool — one that combines the verifiability of databases with reasoning capabilities neither databases nor LLMs can provide.

Capability	Databases PubChem, Open Targets	LLMs ChatGPT, Claude, Gemini	Galen
What it does	Stores experimental data	Generates text from training data	Computes causal inferences from experiments
"What's associated with EGFR?"	Yes	Yes (unreliably)	Yes
"What happens if you inhibit EGFR?"	No	Generates a guess	Computes from structural causal model
"What would have happened with Drug B?"	No	Generates a guess	Computes via counterfactual inference
Cites specific experiments	Yes (it IS the database)	No	Yes (provenance per relationship)
Deterministic	Yes	No	Yes

See detailed Galen vs LLM comparison

Provenance-traced evidence

Galen

Every relationship cites its original data source (e.g., ChEMBL assay, cBioPortal study, STRING interaction). You can verify any fact.

General AI (ChatGPT, Claude, Gemini)

Generated from training data. No way to trace a specific claim to its original experiment or study.

Formal causal inference

Galen

Computes genuine causal effects using do-calculus P(Y|do(X)) over a Structural Causal Model. Not statistical correlations.

General AI (ChatGPT, Claude, Gemini)

Can describe causal concepts but cannot compute causal effects. Relies on associative patterns in text.

Counterfactual reasoning

Galen

Twin-network counterfactual inference: "What would have happened if this patient had received Drug B instead of Drug A?"

General AI (ChatGPT, Claude, Gemini)

Can speculate about counterfactuals but has no formal model to compute them. Answers are plausible-sounding guesses.

Evidence grading

Galen

Every relationship is annotated with a Pearl Causal Hierarchy layer: L1 (observed), L2 (experimental), L3 (counterfactual).

General AI (ChatGPT, Claude, Gemini)

No evidence quality distinctions. A correlation mentioned in a blog post and a randomized trial result look the same.

Multi-mutation interpretation

Galen

Interprets specific mutation combinations against 7.7M+ verified relationships. Returns evidence-graded treatment options.

General AI (ChatGPT, Claude, Gemini)

Summarizes general information about individual mutations. Cannot reason about specific multi-mutation interactions.

Reproducibility

Galen

Same query always returns the same structured result. Deterministic. Auditable.

General AI (ChatGPT, Claude, Gemini)

Different answers to the same question. Non-deterministic. Cannot be audited or validated.

What you can build with Galen

Concrete problems, concrete solutions — for every team that needs verifiable cancer biology data.

Pharma & Biotech R&D

Screen drug candidates for off-target effects and resistance mechanisms before wet-lab validation.

Example

Simulate inhibiting KRAS G12C and trace every causal downstream effect — with the original experimental source for each one.

/causal/intervention

Clinical Decision Support

Interpret a patient's multi-mutation tumor profile with evidence-graded treatment options — not LLM-generated summaries.

Example

Submit BRAF V600E + TP53 R175H + CDKN2A deletion. Get structured treatment options with evidence quality grades and cited sources.

/patient/profile

Academic Research

Explore causal mechanisms in cancer biology with formal do-calculus — not keyword search over static databases.

Example

Compute P(apoptosis | do(inhibit EGFR)) with provenance for every causal edge in the derivation.

/causal/do-calculus

AI Agent Builders

Give your AI agent structured, verifiable cancer biology data instead of hoping it generates correct facts.

Example

Your agent queries verified experimental data from 10+ databases — every response includes provenance, confidence scores, and evidence grades.

/entities/{entity_id}

7 API modules, 40+ endpoints

From entity search to causal inference to patient interpretation — everything connected through a unified, provenance-traced knowledge graph.

Free

Knowledge Graph

Search, explore, and traverse 855K+ cancer biology concepts and 7.7M+ relationships.

GET/entities/{entity_id}

GET/entities/search

GET/entities/{entity_id}/relationships

+3 more endpoints

View documentation →

Free

Databases

Query integrated biomedical databases through a unified API.

GET/chembl/compound/{name}

GET/cbioportal/mutations/{gene}

GET/gtex/expression/{gene}

+3 more endpoints

View documentation →

Researcher

Causal Inference

Simulate interventions, run do-calculus queries, and explore counterfactual scenarios.

POST/causal/intervention

POST/causal/explain

POST/causal/do-calculus

+5 more endpoints

View documentation →

Researcher

Predictions

Predict drug responses, gene essentiality, and mutation effects with confidence scores.

POST/predictions/drug-response

POST/predictions/essentiality

POST/predictions/mutation-effect

+1 more endpoints

View documentation →

Free

Hypotheses

Discover knowledge gaps and explore the frontier of cancer biology.

GET/hypotheses/frontier

GET/hypotheses/knowledge-gaps

View documentation →

Pro

Patient Interpretation

Interpret multi-mutation profiles, find treatments, match clinical trials.

POST/patient/profile/interpret

POST/patient/treatments

POST/patient/resistance/explain

+2 more endpoints

View documentation →

Free

System

Health checks, system statistics, and data source information.

GET/health

GET/status

GET/stats

+2 more endpoints

View documentation →

Causal inference in a few lines of code

Simulate inhibiting a cancer target and trace every downstream effect — with the original experimental source, evidence grade, and confidence score for each one.

example.py

import requests

API_KEY = "gk_live_your_key_here"
BASE = "https://research.usegalen.com/api/v1"

# Query: What happens downstream when you inhibit EGFR?
# This uses BFS over a Structural Causal Model — not
# statistical correlations. Every effect is traceable.
causal = requests.post(
    f"{BASE}/causal/intervention",
    headers={"X-API-Key": API_KEY},
    json={
        "target": "EGFR",
        "intervention_type": "inhibit"
    }
)

# Each downstream effect includes:
# - The affected entity and mechanism
# - Effect size and confidence score
# - Pearl Causal Hierarchy layer (L1/L2/L3)
# - Original data source (e.g., "ChEMBL", "cBioPortal")
for effect in causal.json()["downstream_effects"]:
    print(f"{effect['entity']}: {effect['effect_type']}"
          f" (confidence: {effect['confidence']:.2f})")
    print(f"  Source: {effect['provenance']}"
          f" | Pearl Level: L{effect['pch_layer']}")

# Example output:
# EGFR inhibition → suppresses MAPK signaling (confidence: 0.94)
#   Source: ChEMBL bioactivity | Pearl Level: L2 (experimental)
# EGFR inhibition → reduces cell proliferation (confidence: 0.87)
#   Source: cBioPortal clinical genomics | Pearl Level: L2 (experimental)
# EGFR inhibition → may restore apoptosis via BIM (confidence: 0.71)
#   Source: SCM counterfactual validation | Pearl Level: L3 (counterfactual)

Simple, transparent pricing

Start free. Scale when you need to. No surprise charges.

Explorer

Free

1,000 requests/day

Knowledge graph queries (entities, search, relationships)
All database integrations (ChEMBL, cBioPortal, GTEx, STRING)
Hypothesis frontier & knowledge gaps
Community support

Get Free API Key

Researcher

$49/month

5,000 requests/day

Everything in Explorer
Causal inference (intervention, explain, do-calculus)
Predictions (drug response, essentiality)
Hypothesis frontier queries

Most Popular

Pro

$199/month

50,000 requests/day

Everything in Researcher
Full causal stack (counterfactual, dynamic simulation)
Patient interpretation endpoints
Bulk data download

Enterprise

From $999

Unlimited

Everything in Pro
HIPAA-eligible deployment
SSO / SAML integration
Audit logs for all API access

View full pricing details →

10+ databases, locally integrated — not scraped or proxied

Each database is a verified, versioned local copy. Results are real experimental measurements, cross-referenced and provenance-traced.

ChEMBLcBioPortalSTRINGUniProtReactomeClinVarIntOGenBioGRIDGOTCGA

+3 more databases →

Ready to build with verified evidence?

Get your free API key in 30 seconds. No credit card required. 1,000 requests per day included.

Get Your API Key →Read the Docs