GALENAPI

The Causal Inference Engine for Cancer Biology.

Compute causal effects from experimentally verified relationships. Every answer traces to its original experiment. Deterministic. Auditable. Not generated.

Peer-reviewed databases. Formal do-calculus. Provenance on every edge.

Already have an account? Sign in to Dashboard

Every fact has an address

In a world where AI generates plausible-sounding cancer claims with no way to check them, Galen returns structured data where every relationship cites a specific ChEMBL assay, cBioPortal study, or STRING interaction score. This isn’t a policy choice — it’s an architectural guarantee. The system cannot return a fact without its provenance.

Each relationship is annotated with a Pearl Causal Hierarchy layer: L1 (observed association), L2 (experimental intervention), or L3 (counterfactual). This tells you not just that a relationship exists, but how strong the evidence is behind it.

This isn’t just a technical choice. Cancer’s complexity exceeds human cognitive capacity, and existing tools treat pieces in isolation. Galen is building the first system that reasons across all of cancer biology from first principles. Learn about our approach →

Three kinds of cancer information tools

Galen is not a better AI. It’s a different category of tool — one that combines the verifiability of databases with reasoning capabilities neither databases nor LLMs can provide.

CapabilityDatabases
PubChem, Open Targets
LLMs
ChatGPT, Claude, Gemini
Galen
What it doesStores experimental dataGenerates text from training dataComputes causal inferences from experiments
"What's associated with EGFR?"YesYes (unreliably)Yes
"What happens if you inhibit EGFR?"NoGenerates a guessComputes from structural causal model
"What would have happened with Drug B?"NoGenerates a guessComputes via counterfactual inference
Cites specific experimentsYes (it IS the database)NoYes (provenance per relationship)
DeterministicYesNoYes
See detailed Galen vs LLM comparison

Provenance-traced evidence

Galen

Every relationship cites its original data source (e.g., ChEMBL assay, cBioPortal study, STRING interaction). You can verify any fact.

General AI (ChatGPT, Claude, Gemini)

Generated from training data. No way to trace a specific claim to its original experiment or study.

Formal causal inference

Galen

Computes genuine causal effects using do-calculus P(Y|do(X)) over a Structural Causal Model. Not statistical correlations.

General AI (ChatGPT, Claude, Gemini)

Can describe causal concepts but cannot compute causal effects. Relies on associative patterns in text.

Counterfactual reasoning

Galen

Twin-network counterfactual inference: "What would have happened if this patient had received Drug B instead of Drug A?"

General AI (ChatGPT, Claude, Gemini)

Can speculate about counterfactuals but has no formal model to compute them. Answers are plausible-sounding guesses.

Evidence grading

Galen

Every relationship is annotated with a Pearl Causal Hierarchy layer: L1 (observed), L2 (experimental), L3 (counterfactual).

General AI (ChatGPT, Claude, Gemini)

No evidence quality distinctions. A correlation mentioned in a blog post and a randomized trial result look the same.

Multi-mutation interpretation

Galen

Interprets specific mutation combinations against 7.7M+ verified relationships. Returns evidence-graded treatment options.

General AI (ChatGPT, Claude, Gemini)

Summarizes general information about individual mutations. Cannot reason about specific multi-mutation interactions.

Reproducibility

Galen

Same query always returns the same structured result. Deterministic. Auditable.

General AI (ChatGPT, Claude, Gemini)

Different answers to the same question. Non-deterministic. Cannot be audited or validated.

What you can build with Galen

Concrete problems, concrete solutions — for every team that needs verifiable cancer biology data.

Pharma & Biotech R&D

Screen drug candidates for off-target effects and resistance mechanisms before wet-lab validation.

Example

Simulate inhibiting KRAS G12C and trace every causal downstream effect — with the original experimental source for each one.

/causal/intervention

Clinical Decision Support

Interpret a patient's multi-mutation tumor profile with evidence-graded treatment options — not LLM-generated summaries.

Example

Submit BRAF V600E + TP53 R175H + CDKN2A deletion. Get structured treatment options with evidence quality grades and cited sources.

/patient/profile

Academic Research

Explore causal mechanisms in cancer biology with formal do-calculus — not keyword search over static databases.

Example

Compute P(apoptosis | do(inhibit EGFR)) with provenance for every causal edge in the derivation.

/causal/do-calculus

AI Agent Builders

Give your AI agent structured, verifiable cancer biology data instead of hoping it generates correct facts.

Example

Your agent queries verified experimental data from 10+ databases — every response includes provenance, confidence scores, and evidence grades.

/entities/{entity_id}

7 API modules, 40+ endpoints

From entity search to causal inference to patient interpretation — everything connected through a unified, provenance-traced knowledge graph.

Free

Knowledge Graph

Search, explore, and traverse 855K+ cancer biology concepts and 7.7M+ relationships.

GET/entities/{entity_id}
GET/entities/search
GET/entities/{entity_id}/relationships

+3 more endpoints

View documentation →
Free

Databases

Query integrated biomedical databases through a unified API.

GET/chembl/compound/{name}
GET/cbioportal/mutations/{gene}
GET/gtex/expression/{gene}

+3 more endpoints

View documentation →
Researcher

Causal Inference

Simulate interventions, run do-calculus queries, and explore counterfactual scenarios.

POST/causal/intervention
POST/causal/explain
POST/causal/do-calculus

+5 more endpoints

View documentation →
Researcher

Predictions

Predict drug responses, gene essentiality, and mutation effects with confidence scores.

POST/predictions/drug-response
POST/predictions/essentiality
POST/predictions/mutation-effect

+1 more endpoints

View documentation →
Free

Hypotheses

Discover knowledge gaps and explore the frontier of cancer biology.

GET/hypotheses/frontier
GET/hypotheses/knowledge-gaps
View documentation →
Pro

Patient Interpretation

Interpret multi-mutation profiles, find treatments, match clinical trials.

POST/patient/profile/interpret
POST/patient/treatments
POST/patient/resistance/explain

+2 more endpoints

View documentation →
Free

System

Health checks, system statistics, and data source information.

GET/health
GET/status
GET/stats

+2 more endpoints

View documentation →

Causal inference in a few lines of code

Simulate inhibiting a cancer target and trace every downstream effect — with the original experimental source, evidence grade, and confidence score for each one.

import requests

API_KEY = "gk_live_your_key_here"
BASE = "https://research.usegalen.com/api/v1"

# Query: What happens downstream when you inhibit EGFR?
# This uses BFS over a Structural Causal Model — not
# statistical correlations. Every effect is traceable.
causal = requests.post(
    f"{BASE}/causal/intervention",
    headers={"X-API-Key": API_KEY},
    json={
        "target": "EGFR",
        "intervention_type": "inhibit"
    }
)

# Each downstream effect includes:
# - The affected entity and mechanism
# - Effect size and confidence score
# - Pearl Causal Hierarchy layer (L1/L2/L3)
# - Original data source (e.g., "ChEMBL", "cBioPortal")
for effect in causal.json()["downstream_effects"]:
    print(f"{effect['entity']}: {effect['effect_type']}"
          f" (confidence: {effect['confidence']:.2f})")
    print(f"  Source: {effect['provenance']}"
          f" | Pearl Level: L{effect['pch_layer']}")

# Example output:
# EGFR inhibition → suppresses MAPK signaling (confidence: 0.94)
#   Source: ChEMBL bioactivity | Pearl Level: L2 (experimental)
# EGFR inhibition → reduces cell proliferation (confidence: 0.87)
#   Source: cBioPortal clinical genomics | Pearl Level: L2 (experimental)
# EGFR inhibition → may restore apoptosis via BIM (confidence: 0.71)
#   Source: SCM counterfactual validation | Pearl Level: L3 (counterfactual)

Simple, transparent pricing

Start free. Scale when you need to. No surprise charges.

Explorer

Free

1,000 requests/day

  • Knowledge graph queries (entities, search, relationships)
  • All database integrations (ChEMBL, cBioPortal, GTEx, STRING)
  • Hypothesis frontier & knowledge gaps
  • Community support
Get Free API Key

Researcher

$49/month

5,000 requests/day

  • Everything in Explorer
  • Causal inference (intervention, explain, do-calculus)
  • Predictions (drug response, essentiality)
  • Hypothesis frontier queries
Get Started
Most Popular

Pro

$199/month

50,000 requests/day

  • Everything in Researcher
  • Full causal stack (counterfactual, dynamic simulation)
  • Patient interpretation endpoints
  • Bulk data download
Get Started

Enterprise

From $999

Unlimited

  • Everything in Pro
  • HIPAA-eligible deployment
  • SSO / SAML integration
  • Audit logs for all API access
Contact Sales

View full pricing details →

10+ databases, locally integrated — not scraped or proxied

Each database is a verified, versioned local copy. Results are real experimental measurements, cross-referenced and provenance-traced.

ChEMBLcBioPortalSTRINGUniProtReactomeClinVarIntOGenBioGRIDGOTCGA
+3 more databases →

Ready to build with verified evidence?

Get your free API key in 30 seconds. No credit card required. 1,000 requests per day included.