arrow_back HD Research Hub

HD + AI Landscape

Who's applying AI to Huntington's Disease, what approaches exist, and what makes this hard.

groups Who's Working on AI + HD

Who What They Do AI Type Stage Open?
Isomorphic LabsDeepMind spinoff. IsoDDE drug design engine. First AI-designed cancer drug entering Phase 1Protein-ligand predictionPhase 1No
Aitia + UCBCausal AI digital twins on CHDI TRACK-HD data (30K patients). Novel HD target identifiedCausal AIActiveNo
NovartisGenerative AI designed 15M candidate compounds for HD, synthesized 60Generative chemistryOptimizationNo
HealxAI drug repurposing for rare diseases. Clinical trials in 24 months vs 10-15 yearsDrug repurposingActiveNo
Insilico MedicineEnd-to-end AI drug design. Rentosertib (first AI drug) validated in Nature MedicineGenerative biologyClinicalNo
UnlearnDigital twins from 30K Enroll-HD patientsML forecastingActiveNo
SOM BiotechAI-discovered bevantolol (SOM3355) for choreaDrug repurposingPhase IIbNo
NVIDIA BioNeMoOpen platform for drug discovery. Clara models, $1B co-lab with Eli LillyFoundation modelsPlatformOpen platform
IBM + CHDIML model of 9 HD disease statesProgression modelingPublishedPartially
BDASeq (academic)AI transcriptomics, 394 druggable HD genesTranscriptomic AIPublishedPaper only
TxGNN (Harvard)Foundation model for drug repurposing. 17K diseases, 8K therapeutics. Zero-shotGraph neural networkPublishedYes (MIT)
TxGemma (Google)Open Gemma models for drug discovery. Text + molecular structuresMultimodal LLMAvailableYes
AlphaFold (DeepMind)HTT protein structure predictedProtein foldingAvailableYes
OpenMed AI475+ biomedical NLP models. Disease, chemical, genomic entity extraction. 29.7M downloadsBiomedical NERActiveYes (Apache 2.0)
HuggingFace ML InternOpen-source autonomous ML agent. Reads papers, trains models, ships to HF Hub. Used for HD drug target research.Agentic MLActiveYes (MIT)
OpenAI HealthGPT-5.2 for clinical care. Deployed at MSK, Stanford Medicine, UCSFClinical LLMDeployedNo
HD Research Hub (us)Autonomous agents, drug repurposing, full-text RAG, somatic expansion drug screenAgentic AI + RAGLiveYes (MIT)

What we bring to the table

  • Open-source autonomous research agents for HD (to our limited understanding)
  • Full-text knowledge base with semantic search and multilingual chatbot
  • Auto-refreshing research dashboard combining PubMed + ClinicalTrials.gov + HDBuzz
  • Published AI experiments with full methodology, limitations, and raw data
  • Somatic CAG expansion drug screen across 5 validated targets (MSH3, FAN1, PMS1, MLH1, LIG1)

Most players above keep their models, data, and findings proprietary. We publish everything: code, prompts, raw results, and limitations. That's the point.

Open-source tools available for HD research

These are tools anyone can use today. Part of our mission is highlighting what's accessible.

TxGNN

Harvard. Drug repurposing foundation model. 17K diseases, zero-shot predictions. GitHub

TxGemma

Google. Open Gemma models for drug discovery. Text + molecular + protein. Docs

AlphaFold DB

DeepMind. 200M+ protein structures including HTT. Database

NVIDIA BioNeMo

Open platform for biological AI. Clara models, drug design. Platform

OpenMed AI

475+ biomedical NLP models. Entity extraction for diseases, chemicals, genes. Apache 2.0. Site | HuggingFace

ML Intern

Autonomous ML agent from HuggingFace. Reads papers, traverses citations, trains models. Used for HD somatic expansion research. Site | GitHub

warning Why This Is Hard

lock

Blood-Brain Barrier

Most drugs can't reach the brain. Gene therapies require brain surgery. Small molecules need to cross the BBB. This limits which AI-generated candidates are actually viable.

hourglass_empty

15-19 Year Timeline

CNS drugs take 15-19 years from discovery to approval on average. Longer than any other drug class. AI can accelerate parts of this, but clinical trials still take years.

troubleshoot

98% Failure Rate

For neurodegenerative disease clinical trials, the attrition rate is ~98%. Most candidates that look promising in the lab fail in humans. AI can help identify failures earlier, but can't eliminate them.

dataset

Limited Data

HD is a rare disease (~30K patients globally in studies). AI models need data to learn from. Enroll-HD (30K participants) is the largest dataset, but it's tiny compared to what deep learning typically needs.

psychology

Disease Complexity

HD involves protein aggregation, transcriptional dysregulation, mitochondrial dysfunction, neuroinflammation, and somatic DNA expansion. No single target addresses all mechanisms. AI must reason across all of them.

visibility_off

AI Hallucination Risk

LLMs can generate plausible-sounding hypotheses that are scientifically wrong. Our Experiment #1 suggested Riluzole without knowing it was already tested. Every AI output needs expert review.

gavel

Regulatory Uncertainty

The FDA hasn't established clear guidelines for AI-generated drug candidates. AMT-130 (the most promising HD therapy) is stuck in regulatory debate despite 75% disease slowing in trials.

money_off

Commercial Incentive

HD affects ~30K people in the US. Pharma companies prioritize diseases with larger patient populations. This is why open-source and non-commercial research matters.

auto_awesome Where AI Can Make a Difference

Literature Synthesis

Thousands of HD papers published yearly. AI can read them all, find cross-paper patterns, surface connections humans miss. This is what our agents do.

We're doing this now.

Drug Repurposing

Screen thousands of FDA-approved drugs against HD targets. Faster path because safety is already proven. AI can generate and score hypotheses at scale.

We're doing this now.

Clinical Trial Design

Digital twins (Unlearn) can simulate patient controls. AI can identify optimal endpoints, predict enrollment challenges, and stratify patient populations.

Others are leading this.

Molecular Design

Generative AI designing brain-penetrant molecules (Novartis). AlphaFold predicting protein structures. Computational chemistry at a scale impossible manually.

Requires wet-lab validation.

Biomarker Discovery

ML on patient data to find early indicators of disease onset and progression. NfL is established. AI can find new ones from multi-omics data.

Active research area.

Accessibility

Making research understandable to patients, families, and non-specialists. Multilingual chatbots, plain-language summaries, curated learning paths.

We're doing this now.

Back to HD Research Hub