Illumina and D3b Launch 100 K Genome Effort to Advance Pediatric Cancer and Rare Disease Research
Key Takeaways
- Illumina and D3b launched a data partnership to advance pediatric cancer and rare disease research.
- D3b is analyzing 100,000 pediatric whole genomes, creating one of the largest unified datasets in this field.
- Using DRAGEN v4.4 and Illumina Connected Analytics, the initiative is designed to enable cross-cohort analysis, while early findings suggest it can identify structural variants missed by earlier tools.
- Data will support precision medicine and broader sharing across more than 200 U.S. pediatric institutions.
Illumina and the Center for Data-Driven Discovery in Biomedicine, D3b, have announced a new data partnership aimed at accelerating research in pediatric cancer and rare disease, two areas where fragmented data and delayed diagnosis continue to limit progress.
At the center of the collaboration is the analysis of 100,000 whole genomes from pediatric patients, making it one of the largest unified genomic datasets assembled for this population. By combining cloud-based infrastructure with scalable software, the initiative is designed to let researchers analyze pediatric genomic data within a single environment, support real-time comparisons across cohorts, and speed the path from discovery to clinical application.
The project brings together data from children with rare congenital conditions and cancers, including samples collected through federally funded programs such as the Gabriella Miller Kids First Data Resource Center and the Children's Brain Tumor Network. The resulting dataset will be made available through the Kids First DRC, with the goal of helping researchers and clinicians better understand disease biology and improve the use of precision medicine in pediatric care.
"Genomic datasets like these give researchers powerful insight for precision medicine," said James Han, vice president of Bioinformatics at Illumina. “Through advances in data, software, and AI, we are moving toward a future where genomic insights drive faster research breakthroughs.”
This matters because congenital conditions and cancer remain among the leading causes of death in children, yet the data needed to guide diagnosis and treatment are often scattered across institutions. A shared genomic ecosystem could help clinicians compare similar cases, validate diagnoses, identify meaningful biological patterns, and draw on prior patient experiences when making treatment decisions.
The genomic data are being analyzed using DRAGEN v4.4 and Illumina Connected Analytics, tools intended to provide the speed, accuracy, and scale needed for cross-sample analysis.
According to the companies, findings from the initial cohort already suggest that DRAGEN can correct known errors and detect structural variants that earlier bioinformatics approaches missed. That could be particularly important in rare disease and pediatric oncology, where hard-to-detect variants may shape both diagnosis and therapeutic strategy.
"We're excited to apply leading-edge software technology against some of the toughest challenges in pediatric cancer and congenital conditions," said Allison Heath, director of Data Technology and Innovation at D3b. “Our goal is to empower researchers to uncover new biological signals and to bring genomic insights into routine clinical decision-making, creating a new standard of care.”
Beyond the 100,000-genome initiative, Illumina is also joining D3b in supporting the Pediatric Care eXpansion, PCX, program of ARPA-H. That effort seeks to reduce the time to diagnosis and care from months to weeks by expanding data sharing across a national exchange involving more than 200 pediatric institutions in the United States.
The broader aim is to give clinicians faster access to previously disconnected health data and enable more personalized, evidence-informed care based on real-world patient trajectories across institutions.





Comments
No Comments Yet!