3 files

DNA methylation signatures predict cytogenetic subtype and outcome in pediatric acute myeloid leukemia (AML)

posted on 2021-06-17, 10:20 authored by Olga KraliOlga Krali, Josefine Palle, Christofer Bäcklin, Jonas Abrahamsson, Ulrika Norén-Nyström, Henrik Hasle, Kirsi Jahnukainen, Ólafur Gísli Jónsson, Randi Hovland, Birgitte Lausen, Rolf Larsson, Lars Palmqvist, Anna Staffas, Bernward Zeller, Jessica Nordlund

This dataset contains genome-wide DNA methylation data generated from 142 pediatric acute myeloid leukemia (AML) samples originating from bone marrow or peripheral blood samples taken at AML diagnosis (N=123) or relapse (N=19). Further details regarding the samples are available in Supplementary Table S1 from Krali and Palle et. al., 2021 (

Genome-wide DNA methylation was analyzed at the SNP&SEQ Technology Platform, SciLifeLab, National Genomics Infrastructure Uppsala, Sweden. 200ng of bisulfite converted DNA was amplified, fragmented and hybridised to Illumina Infinium Human Methylation450k Beadchip using the standard protocol from Illumina (iScan SQ instrument).

This metadata record contains information about the raw idat files generated from the Infinium DNA methylation arrays. The raw idat files were processed with Methylation Module (1.8.5) software in Genome Studio (V2010.3). The Methylprep Python library was used to generate and normalize the beta-value matrix (

The raw idat files along with a samplesheet, processed beta-value matrix, annotation file for CpG annotation, and signal intensities matrix will be made available upon request. Limited phenotype information is available in the Supplemental Table 1 of the manuscript. All scripts that give a walk-through from data preprocessing from the raw idat files until the modelling process with Machine Learning can be found on the following GitHub repository:

Terms for access

The DNA methylation dataset is only to be used for research that is seeking to advance the understanding of the influence of epigenetic factors on leukemia etiology and biology.

The data should not be used for other purposes, i.e. investigating the epigenetic signatures that may lead to identification of a person.

For retrieving the data used for the scope of this publication, please contact


The Swedish Childhood Cancer Foundation (#TJ2011-0021, #PR2019-0046)

The Swedish Research Council (#2019-01976)

The Göran Gustafssons Foundation (#2011)



Uppsala University

Access request email