Skip to main content


CEEHRC / For Scientists / Data Analysis /

All CEMT genotype data (aligned bam files, variant calls, and underlying raw fastqs) are available for download via European Genome Archive study EGAS00001000552. The data is subject to protected access and embargoed for a period of nine months after submission to the archive. Please consult the data accesss policy for details.

For working with processed tracks, it's recommended that for the purposes of offline processing, the following resource be used for accessing data with annotated metadata attributes: json data hub. Please consult the IHEC github repository for a current description of the json specification as well as the current official metadata recommendations from the IHEC metadata working group.

All files published by CEMT follow the convention that the first prefix in the filename corresponds to the library name, which is also the unique experiment identifier used in the json hub. In general, while a certain amount of details can be extracted from the filename, referring to the json document is ideal.

The methods pages, specified by the analysis version below, describe the file types and workflows used:

In case any clarification is required, please email