EthSEQ: ethnicity annotation from whole exome sequencing data.

Publication TypeJournal Article
Year of Publication2017
AuthorsRomanel A, Zhang T, Elemento O, Demichelis F
Date Published2017 Aug 01
KeywordsGenetics, Population, Genomics, Humans, Molecular Sequence Annotation, Population Groups, Software, Whole Exome Sequencing

Summary: Whole exome sequencing (WES) is widely utilized both in translational cancer genomics studies and in the setting of precision medicine. Stratification of individual's ethnicity is fundamental for the correct interpretation of personal genomic variation impact. We implemented EthSEQ to provide reliable and rapid ethnicity annotation from whole exome sequencing individual's data, validated it on 1000 Genome Project and TCGA data (2700 samples) demonstrating high precision, and finally assessed computational performances compared to other tools. EthSEQ can be integrated into any WES based processing pipeline and exploits multi-core capabilities.

Availability and Implementation: R package available at and CRAN repository.

Contact: or

Supplementary information: Supplementary data are available at Bioinformatics online.

Alternate JournalBioinformatics
PubMed ID28369222
PubMed Central IDPMC5818140
Grant List648670 / / European Research Council / International