Pages

Monday, 2 April 2012

Flash Report: the Amaz..ing 1000 Genomes in the cloud

Amazon and the U.S. National Institutes of Health (NIH) announced that the complete 1000 Genomes Project is being made available on Amazon Web Services (AWS) as free of charge public data set.
The project has grown to 200 terabytes of genomic data (!!!) including DNA sequenced from more than 1,700 individuals. The 1000 Genomes Project aims to include the genomes of more than 2,662 individuals from 26 populations around the world, and the NIH will continue to add the remaining genome samples to the data collection this year.
The move to put the data up on AmazonWeb Services, aims to help speed up access to the research. Previously, researchers had to download data from government data centers (either NCBI or EMBL-EBI) on their own systems.
On the AWS site you can find more info on how to access the 1000 Genomes data.

No comments: