Syndicated from PubMed RSS Feeds
An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics.
BMC Bioinformatics. 2010;11 Suppl 12:S1
Authors: Taylor RC
Bioinformatics researchers are now confronted with analysis of ultra large-scale data sets, a problem that will only increase at an alarming rate in coming years. Recent developments in open source software, that is, the Hadoop project and associated software, provide a foundation for scaling to petabyte scale data warehouses on Linux clusters, providing fault-tolerant parallelized analysis on such data using a programming style named MapReduce.
PMID: 21210976 [PubMed - indexed for MEDLINE]
More...
An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics.
BMC Bioinformatics. 2010;11 Suppl 12:S1
Authors: Taylor RC
Bioinformatics researchers are now confronted with analysis of ultra large-scale data sets, a problem that will only increase at an alarming rate in coming years. Recent developments in open source software, that is, the Hadoop project and associated software, provide a foundation for scaling to petabyte scale data warehouses on Linux clusters, providing fault-tolerant parallelized analysis on such data using a programming style named MapReduce.
PMID: 21210976 [PubMed - indexed for MEDLINE]
More...