Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

Multi-granularity Parallel Computing in a Genome-Scale Molecular Evolution Application

Jesse D. Walters18, 20, Thomas B. Bair18, Terry A. Braun17, 18, 20, 21, Todd E. Scheetz17, 18, 20, 21, John P. Robinson17, 20 and Thomas L. Casavant17, 18, 19, 20

(17)  Center for Bioinformatics and Computational Biology, University of Iowa, Iowa City, IA 52242, USA
(18)  Coordinated Laboratory for Computational Genomics, University of Iowa, Iowa City, IA 52242, USA
(19)  Department of Biomedical Engineering, University of Iowa, Iowa City, IA 52242, USA
(20)  Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, USA
(21)  Department of Ophthalmology and Visual Sciences, University of Iowa Email:clcg@eng.uiowa.edu, Iowa City, IA 52242, USA
Abstract
Previously [1], we reported a coarse-grained parallel computational approach to identifying rare molecular evolutionary events often referred to as horizontal gene transfers. Very high degrees of parallelism (up to 65x speedup on 4,096 processors) were reported, yet the overall execution time for a realistic problem size was still on the order of 12 days. With the availability of large numbers of compute clusters, as well as genomic sequence from more than 2,000 species containing as many as 35,000 genes each, and trillions of sequence nucleotides in all, we demonstrated the computational feasibility of a method to examine "clusters" of genes using phylogenetic tree similarity as a distance metric. A full serial solution to this problem requires years of CPU time, yet only makes modest IPC and memory demands; thus, it is an ideal candidate for a grid computing approach involving low-cost compute nodes. This paper now describes a multiple granularity parallelism solution that includes exploitation of multi-core shared memory nodes to address fine-grained aspects in the tree-clustering phase of our previous deployment of XenoCluster 1.0. In addition to benchmarking results that show up to 80% speedup efficiency on 8 CPU cores, we report on the biological accuracy and relevance of our results compared to a reported set of known xenologs in yeast.

Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Remote Address: 38.107.191.113 • Server: mpweb15
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)