Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
|
 |
Multi-granularity Parallel Computing in a Genome-Scale Molecular Evolution Application
| |
|
Multi-granularity Parallel Computing in a Genome-Scale Molecular Evolution Application
Jesse D. Walters18, 20, Thomas B. Bair18, Terry A. Braun17, 18, 20, 21, Todd E. Scheetz17, 18, 20, 21, John P. Robinson17, 20 and Thomas L. Casavant17, 18, 19, 20
| (17) |
Center for Bioinformatics and Computational Biology, University of Iowa, Iowa City, IA 52242, USA |
| (18) |
Coordinated Laboratory for Computational Genomics, University of Iowa, Iowa City, IA 52242, USA |
| (19) |
Department of Biomedical Engineering, University of Iowa, Iowa City, IA 52242, USA |
| (20) |
Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA 52242, USA |
| (21) |
Department of Ophthalmology and Visual Sciences, University of Iowa Email:clcg@eng.uiowa.edu, Iowa City, IA 52242, USA |
Abstract
Previously [1], we reported a coarse-grained parallel computational approach to identifying rare molecular evolutionary events
often referred to as horizontal gene transfers. Very high degrees of parallelism (up to 65x speedup on 4,096 processors) were
reported, yet the overall execution time for a realistic problem size was still on the order of 12 days. With the availability
of large numbers of compute clusters, as well as genomic sequence from more than 2,000 species containing as many as 35,000
genes each, and trillions of sequence nucleotides in all, we demonstrated the computational feasibility of a method to examine
"clusters" of genes using phylogenetic tree similarity as a distance metric. A full serial solution to this problem requires
years of CPU time, yet only makes modest IPC and memory demands; thus, it is an ideal candidate for a grid computing approach
involving low-cost compute nodes. This paper now describes a multiple granularity parallelism solution that includes exploitation
of multi-core shared memory nodes to address fine-grained aspects in the tree-clustering phase of our previous deployment
of XenoCluster 1.0. In addition to benchmarking results that show up to 80% speedup efficiency on 8 CPU cores, we report on
the biological accuracy and relevance of our results compared to a reported set of known xenologs in yeast.
Fulltext Preview (Small, Large)
 References secured to subscribers.
|
|
|
|
|
|