NSF PROJECT PAGE

III-CXT: Collaborative Research: A High-Throughput Approach to the Assignment of Orthologous Genes Based on Genome Rearrangement

Introduction: Orthologous genes, or orthologs, are genes in different species that have evolved directly from a common ancestral gene. Genome-scale assignment of orthologs is a fundamental and challenging problem in computational biology, and has a wide range of applications in comparative genomics and functional genomics. This project continues the development of the parsimony approach for assigning orthologs between closely related genomes which essentially attempts to transform one genome into another by the smallest number of genome rearrangement events including reversal, translocation, fusion, and fission, as well as gene duplication events. The project addresses three key algorithmic problems including (i) signed reversal distance with duplicates, (ii) signed transposition distance with duplicates, and (iii) minimum common string partition. Efficient solutions to each of these problems are combined and incorporated into a software system for ortholog assignment, called MSOAR . The project encompasses genome-wide analysis of orthologous (and paralogous) relationships on the human and mouse genomes to valdiate the approach, and more importantly, to address several important evolutionary biological questions including the characterization of gains and losses of duplicated genes in the two genomes, the elucidation of gene movements in one genome with respect to the other genome, and the quantification of different mechanisms of gene duplication.

Principal Investigators:

Tao Jiang (PI) & the UCR project website
Liqing Zhang (co-PI)

Graduate Students and Postdoc funded by the project:

Mark Lawson
Deng Pan

Relevant Publications:

  • M. Lawson, L. Heath, N. Ramakrishnam, and L.Q.Zhang. Using Cost-Sensitive Learning to Determine Gene Conversions. Advanced Intelligent Computing Technology and Applications ICIC2008.
  • Deng Pan and Liqing Zhang. Burst of young retrogenes and independent retrogene formation in mammals. submitted.
  • Deng Pan and Liqing Zhang. Tandemly Arrayed Genes in Vertebrate Genomes. minor revision.
  • Valia Shoja, T. M. Murali, and Liqing Zhang. Expression Divergence of Tandemly Arrayed Genes in Human and Mouse. Comparative and Functional Genomics. v2007, 2007.
  • D. Pan and L.Q. Zhang. Quantifying the major mechanisms of recent gene duplications in the human and mouse genomes: a novel strategy to estimate gene duplication rates. Genome Biology 2007, 8:R158.
  • Funding Sources:

    This project is funded by an NSF grant IIS-0710945 for the period of Sept. 15, 2007 - August 31, 2010. A collaborative grant was simultaneously awarded to the PI Prof. Tao Jiang for the same period.