Utilizing metagenomic data for the study of genes and gene cassettes

Itay Sharon, Technion

Abstract:

Metagenomics is an increasingly popular method for studying microbes and viruses directly from the environment in which they live, through their DNA. Using metagenomics it is possible to access the vast majority of microbes on earth, more than 99% of microbial species according to some estimates, that cannot be studied using other methods. The data consists of a random collection of very short DNA sequences originating from microbes and viruses in the examined environment. Through computational analysis it is possible to study aspects of microbial and viral lifestyle in the examined environment such as community structure, functional capabilities and more.
In this talk I will describe computational methods developed in our lab for the study of genes and gene cassettes from metagenomic data. I will focus on our latest finding of Photosystem I gene cassettes in marine virus genomes, and also describe the application of the methods for large-scale analysis aimed to find microbial genes on viral genomes with no prior knowledge.

References: (Kagan et al., Genome Biology, 2008, Sharon et al., Nature, 2009).