W114 SNP Discovery Across Buffalo Breeds

Date: Saturday, January 14, 2012
Time: 8:45 AM
Room: Pacific Salon 4-5 (2nd Floor)
Tad Sonstegard , USDA-ARS-ANRI-BFGL, Beltsville, MD
John Williams , Parco Tecnologico Padano, Lodi, Lombardy, Italy
Steven G. Schroeder , BFGL, ARS-USDA, Beltsville, MD
Jose Fernando Garcia , UNESP - Sao Paulo State Univ., Aracatuba, Brazil
Aleksey Zimin , University of Maryland, College Park, MD
Masroor Babar , University of Veterinary and Animal Sciences, Lahore, Pakistan
Daniela Iamartino , Parco Tecnologico Padano, Lodi, Italy
International Buffalo Genome Consortium , Parco Tecnologico Padano
There is a considerable interest to enhance dairy production for buffalos in Asia and rest of the world.  The importance of buffalo in helping provide economic security for developing economies by meeting growing demands for protein-based nutrition cannot be overstated.  To exploit the inherent potential of dairy buffalo, we initiated development of genome-based resources that can be used to identify important selective variation in these species and develop DNA-based tools for genetic improvement.  SNP discovery was performed using two methods:  1) k-mer hash error correction of Illumina reads during the genome assembly process of more than 100X sequence read depth generated from a Mediterranean, female water buffalo and 2) alignment of sequence contigs and Illumina paired end reads from 86 other animals representing 8 breeds and 2 subspecies of buffalo.  The expected sequence yield generated from this sampling should exceed 470X depth of genome coverage, and is composed of deep coverage (more than 12X) from 3 animals and 5X paired end coverage from 83 animals.  This strategy of sequencing also allows for detection of copy number variation between animals.  The results of this SNP discovery will be used to identify selective sweeps, establish a baseline for shared and unique diversity, and serve as a resource for high density SNP assay development.