ebook img

Genetic data analysis for plant and animal breeding PDF

409 Pages·2017·25.41 MB·English
by  Holland
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Genetic data analysis for plant and animal breeding

Fikret Isik · James Holland Christian Maltecca Genetic Data Analysis for Plant and Animal Breeding Genetic Data Analysis for Plant and Animal Breeding Fikret Isik (cid:129) James Holland (cid:129) Christian Maltecca Genetic Data Analysis for Plant and Animal Breeding FikretIsik JamesHolland DepartmentofForestryand UnitedStatesDepartmentofAgriculture–Agricultural EnvironmentalResources ResearchService NorthCarolinaStateUniversity DepartmentofCropandSoilSciences Raleigh,NC,USA NorthCarolinaStateUniversity Raleigh,NC,USA ChristianMaltecca DepartmentofAnimalScience NorthCarolinaStateUniversity Raleigh,NC,USA ISBN978-3-319-55175-3 ISBN978-3-319-55177-7 (eBook) DOI10.1007/978-3-319-55177-7 LibraryofCongressControlNumber:2017944107 #SpringerInternationalPublishingAG2017 Thisworkissubjecttocopyright.AllrightsarereservedbythePublisher,whetherthewholeorpartofthematerialis concerned,specificallytherightsoftranslation,reprinting,reuseofillustrations,recitation,broadcasting,reproduction on microfilms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation,computersoftware,orbysimilarordissimilarmethodologynowknownorhereafterdeveloped. Theuseofgeneraldescriptivenames,registerednames,trademarks,servicemarks,etc.inthispublicationdoesnot imply,evenintheabsenceofaspecificstatement,thatsuchnamesareexemptfromtherelevantprotectivelawsand regulationsandthereforefreeforgeneraluse. Thepublisher,theauthorsandtheeditorsaresafetoassumethattheadviceandinformationinthisbookarebelieved tobetrueandaccurateatthedateofpublication.Neitherthepublishernortheauthorsortheeditorsgiveawarranty, expressorimplied,withrespecttothematerialcontainedhereinorforanyerrorsoromissionsthatmayhavebeen made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional affiliations. Printedonacid-freepaper ThisSpringerimprintispublishedbySpringerNature TheregisteredcompanyisSpringerInternationalPublishingAG Theregisteredcompanyaddressis:Gewerbestrasse11,6330Cham,Switzerland To Onur, Paula, my parents Akbal and Sevket (Isik) To Andrea, Pablo, Lucía, and my parents (Holland) To my parents and family (Maltecca) Plant and animal illustrations: Professor Hüsnü Dokak Director of Hacettepe University Art Museum Hacettepe University, Faculty of Fine Arts Ankara, Turkey Preface We wrote this book to fill the gap between textbooks of quantitative genetic theory and software manuals that provide details on analytical methods but little context or perspective on which methods may be most appropriate for particular applications. We do not cover thebasicsofquantitativegeneticstheory;werecommendreadersbefamiliarwithtwoofthe classic introductory texts on the subject, Introduction to Quantitative Genetics, 4th Ed. by D.S. Falconer and Trudy Mackay, and Genetics and Analysis of Quantitative Traits by Michael Lynch and Bruce Walsh. We hope to apply the concepts of quantitative genetics to particularanalyticalsolutionsthatwillbeusefultoplantandanimalbreeders,focusingmainly onmethodstopredictbreedingvalues.Weattempttodemonstrateanalysesinfreelyavailable software (such as R packages) where possible, but we also include considerable attention to the commercial software ASReml because it provides so much flexibility and utility to analysis of breeding program data. Free (but time-limited) trials of ASReml are available, and “Discovery” versions of the software are freely available to public institutions in many developing countries (http://www.vsni.co.uk/free-to-use/asreml-discovery). In addition, we include some information on SAS analyses for comparison, because SAS is widely used in thebreedingcommunity. This bookiscomposed oftwomajor sections. Thefirstsection(Chaps.1,2,3,4, 5,6,7, and8)coversthetopicofclassicalphenotypicdataanalysisforpredictionofbreedingvalues inanimalandplantbreedingprograms.InChap.1,weintroduceASRemlsoftwarebecauseit is one the more popular, and we believe one of the most powerful, softwares available for analyzingdatainbreedingprogramsusingmixedmodelsanalyses.Chapter2includesabrief review of linear mixed models and compares them to ordinary least squares analyses of variance, with which some readers may be more familiar. This is followed by a general introductiontovariance-covariancestructuresusedinmixedmodels(Chap.3).Chapters4and 5 cover prediction of breeding values using sire (or general combining ability) models and animal models. Chapter 6 is about multivariate models used when breeders want to analyze multiple traits simultaneously and estimate genetic correlations among traits. Chapter 7 introduces spatial analyses for field experimental designs used in tree and crop breeding to accountforenvironmentalheterogeneitywithinenvironments.Chapter8introducesgenotype- by-environment (GE) interactions in multi-environmental trails and various variance- covariance structures to model GE and the heterogeneity of error variation among environments. Inthesecondsection(Chaps.9,10,11,and,12),weprovidetheconceptsandanoverview of available tools for using DNA markers for predictions of genetic merit in breeding populations. With recent advancements in DNA sequencing technologies, genomic data, especially single nucleotide polymorphism (SNP) markers, have become widely available for animal and plant breeding programs in recent years. Analyses of DNA markers for prediction of genetic merit are a relatively new and very active research area, with new methods and improvements of older methods being proposed and tested constantly. The algorithmsandsoftwaretoimplementthesealgorithmsarechangingaswespeak.Therefore, Sect.2intendstobeanintroductiontothetopic, touchingonsomeofthemorewidelyused ix x Preface methods and softwares currently available. Readers should be aware that the methods discussed here are likely to be modified and improved in the near future, and that new statistical packages will be introduced. We present this material, however, in the hopes of providingasolidgroundinginthebasicsofhandlinglargemarkerdatasetsandusingthemto predict breeding values. InChap. 9, we describe characteristics of typical DNA marker data sets and introduce some software tools useful for exploratory analyses (visualization, sum- mary, and data manipulation) of marker data. Chapter 10 focuses on imputation of missing genotypes. Chapter 11 covers the use of DNA markers to predict genomic relationships between individuals in breeding populations and the use of genomic best linear unbiased prediction (GBLUP) to predict breeding values even of individuals that have not been phenotyped.Chapter12reviewsthestatisticalbackgroundofmoreadvancedgenomicselec- tionmethodswithseveralexamples. Thisbookisintendedforstudentsinplantoranimalbreedingcoursesandforprofessional breedersinterestedinusingthesetoolsandapproachesintheirbreedingprograms.Weloveto hearfromusersaboutsuggestionsforimprovementsandcorrectionstothetext. We tried our best to give credit to resources we used to write this book. Apologies if we missed something; please let us know so that we can include the source in a future edition. Manyfriends,colleagues,andgraduatestudentshelpedwiththewriting,revising,andediting theoriginallecturenotesandexercises,whichturnedouttobeahugetask.Weacknowledge thecontributionsofGregDutkowski,SalvadorGezan,SteveMcKeand,Je´roˆmeBartholome, Trevor Walker, Jaime Zapata-Valenzuela, Funda Ogut, Kent Gray, YiJian Huang, Patrick Cumbie, Alfredo Farjat, Terrance Ye, Jeremy Howard, Francesco Tiezzi, Brian Cullis, Tori BatistaBrooks,AustinHeine,AprilMeeks,PaulaBarnesCardinale,OnurTroyIsik,Amanda Lee, Mohammad Nasir Shalizi, Edwin Lauer, and Miroslav Zoric for reviewing drafts of chapters and providing feedback. Thiago Marino, Jason Brewer, Heather Manching, and Randy Wisser generously provided unpublished maize data to use as an example in Chap. 11. Christophe Plomion (INRA, France) provided maritime pine data to use in Chaps. 9, 10, 11, and 12. Tree Improvement Program at NC State University provided unpublishedpineprogenytestdatatouseinseveralchapters. OurcolleagueDr.RossWhettenofNCStateUniversityeditedChaps.1,2,3,4,and5and Chap.9.Rosshelpeddevelopedthetrainingworkshopsonwhichthisbookisbased,hetested manyscripts,andprovideduswithinvaluablefeedback.WearegratefultoRoss. We are also very grateful to Hüsnü Dokak, Professor of Arts at Hacettepe University, Ankara,Turkey,forsketches/drawingsofanimalsandplantsusedinthechapters. Raleigh,NC,USA FikretIsik JamesHolland ChristianMaltecca Software Requirements Severalprogramswereusedinthisbook.Ifyoudonotalreadyhavethefollowingprograms installedonyourcomputer,werecommendyoudownloadandinstallthem. ASReml:ASRemlisapowerfultoolforanalysisoflinearmixedmodels.Downloadfrom https://www.vsni.co.uk/downloads/asreml. Make sure you obtain a license from VSN, the company that distributes the program. See program website for details. It typically takes several days to a week from requesting to receiving a license. For starters, Luis Apiolaza’s websiteaboutASRemisanexcellentsource:http://uncronopio.org/ASReml/HomePage ConTEXT: ConTEXT is a small, fast, and powerful freeware text editor for Windows, availableathttp://www.contexteditor.org/.WeusedittowriteASRemlstandalonecommand filesandexamineoutput. R: Download R from http://cran.r-project.org/, choosing the Windows, Mac, or Linux version according to the OS on your computer. All R versions are free. For the exercises, you need to install several packages (and the other packages they depend on). After installing R, start the R program from the desktop shortcut and copy-paste the following R scriptintotheRwindowtoinstalltherequiredpackages. is.installed <- function(mypkg) is.element(mypkg, installed.packages()[,1]) source("http://bioconductor.org/biocLite.R") packBIOC=list("GeneticsPed","chopsticks") for(i in 1:length(packBIOC)){ if(!is.installed(packBIOC[[i]])){biocLite(packBIOC[[i]])} cat(paste("----------",packBIOC[[i]],"----------",sep="\t"));cat("\n") } #-copy to here, paste, and let R finish before copying the rest - packCRAN=list("MASS","pedigree","rrBLUP","BLR","multicore","plyr") for(i in 1:length(packCRAN)){ mip=as.character(packCRAN[i]) if(!is.installed(packCRAN[[i]])){install.packages(mip,dependencies = T)} cat(paste("----------",packCRAN[[i]],"----------",sep=" \t"));cat("\n") } Example data sets and code scripts: All of the example codes shown in this book are available for download from this book website: https://faculty.cnr.ncsu.edu/fikretisik/ breedingbook/. We recommend keeping all of the example data sets together in a common foldersotheexamplesshowninthisbookcanberun“as-is”exceptforchangingthefilepaths tothatfolder. xi xii SoftwareRequirements Installingpackagesfromlocalsource:Weusedasetofscriptsbundledinpackageforthe genomicselectionchapter.ThesearenotloadedonCRANandaremadeavailabletoreaders. The installation process is slightly different for Mac/Unix and Windows. Please do the installationofthepackageafteryouhaverunthesmallscriptabove. MAC/Unix:PlacethepackageGSa_1.0.tar.gzonyourdesktop.Openaterminaland changedirectorytoyourdesktop(inMacthiswillbesomethinglikecd/Users/”NAME”/ DesktopwhileinUnixitwilllikelybecd/home/”NAME”/Desktop).Runthefollowing command R CMD INSTALL GSa_1.0.tar.gz. Note that you can do the same from the GUIonaMacbutthisissimpler. Windows:PlacethefileGSa_1.0.ziponyourdesktop.Ifyouhavea64bitmachineR willinstallbothversions.InRchangedirectorytoyourdesktop(youcanusethebuttonsofthe GUI to do so) then run the following line install.packages(“GSa_1.0.zip”, repos¼NULL).

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.