ebook img

Stochastic processes in genetics and evolution : computer experiments in the quantification of mutation and selection PDF

695 Pages·2012·7.692 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Stochastic processes in genetics and evolution : computer experiments in the quantification of mutation and selection

8159.9789814350679-tp .indd 1 10/13/11 3:07 PM TThhiiss ppaaggee iinntteennttiioonnaallllyy lleefftt bbllaannkk Charles J. Mode Drexel University, USA Candace K. Sleeman NAVTEQ Corporation, USA World Scientific NEW JERSEY • LONDON • SINGAPORE • BEIJING • SHANGHAI • HONG KONG • TAIPEI • CHENNAI 8159.9789814350679-tp .indd 2 10/13/11 3:07 PM Published by World Scientific Publishing Co. Pte. Ltd. 5 Toh Tuck Link, Singapore 596224 USA office: 27 Warren Street, Suite 401-402, Hackensack, NJ 07601 UK office: 57 Shelton Street, Covent Garden, London WC2H 9HE British Library Cataloguing-in-Publication Data A catalogue record for this book is available from the British Library. STOCHASTIC PROCESSES IN GENETICS AND EVOLUTION Computer Experiments in the Quantification of Mutation and Selection Copyright © 2012 by World Scientific Publishing Co. Pte. Ltd. All rights reserved. This book, or parts thereof, may not be reproduced in any form or by any means, electronic or mechanical, including photocopying, recording or any information storage and retrieval system now known or to be invented, without written permission from the Publisher. For photocopying of material in this volume, please pay a copying fee through the Copyright Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to photocopy is not required from the publisher. ISBN-13 978-981-4350-67-9 ISBN-10 981-4350-67-2 Printed in Singapore. Devi - Stochastic Processes in Genetics.pmd 1 10/21/2011, 3:51 PM October6,2011 16:3 WorldScienti(cid:12)cBook-9inx6in 01-Dedication Dedication To the memory of my wife, Eleanore L. Perdelwitz Mode and my parents, Karl Charles and Fanny E. Hansen Mode To the memory of my father, Dr. Richard A. Sleeman TThhiiss ppaaggee iinntteennttiioonnaallllyy lleefftt bbllaannkk October6,2011 16:3 WorldScienti(cid:12)cBook-9inx6in 02-Prologue Prologue Attheoutsetitshouldbestatedthatthisisnotabookonphylogeneticsin whichmillionsofyearsofevolutionandrelationshipsamongexistingspecies are often under consideration. However, in chapters 6,7 and 8 stochastic models of nucleotide substitutions, which may be applied in research on phylogenetics, are reviewed and some useful extensions are suggested that accommodatenucleotidesubstitutionsatalargenumberofsitesofaDNA molecule rather than a single site or codons with three sites that are char- acteristic of most of the models introduced to the literature 3 to 4 decades ago. Incontrasttoresearchinphylogenetics,themainthrustofthisbookis toprovidemethodsforsimulatingthestochasticevolutionof singlespecies during short periods of evolutionary time consisting of 10,000 to 200,000 years, but in some models time is expressed on a scale of generations. A common theme of the computer simulation experiments reported in this book is the evolution of a population stemming from a small founder population. Of particular interest in these simulation experiments is the informative statistical summarization of a sample of Monte Carlo waiting times until a new beneficial mutation arises and become predominant in a population. Another distinguishing feature of this book is that all Monte Carlo simulation models are rooted in stochastic processes, and, in each case, an attempt has been made to present the mathematics in sufficient detail so that if an investigator were interested, he would, in principle, be possible to write software in a programming language of his choosing and duplicate any computer experiment reported in this book. The program- minglanguageusedthroughoutthisbookwasAPL2000,whichisaninter- national language that is popular among a minority of people who like the succinctness with which complex code may be written. Unfortunately, this programming language is not as popular as C++ and other languages. As vii October6,2011 16:3 WorldScienti(cid:12)cBook-9inx6in 02-Prologue viii Stochastic Processes in Genetics and Evolution mathematicsisauniversalinternationallanguagehowever,itishopedthat by inspecting the mathematics underlying a model, an investigator will be abletowritesoftwaretoimplementanymodeldiscussedinaprogramming language of his choosing. The following paragraphs of this prologue are devoted to suggestions that will be helpful to readers who wish to read this book thoroughly or merely skim through it or even skip some chapters to obtain an overall impressionofthecontentsofthebook. Itishopedthatthemodularnature of this book by topics will expedite this exploratory process. Chapter 1 is devoted to an axiomatic treatment of probability, which will be useful in setting the stage for the chapters that follow. A central theme of this chapter is the concept of a finite probability space, which encompasses a sample space of outcomes of a conceptual experiment, a collection of events or subsets of the sample space and the definition a probability function defined on the class of events with certain properties. Randomvariablesarethendefinedwithinthecontextofaprobabilityspace and the binomial, multinomial and Poisson distributions are derived and areappliedextensivelythroughoutthebook. Forthosereaderswhoarenot comfortablewiththeaxiomaticapproachtoprobability,itwillbesufficient tograsptheideasunderlyingthebinomial,multinomialandPoissondistri- bution. In this connection, a study of the many examples from Mendelian genetics involving applications of these distributions will be very helpful. Chapter 2 is devoted the parameterization of the gametic distribution with respect to a large number of linked Mendelian loci or markers such as singlenucleotidepolymorphisms,SNPs;onmoleculesofDNA:Aftersome suggestionsforassemblingdatabasestostudygeneticrecombinationatthe molecular level, a method is developed for parameterizing the gametic dis- tributionintermsofrecombinationprobabilitiesforsomearbitrarynumber N (cid:21)2 of linked loci. In the closing section of this chapter, suggestions are madeastohowtheideasdevelopedintheforegoingsectionscanbeapplied to pedigrees in which linked markers are under consideration. Chapter 3 is devoted large random mating diploid populations with no mutation or selection and the principal objective of the chapter is to develop a mathematical structure that can accommodate a large number of linked loci with a finite but arbitrary number of alleles at each locus so that convergence to a linkage equilibrium in such a population may be studied. This chapter begins with a classical account of convergence to linkage equilibrium for the case of two loci with two alleles at each locus, whichmaybefoundinmanytextbooksofpopulationgenetics. Thisresult October6,2011 16:3 WorldScienti(cid:12)cBook-9inx6in 02-Prologue Prologue ix is then extended to the case of a finite but arbitrary number of alleles at each locus and then finally to the general case of multiple loci mentioned above. Muchofthecontentofchapter3isbasedonresultsbyH.Geirenger, which were published in 1944. Among other things, this theory is based on a elegant application of set theory, and if this is discomforting to a reader, he can rest with the knowledge that this theory will not be used in subsequent chapters of the book. However, when genetic recombination is again encountered in chapter 14, the case of two linked loci discussed in chapters 2 and 3 will be applied to the case of two linked markers at the molecular level. Chapter4isdevoted toapresentationtheWright-Fisherprocesswithin the context of finite absorbing Markov chains in which applications of ma- trix theory are useful in reducing the structure of this process to simple terms that are familiar to anyone with a working knowledge of the theory of finite matrices. In particular, formulas of a set of conditional absorption probabilities are derived such that if the process starts in given transient state,theconditionalprobabilitythatitisabsorbedortheprocessintermi- nated in some particular absorbing state are expressed as elements of ma- trices. Furthermore, giving that the process terminates in some absorbing state,formulasfortheconditionalexpectationsandvariancesofthewaiting timestoabsorptionarederived. Ageneralformulaforthequasi-stationary distribution of a finite absorbing Markov chain are also derived, which will beusefulinconnectionswithbranchingprocessesintroducedinsubsequent chapters. Any mention of diffusion approximations to Wright-Fisher pro- cess have deliberately been avoided, because, for the most part, this book is devoted to computer intensive methods. In this chapter, Wright-Fisher processes with respect to a single autosomal locus with two alleles are the principal foci of attention and both the neutral case and the cases of mu- tation and selection as characterized within the Wright-Fisher paradigm in termsofprobabilities. AclassofWright-Fisherprocesseswithastatespace such that all states communicate with each other was also included in this chapter. Chapter 5 is devoted for the most part to Wright-Fisher process with multiple alleles at a single autosomal locus. As through trial and error it wasfoundthatmatrixformulasderivedinchapter4tendtobecomenumer- ically unstable when the size of a Markov transition matrix exceeds about 1000(cid:2)1000; it became necessary to use Monte Carlo simulation methods for dealing with process based on multiple alleles which usually entails the use of very large transition matrices. Fortunately, by using Monte Carlo

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.