ebook img

Nuclear DNA Amounts in Angiosperms: Progress, Problems and Prospects PDF

46 Pages·2004·0.41 MB·English
by  
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Nuclear DNA Amounts in Angiosperms: Progress, Problems and Prospects

Annalsof Botany95:45–90, 2005 doi:10.1093/aob/mci003,available onlineatwww.aob.oupjournals.org Nuclear DNA Amounts in Angiosperms: Progress, Problems and Prospects M. D. BENNETT* and I. J. LEITCH Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, SurreyTW9 3AB, UK Received:23July2004 Returnedforrevision:12August2004 Accepted:1September2004 CONTENTS INTRODUCTION 45 PROGRESS 46 Improvedsystematicrepresentation(speciesandfamilies) 46 (i)Firstestimatesforspecies 46 (ii)Firstestimatesforfamilies 47 PROBLEMS 48 Geographicalrepresentationanddistribution 48 Plantlifeform 48 Obsolescencetimebomb 49 Errorsandinexactitudes 49 Genomesize,‘complete’genomesequencing,and,theeuchromaticgenome 50 Thecompletelysequencedgenome 50 Weedingouterroneousdata 52 WhatisthesmallestreliableC-valueforanangiosperm? 52 WhatistheminimumC-valueforafree-livingangiospermandotherfree-livingorganisms? 53 PROSPECTSFORTHENEXTTENYEARS 54 Holisticgenomics 55 LITERATURECITED 56 APPENDIX 59 NotestotheAppendix 59 OriginalreferencesforDNAvalues 89 (cid:1)BackgroundThenuclearDNAamountinanunreplicatedhaploidchromosomecomplement(1C-value)isakey diversitycharacterwithmanyuses.AngiospermC-valueshavebeenlistedforreferencepurposessince1976,and pooledinanelectronicdatabasesince1997(http://www.kew.org/cval/homepage).Suchlistsarecitedfrequentlyand providedataformanycomparativestudies.Thelastcompilationwaspublishedin2000,soafurthersupplementary lististimelytomonitorprogressagainsttargetssetatthefirstplantgenomesizeworkshopin1997andtofacilitate newgoalsetting. (cid:1)ScopeThepresentworklistsDNAC-valuesfor804speciesincludingfirstvaluesfor628speciesfrom88original sources,notincludedinanypreviouscompilation,plusadditionalvaluesfor176speciesincludedinaprevious compilation. (cid:1)Conclusions1998–2002sawstrikingprogressinourknowledgeofangiospermC-values.Atleast1700firstvalues forspeciesweremeasured(themostinanyfive-yearperiod)andfamilialrepresentationrosefrom30%to50%. The loss of many densitometers used to measure DNA C-values proved less serious than feared, owing to the developmentofrelativelyinexpensiveflowcytometersandcomputer-basedimageanalysissystems.Newusesofthe term genome (e.g. in ‘complete’ genome sequencing) can cause confusion. The Arabidopsis Genome Initiative C-value for Arabidopsis thaliana (125 Mb) was a gross underestimate, and an exact C-value based on genome sequencingaloneisunlikelytobeobtainedsoonforanyangiosperm.Lack ofthisexpectedbenchmarkposesa quandary as to what to use as the basal calibration standard for angiosperms. The next decade offers exciting prospectsforangiospermgenomesizeresearch.Thedatabase(http://www.kew.org/cval/homepage)shouldbecome sufficiently representative of the global flora to answer most questions without needing new estimations. DNAamountvariationwillremainakeyinterestasanintegratedstrandofholisticgenomics. ª2005AnnalsofBotanyCompany Keywords: AngiospermDNAamounts,DNAC-values,nucleargenomesize,plantDNAC-valuesdatabase. INTRODUCTION in each successive decade. Work on plants has played a leading part in research to describe and understand the IthasbeenpossibletoestimatetheamountofDNAinplant origin, extent and effects of variation in the DNA amount nuclei for over 50years, andsince the keyroleofDNAin in the unreplicated haploid nuclear chromosome comple- biologywasdiscoveredin1953,suchresearchhasincreased ment(definedbySwift,1950,asthe1C-value)ofdifferent *[email protected] taxa.Indeed,angiospermsareprobablythemostintensively Annals of Botany 95/1 ª Annals of Botany Company 2005; all rights reserved 46 Bennett and Leitch — Nuclear DNA Amounts in Angiosperms studied major taxonomic ‘group’ of organisms, with and quality over a long period (Bennett and Leitch, published C-values for over 4100 species. 1995).Theimportanceofidentifyinggapsinourknowledge Early research to address questions such as possible concerning this key biodiversity character, of recommend- relationships between DNA C-value and the rate of cell ing targets for new work to fill them by collaboration of development (e.g. Van’t Hof, 1965) usually required internationalpartners,andofmonitoringprogresstoensure work to estimate C-values for most of the taxa concerned, that any shortfall is recognized, was confirmed by the first as these were unavailable. Later, as taxa with ‘known’ plantgenomesizeworkshopin1997(http://www.kew.org/ C-values increased, it was possible to use such data in cval/conference.html#outline, Bennett et al., 2000) and new comparisons (supplemented by further first estimates reviewed by participants at the second plant genome size made for sample taxa). However, it was often difficult to workshopin2003.Thus,whatfollowsismainlyasummary knowwhetheraC-valueexistedforaparticulartaxon,andif of the overall progress for angiosperms against key targets so,wheretofindit.Suchestimateswerewidelyscatteredin set in 1997 for the following quinquennium (1998–2002). the literature or even unpublished. Small lists of nuclear However, it also notes meaningful statistics for the data DNA amounts were published in reviews and research included in the Appendix table, or known to us from papers, but the first large list of DNA amounts for angio- personal communications made after the Appendix table sperms, compiled primarily as a reference source was was closed. publishedin1976.Thiscontaineddataforover750species In 1997 C-values for 2802 species (approximately 1 %) from 54 original sources (Bennett and Smith, 1976), and of angiosperm species had been estimated in the previous noted an intention to publish supplementary lists for refer- 40 years. The 1997 workshop concluded that the ideal of ence purposes at intervals. Five such lists, together giving a C-value for all taxa was unrealistic, but long-term, pooleddataforover2900speciesfrom323originalsources, estimates for 10–20 % of angiosperms seemed both ulti- have followed (Bennett et al., 1982, 2000; Bennett and mately achievable and adequate for all conceivable uses Smith, 1991; Bennett and Leitch, 1995, 1997). Data from provided they were carefully targeted to be representative the first five publications were pooled in an electronic of the various taxonomic groups, geographical regions, form – the Angiosperm DNA C-values database, which and life forms in the global flora. So the first recom- went live in April 1997. This was updated as release 3.1 mended target was to estimate first C-values for the and incorporated, with databases for gymnosperms, next 1 % of angiosperm species (i.e. another 2500 spe- pteridophytesandbryophytes,intothePlantDNAC-values cies) by 2003. Many saw this goal as aspirational, as database (release 1.0) in 2001. achieving it would mean estimating as many C-values These data are clearly much used, as the published lists in five years as in the past 40. Others thought that new have beencited over1400times,includingover700times technology (e.g. flow cytometry) would make it easy to since1997,whilsttheelectronicdatabasehasreceivedover achieve. 50000hits.Recentlytheyhaveprovidedthelargesamples ofdata needed formanydiversecomparative studies,such Improvedsystematicrepresentation(speciesand as testing for possible relationships between nuclear DNA families) amountandriskof extinction (Vinogradov, 2003), ecolog- ical factors in California (Knight and Ackerly, 2002), lead (i) First estimates for species. In September 1997 the pollution in Slovenia (B. Vilhar, University of Ljubljana, Angiosperm DNA C-values database contained data for Slovenia, pers. comm.); ploidy level (Leitch and Bennett, 2802 species. By September 2003 C-values were listed 2004), and land plant evolution (Leitch et al., 2005). for4119species,including689firstvaluesforspecieslisted Giventheirongoinguseasreferencesources,publication in Bennett et al. (2000) and 628 such values for species of a sixth supplementary list of angiosperm C-values is in the Appendix table. Progress toward the first target in timely,ifnotoverdue.ThepresentworklistsDNAC-values thefiveyearperiod(1998–2002)considerablyexceededthe for 804 species from 88 original sources, including first average of (cid:2)110 first values for species per annum in the estimatesfor628speciesnotincludedinanypreviouscom- early 1990s. Clearly, the 1997 workshop stimulated an pilation, plus additional estimates for 176 species already increase in the total output of first C-values for species to included in one or more previous compilation. Data in the its highest level for any five-year period (almost 200 per AppendixtablewerepreparedforanalysisatthesecondPlant annum; Fig. 1A). Moreover, the proportion of newly pub- GenomeSizeDiscussionMeetinginSeptember2003,soitis lished C-values that were also first estimates for species, fitting that they are included in this special supplement. whichhadpreviouslyfallen(Bennettetal.,2000),roseasa WhilsttheyrepresentmostofthenewC-valuedatapublished result of recent targeting and averaged 72(cid:3)5 % for values orestimatedin2000–2002,wearealreadyawareofafurther published since 1997 (Fig. 1B). Nevertheless, the total largesampleestimatedbutunpublishedeitherbylate2002, number of published first C-values for species (1032) orsubsequently. Thus, despite its large size, the presentlist listed since 1997 was only 41 % of the 1997 target of will soon be followed by a seventh supplement. approx. 2500. The real total of first C-values for angiosperms esti- matedafter1996butunpublishedby2003wasmuchhigher, PROGRESS but is difficult to determine exactly. For example, several Research on DNA C-values in angiosperms is unique in hundred values were measured by Ben Zonneveld (pers. having been subject to detailed analyses of its quantity comm.) using flow cytometry but not published. Listing Bennett and Leitch — Nuclear DNA Amounts in Angiosperms 47 A 60 300 50 250 of estimates 200 PG families 40 umber 150 % of A 30 n n 100 ve Mea ulati 20 50 m u C 10 0 4 4 4 4 4 2 5 6 7 8 9 0 9 9 9 9 9 0 1 1 1 1 1 2 – – – – – – 0 0 0 0 0 0 0 5 6 7 8 9 0 9 9 9 9 9 0 1950 1960 1970 1980 1990 2000 1 1 1 1 1 2 Year Period B FIG.2. Cumulativepercentageofangiospermfamiliesrecognizedbythe AngiospermPhylogenyGroup(APG)(APGII,2003)withafirstC-value 100 representedinthepresentAppendixtable,theAngiospermDNAC-values d e database (release 4.0, January 2003), plus eleven known to the present h ublis 80 authorsinSeptember2003. p s e C-value estimathat are ‘new’ 4600 ptaanrkeg(esiiieo)1ns0tpF0erirayrmtseetasraespcs,ehtsciiomieevaasint)negiusslt2fmiom0roa%rtefeassmgepoinelasicleiibseo.lsfe1.Tre0hpe%res1(ea9np9tp7artoiwoxn.o2rwk5soh0uo0ldp0 of t noted that a first C-value was available for only 30 % of e g angiospermfamiliesrecognizedatthattime.Thus,asecond nta 20 recommended target was ‘To obtain at least one C-value e c er estimateforaspeciesinallangiospermfamilies’.Monitor- P 0 ingfirstC-valuesforspecieslistedinBennettetal.(2000) 1965 1970 1975 1980 1985 1990 1995 2000 showed that progress towards this goal was initially very Year slow.Indeed,‘since1997firstC-valueshadbeenlistedfor 691angiospermspecies,butonly12(1(cid:3)7%)werealsofirst FIG.1. (A)Meannumberperyearoftotal(opensymbols)and‘first’(closed estimatesforfamilies’.WorktocorrectthisbeganatRBG, symbols)DNAC-valueestimatescommunicatedintensuccessive5-year periodsandthe3-yearperiod2000–2002,between1950and2002.Basedon Kewin1999.In2001twopapersreportedfirstC-valuesfor analysisofdatalistedinthepresentAppendixtable,andtheAngiosperm 50families(Hansonetal.,2001a,b),and30morefollowed, DNA C-values database (release 4.0, January 2003). (B) Percentage of including five basal angiosperm families (Leitch and C-valueestimatespublishedorcommunicatedduring1965–2002thatare Hanson, 2002; Hanson et al., 2003), all included in the first values for species listed in the present Appendix table and the present Appendix table. AngiospermDNAC-valuesdatabase(release4.0,January2003). Analysis of listed data for 4119 species shows that a first published value is available for at least 217 of the 457 angiosperm families currently recognized by the fortheAppendixtableclosedinAugust2002,readyforthe Angiosperm Phylogeny Group (APG) (APG II, 2003). workshop;however,wesaw158firstC-valuespublishedby Together with first estimates for 11 unlisted families other authors later in 2002, and 22 such values were esti- (Hanson, RBG, Kew, pers. comm.; Koce et al., 2003) matedatRBG,Kew.Addingthesedatatothoselistedinour measured or seen after listing for the Appendix table was compilationssuggeststhatthetotalnumberoffirstC-values closed,thetotalis228.Thus,since1997(afterlossesowing for species estimated in 1997–2002 was probably at least to new familial circumscriptions—APG II, 2003; Hanson 1700andhencenotlessthanapprox.66%ofthetargetset et al., 2003) first values for at least 85 such families have in1997.Analysisshowsthatthiswasachievedbyinterna- beenmeasured,sogoodprogresshasbeenmade.However, tionalcollaborationinvolvingatleast18researchgroupsin theproportionoffamiliesrepresentedroseonlyfrom30% ten countries. Whilst a target of 2500 was aspirational, it to49(cid:3)9%(Fig.2),whichislessthanonethirdofthetarget seemsattainableasafuturefive-yeargoal.However,atthe (100 %) set in 1997. Major factors limiting progress were 48 Bennett and Leitch — Nuclear DNA Amounts in Angiosperms TABLE 1. Thenumberandpercentage (inbrackets)oforiginalreferenceswithfirstauthorsfromvariousgeographicalareas amongthetotalof465sourcescontributingtothepresentAppendixtableandthesixlistsofangiospermDNAamountspreviously compiled for reference purposes that were pooled in the Angiosperm DNA C-values database (release 4.0, January 2003) DNAC-valuecompilation Area 19761 19822 19913 19954 19975 20006 PresentAppendix Total Europe 34(63.0) 38(71.7) 30(53.6) 43(40.6) 18(48.6) 38(51.4) 54(61.4) 255(54.8) UK 28(51.9) 13(24.5) 22(39.3) 23(21.7) 5(13.5) 8(10.8) 10(11.4) 109(23.4) NorthAmerica 14(25.9) 11(20.8) 16(28.6) 19(17.9) 5(13.5) 11(14.9) 13(14.8) 89(19.1) SouthandMesoAmerica 0(0.0) 0(0.0) 3(5.4) 9(8.5) 1(2.7) 6(8.1) 2(2.3) 21(4.5) Africa 1(1.9) 0(0.0) 0(0.0) 1(0.9) 2(5.4) 1(1.4) 2(2.3) 7(1.5) Asia 1(1.9) 3(5.7) 4(7.1) 30(28.3) 11(29.7) 8(10.8) 17(19.3) 74(15.9) India 1(1.9) 1(1.9) 2(3.6) 28(26.4) 11(29.7) 4(5.4) 11(12.5) 58(12.5) China 0(0.0) 0(0.0) 0(0.0) 0(0.0) 0(0.0) 0(0.0) 2(2.3) 2(0.4) Australasia 4(7.4) 1(1.9) 3(5.4) 4(3.8) 0(0.0) 7(9.5) 0(0.0) 19(4.1) Australia 4(7.4) 1(1.9) 3(5.4) 1(0.9) 0(0.0) 1(1.4) 0(0.0) 10(2.2) Total 54 53 56 106 37 71 88 465(100) 1BennettandSmith(1976); 2Bennettetal.(1982); 3BennettandSmith(1991); 4BennettandLeitch(1995); 5BennettandLeitch(1997); 6Bennett etal.(2000). discussedpreviously(Hansonetal.,2003).Unlikeprogress oftaxawhoseC-valuesareunknown.Bennettetal.(2000) towards the species target, which involved many research notedthat‘Africaremainsanunexploredcontinent’andthat groups,movementtowardsthegoalforfamiliessince1997 ‘Whereas six out of 377 original sources have first authors has depended mainly on work by one institution, as RBG with addresses in Africa, still none has an angiosperm Kewestimated65ofthe74(87%)firstvaluesforfamilies C-value estimated in Africa, as all six reported work done listed in the Appendix table. inEuropeortheUSA.’Analysisofthe88originalsources ThePlantGenomeSizeWorkshopin2003confirmedthat inthe present work showsnoimprovement, asthe number global capacity for estimating DNA C-values (determined of original sources from Africa (2), China (2), and South by available equipment, funding and trained operators) America(2)remainslow(Table1).(ii)Thesecondconcerns remains very limited. Consequently, any increased focus the small number of first C-values by any authors for on targets for other plant groups (e.g. bryophytes, pterido- species endemic to several large geographical regions. phytes, gymnosperms—reviewed in Leitch and Bennett, With some exceptions, the sample is still dominated by 2002b) inevitably reduces progress to improve representa- crops and their wild relatives, model species grown for tion for angiosperm targets, a problem discussed below. experimental use, and other species growing near labora- However, it should not detract from the highly successful tories in temperate regions, mainly in Western Europe and progresstomakeC-valuesmorerepresentativeoftheglobal North America. Analysis of data in the Appendix table flora described above. shows that none presented data for other taxa endemic to China, Japan, Brazil, Mexico or Central Africa. Similarly, althoughislandflorasareknowntoberichinendemics,no PROBLEMS original source has reported C-values for any large islands suchasBorneo,NewGuineaorMadagascar,where80%of Geographicalrepresentationanddistribution the 12000 described plant species are endemic (Robinson, Wefirstnotedtheneedtoimprovegeographicalrepresenta- 2004). tion for angiosperm C-values in Bennett and Leitch (1995). This was confirmed by the 1997 plant genome Plantlifeform size workshop, although no specific regional targets were recommended. Perhaps, in consequence, progress in this There is also a need for the overall sample to represent area has never been monitored in detail, although we better the full range of plant types and life forms. We have been at pains to advertise the problem in a general previously identified several associations and life forms way and to provoke action to rectify it in particular as being poorly represented in the database (Bennett and regions, such as southern Africa (Leitch and Bennett, Leitch, 1995), yet taxa from bog, fen, tundra, alpine 2002a). and desert environments, and halophytic, insectivorous, Therearetwocriticalconcernsregardingthegeographical parasitic, saprophytic andepiphytic speciesand their asso- distribution of angiosperm C-value work. (i) The first ciated taxa are all still under-represented. concerns the small number of publications with original Solving this problem needs a proactive approach, as C-values by first authors in many regions (Table 1). This recent experience with first C-values for angiosperm reflects a serious imbalance between the geographical dis- families shows. First, a target must be set for each gap. tributionofresearchscientistsworkingongenomesizeand Second, monitoring newly published data against targets Bennett and Leitch — Nuclear DNA Amounts in Angiosperms 49 must begin. Third, if work on poorly-represented floras or capacity-building may be thwarted by a lack of in-country life forms does not increase, then established research support by the suppliers of flow cytometers. groups must re-focus on target material available in exist- A second solution to the problem is a new availability ing collections. Unless global capability for estimating of relatively inexpensive computer-based image analysis plant DNA C-values is significantly increased by new (CIA) systems, which can estimate DNA amounts using technology, funds or skilled operators, then this change Feulgen-stainedcytologicalpreparationsinplaceofamicro- in strategy will reduce progress towards achieving other densitometer.Althoughproprietaryhard-wiredCIAsystems targets. However, the prime objective remains: togenerate have been available since the 1970s (e.g. Zeiss Quantimet a sample representative of the global flora that is able system),theycostmuchmorethanmicrodensitometers,and to support most comparative studies. Managers of the analysis of the literature shows they have notbeen used to limited global capacity for estimating genome size should estimate plant C-values. However, in the 1990s, with keep this firmly in mind when targeting taxa for new advances in computer technology, less expensive systems work. weredeveloped(e.g.CIRESsystem)primarilyformedical use, andthesehave also been used togoodeffectfor plant C-value estimations (e.g. Temsch et al., 1998; Greilhuber Obsolescencetimebomb et al., 2000). Several methods have been used to measure plant DNA Sadly, the CIRES system that adapted well for this pur- C-values, butmostvalues have been estimatedbyFeulgen poseisnolongeravailable,asthesoftwareisincompatible microdensitometry (Fe), both overall and since 1997. with the operating system used on modern PCs. However, In 1997 we identified an imminent problem, likely to computer-literate groups can assemble the kit needed to limit future estimations. This was the failure and non- estimateC-valuesusingCIA,andseveralsoftwarepackages replacement of densitometers long used by many groups written specifically for this purpose are available (Vilhar to estimate DNA C-values. Manufacturers were ending et al., 2001; Hardie et al., 2002). Hardie et al. (2002) give theirsupportforsuchequipment,andusersfaceddifficulty an excellent review of this technique and practical issues infundingnew equipment for thispurpose. Moreover,this concernedwithitsuseforanimalmaterials,andVilharetal. problemwaslikelytobemostacuteinregionswheresome (2001) have compared CIA, Fe and FC, to help define of the greatest gaps in our knowledge lay. Reviewing best practice for CIA, demonstrating that CIA can be the position at the second Plant Genome Size Workshop applied to plants with an approx. 100-fold range of confirmedthat,aspredicted,the‘obsolescencetimebomb’ C-values. Vilhar et al. (2001) concluded that ‘DNA image had exploded. By 2003 several laboratories that had photometry gives accurate and reproducible results, and long published C-values listed in the Angiosperm DNA may be used as an alternative to photometric cytometry C-values database were now unable to estimate C-values in plant nuclear DNA measurements’. CIA can use an by this (e.g. in Mexico, the USA), or any method (e.g. in existing microscope, costs less than FC to set up, and is Argentina). Vickers Instruments no longer supports their easier to service in countries that lack FC manufacturers’ M85microdensitometer,andsparepartsforitareunobtain- support. The field would benefit from development of a able. A few laboratories, including ours, can still use such standard inexpensive CIA ‘kit’, an agreed best practice machines,butnowwithoutservicingandonlyuntiltheyfail CIA technique, and easy access to leading laboratories catastrophically. fortrainingandtechnologytransfer.Giventhis,CIAcould As expected, one response to this problem was the soonbecomethemethodofchoiceforestimatingC-values increaseduseofflowcytometry(FC)toreplaceFe.Analysis in angiosperms, replacing Fe as a method of choice along ofdataintheAppendixtableshowsahigherproportionof with FC, but with the advantage that, unlike FC, it uses C-valuesobtainedbyFC(58(cid:3)4%),mostlysince1997,than microscopeslidepreparations,allowinguserstomakecyto- noted previously (48(cid:3)6 %) for data listed in Bennett and logical observations. Leitch(2000),whilstinBennettandLeitch(1995,1997)FC averaged 26(cid:3)7 %. Several groups have undertaken careful Errorsandinexactitudes studies to compare DNA estimates made by FC and Fe, to define best practice for FC, or to show that FC can be Swift(1950)definedtheDNAcontentofanunreplicated applied widely to most plants across the full range of haploid complement as its 1C-value (C standing for known DNA C-values (e.g. see review by Dole(cid:1)zzel and ‘constant’). Thus, replicated diplophase nuclei have a Bartos, 2005, this volume). Fortunately, the cost of a 4C DNA amount and produce two unreplicated 2C nuclei basicflowcytometerforsuchworkhasfallen,andsuitable bymitoticdivision,andfour1Cgameticnucleiaftermeio- models(e.g.PartecPAII)haverecentlybeensetupforthis sis, irrespective of the organism’s ploidy level. This con- use for approx. £20K (US$30K). If this technology con- vention applies well to polyploid taxa with diploidized tinues toimprove,anditscostscontinue tofall, FCshould meioticchromosomepairingsuchashexaploidbreadwheat, be more easily available. However, FC easily yields poor which produce mainly functional, balanced polyhaploid data in unskilled hands and by itself does not provide the gametes with 1C DNA amounts at meiosis (Rees and cytologicalviewoftestmaterial(s)thatisessentialtocount Walters, 1965). Consequently, for several previous refer- chromosome number(s). Its use in some less-developed ence lists, 4C DNA estimates for all taxa were divided countries (where the greatest gaps in our knowledge still by 2 and 4 to generate 2C- and 1C-values respectively remain) will depend on training local operators, but such (e.g. Bennett et al., 2000). However, a problem with this 50 Bennett and Leitch — Nuclear DNA Amounts in Angiosperms practice was identified for the few taxa with odd ploidy Further potential for confusion comes from new uses of levels in release 3.1 of the Angiosperm DNA C-values theterm‘genome’recentlyspawnedbygenomesequencers. database (namely 45 out of 3493 listed taxa, (cid:2)1(cid:3)3 %), as Theseconcernthecounter-intuitivemeaningofa‘wholly’, theresulting1C-valuesarenotbiologicallymeaningful.For ‘completely’or‘entirely’sequencedgenome,orofequating example,triploidswitha4Camountintheirfullyreplicated ‘genome’ with ‘euchromatic genome’—a confusing con- metaphase nuclei do regularly produce two 2C nuclei at cept in which ‘genome’ equals the parts which could be mitosis, but do not regularly produce four 1C products at clonedandsequenced,butnottherest(seebelow).Noneof meiosis.Theauthorsaregratefultoseveralcolleagueswho thesequalitativenewusesofgenomeequatestoitsquanti- noted this problem and suggested solutions. tative use to mean either a 1C-value, or one monoploid This problem has several practical consequences. parental genome in a polyploid. (i) Regrettably, researchers who use 1C data from the literature or downloaded from the Angiosperm DNA Thecompletelysequencedgenome C-values database may have included this error in the samplesthattheyusedforcomparativeanalyses.However, Since 2000 the scientific and popular press has reported thisisunlikelytohaveinfluencedtheirconclusionssignific- andcelebratedthe‘complete’sequencingofthefirstinsect antly, since the magnitude of the error is relatively small (Drosophilamelanogaster)andplantgenome(Arabidopsis (ranging between (cid:4)0(cid:3)25C for a triploid, to +0(cid:3)25C for a thaliana) and the human genome (in 2001). For example, pentaploid—which tend to cancel out), and affects only atitleinNaturereported:‘Thesequencingofanentireplant 1(cid:3)3 % of all taxa listed. Overall, errors in mean DNA genome is now complete.’ Readers could be forgiven for amounts for samples are probably less than 0(cid:3)5 %. Studies assuming this meant the entire linear sequence of the that used data from the 2C or 4C columns for samples of nuclear DNA had been sequenced and assembled, so that odd-ploidtaxaareunaffectedbytheerror.(ii)Toensurethat thetotalsizeofthenucleargenomeintheseorganismswas researchersareawareoftheproblemanddonotgenerate1C nowknownwithcertainty,andhencemuchmoreaccurately datafortaxawithoddploidylevelsinthefuture,release4.0 thananypreviousestimatebasedonothermethodssubject of the Angiosperm DNA C-values database gives 2C- and to various experimental errors. The popular and scientific 4C-values for the 45 out of 3493 entries with odd ploidy literature easily gives that impression, and unfortunately levels, plus a warning note in response to any queries for that is what many, incorrectly, understood. The truth is 1C-values. This approach is also followed in the present otherwise, as a ‘completely sequenced’ genome is a very Appendixtable(seefootnotet).(iii)Thisproblemalsohigh- relativeconcept.InthesameissueofSciencewhereBrenner lights a general need to re-assess definitions of ‘C-value’ (2000) wrote ‘We have the complete sequence of the and‘genomesize’inlightofrecentusageandnewtheore- 125-megabasegenomeofthefruit-flyDrosophila’,Pennisi tical understanding, a topic explored by Greilhuber et al. (2000) noted that ‘the fly sequence still has c. 1000 small (2005).Indeed,theaboveproblemshowstheneedforcare gaps’—referring only to the sequenced euchromatin part. when handling data, and the danger of using computer- Butwhatoftherest?Speakingofheterochromatin,Adams generated numbers uncritically. It is clearly perilous to et al. (2000) explained that the ‘genomes of eukaryotes ignore basic biology or the literature, as the recent history generally contain heterochromatic regions surrounding ofgenomesize,‘complete’genomesequencing,andinterest thecentromeresthatareintractabletoallcurrentsequencing in the smallest angiosperm genome clearly shows. methods’ and that ‘Because of the unclonable repetitive DNA surrounding the centromeres it is highly unlikely that the genomic sequence of chromosomes from eukar- Genomesize,‘complete’genomesequencing, yotessuchasDrosophilaorhumanwilleverbe‘complete’. and,theeuchromaticgenome Moreover,Adamsetal.(2000)statedthattheunsequenced A growing semantic problem concerns different uses of centricheterochromatinregionscomprised‘onethird’ofthe the term ‘genome’ (Greilhuber et al., 2005). As originally approx.180MbgenomeofDrosophila.Buthowwasitssize definedbyWinkler(1920),genomereferredtoamonoploid determined? Careful reading revealed that the Mb size of chromosomecomplement.Sinceamonoploidisdefinedas these unsequenced centromeric heterochromatic segments ‘having one chromosome set with the basic (x) number of was measured not by any modern molecular method, but chromosomes’ (Rieger et al., 1991), it followed by defini- by using a ruler on one cell of a plate in a paper by tion that any polyploid taxon had three or more genomes. Yamamotoetal.(1990).Thisimportantdetailisnotstated However, an alternative meaning, now in common usage, inthemain text, butinthe legendtofig.1inAdams et al. uses genome as an interchangeable alternative for the (2000).AsBorkandCopley(2001)clearlyexplain,‘There 1C-value to refer to the DNA content of an unreplicated are regions, often highly repetitive, that are difficult or gametic nuclear complement, irrespective of ploidy level. impossibletoclone(oneoftheinitialstepsinasequencing Unless the meaning intended is clearly defined on each project)orsequencewithcurrenttechnology.... Theextent occasion, this can be confusing, especially when authors of these regions varies widely in different species. So, use both meanings for a polyploid taxon in the same ratherthanapplyingauniversalgoldstandard,eachsequen- paper. For example, Devos and Gale (1997) used the cing project has made pragmatic decisions as to what term ‘genome’ to refer to both the entire complement of constitutes a sufficient level of coverage for a particular nuclear DNA in a hexaploid wheat nucleus and to the genome. For example, as much as one-third of the individual A, B and D ‘genomes’. sequence of the fruitfly Drosophila melanogaster was not Bennett and Leitch — Nuclear DNA Amounts in Angiosperms 51 stable in the cloning systems used, and so was not those words their natural meaning. Other molecular work sequenced.’ has confirmed this conclusion (e.g. Hosouchi et al., 2002). Thus, workers interested in C-values should clearly Morerecently,thedraftDNAsequenceoftherice(Oryza understand that a ‘completely’, ‘entirely’ or ‘wholly’ sativa) genome was published (O. sativa ssp. japonica, sequenced genome is not what those words might imply Goff et al., 2002; O. sativa ssp. indica, Yu et al., 2002). iftakenatfacevalue,andthesizegivenforsuchagenome However,whiletheestimatedgenomesizesbasedonDNA may indicate either the amount of DNA sequenced, or the sequencingdidnotsufferfromtheseriousshortcomingsof size of that euchromatic genome sequenced plus a best- theArabidopsisestimate,neither didtheyfulfilthecriteria guess estimate of a lot of unsequenced heterochromatin. essentialforanewbenchmarkcalibrationstandard.Yuetal. Further, it can mean that every type of sequence in an (2002) gave a new C-value of 466 Mb for O. sativa ssp. organism has been sequenced, but it need not mean that indicacalculatedbyaddinguptheDNAsequencingdatafor all copies of all types have been sequenced, or that their 362 Mb of sequenced scaffolds and 104 Mb of ‘unas- copy numbers are known. Without this information total sembled data’. In contrast Goff et al. (2002) reported the genome size (the DNA C-value) cannot be determined sequencingofDNAwhichcoveredatotalof389809244bp based on genome sequencing (Bennett et al., 2003). oftheO.sativassp.japonicagenome.Theystatedthatthis Swift (1953) stated that, ‘in general estimates of the represented 93 % of the 420 Mb rice genome but did not nucleic acids in cells are at present accurate to 10 or give a reference to the source of 420 Mb. It is therefore 20 %’. Later, Bennett and Smith (1976) concluded that unclearwhethertheC-valueof420MbgivenbyGoffetal. ‘While a few estimates are not accurate even to within (2002)representsanewC-valuebasedongenomesequen- 20%,carefulmeasurementsof4CDNAamountsinspecies cing alone. The 1C-value for rice may yet prove to be with 0(cid:3)5–2(cid:3)0 times that of a standard species are probably slightly higher than the values assumed by Goff et al. accuratetowithin5–10%’.Greilhuber(1998)noted‘much (2002)andYuetal.(2002),andapproach490Mb,equival- suspectordemonstrablywrongdatahaveaccumulatedand ent to the 0(cid:3)5 pg estimated by Bennett and Smith (1991). continue to be accumulated in the literature’. Sadly, the Exact C-values based on complete genome sequences ‘complete’genomesequencingofArabidopsis(Arabidopsis would be invaluable (Bennett et al., 2003). The need to Genome Initiative, 2000), which was expected to provide complete sequencing gaps in Arabidopsis remains tech- a new baseline, only added to this phenomenon. nically difficult, and it is unclear how, when, or if it will Plant genome size researchers have long recognized the beachieved.Genomesequencingbecomesmoredifficultas needforanexactcalibrationstandard,whoseC-valueisnot genome size increases, and experience with Arabidopsis subjecttotechnicalerrors.Thus,thepublicationofaprecise implies that exact C-values are unlikely to be obtained in C-value for the first plant to have its genome completely this way soon for any larger plant genomes, including the sequenced was eagerly awaited, as it was expected to established plant C-value standard Oryza sativa. provide a baseline, gold-standard reference point, against The current situation poses a quandary for the plant which all other plants could be compared and expressed. genome size community, who have long paid serious Arabidopsis thaliana ecotype ‘Columbia’ was chosen for attention to trying to maximize the accuracy and compar- complete genome sequencing, partly because its tiny ability of plant DNA C-values by using agreed calibration genome should be less costly to sequence than larger standards (both materials and assumed values; e.g. see genomes in other species. http://www.rbgkew.org.uk/cval/conference.html#outline, In 2000 the Arabidopsis Genome Initiative (AGI) pub- Bennettetal.,2000),whileeagerlyawaitingthefirstabsol- lishedthegenomesizeofArabidopsisthalianaas125Mb, ute measurement for a plant obtained by really complete comprising115(cid:3)4Mbinthesequencedregionsplusarough DNA sequencing. Current options include: (i) continue to estimate of 10 Mb in unsequenced centromere and ribo- use the existing small group of plant calibration standards somal DNA regions. The accuracy of this estimate was until a plant C-value which meets the required criteria setnotbytheprecisionofsequencingandassemblingcon- becomes available; (ii) adopt an animal C-value which tigs,butbythetotalinaccuracyinthesizesassumedforthe meetsthesecriteriaasthebaselinereferenceforexpressing unsequenced gaps (Bennettet al., 2003) andhence was no all other plant species values, e.g. Caenorhabditis elegans moreaccuratethanmanyestimatesintherange150–180Mb Bristol N2, whose C-value is known with confidence to made by other methods. Further analysis showed that the within 1 % from genome sequencing to be just above AGI’s rough estimate of 10 Mb in the unsequenced gaps 100 Mb (or roughly 0(cid:3)1 pg); (iii) adopt a plant value was highly inaccurate. Thus, new comparisons using flow based on direct comparisons with C. elegans, as the base cytometry, which co-ran A. thaliana ecotype ‘Columbia’ calibration standard for plants, and create a ladder of withthreeanimalspeciesincludingCaenorhabditiselegans secondary calibration standards all measured against it in BristolN2(whosegenomesizeisaccuratelyestablishedby a study replicated between several groups able to use best genome sequencing as just over 100 Mb), gave C-value practice.TheC-valueforArabidopsisthaliana(1C=157Mb estimates for A. thaliana in the range 154–162 Mb (with or0(cid:3)16pg),recentlymeasuredagainstC.elegans(Bennett 157MbwhenC.eleganswasusedasthestandard)(Bennett etal.,2003),couldbeadoptedasthebasalplantcalibration etal.,2003).Thisvalueisabout25%largerthantheAGI standard. Seed is readily available from stock centres and estimate of 125 Mb which was clearly a gross under- gives small, easily grown plants. Moreover, the ladder of estimate,andhenceisnotthelong-awaitedfirstbenchmark valuesforitsmanyendopolyploidnucleiwouldalsoprovide C-valueforacompletelysequencedplantgenome—giving convenientcalibrationreferencepointsforhighervaluesup 52 Bennett and Leitch — Nuclear DNA Amounts in Angiosperms to approx. 2500 Mb or 2(cid:3)5 pg (i.e. 0(cid:3)64 (cid:4) 4C, 1(cid:3)28 (cid:4) 8C, A and 2(cid:3)56 (cid:4) 16C). Weedingouterroneousdata Thevalueofthedatabaseisdeterminedbytheaccuracyof s the data it contains. Ideally, values should be exact, but in ate m reality they are all subject to various technical and other errors,asnotedabove.Thisraisesquestionsastohowaccur- esti atedataare,andwhatleveloferrorisacceptableinpractice, er of or makes a datum valueless for a particular use or study. b m Theexistenceofadatabaseitselfisavaluablemeansof Nu AAAbbbooouuuttt identifyingrealorpotentialerrors,andhenceofimproving rrriiiggghhhttt the accuracy and quality of the whole body of data. For Too low Too high example,whereestimatesforthesametaxon(withthesame chromosome number) disagree greatly this suggests an error. Further, where a body of data for a taxon shows close agreement except for one major departure, this iden- tifiestheoutlierasalmostcertainlyincorrect.Forexample, 1C DNA amount in the Appendix table, the 2C-value for diploid Acacia dealbata (1(cid:3)7 pg) reported by Blakesley et al. (2002) is 60 B similar to that reported by Bukhari (1997) of 2C = 1(cid:3)6 pg (listed in Bennett et al., 2000), but both values differ con- siderablyfromthe2C-valueof2(cid:3)9pgreportedbyMukerjee and Sharma (1993b) (see Notes to the Appendix bb). Another example concerns Brachypodium distachyon. es 40 ci In 1991, the PhD thesis of Shi reported a 1C-value of pe s 0(cid:3)15 pg, but later Shi et al. (1993) gave its 1C-value as of 0(cid:3)3 pg. To resolve this discrepancy, RBG, Kew obtained er b some original material studied by Shi and estimated its m 1C-value to be 0(cid:3)36 pg, confirming the larger C-value for Nu 20 this species (see also footnote br). Thus, real errors can be identifiedwithcertainty,andpotentialerrorsflaggedupfor users in cautionary footnotes following Appendix tables. DNA C-values in angiosperms vary approx. 1000-fold 0 (over three orders of magnitude) from approx. 0(cid:3)1 pg to 0–0·10 0·11–0·20 0·21–0·30 over 100 pg. It is, therefore, often useful to know whether 1C DNA amount range (pg) aspecies’1CDNAamounthasapproximately0(cid:3)1,1,10or 100pg,evenifthereisstilluncertaintyregardingwhethera FIG.3. (A)ExpectederrorvariationinalargepopulationofDNAC-value species with approximately 1 pg is really closer to 0(cid:3)8 pg estimates for one genotype as underestimates (in the lower tail) and overestimates (in the upper tail) surround more accurate, intermediate, than to 1(cid:3)2 pg (an error 620 %). In terms of its predictive genomesizeestimates.(B)HistogramshowingfrequencyofC-valuesfor valueinnucleotypiccorrelations,suchanerrorstillpermits the85smallestspeciesinthedatabaseorAppendix. useful conclusions to be drawn. The Arabidopsis com- munity laboured long under the misapprehension that its 1C DNA amount was approx. 70 Mb (Leutwiler et al., draw important conclusions, to make valuable predictions, 1984), and later approx. 100 Mb (Meyerowitz, 1994), and as a basis for necessary planning. when in reality it is much higher (about 157 Mb, Bennett et al., 2003). The level of inaccuracy involved (approx. WhatisthesmallestreliableC-valueforanangiosperm? 50–100 %) was considerable, yet it did not prevent the selection of Arabidopsis as the model plant for first The above examples show how seeing data in the complete genome sequencing, in no small part on the comparative context of the database can help to identify basis of its ‘small genome size’ (NSF, 1990; Somerville real or potential errors in particular species. It can also and Somerville, 1999). The Convention on Biological facilitate broader enquiries such as ‘what is the smallest Diversity (United Nations Environment Programme, reliableC-valueforanangiosperm?’.Again,thecomparat- 1992) noted the need to make biodiversity data available, ive approach has enabled researchers to be active in iden- despite imperfections; a view which merits support tifyingpotentialerrorsinspecieswiththesmallestreported (Bennett, 1998). Thus, it is better to list available C-value C-values, and to be transparent in correcting mistakes. datasubjecttoerrors,untilimproveddatawithfewererrors Because of error variation, a population of 1C-value become available. The body of data is needed by the estimates for one taxon should vary according to a normal scientific community and can clearly already be used to curve, so those in the lower tail are all too low (Fig. 3A). Bennett and Leitch — Nuclear DNA Amounts in Angiosperms 53 TABLE2. The24lowestangiosperm1CDNAestimatesamong The1C-valueweobtainedwasaround0(cid:3)24pg(almostfive- data listed inthe present Appendix table and the Angiosperm fold the earlier report). This is in close agreement with DNA C-values database (release 4.0, January2003) independent estimates made elsewhere (e.g. see Bennett and Leitch, 2005; Johnston et al., 2005). Taxon 1C(pg) Originalreference Once the underestimates for Arabidopsis thaliana and Cardamine amara are discounted, few 1C-values of Arabidopsisthaliana 0.051 Francisetal.(1990) 0(cid:3)125 pg or below remain for other species. One is the Cardamineamara 0.055 BandSR(pers.comm.1984) 1C estimate of 0(cid:3)125 pg for Rosa wichuriana, estimated Arabidopsisthaliana 0.073 Leutwileretal.(1984) Fragariaviridis 0.108 AntoniusandAhokas(1996) using callus material from Dr Andy Roberts (Bennett and Rosawichuriana 0.125 BennettandSmith(1991) Smith 1991). This value seemed questionably low in the Aesculushippocastanum 0.125 Bennettetal.(1982) context of the database, especially as it became clear that Arabidopsisthaliana 0.128 ArabidopsisGenomeInitiative(2000) culturing may induce stain inhibitors. This concern led to Sedumalbum 0.145 Hart(1991) a new collaboration with RBG, Kew using non-callous Arabidopsisthaliana 0.150 ArumaganathanandEarle(1991) Carexnubigera 0.150 Nishikawaetal.(1984) material, and our doubts were confirmed when it was Carexpaxii 0.150 Nishikawaetal.(1984) re-estimated as 1C = 0(cid:3)575 pg (Yokoya et al., 2000). Epilobiumpalustre 0.150 BandSR(pers.comm.1984) Perhaps all estimates below 0(cid:3)125 pg should be Hypericumhirsutum 0.150 Hanson,LeitchandBennett doubted until confirmed. Another candidate was Aesculus (pers.comm.2002) Thlaspialpestre 0.150 BandSR(pers.comm.1984) hippocastanum whose 1C-value was listed as 0(cid:3)125 pg Arabidopsisthaliana 0.153 Bennettetal.(2003) (Bennett et al., 1982). This material is rich in tannins Arabidopsisthaliana 0.160 Bennettetal.(2003) andalikelycandidateforunderestimatingitsDNAamount Arabidopsisthaliana 0.160 Galbraithetal.(1991) (Noirotetal.,2000,2005).Recentworkusingflowcytome- Arabidopsisthaliana 0.165 Galbraithetal.(1991) tryatRBG,Kew,showedthatthe1C-valueof0(cid:3)125pgwas Arabidopsisthaliana 0.167 KrisaiandGreilhuber(1997) Arabidopsisthaliana 0.167 Bennettetal.(2003) clearlyanunderestimate,asthetruevalueisapprox.0(cid:3)60pg Amoreuxiawrightii 0.168 Hansonetal.(2001a) (L.Hanson,RBG,Kew,pers.comm.).With0(cid:3)125pgforA. Arabidopsisthaliana 0.170 Galbraithetal.(1991) hippocastanum rejected, only one estimate below 0(cid:3)14 pg Arabidopsisthaliana 0.175 BennettandSmith(1991) remains,namely0(cid:3)11pgfortheGreenstrawberry,Fragaria Arabidopsisthaliana 0.175 MarieandBrown(1993) viridis. Since there is considerable interest in knowing the smallest possible angiosperm genome, checks to establish whether this estimate is robust are now urgently required. Such values are lost in the frequency histogram for all WhatistheminimumC-valueforafree-livingangiosperm angiosperm C-value estimates except at its lowest tail andotherfree-livingorganisms? where some of the lowest C-values claimed are expected to be too low. This expectation is strongly supported in Suchcomparative approachescan alsofacilitatebroader practice, as shown below. There are 53 1C estimates in questions such as: ‘what is the minimum genome size in the Angiosperm DNA C-values database or the present angiosperms and other free-living organisms?’. There is a Appendix with 0(cid:3)21–0(cid:3)30 pg, 29 with 0(cid:3)11–0(cid:3)20 pg, minimum compendium of nuclear genes essential for the but only three with 0(cid:3)10 pg or below (Fig. 3B). Table 2 life of any organism. This concept was behind Craig liststhe24lowestestimateslistedwith0(cid:3)175pgorless,but Venter’s declared intension to synthesize from scratch a how robust are they? minimal bacterial genome (Check, 2002), and a project Thirteen of the 24 estimates in Table 2 are for for a minimal eukaryote genome may eventually follow. Arabidopsis thaliana. A comparative approach suggests Meanwhile we can only speculate on how small the that some, which featured among the lowest C-values minimum genome is for an angiosperm, and how closely reported for angiosperms, are too low. Thus several extantspeciesapproachtheminimum.Itis,ofcourse,below C-value estimates made by molecular means in the range thelowestrobustC-valueforthegroup,i.e.lessthan0(cid:3)16pg 0(cid:3)05–0(cid:3)125pg(Leutwileretal.,1984;Francisetal.,1990; established for Arabidopsis thaliana. The presence of six ArabidopsisGenomeInitiative,2000)arenowseenasgross other species with C-values of 0(cid:3)15–0(cid:3)169 pg in Table 2 underestimates,whilemanyothersintherange0(cid:3)15–0(cid:3)18pg stronglysupportsthisconclusion.Theestimate(s)ofapprox. areshowntospanthetruevalueofabout0(cid:3)16pg(Bennett 0(cid:3)108 pg for Fragaria viridis may indicate a minimum et al., 2003). C-value for extant angiosperms of about 100 Mb, but if After discounting the 1C estimate for Arabidopsis so, is it a diploid, or a polyploid with three or more even thaliana of 0(cid:3)051 pg by Francis et al. (1990), the next smaller ancestral genomes? smallest estimate listed is 0(cid:3)055 pg for Cardamine amara Whilsttherobust1C-valueforA.thalianais0(cid:3)16pg,this (communicatedfromS.R.Bandin1984).Withonlyathird includes>25%ofrepeatedDNA(Bennettetal.,2003)and theDNAamountofitsrelatedcruciferA.thaliana(0(cid:3)16pg), analysis of sequenced regions shows that >70 %ofcoding itseemedsuspiciouslylow.Cardamineamaraseedcannot genesareduplicated(Bowersetal.,2003).Thus,intheory, survive drying, so it is unavailable from seed banks. aminimalgenomewithoutduplicatedcodinggenesorrep- However, we recently used flow cytometry to compare etitive DNA should not exceed approx. 50 Mb. Currently, diploidC.amaracollectednearSheffieldwithseveralcali- there is no robust 1C estimate below 0(cid:3)1 pg for an angio- brationstandardsincludingA.thalianaecotype‘Columbia’. sperm, but if any of the seven species with C-values 54 Bennett and Leitch — Nuclear DNA Amounts in Angiosperms TABLE 3. Robust minimum 1C-value estimates for several widely different groups of free-living, multicellular, higher organisms obtained by genome sequencing (*), other best practice techniques, or static cytometry using the fluorochrome DAPI for algae1 Group Species Mb Originalreference ANIMALS Nematode Caenorhabditiselegans 100* C.eleganssequencingConsortium(1998) Platyhelminthes(flatworms) Stenostomumbrevipharyngium 59 Gregoryetal.(2000) Crustacea Scapholeberiskingii(waterflea) 157 Beaton(1988) Annelid Dinophilusgyrociliatus(polychaeteworm) 59 Soldietal.(1994) Tardigrades(waterbears) Isohypsibiuslunulatus 78 RediandGaragna(1987) Insect Peristenusstygicus 98 TRGregory(pers.comm.) Arachnid Tetranychusurticae(spidermite) 78 TRGregory(pers.comm.) Urochordates(tunicates) Oikopleuradioica 72 Seoetal.(2001) PLANTS Chlorophyta(greenalga) Caulerpapaspaloides 88 Kapraun(2005) Rhodophyta(redalgae) Heydrichiawolkerlingii 69 Kapraun(2005) Phaeophyta(brownalgae) Stilophorarhizodes 98 Kapraun(2005) Bryophyte Holomitriumarboreum 167 Voglmayr(2000) Lycophyte Selaginellakraussiana 157 Obermayeretal.(2002) Angiosperm Arabidopsisthaliana 157 Bennettetal.(2003) 1Theuseofthebase-specificfluorochromeDAPIforestimatingDNAamountsmaybelessreliablethanusingintercalatingfluorochromessuchaspropidium iodide(e.g.Dole(cid:1)zzeletal.,1992). between0(cid:3)14–0(cid:3)15pgis atetraploid, this wouldindicate a If so, there is a reasonable prospect that the number of minimum genome sizeinextanttaxa ofapprox.75Mb,or species with a C-value estimate will reach, or significantly approx. 50 Mb if it is a hexaploid. exceed 7500 by 2014. Thecomparativeapproachisusefullyextendedtoinclude More important than the expected increase in total othergroupsoforganisms.Table3showsminimumC-value numbers is the predicted improvement in the spread of estimates for multicellular organisms in several widely new values across taxa, geographical regions and life differing groups obtained by genome sequencing or other forms,makingthesamplemorerepresentativeoftheglobal methods.Suchminimaforgroupsasdiverseasnematodes, angiospermflora,basedoncarefultargetingtoidentifyand insects, algae and angiosperms range from 59 to 160 Mb. fill knowledge gaps. The next decade should see almost Thus, the minimum C-value known in extant free-living completerepresentationforfamilies,andagreatlyincreased multicellular higher organisms is around 60 Mb. All may representationforgenera(especiallyinmonocots),aswork bediploidized paleopolyploids(Wendel,2000), butexcept focusesincreasinglyatthistaxonomiclevel.Representation for one early and unconfirmed report of a 1C-value of at the generic level is currently approx. 1042 out of an approx. 39 Mb in a most simple multicellular placozoan estimated 14000 genera (7(cid:3)4 %) and is targeted to rise to animal (Ruthmann and Wenderoth, 1975) there is no evi- 10 % by 2009, and might reasonably be expected to denceforextantdiploidmulticellulareukaryoticlifeforms approach 15 % within a decade. Moreover, this may withonly40–50Mb.Thistantalizingpossibilitywillbean approach 100 % for monocots, as they are targeted for interesting driver for new work to find a first angiosperm holisticgenomicstudies(includingC-values)fortheglobal whose 1C-value is <100 Mb, or a first free-living multi- Monocot Checklist Project (Govaerts, 2004). cellular plant or animal with a robust 1C-value <50 Mb. Recent experience shows that identifying a gap and setting a target may still not provoke the work needed to fill it. Positive monitoring of trends in published C-value PROSPECTS FOR THE NEXT TEN YEARS datamayalsoberequiredtoachieveasignificantchangein Apart from better defining the limits of genome size researchactivity(e.g.aswiththeleveloffamilyrepresenta- variation, what key developments are targeted, or likely, tion in angiosperms; Hanson et al., 2003). Thus it will be to occur in angiosperm genome size research in the next importanttomonitorby2009whetherthegapingchasmsin decade? therepresentationofAfrican,SouthAmericanandChinese The first concerns the expected progress to increase the florasnotedpreviouslyhaveyetresultedinasignificantrise total number and representation of angiosperms in the in first estimates for taxa from thoseregions.If not,then a C-values database. As noted above, estimating first values majoreffortwillbeneededtocorrectthis.Thesameapplies for species reached a historic high during recent years toothergroupsofplantsthathavebeenidentifiedaspoorly (Fig. 1). At least 1700 such values were added in 1997– represented in the Angiosperm DNA C-values database 2002, and the total number of species’ C-value estimates (e.g.halophytes,parasiticspeciesandtheirhostsandtundra probably reached around 4300. In 2003 the second Plant species). Whilst less certain, there is a good prospect that Genome Size Workshop set a goal of estimating a further this vital process will occur in the next decade, driven by 1%(i.e.approx.2500species)inthenextfiveyears,anda theGenomeSizeInitiative(GESI:seeBennettandLeitch, similar target is likely for the following quinquennium. 2005).

Description:
species or subspecies already listed by Bennett and Smith. (1976, 1991) .. (belonging to the separate subspecies truncata) had the Fe. 298. Co riaria myrtifo lia. L. No. Co riariaceae. E c. 72. 8. P. 326. 0 . 30 . 71 . 3. 378. O. J. Fe. 299. Co smos atrosan guineus l. No. Co mpositae. E. 4. 8. —
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.