Transcriptome analyses to investigate symbiotic relationships between marine protists Sergio Balzano, Erwan Corre, Johan Decelle, Roberto Sierra, Patrick Wincker, Corinne da Silva, Julie Poulain, Jan Pawlowski, Fabrice Not To cite this version: Sergio Balzano, Erwan Corre, Johan Decelle, Roberto Sierra, Patrick Wincker, et al.. Transcriptome analyses to investigate symbiotic relationships between marine protists. Frontiers in Microbiology, 2015, 6, pp.98. 10.3389/fmicb.2015.00098. hal-01143962 HAL Id: hal-01143962 https://hal.sorbonne-universite.fr/hal-01143962 Submitted on 27 Apr 2015 HAL is a multi-disciplinary open access L’archive ouverte pluridisciplinaire HAL, est archive for the deposit and dissemination of sci- destinée au dépôt et à la diffusion de documents entific research documents, whether they are pub- scientifiques de niveau recherche, publiés ou non, lished or not. The documents may come from émanant des établissements d’enseignement et de teaching and research institutions in France or recherche français ou étrangers, des laboratoires abroad, or from public or private research centers. publics ou privés. Distributed under a Creative Commons Attribution| 4.0 International License ORIGINALRESEARCHARTICLE published:17March2015 doi:10.3389/fmicb.2015.00098 Transcriptome analyses to investigate symbiotic relationships between marine protists SergioBalzano1,2*†,ErwanCorre3,JohanDecelle1,2,RobertoSierra4,PatrickWincker5, CorinneDaSilva5,JuliePoulain5,JanPawlowski4 andFabriceNot1,2 1UMR7144,UniversitéPierreetMarieCurieUniversitéParis06,SorbonneUniversités,StationBiologiquedeRoscoff,Roscoff,France 2CentreNationaldelaRechercheScientifique,UMR7144,StationBiologiquedeRoscoff,Roscoff,France 3CentreNationaldelaRechercheScientifiqueandUniversitéPierreetMarieCurie,FR2424,ABiMS,StationBiologique,Roscoff,France 4DepartmentofGeneticsandEvolution,UniversityofGeneva,Geneva,Switzerland 5Commissariatàl’EnergieAtomiqueetauxEnergiesAlternatives,Genoscope,France Editedby: Rhizaria are an important component of oceanic plankton communities worldwide. A ManuelMartinezGarcia,University numberofspeciesharboreukaryoticmicroalgalsymbionts,whicharehorizontallyacquired ofAlicante,Spain in the environment at each generation. Although these photosymbioses are determinant Reviewedby: for Rhizaria ability to thrive in oceanic ecosystems, the mechanisms for symbiotic IñakiRuiz-Trillo,InstitutdeBiologia interactions are unclear. Using high-throughput sequencing technology (i.e., 454), we Evolutiva(Commissariatàl’Energie AtomiqueetauxEnergies generatedlargeExpressedSequenceTag(EST)datasetsfromfourunculturedRhizaria,an Alternatives),Spain acantharian(Amphiloncheelongata),twopolycystines(Collozoumsp.andSpongosphaera MichaelCunliffe,MarineBiological streptacantha), and one phaeodarian (Aulacantha scolymantha). We assessed the main AssociationoftheUnitedKingdom, geneticfeaturesofthehost/symbiontsconsortium(i.e.,theholobiont)transcriptomesand UK found rRNA sequences affiliated to a wide range of bacteria and protists in all samples, *Correspondence: SergioBalzano,Departmentof suggesting that diverse microbial communities are associated with the holobionts. A MarineOrganicBiogeochemistry, particularfocuswasthencarriedouttosearchforgenespotentiallyinvolvedinsymbiotic RoyalNetherlandsInstituteforSea processes such as the presence of c-type lectins-coding genes, which are proteins that Research,POBox59,1790ABDen play a role in cell recognition among eukaryotes. Unigenes coding putative c-type lectin Burg,Netherlands e-mail:[email protected] domains(CTLD)werefoundinthespeciesbearingphotosyntheticsymbionts(A.elongata, †Presentaddress: Collozoum sp., and S. streptacantha) but not in the non-symbiotic one (A. scolymantha). SergioBalzano,Departmentof Moreparticularly,phylogeneticanalysesgroupCTLDsfromA.elongataandCollozoumsp. MarineOrganicBiogeochemistry, on a distinct branch from S. streptacantha CTLDs, which contained carbohydrate-binding RoyalNetherlandsInstituteforSea motifstypicallyobservedinothermarinephotosymbiosis.Ourdatasuggestthatsimilarly Research,DenBurg,Netherlands to other well-known marine photosymbiosis involving metazoans, the interactions of glycans with c-type lectins is likely involved in modulation of the host/symbiont specific recognitioninRadiolaria. Keywords:radiolarian,ESTs,c-typelectins,plankton,photosymbiosis,Rhizaria INTRODUCTION as dinoflagellates (Gast and Caron, 1996), chlorophytes (Gast Marine planktonic microorganisms are major constituents of et al., 2000), and haptophytes (Decelle et al., 2012a). The oceanicecosystems,beingresponsibleforhalfofglobalprimary intracellular algae provide photosynthetic products to the host production and participating massively to carbon sequestra- which in turn creates a nutrient-rich micro-environment facil- tion to the deep ocean (Falkowski and Raven, 2013). Despite itating algal growth, the symbiotic interactions turning out to its recognized global impact, plankton represents one of the be critical for marine food webs and biogeochemical processes leastexploredcompartmentsofthebiosphere,particularly with (Stoecker et al., 2009). Protists without a clear role as photo- regards to biotic interaction such as symbiosis. Radiolaria are symbiontsmayalsobepresentwithinradiolariancells,asfound abundantandwidespreadunicellularplankton,theirlargecells, for the acantharian Acanthochiasma sp. (Decelle et al., 2012b). typically ranging from 0.1 to 1mm in size, exhibit mineral Similarly marine invertebrates such as corals, anemones and skeletons made up either of silica for Polycystinea or stron- sponges are known to harbor diverse microbial communities tiumsulfateforAcantharia.Radiolariacanaccountforasignif- alongwiththeirphotosymbionts(Radaxetal., 2012;Sun etal., icantproportionofplanktoncommunitiesinthewatercolumn 2014). (Dennett et al., 2002; Nimmergut and Abelmann, 2002) and Well-knownphotosymbioticinteractionsinthemarineenvi- are considered active predators (Swanberg and Caron, 1991). ronmentincludethoseoccurringbetweenCnidariaanddinoflag- A number of radiolarian species exhibit also a mixotrophic ellatefromthegenusSymbiodinium(LaJeunesse,2001;Schwarz metabolism,hostingeukaryoticphotosymbioticmicroalgaesuch et al., 2008; Weis, 2008), and to a lesser extent those involving www.frontiersin.org March2015|Volume6|Article98|1 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis sponges (La Jeunesse, 2001) and benthic Foraminifera (Lee, and amplification was performed using the SMARTer RACE 2006).Littleisknownabouttheinteractionmechanismsbetween cDNA amplification kit (TaKaRa BIO/Clontech, Mountain the host and the symbionts: lectin/glycan interactions were View, CA) followed by the library preparation protocol avail- found to be involved in host/symbiont recognitions for differ- able on Matzlab website (http://www.bio.utexas.edu/research/ ent cnidarian taxa (Wood-Charlson et al., 2006; Vidal-Dupiol matz_lab/). Approximately 2.5μg of the cDNA pool from pre- et al., 2009; Wood-Charlson and Weis, 2009). c-Type lectins paredlibrarieswereusedforatitrationrunusingone-quarterof are carbohydrate binding proteins involved in a range of dif- aplateforeachsampleontheRoche454GenomeSequencerFLX ferent functions including cell adhesion, pathogen recognition usingGS-FLXTitaniumseriesreagents.Sequencingwascarried andphagocytosis(ZelenskyandGready,2005).Althougharole out at Genoscope Institute (Evry, France). Data have been sub- of c-type lectins in symbiosis has been demonstrated for meta- mittedtotheGenBankundertheSequenceReadArchive(SRA) zoa (Meyer and Weis, 2012) and to a lesser extent fungi (Singh numbers SRR1734678 (A. elongata), SRR1744093 (Collozoum andWalia,2014),theseproteinsarepresentinarangeoforgan- sp.),SRR1734679(S.streptacantha),and SRR1734688(A.scoly- ismsandgenomecomparisonbasedonhidden-Markovmodels mantha). suggest that putative c-type lectins are also present in sev- eral protists (http://supfam.org/SUPERFAMILY/cgi-bin/taxviz. ASSEMBLY,DATACLEANING,ANDANNOTATION cgi?sf=56436).PlanktonicRhizariaareessentiallyunculturedand Raw 454 reads were cleaned and assembled using Newbler rd tedious to collect, making investigations of symbiotic interac- 454_mapasm_08172010 (454.com/products/analysis-software). tionsdifficult.Consequently,themechanismsforhost/symbiont Sequencing adaptors were trimmed from the dataset using recognition in Rhizaria are essentially unknown. In this con- the–vt option and reads were finally assembled in contigs text,ourprimaryobjectivewastoprovideageneraloverviewof using the Newbler option -cDNA, which specifies a cDNA thegenesoccurringinthesespeciesforwhichgenomicdataare based assembly project. All sequences shorter than 200bp were notavailableandtoidentifygenespotentiallyinvolvedinsymbi- removed from the dataset. Seqclean (https://sourceforge.net/ otic processes. To achieve our goal we produced and compared projects/seqclean) was then used to remove sequences of low large Expressed Sequence Tag (EST) datasets from three photo- quality. Overall the number of contigs recovered was 3252 for symbioticRadiolaria,anacantharian(Amphiloncheelongata)and A.elongata,4888forCollozoumsp.,3229forS.streptacantha,and two polycystines (Collozoum sp. and Spongosphaera streptacan- 3243forA.scolymantha(Table1). tha),andoneheterotrophicRhizaria,thephaeodarianAulacantha Contigs and singletons associated to rRNA were identified scolymantha(Cercozoa). and extracted from the dataset by a blastn similarity search (Altschul et al., 1997) against the NCBI-nt database and the MATERIALSANDMETHODS following keywords were searched within the description line SAMPLECOLLECTION of the blast results: 5.8S, 16S, 18S, 23S, 26S, 28S, ITS, rRNA, ThePhaeodaria(Cercozoa)A.scolymantha,andthePolycystinea rDNA, ribosomal DNA, ribosomal RNA, LSU, and SSU. Reads (Radiolaria) Collozoum sp. and S. streptacantha were collected matchingwithreferencesequencescontainingoneofthosekey- withaplanktonnetinthebayofVillefranche-sur-mer(43◦41N, words, sharing >80% identity with them, and an alignment 7◦18 E, Mediterranean Sea) whereas the Acantharia A. elongata length >100bp were considered as ribosomal sequences. The ◦ ◦ wascollectedintheGulfofEilat(29 32N,34 57E,RedSea). remaining sequences were further screened for the presence of The specimens were sorted manually from the plankton mix rRNA reads as described in Radax et al. (2012). Unigenes were usingastereomicroscopebypipettingoutofthesamples,oneby blastedagainstbothLSUandSSUdatabases(releases115,http:// one, the cells of interest. The taxonomically homogeneous pool www.arb-silva.de/download/archive) (Quast et al., 2013) using ofsortedcells(onecolonywasisolatedforCollozoumsp.,about theSILVANGSplatform(https://www.arb-silva.de/ngs/)andno 150cellsforA.elongata,and50cellsforbothA.scolymantaand additional sequences related to rRNA were found in the in the S. streptacantha) were maintained at least 2h (up to 8h) in a dataset.TheremainingsequenceswerethenconsideredasmRNA petri dish filled with 0.22μm-filtered seawater. Since digestion andannotatedbyhomologyusingblastxsimilaritysearchagainst of preys usually occurs within the time frame of minutes (Not theNCBI–nrdatabase(version04/2012). personal observation) this procedure allowed minimizing sam- The taxonomic composition of both ribosomal and non- ple contamination by marine microorganisms other than those ribosomal sequences was inferred from the blast results using closelyassociatedwiththesortedorganisms.Cellswerethencol- Megan (Huson and Mitra, 2012). The GC ratio distribution lectedindividuallyfromthepetridishandrapidlytransferred(ca. fromthedifferentsequencesofourputativemRNAdatasetwas 1min) through two successive rinsing steps in clean, 0.22μm- inferredusingahomemadepythonscript(github.com/frederic- filtered seawater, to a sterile 1mL tube. RNAlater was added to mahe/bioinformatics-scripts/blob/master/GC_content.py). ◦ thetubewhichwasstoredovernightat4 Candthentransferred to−80◦Cuntilfurtheranalysis. GENEONTOLOGY(GO)TERMDISTRIBUTIONANDMETABOLIC PATHWAYS RNAISOLATION,cDNALIBRARYPREPARATIONANDSEQUENCING Geneontologytermswereassignedtounigenesagainstthenon- Total RNA was isolated from the samples using the NucleoSpin redundantproteindatabaseNR(April2012)usingtheBlast2GO XS (Macherey-Nagel, Germany) following the manufacturer tool (Conesa et al., 2005). Overall our EST were assigned to instructions. Immediately after RNA isolation, cDNA synthesis 6264 different GO terms (Supplementary Table S1) which were FrontiersinMicrobiology|MicrobialPhysiologyandMetabolism March2015|Volume6|Article98|2 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis Table1|StatisticsoftheESTrawreadsandassembly. A.elongata Collozoumsp. S.streptacantha A.scolymantha Total Totalsequenced Rawreads 195,890 214,475 220,239 195,070 825,674 Cleanedreads 138,867 121,079 120,924 126,494 507,364 BP 5.0×107 4.2×107 4.2×107 4.1×107 1.7×108 Averagelength 362 337 348 369 Medianlength 365 324 338 378 Afterassembly Unigenes 14,210 42,784 25,590 42,251 124,835 BP 6.5×106 1.6×107 1.0×107 1.7×107 4.9×107 GC% 46.3 44.0 50.9 40.6 Coverage 8.6 3.3 5.2 3.2 Contigs No 3252 4888 3229 3243 14,612 Assemblysize(BP) 2.8×106 3.4×106 2.8×106 2.9×106 1.2×107 %BPassembled 42.4 21.6 26.9 16.9 Averagelength 858 699 883 893 Medianlength 663 544 606 637 Annotations Blastn 410 153 1486 409 2458 Blastx 13,800 42,631 24,104 41,842 122,377 UnigenesannotatedtoGO 2798 4839 4662 6938 19,237 GOterms 2002 3027 3239 4318 6077 UnigenesannotatedtoKEGG 1303 1665 2264 2047 7279 KEGGpathways 98 101 105 111 123 then grouped into 117 categories (Supplementary Table S2) while in symbiosis with Symbiodinium spp., compared to non- using the web-based tool CateGOrizer (Zhi-Liang et al., 2008) symbioticstagesofthesamespecies(IversenandSeuthe,2011). with the GO_slim classification method (http://geneontology. Similarly we analyzed the occurrence of candidate symbiosis org/page/go-slim-and-subset-guide). To reconstruct the known genes which were previously found in different anemone and metabolic pathways occurring in our samples, the assembled coralspecies(MeyerandWeis,2012).Incoral/dinoflagellatesym- unigenes were annotated with corresponding enzyme commis- biosis these genes are involved in several functions such as pat- sion(EC)numbersagainsttheKyotoEncyclopediaofGenesand tern recognition, cell adhesion, vesicular trafficking, regulation Genomes(KEGG)database(KanehisaandGoto,2000)usingthe ofincominglight,apoptosis,nutrientandmetabolitetransport, Blast2Goprogram(http://www.blast2go.com/b2ghome). lipid storage and transport, and response to oxygen reactive species(MeyerandWeis,2012). CLUSTERING c-TYPELECTINEXTRACTIONANDPHYLOGENETICANALYSES Similarity between our different samples was evaluated by clus- The presence of genes putatively expressing c-type lectins was tering the four mRNA libraries. Sequences were translated evaluatedbyatblastnsimilaritysearchofallthesixpossibleread- into the six possible reading frames and clustered using Cd- ingframesofthefoursamplesmRNAagainstaselectionofc-type hit (Li and Godzik, 2006) with 50% sequence identity as lectinsdomainsdownloadedfromGenbankusingGeneioussoft- describedinToselandetal.(2013).Clusterswereexaminedand ware (Drummond et al., 2011). The lectins used in this search sortedusingaRubyscript(https://github.com/georgeG/bioruby- werederivedfromthescleractiniancoralsPocilloporadamicornis cd-hit-report). A Venn diagram was constructed to compare (Vidal-Dupiol et al., 2009) and Acropora millepora (Kvennefors the number of clusters shared by each specimen, using the et al., 2008), the sea anemone Nematostella vectensis (Wood- VennDiagram package with the R version 2.15.0 (http://cran.r- CharlsonandWeis,2009),andthenematodesLaxusoneistusand project.org). Stilbonema magnum (Bulgheresi et al., 2011). Several unigenes from S. streptacantha, and two unigenes from both A. elongata SYMBIOSIS-RELATEDGENESSEARCH andCollozoumsp.showedsimilaritieswithreferencec-typelectin To evaluate the occurrence of ESTs likely coding for symbiosis- domains.AllbutoneunigeneextractedfromourS.streptacantha related proteins, keyword searches were carried out on the librarycontainedseveraldifferentputativec-typelectindomains. description lines of both blastx and GO results. Enzymes and According to previous studies (Zelensky and Gready, 2005) we receptors involved in different functions to the interactions defineac-typelectindomain(CTLD)asasequencecontaining between cnidaria and dinoflagellates were searched. Specifically fourconservedcysteinemotifsandWIGL-likeregions.Unigenes wesearchedforgenesoverexpressedinthecoralAcroporapalmata fromS.streptacanthacontainingmultipleCTLDwerethensplit www.frontiersin.org March2015|Volume6|Article98|3 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis 10,000 Amphilonche elongata Collozoum sp. 450 ns 7500 N o Singletons o singlet 5000 Contigs 300 of co No of 2500 150 ntigs 0 0 0 500 1000 1500 2000 0 500 1000 1500 2000 10,000 Spongosphaera streptacantha Aulacantha scolymantha 450 s on 7500 N No of singlet 25500000 135000 o of contigs 0 0 0 500 1000 1500 2000 0 500 1000 1500 2000 Length (bp) Length (bp) FIGURE1|Distribution of contig and singleton lengths for 2400bp, n=220 for A. scolymantha, max length 11,307, n=110 for A. elongata, Collozoum sp., S. streptacantha, and A. scolymantha. Collozoum sp., max length 4000, n=75 for S. streptacantha, max length Contigs longer than 2000bp (n=139 for A. elongata, max length 30,120) are not shown. in the different domains and all the 28 domains obtained from and Collozoum sp. holobionts were the most poorly assembled S.streptacanthawerethenalignedtochecksimilarityandremove with a coverage of 3.2X and 3.3X and 21.9 and 26.6% ofbp redundant sequences. We finally obtained eight unique c-type assembled,respectively(Table1).Most(93–97%)ofourcontigs lectin domains from S. streptacantha, which were aligned with were ≤2000bp, and the median contig length ranged from 544 putativec-typelectindomainsfromA.elongata,Collozoumsp.as (Collozoum sp.) to 663 (A. elongata, Table1) bp and the contig wellasreferencec-typelectinsfrommarinesymbiosisdescribed lengthpeakedat500bpforallthesamples(Figure1). aboveandsevenputativec-typelectinsfromotherprotistssuchas Apicomplexa(Cryptosporidiumparvum,Cryptosporidiummuris, RIBOSOMALrRNA Gregarina niphandrodes, Neospora canicum), Pelagophyceae All the unigenes obtained were blasted against the GenBank (Aureococcus anophagefferens), Cryptophyta (Guillardia theta), nucleotidedatabase(blastn)todiscriminatebetweenrRNAand and Foraminifera (Reticulomyxa filosa). Sequences were aligned putativemRNAsequences.BlastnhitsfromrRNAreadsincluded usingclustalomega(Sieversetal.,2011)andthealignmentwas both eukaryotic (18S, ITS, and 28S rRNA) and as prokaryotic curedusinggblocks(Castresana,2000).Thefinalalignmentcon- (16S,ITSand23SrRNA)genes(Table2).ThenumberofrRNA tained 43 sequences with 246 unambiguously aligned positions, unigenes ranged from 153 (Collozoum sp.) to 1406 (S. strepta- phylogenetic inferences were conducted using phyml (Guindon cantha). Among rRNA unigenes 10% (S. streptacantha) to 33% et al., 2010). The maximum likelihood results are shown as an (A.elongata)wereaffiliatedtoprokaryotes.Prokaryoticunigenes unrootedtree. were dominated by γ-Proteobacteria and Bacteroidetes, with α-Proteobacteria and Planctomycetes also frequently occurring RESULTS throughoutoursamples(Table2). ASSEMBLAGEQUALITY EukaryoticrRNAunigenesincludedfrom13%(A.scolyman- Atotalof825,674rawreadsweregenerated,andafterremovalof tha) to 71% (S. streptacantha) of rhizarian sequences. Hundred badqualityandshort(<200bp)sequencesweobtained507,364 and one rRNA unigenes from A. elongata were affiliated to sev- reads for a sequencing size of 1.7×108bp (Table1). Both the eralAcanthariagenera,withonlyfiveunigenesassociatedwiththe number of reads and the sequencing size did not differ signifi- genusAmphilonche(SupplementaryTableS3).Polycystineawere cantly(<20%)betweenthedifferentholobionts.Readsassembled poorly represented in the rRNA library of Collozoum sp. since using Newbler led to a number of unigenes (contigs + sin- only10outof103eukaryoticrRNAreadssharedhighsimilarities gletons) from 14,210 (A. elongata) to 42,784 (Collozoum sp.). with Polycystinea. In contrast, about 25% of rRNA reads from Collozoum sp. yielded the highest number of contigs (4888) S.streptacanthawereaffiliatedtoPolycystinea(Table2),butnone and largest assembly size (3.4×106bp). A. elongata ESTs were of them could be affiliated to the genus Spongosphaera, very the best assembled in terms of coverage (8.1X) and propor- likely because reference sequences from this genus are missing tion of bp assembled (42.4%, Table1), whereas A. scolymantha inGenBank.RhizarianrRNAunigenesfromA.scolymanthawere FrontiersinMicrobiology|MicrobialPhysiologyandMetabolism March2015|Volume6|Article98|4 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis Table2|Taxonomicdistributionofribosomal(rRNA)andmessengerRNA(mRNA)readsa. A.elongata Collozoumsp. S.streptacantha A.scolymantha rRNA mRNA rRNA mRNA rRNA mRNA rRNA mRNA α-Proteobacteria 2 7 1 47 5 30 8 19 β-Proteobacteria 0 11 3 15 0 6 2 5 γ-Proteobacteria 135 1645 16 242 53 1315 44 255 δ/ε-Proteobacteria 0 12 8 220 2 24 1 39 Unidentified 0 45 0 139 0 121 0 130 TotalProteobacteria 137 1720 28 663 60 1496 55 448 Bacteroidetes 0 22 5 167 2 51 9 304 Planctomycetes 0 0 2 5 1 0 7 11 Cyanobacteria 1 0 0 32 1 20 1 21 Firmicutes 0 32 2 169 0 38 2 178 Actinobacteria 0 26 3 39 0 53 2 52 Chlamidiae 0 0 0 24 0 0 3 23 Unidentifiedbacteria 0 277 0 824 0 575 1 999 Other 0 0 2 20 1 21 5 6 Totalbacteria 138 2077 42 1943 65 2254 85 2042 Acantharea 101 0 9 0 49 66 16 0 Cercozoa 32 0 2 0 119 0 28 7 Foraminifera 13 47 1 0 4 0 0 0 Polycystinea 12 0 10 0 521 0 2 0 Sticholonchida 10 0 1 0 7 0 0 0 UnclassifiedRhizaria 3 7 2 100 93 66 1 103 Gromiidae 0 0 0 0 0 0 4 0 TotalRhizaria 171 54 25 100 793 132 51 110 Apicomplexa 12 37 2 231 3 48 3 289 Dinophyceae 3 35 45 43 3 0 20 15 Perkinsea 0 6 0 40 0 42 0 49 Ciliophora 7 82 1 278 9 108 0 305 Unidentified 0 14 0 40 0 24 0 31 TotalAlveolata 22 174 48 632 15 222 23 689 Blastocystae 0 10 0 13 0 15 0 25 Diatoms 7 66 4 100 4 66 3 103 Developayella 0 0 0 0 2 0 1 0 Dictyochophyceae 0 0 0 0 2 0 0 0 Labyrinthulida 0 0 0 0 0 0 4 20 Oomycetes 0 34 0 171 1 78 15 225 Pelagophyceae 0 77 0 122 0 137 0 217 PXclade 0 22 0 81 3 44 2 107 Synurophyceae 0 0 1 0 0 0 0 0 UnidentifiedHeterokonts 0 38 0 127 0 85 0 155 TotalHeterokonts 7 247 5 614 12 425 25 852 Haptophyta 9 28 0 0 0 0 2 6 Cryptophyta 4 0 0 0 1 0 0 6 Excavata 0 65 17 261 1 78 0 269 Fungi 7 187 2 947 82 424 31 777 Hyphochytriomycetes 0 0 0 0 3 0 1 0 Metazoa 19 756 1 3101 42 2457 162 5730 Choanoflagellates 0 46 0 130 0 96 0 534 UnidentifiedOpisthokonta 0 346 0 1077 0 769 0 1675 Amoebozoa 2 64 0 209 0 84 2 578 Rhodophyta 2 0 0 0 0 0 0 0 Chlorophyta 2 91 2 0 31 137 1 2 (Continued) www.frontiersin.org March2015|Volume6|Article98|5 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis Table2|Continued A.elongata Collozoumsp. S.streptacantha A.scolymantha rRNA mRNA rRNA mRNA rRNA mRNA rRNA mRNA Streptophyta 6 209 0 0 6 421 0 2 UnidentifiedArchaeoplastida 0 36 0 996 0 93 0 971 TotalArchaeoplastida 10 336 2 996 37 651 1 975 Unidentifiedeukaryotes 19 2285 3 7040 134 4349 20 6523 Totaleukaryotes 270 4588 103 15,107 1120 9687 318 18,724 Archea 0 0 0 0 0 0 0 11 Virus 0 0 0 10 0 29 0 12 Nohits 0 5782 0 22,583 0 10,238 0 18,303 Notassigned 0 1353 0 2988 0 1962 0 2754 Total 408 13,800 145 42,631 1185 24,170 403 41,846 %Rhizariasequences 42 0.4 17 0.2 67 0.5 13 0.2 aResultswereobtainedbasedonblastnandblastxsearch,respectively. mostlyaffiliatedwithCercozoa(Table2),withonlyoneoutof28 related to mRNA sequences from the cercozoan Bigelowiella sp. cercozoan unigenes assigned to A. scolymantha (Supplementary A minor proportion of the unigenes were assigned to Bacteria TableS3). (Table2). OthereukaryoticsequencesrecoveredwithintherRNAfrom our holobionts were mainly affiliated to Alveolata (Table2), GCCONTENT specifically Apicomplexa for A. elongata, ciliates for S. strep- We analyzed the GC content of our samples to assess whether tacantha and dinoflagellates for Collozoum sp. and A. scoly- our holobionts showed a plurimodal distribution reflecting the mantha. rRNA reads from known acantharian symbionts such occurrence of different microorganisms. The overall GC con- as Phaeocystis sp. (Decelle et al., 2012a) were found for tent of our samples ranged from 40.6% for A. scolymantha to A. elongata holobionts along with reads from other photosyn- 50.9%forS.streptacantha(Table1).Wethencompared,within thetic protists such as Bacillariophyceae, Prasinophyceae, and eachofourholobionts,theGC-contentdistributionfromthedif- Cryptophyceae(SupplementaryTableS3).S.streptacanthaholo- ferent domain level groups (Bacteria, Eukaryotes, no-hits, and biontalsocontained>100microalgalrRNAreadsassociatedwith not-assigned)andfoundthattheGCcontentfromourbacterial the Lotharella sp. (Chlorarachniophyceae) and some reads from mRNA unigenes differed from that of the eukaryotic unigenes, BacillariophyceaeandCryptophyceae. especiallyforS.streptacantha(SupplementaryFigureS1). However the GC-content of the eukaryotic mRNA from our PUTATIVEmRNA:TAXONOMICCOMPOSITION holobionts showed a unimodal distribution and we could not Putative mRNA represented the majority of our dataset, corre- discriminate the contribution of the different microorganisms sponding to 97% of the total number of unigenes for A. elon- withinourlibraries.Significantdifferencesoccurredbetweenthe gata, 99.7% for Collozoum sp., 88% for S. streptacantha, and different samples, including the taxonomically closest ones, our 98.3% for A. scolymantha. The putative mRNA unigenes cor- two polycystinean species, with most unigenes from Collozoum respond to transcripts occurring in our holobionts at the time sp.havingalowerGCcontentthanthemajorityofS.streptacan- of sampling and likely derive from the hosts, symbionts and/or thareads(Figure2). other microorganisms present within and around the rhizar- ian cells. Overall we blasted 122,377 putative mRNA unigenes GENEONTOLOGYANDKEGG against the GenBank protein database (blastx) to obtain a gen- Overall19,237unigenesfromourmRNAdatasetwereannotated eral annotation of the expressed genes and to assign a putative to a total of 6264 different GO terms which were then grouped metabolicfunctionbasedonsimilarity.MostofthemRNAuni- into 117 GOslim categories (Supplementary Tables S1,S2). The genes matched eukaryotic sequences or did not have significant most represented categories were related to catalytic activity hitswithknownreferencesequences(nohits).Themostrepre- and cell components (Figure3). The proportion of unigenes sentedmatcheswereassociatedwithlandplants(Streptophyta), expressed for each category was similar for the different sam- Metazoa and Fungi (Table2), very likely reflecting the most ples,preventingidentificationofGOcategoriesunder-expressed important occurrence of genomes from these groups in cur- in our non-photosymbiotic sample (A. scolymantha) and thus rentdatabases.Thescarcityofgenomicsequencesfromrhizarian likelyrelatedtophotosyntheticactivities. groupsotherthanForaminiferaandCercozoa(i.e.,nogenomes Moreover1303–2264unigenesfromourdifferentholobionts of Polycystinea) implied that A. elongata, Collozoum sp. and werealsoannotatedtoKEGGandatotalof123KEGGpathways S.streptacanthalibrariesincludedanumberoftranscriptsblast- were identified within our dataset (Supplementary Table S4). ing with Foraminifera. A few A. scolymantha unigenes were The most represented pathways were related to nucleotides, FrontiersinMicrobiology|MicrobialPhysiologyandMetabolism March2015|Volume6|Article98|6 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis 600 500 Amphilonche elongata s Aulacantha scolymantha ne 400 e g ni 300 u of o 200 N 100 0 20 30 40 50 60 70 GC content(%) 500 400 Collozoum sp. es Spongosphaera streptacantha n e 300 g ni u of 200 o N 100 0 20 30 40 50 60 70 GC content(%) FIGURE2|GCcontentofputativeeukaryoticmRNAunigenesforthefourspecimensanalyzedinthepresentstudy. A. elongata Collozoum sp. S. streptacantha A. scolymantha metabolism R metabolism A Thi metabolism catalytic activity Ox. Phosphorylation B cell development Y metabolism Nitrogen metabolism intracellular C fixation pathways in proks cell organization and biogenesis Citrate cycle (TCA cycle) Aminoacyl-tR biosynthesis binding Pyruvate metabolism nucleic acid metabolism Methane metabolism Val, leu, and ile degradation biosynthesis Glycolysis / Gluconeogenesis cell communication Propanoate metabolism transport Fatty acid metabolism Glyoxyl. and dicarboxyl. Met. protein metabolism T cell receptor sigling pathway hydrolase activity Glyc, ser, and thr metabolism Butanoate metabolism morphogenesis Glutathione metabolism transferase activity Ala, asp and glm metabolism Phe metabolism cytoplasm Pentose phosphate pathway 0 2 4 6 8 10 0 5 10 15 20 % Unigenes % Unigenes FIGURE3|Comparisonofthe(A)GeneontologytermssummarizedintoGoslimcategoriesand(B)KEGGassignmentsfromthetranscriptomesof theholobiontsanalyzedinthepresentstudy. thiamine and nitrogen metabolisms as well as citrate cycle and phosphorylation and nitrogen metabolisms was significantly carbon fixation pathways in prokaryotes (Figure3). A higher lower for A. elongata. Consistent with GOslim data we could proportion of Collozoum sp. unigenes were related to purine not find any KEGG pathway associated with non-symbiotic and thiamine metabolisms compared to the other holobionts, A. scolymantha over-represented or under-represented with whereas the proportion of ESTs associated with oxidative respect to the other holobionts. Therefore, it was not www.frontiersin.org March2015|Volume6|Article98|7 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis possible to identify genes or pathways likely associated with ThealignmentofputativeCTLDfromoursampleswithCTLD photosymbiosis. from other marine holobionts or putative CTLD from other protists consisted of four highly conserved cysteine residues as CLUSTERING well as four motifs (Supplementary Figure S3). The WIGL-like Themajorityoftheclustersidentifiedhere,basedona50%sim- motif which is crucial in the identification of CTLD (Zelensky ilarity, only include sequences from one sample (Figure4). For and Gready, 2005) was found to vary for our holobionts, being example83%oftheclusterscontainingsequencesfromA.elon- WIGV for A. elongata, WLGF for Collozoum sp. and WIGL, gatadonotcontainunigenesfromothersamples(Figure4),and WLGL,andWVGLforthedifferentCTLDfromS.streptacantha this percentage rises up to 94% for A. scolymantha. Very few (SupplementaryFigureS3).Theglucose/mannosebindingmotif clusters thus contain sequences from multiple samples. In spite EPN(ZelenskyandGready,2005)wasfoundonlyinS.streptacan- ofthetaxonomicrelatednessofourspecimensonly130clusters tha whereas such motif was replaced by DSS in A. elongata and include unigenes from all our samples whereas higher propor- by WSD in Collozoum sp. Similarly the galactose binding motif tionsoftheclustersaresharedbetweenA.elongataandCollozoum LND (Zelensky and Gready, 2005) was only found in S. strep- sp.(899),Collozoumsp.andS.streptacantha(1170),andA.elon- tacantha CTLD. In A. elongata the LND motif was replaced by gata and S. streptacantha (951) compared to the clusters shared VNDwhereaswecouldnotfindanequivalentmotifinCollozoum between A. scolymantha and any of the three other holobionts sp. (Supplementary Figure S3). The resulting phylogenetic tree (Figure4).Thesedifferencesmightberelatedtothelackofpho- reconstruction is shown as an unrooted tree (Figure5), and tosymbionts within A. scolymantha, in contrast with the other includesputativeCTLDextractedfromtheholobiontsaswellas samples. a range of reference sequences from Metazoa and protists. Five well-supportedc-lectingroupscanbeidentifiedfromthephylo- SYMBIOSIS-RELATEDGENESANDc-TYPELECTINDOMAINS genetictree.GroupAstrictlyincludescoralCTLDsthatcouldbe Wesearchedforthepresenceofgenespreviouslyfoundoverex- relatedtoseveralungroupedsequencesfromalveolates.Ingroup pressed in the symbiotic stages of cnidaria/dinoflagellate holo- BwefindtheprotistCTDLsofA.elongataandA.anophageffer- bionts(Kvenneforsetal.,2008;IversenandSeuthe,2011;Meyer ensmorecloselyrelatedtoothermetazoansCTDLsthantoother and Weis, 2012; Yuyama et al., 2012). A number of putative acantharians and apicomplexan protists, respectively. Group C mRNAreadsfromourdatasetarerelatedtogenescodinggluta- is almost exclusively composed by S. streptacantha except for a matedehydrogenase,glutathioneS-transferase,lectins,glutamine sequence from N. vectensis. Group D is an assembly of various synthetase, lipase, phospholipase and ferritin (Supplementary metazoanCTDLswiththepresenceoftheprotistG.theta.Finally, TableS5).Withinthesegenesweexaminedspecificallytheoccur- groupEcontainstheCTDLsofCollozoumsp.andthelectinsof renceoflectins.Overall142unigenesfromalloursamplesshared theSalpingoecarosettaandCyprinuscarpio. similarities with lectins (Table3). Their number ranged from Thephylogenyfoundheredoesnotreflectthetaxonomicposi- 19 (A. elongata) to 57 (S. streptacantha). We blasted reference tionofthespeciesfromwhichthesequenceswereextractedbut sequences from CTLDs involved in marine symbiosis (Wood- depictstherelatednessofthelectinproteinspresentinthespecies Charlsonetal.,2006;Vidal-Dupioletal.,2009;Bulgheresietal., shown(Figure5).Severalclusters,eachincludingsequencesfrom 2011) against our dataset of mRNA translated into amino acid both protists and Metazoa were indeed found. Putative CTLDs sequences in order to identify putative CTLD within our data. within each holobionts group together forming well-supported OveralleightsequencesfromtheholobiontsofA.elongata(two), clusters (100 support for A. elongata, 98 for Collozoum sp., and Collozoumsp.(two),andS.streptacantha(four)showedsimilari- 72 for S. streptacantha CTLDs). Contrarily to what would be tieswithCTLDs. expected, the putative CTLDs extracted from the radiolarian Collozoum sp. S. streptacantha A. elongata 43,338 (93 %) 21,211 (86 %) A. scolymantha 1170 899 713 13,720 184 135 34,825 (94 %) (83 %) 130 951 763 99 125 372 FIGURE4|Venndiagramforpooledsequenceclusteringbasedoncd-hit.Atotalof124,979clusterswereobtainedusing50%similarity.Valuesindicate thenumberofclustersrelativetoeachdiagramsection.Percentagesshowportionsofclusterswhichonlyoccurinonespecimen. FrontiersinMicrobiology|MicrobialPhysiologyandMetabolism March2015|Volume6|Article98|8 Balzanoetal. Rhizariatranscriptomeandphotosymbiosis Table3|Blastxhitsfortheunigenesrelatedtolectins. Proteintype A.elongata Collozoumsp. S.streptacantha A.scolymantha Agglutininisolectin1precursor 0 0 0 1 Cellulosebindingelicitorlectin 0 0 2 0 Chitinbindinglectin 0 0 1 0 c-typelectin 2 12 6 0 Dextranbindinglectin 0 0 0 1 Endoplasmicreticulumlectin 2 1 0 0 Fucolectin 0 0 2 0 Galectin 0 0 1 1 g-typelectin 1 2 1 0 Jacalinrelatedlectins 0 1 2 1 Lactosebindinglectin 0 2 0 0 Lectindomain-containingprotein 2 0 1 3 Lectinhomolog 0 0 0 1 Lectinprotein 2 0 0 0 Lectinreceptorkinase 0 3 1 0 Lectintype2 0 0 2 0 Legumelectinβcontainingprotein 0 1 1 3 Mannosebindinglectins 4 1 0 5 Rhamnosebindinglectin 0 0 0 2 Ricinblectin 5 1 31 13 Selectin 2 0 0 2 Sialicacidbindinglectin 1 1 0 0 Otherlectins 0 1 6 5 Total 21 25 51 33 %oftotalblastresults 0.15 0.05 0.09 0.05 holobiontsshowedtobemorecloselyrelatedtootherprotistsor Only, one rhizarian genome, from Bigelowiella natans metazoanlectinsthantoradiolariansthemselves(Figure5). (Chloroarachniophyta), has been fully sequenced to date (Bhattacharya et al., 2013) and other data only include par- DISCUSSION tially sequenced genomes and transcriptomes, mostly from HOLOBIONTSEQUENCING Foraminifera (Burki et al., 2010; Bulman et al., 2011; Habura Nowadays, high throughput sequencing approaches are largely et al., 2011; Pillet and Pawlowski, 2013; Sierra et al., 2013). In applied to investigate the interactions of microbes with mul- the present study a number of A. elongata reads were assigned ticellular hosts, especially in the marine environment. Such to Foraminifera likely because of the evolutionary relatedness approaches were used to elucidate symbiotic interactions in of Acantharia and Foraminifera (Sierra et al., 2013). These two Cnidaria (Sabourault et al., 2009; Yuyama et al., 2011) and classesshowhighergeneticdistancesfromPolycystineaimplying sponges(Radaxetal.,2012).Incontrastfewerstudiesinvestigated thathighproportionsofreadsfromCollozoumsp.andS.strepta- interactionswhenbothpartnersareprotists.Recently,ESTanaly- canthawerenotassigned,evenathightaxonomiclevel. seshavebeenappliedtounderstandtheabilityofForaminiferato The high proportion of mRNA sequences related to maintainchloroplastsderivingfrompreviouslyingesteddiatoms Streptophyta, unidentified Archaeplastida, Fungi, and Metazoa (PilletandPawlowski,2013).Inourinvestigation,becauseofthe within our eukaryote-assigned transcripts could be explained relativelylowsequencingdepth,ourcoverageisnotsufficientto by some contamination. Yet considering that proportions of inferexpressionpatternsbutallowsaqualitativeassessmentofthe rRNA reads affiliated to these taxa does not match with the genesexpressedwithinradiolarianholobionts.Stillweobserved mRNA assignments, it is also likely that it exists a bias related different coverage between our datasets, with fewest unigenes to the unevenness of taxonomic coverage within the public recoveredfromthebestassembledsample(A.elongata),suggest- database (Keeling et al., 2014). As for a number of eukaryotes, ingsomevariabilityintranscriptdiversitybetweentheholobionts a transcriptomic approach provides a valuable alternative to analyzed, the lower coverage the higher diversity. Also, the high genome sequencing for uncultured protistan taxa. The use of proportion of unassigned mRNA unigenes (Table2) might be transcriptomics is also particularly suited to study uncultured relatedtothefactthatRhizariaistheleastsequencedeukaryotic Radiolaria as their symbionts are mainly dinoflagellates (Gast super-grouptodate(BurkiandKeeling,2014;delCampoetal., et al., 2003; Probert et al., 2014), which have an enormous 2014). genome size (Lin, 2011) making sequencing very difficult. In www.frontiersin.org March2015|Volume6|Article98|9
Description: