ORIGINALRESEARCH published:13October2016 doi:10.3389/fmicb.2016.01597 Molecular Detection and Environment-Specific Diversity of Glycosyl Hydrolase Family 1 β-Glucosidase in Different Habitats RameshwarTiwari1,2,KanikaKumar3,SurenderSingh2,LataNain2and PratyooshShukla1* 1EnzymeTechnologyandProteinBioinformaticsLaboratory,DepartmentofMicrobiology,MaharshiDayanandUniversity, Rohtak,India,2DivisionofMicrobiology,ICAR-IndianAgriculturalResearchInstitute,NewDelhi,India,3ICAR-National ResearchCentreonPlantBiotechnology,LBSCentre,IndianAgriculturalResearchInstitute,NewDelhi,India β-glucosidase is a crucial element of the microbial cellulose multienzyme complex since it is responsible for the regulation of the entire cellulose hydrolysis process. Editedby: Therefore, the aim of the present work was to explore the diversity and distribution VijaiKumarGupta, of glycosyl hydrolase family 1 β-glucosidase genes in three different environmental NationalUniversityofIreland,Galway, niches including, Himalayan soil, cow dung and compost by metagenomic approach. Ireland Preliminary evaluation through metabolic profiling using BIOLOG based utilization Reviewedby: HarinderSinghOberoi, patterns of carbon, nitrogen, phosphorus and sulfur revealed the environment and CentralInstituteofPostHarvest substrate specific nature of the indigenous microbial population. Furthermore, clonal EngineeringandTechnology,India PradeepB.V., library selection, screening and sequence analysis revealed that most of the GH1 KarpagamUniversity,India β-glucosidase proteins had low identities with the available database. Analysis of HaoLiu, the distribution of GH1 β-glucosidase gene fragments and β-glucosidase producing SouthChinaUniversityofTechnology, China microbial community revealed the environment specific nature. The OTUs obtained *Correspondence: fromHimalayansoilandcompostmetagenomiclibrariesweregroupedinto19different PratyooshShukla genera comprising 6 groups. The cow dung sample displayed the least diversity of [email protected] GH1 β-glucosidase sequences, with only 14 genera, distributed among three groups- Specialtysection: Bacteroidetes, Firmicutes, and Actinobacteria. The metagenomic study coupled with Thisarticlewassubmittedto metabolicprofilingofGH1β-glucosidaseillustratedtheexistenceofintricaterelationship Microbiotechnology,Ecotoxicology andBioremediation, betweenthegeochemicalenvironmentalfactorsandinherentmicrobialcommunity. asectionofthejournal FrontiersinMicrobiology Keywords: GH1 β-glucosidase, metagenomics, metabolic profiling, operational taxonomic units, microbial community Received:30July2016 Accepted:26September2016 Published:13October2016 INTRODUCTION Citation: TiwariR,KumarK,SinghS,NainL Cellulose is the most abundant fixed form of carbon (100 billion ton global production per andShuklaP(2016)Molecular year) in the biosphere. This can be converted for sustainable energy production through its DetectionandEnvironment-Specific sequential bioconversion (Dusselier et al., 2014; Brown, 2015). Cellulose microfibrils contain, DiversityofGlycosylHydrolaseFamily closelypackedcrystallinetwotothreedozen(1→4)-β-D-glucanchainsanddisorderedamorphous 1β-GlucosidaseinDifferentHabitats. Front.Microbiol.7:1597. regions that are randomly distributed along with their length (Percival Zhang et al., 2006). The doi:10.3389/fmicb.2016.01597 complexnativecellulosearchitecture,explainstheevolutionarydevelopmentinmicrobialcellulose FrontiersinMicrobiology|www.frontiersin.org 1 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase degradation mechanism (Béguin, 1990). Cellulose hydrolysis Bawejaetal.,2016).Additionally,themoderntechniquesofgene can be performed under polysaccharide-enzyme synergy, even editing,systemsbiology,molecularmodelingofenzymearethe in the absence of microbial cells. The complete enzymatic notableareas,wheretheinnovationcanprovidenovelproducts hydrolysis of cellulose is accomplished by a set of cellulase and applications (Singh and Shukla, 2011; Karthik and Shukla, enzymesbelongingtotheGlycosylHydrolase(GH)family(EC: 2012;Bawejaetal.,2015;GuptaandShukla,2016b;Kumaretal., 3.2.1) (Davies and Henrissat, 1995; Franková and Fry, 2013; 2016;Singhetal.,2016). Elleuche et al., 2014). These enzymes hydrolyze the glycosidic Metagenomicsoffersanopportunitytoexplorebothmicrobial bondbetweenvariouscarbohydratesorbetweenacarbohydrate diversityandminenovelbiocatalystsfromenvironmentalDNA andanon-carbohydratemoiety.Typically,enzymatichydrolysis (Ferrer et al., 2005; Wang et al., 2009; Vey and Moreno- of cellulose requires three different types of cellulase enzymes, Hagelsieb, 2012). This approach has been used successfully namely,endoglucanase(EC3.2.1.4),exoglucanase(EC3.2.1.91) to identify enzymes with defined activities (Kim et al., 2008; and β-glucosidase (EC 3.2.1.21) (Bayer et al., 1998; Percival Wang et al., 2009), it initially depends on relatively low- Zhang et al., 2006). The endoglucanase degrades cellulose to throughput function-based screening of environmental DNA shortercellooligosaccharides;whiletheexoglucanasecleavesthe clone libraries (Simon et al., 2009; Uchiyama and Miyazaki, reducing and/or non-reducing ends of the cellulose moiety 2009; Gupta and Shukla, 2016a). Most of the research work to produce cellobiose, which are finally hydrolyzed by β- focusesonmicrobialecologybasedtaxonomicdivergencewhich glucosidase into glucose. β-glucosidase is a key rate limiting contributesalottowardunderstandingtherichnessofmicrobial factorofthecellulosecomplex,sinceithydrolyzestheenddimer populations in specific environmental niche (Achtman and “cellobiose”(CairnsandEsen,2010).Thisenzymeisresponsible Wagner, 2008; Haegeman et al., 2013). Although, few attempts for the regulation of the entire cellulose hydrolysis process have been made to explore the microbial diversity on the by easing cellobiose-mediated suppression and producing the basis of their metabolic or functional potential (Liang et al., final product glucose (Singhania et al., 2013). This enzyme 2011; Engel et al., 2012), metagenomic approach may facilitate is not only involved in hydrolyzing the glycosyl bond, but mimickingthebiologicalprocessesinthespecificenvironmental also reverse of hydrolysis which is catalyzed under specific condition in a more meaningful manner (Baweja et al., 2015). conditions(Bhatiaetal.,2002).Theglycosylhydrolasefamilyβ- Moreover, the functional diversity analysis of a biocatalyst glucosidaseisgroupedintoGH1andGH3families(http://www. throughmetagenomicapproachalsodefinesthehiddenmajority cazy.org/)(Cantareletal.,2009).Amongthem,theGH1family ofunculturedmicrobes(Arnosti,2011;KumarSinghandShukla, proteinsexhibitβ-glucosidases,β-galactosidasesorproteinswith 2015). Many β-glucosidase genes have been extracted from both activities, having considerably higher Km values for the various environmental niches like alkaline polluted soil (Wang galactosides (Cairns and Esen, 2010). At present, a total of etal.,2012),gutmicrobiome(DelPozoetal.,2012),hotspring 329GH1familyβ-glucosidaseshavealreadybeencharacterized (Schröderetal.,2014),agriculturalsoil(Biveretal.,2014),marine frombacteria,archaeaandeukaryotes.Theproteindatabasealso water(Fangetal.,2010),mangrovesoil(Lietal.,2012),compost comprises47crystalstructuresofGH1familyproteins,including (McAndrew et al., 2013) and anaerobic digester (Wang et al., those of several microbial β-glucosidases, β-galactosidases, β- 2015) through unculturable metagenomic approaches but only rutinosidases, alkyl β-glucosidases, 6-P-β-glucosidases and 6-P- a few of their functional proteins have been characterized in β-galactosidases (Aguilar et al., 1997; Wierzbicka-Wo´s et al., detail (Fang et al., 2010; Li et al., 2012; Wang et al., 2012). 2013). Moreover, GH1 family proteins possess a single active Therefore,metagenomicbasedminingofβ-glucosidasegenes,in domain with an α/β barrel topology, while GH3 proteins are order to explore the similarities and differences can strengthen made up of two domains. GH1 β-glucosidases are more active our knowledge about the functional diversity of β-glucosidase thanGH3ondi-oroligosaccharides(Cotaetal.,2015)andalso from different environmental niches. Congregated compilation tolerateendproduct(glucose)uptoahighlimit(Giuseppeetal., ofthisgeneticinformationacrossdifferentenvironmentalniches, 2014). Due to these properties, GH1 β-glucosidases are being including extreme conditions, of GH1 β-glucosidase, using recognizedasversatilebiocatalysts,withnumerousapplications metagenomicapproachwillbevitalforimprovingtheprospects inbiorefinery,industry,medicineandfood.Therefore,inviewof ofimprovingcellulosicbioethanolproduction. its versatility and utility, it is very important to decipher GH1 β-glucosidase producing microbial communities from diverse environmentalniches. MATERIALS AND METHODS Previously, various microorganisms were evaluated on the Sample Collection for Phenotypic and basis of their β-glucosidase production potential (Tiwari et al., 2013, 2014, 2015). However, current cultural laboratory Metagenomic Analysis practices are unable to illustrate the microbial diversity due ThecolddesertofsoutheasternHimalayanregion,Kargildistrict ◦ ′ ◦ ′ to spatial heterogeneity, non-ubiquitous nutrient requirements (KD),India(34 32 Northlatitudeand76 07 Eastlongitude) and taxonomic ambiguity among the microbial population in was selected for soil sample collection at high altitude (10,500 the environment. Also, few studies on ecological screening, ft above sea level). The annual temperature varies from −20 to ◦ process optimization and enzymatic applications in industries 5 C. Compost (CM) was prepared from paddy straw collected are broadening the scope and new niche for their utilization from the premises of Division of Microbiology, IARI, New (Shukla et al., 2007; Zhang et al., 2010; Banerjee et al., 2016; Delhi,India.Atthetimeofcollection,compostheapwasinthe FrontiersinMicrobiology|www.frontiersin.org 2 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase ◦ thermophilic phase with temperature ranging between 60 and Thephenotypicexpressionlevelandt-testbasedsignificance ◦ 65 C.Cowdung(CD)wascollectedfromdairyfarmsituatedin was considered by using Multiple array viewer (MeV 4.9.0) ◦ Todapourvillage,NewDelhi,India.Allsampleswerekeptat4 C software. duringtransportationandtherafterat−20◦Cinthelaboratory. Degenerate Primer Designing for Community-Level Substrate Utilization by β-Glucosidase Gene Amplification Different Environmental Samples by Atotalof247glycosylhydrolaseFamily1β-glucosidaseamino BIOLOG Phenotypic Microarray acid sequences were retrieved from NCBI Genbank. Protein ThesoilsamplesfromKargilDistrict(KD),Compost(CM),and sequences were aligned using PROMALS3D (http://prodata. Cow dung (CD) were analyzed based on utilization of carbon, swmed.edu/promals3d/promals3d.php) to predict conserved nitrogen, phosphorus and sulfur substrate using the BIOLOG regionsandsecondarystructurebasedonavailablehomologous phenotypicmicroarrayPMMicroPlates(Biolog,Inc.,Hayward, 3D structures in database. All sequences were edited and California, USA). Phenotype MicroPlates PM1-2, PM3B and trimmed to remove short, ambiguously aligned regions. The PM4A measure the carbon, nitrogen and phosphorus/sulfur trimmed alignment was analyzed and graphical representation metabolism,respectively.Thesamplesweresuspendedinsterile of conserved region was generated through weblogo3 interface salinewater(0.8%w/v)andincubatedundershakingcondition (http://weblogo.threeplusone.com/create.cgi) (Crooks et al., (100rpm)for3hatroomtemperatureforpropermixing.The 2004). Based on lowest degeneracy, equal GC content and Tm mixturewasallowedtosettleandaliquotsweredilutedin15ml value,thefollowingprimersweredesigned:BGLTF(27):5′ ACI of inoculation fluid (IF-0; Biolog) to adjust the optical density YTITAYCAYTGGGAYYTICCICAR3′;BGLTR(27):5′RTA at420nmof0.2whichwerethenusedastheinoculumforthe ICC YTC IGC CCA YTC RAA RTT RTC 3′; BGL21F (21): 5′ Microplates (Weber et al., 2007). For PM3B and PM4A plates, TAYCAYTGGGAYYTICCICAR3′;BGL21R(21):5′RTAICC 15 ml inoculation fluid was added to the mixture of sodium YTCIGCCCAYTCRAA3′;BGL17F(21):5′ GCGCCTAYC succinateandferriccitrateascarbonsources.The100Xadditive AYTGGGAYYTICC3′;BGL17R(17):5′CCYTCIGCCCAY solutionswerepreparedbydissolving5.402gofsodiumsuccinate TCR AA 3′. Inosine (I) was used to decrease the degeneracy. and 0.49mg ferric citrate in sterile saline water. A total of 150 The efficiency of these primer sets was confirmed by in silico µl from the 100 X solution were added to the 15 ml of IF- PCR through a NCBI primer BLAST search with different 0 inoculating fluid before preparing the sample suspensions. microorganismname,microorganismgrouportaxonomyID. Thereafter,100µlofsamplesuspensionwasinoculatedintoeach PM well. All PM plates were used in triplicate for each sample Extraction of Metagenomic DNA from ◦ andwereincubatedat30 C.After72hofincubation,eachwell Samples wasinoculatedwith100µloftheBiologredoxdyemix.Forcolor ◦ DNA extraction from soil sample of Kargil District (KD), development,theplateswereincubatedat37 Cuntilvioletcolor Compost (CM), and Cow dung (CD) was performed using appeared.Absorbancereadingsweretakenat590nmwithaplate the Power Soil DNA Isolation Kit (MoBio Laboratories Inc., reader(MicroPlate.Reader)(Weberetal.,2007). Carlsbad,USA)asperthemanufacturer’sinstruction. Data Normalization and Analysis Amplification of Partial β-Glucosidase The absorbance data were normalized to observe the average good color development (AWCD) as mentioned by (Garland, Gene 1999)andfurtherselectedforanalysis. PCR amplification was performed using high fidelity Pfu DNA Thenormalizedabsorbanceforeachwellwascalculatedas: polymerase(Fermentas)withproofreadingactivitytoavoidTaq polymerasebasedlowreplicationfidelity.Reactionswerecarried AWCD=6ODi/95 outinaBioRedPCRsystemwithmetagenomicDNAastemplate. PCR condition was optimized for all parameters of PCR by Where ODi is the absorbance value from each well, corrected gradient PCR (50–58◦C), different MgCl concentration (1–2.5 subtractingtheblankwell(A1)valuesfromeachplatewelland 2 mM) and primer concentration (0.1–0.4 mM). The final PCR 95isthenumberoftotalsubstrates. ′ wasconductedbyusingprimerssetsforwardprimerBGL17F(5 Substraterichness(S)inselectedsamplewascalculatedasthe ′ GCG CCT AYC AYT GGG AYY TCC 3) and reverse primer number of wells with a corrected absorbance greater than 0.25 ′ ′ BGLTR (5 RTA CCY TCG CCC AYT CRA ART TRT C 3). (Siragusaetal.,2009). Thereactionmixturecontained:10µl:10XPfuDNApolymerase The Shannon index (H) or “diversity” was calculated to bufferwithMgCl ,0.3mM:dNTPmix(10mM),0.1µMeach: understand the common ecological metric and the microbial 2 Primers,2units:PfuDNApolymerase,10–50ng:metagenomic communities shift over space and time. The Shannon–Weaver DNA and MQ water make the final volume up to 100 µl. The indexwascalculatedasfollows: ◦ PCRprogramwasasfollows:initialincubationat95 Cfor5min, ◦ ◦ ◦ H=−6pln(pi) followedby40cycles(95 Cfor50s,55 Cfor1min,and72 C i ◦ for90s),andthenbyafinalextensionat72 Cfor10min.The Wherep istheratiooftheactivityoneachsubstrate(OD)tothe amplifiedproductswereanalyzedbyelectrophoresisat60Vfor i i sumofactivitiesonallsubstrates6OD. 1hin1.2%agarosegelin1XTAEbuffer. i FrontiersinMicrobiology|www.frontiersin.org 3 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase Construction of Metagenomic Library in phosphorus (n = 59) and sulfur (n = 35) utilization pattern. pGEM-T Easy Vector Figure1showscleardifferencesamongthesubstrateutilization ThepGEM-TEasyvector(Promega)wasusedtoconstructthe patterns in microbial communities of diverse environmental threemetagenomiclibrariesofpartialβ-glucosidasegene.Since, samples,asevidentfromAWCD,richness,anddiversityvalues. PfuDNApolymeraseisunabletoadd“A”overhang,“A”tailwas The P values were < 0.05 which showed the significance of generated in the PCR product. The PCR product was purified the experiment. For carbon and nitrogen sources, all the three using the Qiaquick PCR purification kit (Qiagen, Valencia, AWCD and richness values were greater for Cow dung (CD), CA, USA) as per manufacture’s instruction. The purified PCR followedbyCMandKD. products of three environmental samples were first cloned in The normalized data were further utilized for hierarchal pGEM-TEasyvector.ThepGEM-Tandβ-glucosidaseconstruct clustering to explore the possible linkages between substrate was transformed into competent cells of E. coli strain (DH5α) utilization patterns and three different samples. On the basis and colonies were screened by blue/white colony selection on of 1.0 distance threshold, all 190 carbon sources were grouped theLBplatescontainingX-gal(80µg/ml),IPTG(0.5mM)and into5majorclusters(SupplementaryFigure1).Himalayansoil ampicillin(100mg/ml). and cow dung sample were grouped closely which showed a common carbon utilization pattern. The clustering of nitrogen Plasmid Isolation and Amplification of sources showed a very diverse utilization pattern, as only one Partial β-Glucosidase Genes majorclusterwasformed.Thephosphorusandsulfursubstrates were grouped into 3 clusters. According to the t-test based, on All positive white colonies were further inoculated to the 5 ml –log (p) values, 86% (n = 164) carbon sources were utilized LB broth for plasmid isolation. QIAprep Spin Miniprep Kit 10 by the microbial community of all three samples, while the (Qiagen, Valencia, CA, USA) was used for plasmid extraction as per manufacture’s instruction. Clones containing putative β- rest of the 14% (n = 26) registered non–significant utilization. Similarly,fornitrogen,phosphorusandsulfursubstrates,97,90, glucosidasegeneswerescreenedbyamplificationoftheisolated and86%substrateswereutilizedsignificantlybyallthesamples, plasmidswithM13FandM13Rprimers. respectively. Screening for Clonal Duplication In order to avoid sequencing several identical β-glucosidase Sequence Analysis of GH1 Family genes, Restriction Fragment Length Polymorphism (RFLP) β-Glucosidase for Primer Designing analysiswasperformedbytreatingthePCRproducts(amplified with M13 primers) with four base pair-cleaving restriction Thealignmentof247glycosylhydrolaseFamily1β-glucosidase endonuclease AluI (MBI Fermentas) at 37◦C for 12 h, and the aminoacidsequenceswasanalyzedandgraphicalrepresentation resulting electrophoretic mobility at 3% agarose was used to of conserved region was generated (Figure2). The alignment grouptheisolates. showedcharacteristicfeaturesofGH1withsecondarystructure of a classical (α/β) sheets. The presence of two catalytic 8 Sequencing and Analysis glutamate residues being 200 residues apart from each The PCR products were sequenced by the Sanger sequencing other and located at the C-terminal ends of β-strands 4 method at Xcelris Labs Ltd, Ahmedabad, India. The obtained (acid/base) and 7 (nucleophile). Other characteristic identical sequences were checked for chimera and homology were and conserved residues are also shown in Figure 2. The analyzed by the BLASTx search in the National Center for peptide sequences [T/C]-[L/I/V/M/A]-[Y/F]-H-W-D-L-P-Q Biotechnology Information (NCBI) database (http://www.ncbi. and D-[N/L]-[F/L]-[E/S]-W-[A/L/G/V/S]-[E/N/F/W/M/L]-G- nlm.nih.gov) and were submitted to NCBI Genbank. The [Y/I/E/L/F] were found to be the most conserved throughout nucleotide sequence was further translated into amino acid the consensus sequence according to the maximum length sequence (http://web.expasy.org/translate). All sequences of the of the peptide. Based on the maximum probability peptide sameenvironmentalnicheswerealignedusingClustalWversion sequence TLYHWDLPQ and DNFEWAEGY were selected to 1.8.Theevolutionarydistanceswerecalculatedandphylogenetic designforwardandreversedegenerateprimers,respectively.The dendrogram was constructed by neighbor-joining method and selectedregionwaspretendedtoamplifyapprox.876bpofPCR treetopologieswereevaluatedbyperformingbootstrapanalysis product. of 1000 data sets using MEGA 5.0 (Molecular Evolutionary GeneticsAnalysis). Amplification of Partial β-Glucosidase RESULTS Gene The metagenomic DNA was extracted from all three samples Phenotypic Microarray Based Substrate and used as a template for amplifying the partial β-glucosidase Utilization Pattern gene. The synthesized primer sets were used to optimize the Themetabolicprofilingoftheindigenousmicrobialpopulation PCR condition for amplification of PCR products with the ofthreesamplesfromtheHimalayanregion,Kargildistrict(KD), desiredlength.Genefragments,about876bpforβ-glucosidase India, Cow dung (CD) and Compost (CM) was analyzed by gene, were amplified from the primer pair BGL17F/BGLTR BIOLOG based substrate carbon (n = 190), nitrogen (n = 95), (SupplementaryFigure2). FrontiersinMicrobiology|www.frontiersin.org 4 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase FIGURE1|Diversityindicesbasedoncommunitylevelphysiologicalprofiling(CLPP)p-valuesbasedont-tests.(A)Richness(R);(B)Averagewell-color development(AWCD)and(C)Shannon–Weaverindex(H)ofmetabolizedsubstrates. FrontiersinMicrobiology|www.frontiersin.org 5 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase FIGURE2|Thesequencelogoof247glycosylhydrolaseFamily1β-glucosidaseaminoacidsequencesbasedonalignmentusingPROMALS3D.The identicalandconservativeresiduesareshowninwhiteandblacktriangle,respectively.Thetwocatalyticglutamateresiduesareshownbystars.Theblackandgray cylindersshownβandαsheets,respectively.Thearrowrepresentedtheselectedconservedregionfordegenerateprimerdesign. FrontiersinMicrobiology|www.frontiersin.org 6 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase TABLE1|Sequencingdetailsofselectedclonesfromeachmetagenomic TABLE1|Continued librarybasedontheirdeducedaminoacidsequenceidentities. S.No. CloneID Nearestneighbor Division Identity(%) S.No. CloneID Nearestneighbor Division Identity(%) 15 CD15 Pedobactersp. Bacteroidetes 84 HIMALAYANSOIL(KD) 16 CD16 Pedobacter Bacteroidetes 83 1 KD1 Vibriovulnificus γ-Proteobacteria 60 heparinus 17 CD17 Cellulophagabaltica Bacteroidetes 99 2 KD2 Meiothermusruber Deinococcus 59 18 CD18 Clostridium] Firmicutes 98 3 KD3 Arcticibacter Bacteroidetes 83 termitidis svalbardensis 19 CD19 Clostridium Firmicutes 97 4 KD4 Olivibactersitiensis Bacteroidetes 69 straminisolvens 5 KD5 Segetibacter Bacteroidetes 69 20 CD20 Clostridium Firmicutes 97 koreensis cellulovorans 6 KD6 Flavobacteriumsp. Bacteroidetes 76 COMPOST(CM) 7 KD7 Exiguobacteriumsp. Firmicutes 99 1 CM1 Arenibacteralgicola Bacteroidetes 73 8 KD8 Tumebacillus Firmicutes 98 2 CM2 Verrucomicrobiae Chlamydiae 64 flagellatus bacterium 9 KD9 Alteromonas γ-Proteobacteria 98 3 CM3 Zunongwangia Bacteroidetes 63 macleodii profunda 4 CM4 Pontibacterkorlensis Bacteroidetes 66 10 KD10 Ensiferadhaerens α-Proteobacteria 96 5 CM5 Cellulophagabaltica Bacteroidetes 74 11 KD11 Azospirillum α-Proteobacteria 99 6 CM6 Mucilaginibacter Bacteroidetes 70 brasilense paludis 12 KD12 Rhodopseudomonas α-Proteobacteria 97 7 CM7 Maribactersp. Bacteroidetes 72 palustris 8 CM8 Spirosomalingual Bacteroidetes 68 13 KD13 Rhodovulumsp. α-Proteobacteria 98 9 CM9 Adhaeribacter Bacteroidetes 69 14 KD14 Burkholderia β-Proteobacteria 98 aquaticus mimosarum 10 CM10 Elizabethkingia Bacteroidetes 73 15 KD15 Halobacillussp Firmicutes 97 miricola 16 KD16 Thermobacillus Firmicutes 98 11 CM11 Vibriovulnificus γ-Proteobacteria 99 composti 12 CM12 Gynuellasunshinyii γ-Proteobacteria 100 17 KD17 ThermoanaerobacteriumFirmicutes 98 13 CM13 Vibriocampbellii γ-Proteobacteria 59 xylanolyticum 14 CM14 Devosia α-Proteobacteria 89 18 KD18 Paenibacillussp. Firmicutes 98 15 CM15 Sinorhizobiumfredii α-Proteobacteria 99 19 KD19 Devosialimi α-Proteobacteria 87 16 CM16 Paenibacillus Firmicutes 99 20 KD20 Sphingobacterium Bacteroidetes 99 17 CM17 Bacillusakibai Firmicutes 98 sp. 18 CM18 Actinospica Actinobacteria 97 COWDUNG(CD) acidiphila 1 CD1 Formosaagariphila Bacteroidetes 89 19 CM19 Streptomycessp. Actinobacteria 96 20 CM20 Clostridium Firmicutes 98 2 CD2 Arenibacteralgicola Bacteroidetes 73 cellulovorans 3 CD3 Streptosporangium Actinobacteria 71 roseum 4 CD4 Paenibacillus Firmicutes 98 Construction of Metagenomic Library in 5 CD5 Pontibacterkorlensis Bacteroidetes 98 pGEM-T Easy Vector 6 CD6 Spirosoma Bacteroidetes 98 The purified PCR products of three environmental samples radiotolerans wereseparatelyclonedinpGEM-TEasyvectorformetagenomic 7 CD7 Hymenobacter Bacteroidetes 97 libraryconstruction.Thepositivecloneswereselectedbasedon swuensis blue/white colony screening. After random confirmation of 80 8 CD8 Streptomycessp. Actinobacteria 95 clonesfromeachlibrarybyPCRwithprimersM13FandM13R, 9 CD9 Flavobacteriumsp. Bacteroidetes 74 a total of 60 clones was sequenced from three metagenomic 10 CD10 Flavobacterium Bacteroidetes 74 subsaxonicum libraries. 11 CD11 Flavobacteriumsp. Bacteroidetes 77 Restriction Fragment Length 12 CD12 Flavobacterium Bacteroidetes 74 hibernum Polymorphism (RFLP) Analysis of PCR 13 CD13 Sphingobacterium Bacteroidetes 84 Products Amplified from Selected Clones sp. The identical clones were selected by Restriction Fragment 14 CD14 Cytophaga Bacteroidetes 60 Length Polymorphism (RFLP) analysis of 90 PCR products hutchinsonii obtained by M13 primers from selected clones by restriction (Continued) endonuclease AluI (Supplementary Figure 3). On the basis FrontiersinMicrobiology|www.frontiersin.org 7 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase NCBI-BLASTp analysis, 20, 5, and 35% sequences had low identities (<70%) with known GH1 family β-glucosidase sequencesinGenBankfromtheHimalayansoil,cowdungand compostsample,respectively(Figure3). Thephylogeneticdiversityofdeducedproteinsequenceswas analyzed by building the unrooted protein-level phylogenetic tree for BGL sequence OTUs (Operational Taxonomic Units) using the 20 divergent sequences from each clone library with their respective neighbor sequences retrieved from NCBI GenBank (Figure4). Metagenomic sequences from Himalayan soil (KD) showed similarities with 19 belonging to 6 groups, including Deinococcus, α-Proteobacteria, β-Proteobacteria, γ-Proteobacteria, the Bacteroidetes and Firmicutes. The Compost (CM) OTUs also showed 19 genera comprising 6 groups, including Chlamydia, Bacteroidetes, α-Proteobacteria, γ-Proteobacteria,FirmicutesandActinobacteria.TheCowdung (CD) sample showed least diversity which arranged into 14 genera with only three groups - Bacteroidetes, Firmicutes and Actinobacteria(Figure5). DISCUSSION Phenotypic Microarray Based Substrate Utilization Pattern The microbial metabolic activities of any particular environmental system depend on the utilization of specific nutrients. Hence, the functional diversity of environmental niches can be compared through the analyses of substrate utilization profiles (Torsvik and Øvreås, 2002). Phenotypic microarray provided by the BIOLOG may be the best way to compare the utilization of diverse group of essential substrates includingcarbon,nitrogen,phosphorusandsulfur(Stefanowicz, 2006). Specific substrate utilization patterns were generated by the microbial communities of three different environmental samples. The AWCD values showed that among all three samples, Himalayan soil showed least active utilization of substrates,ascomparedwithtwoothersamples.Cowdungand FIGURE3|AminoacidsequenceidentitiesofGH1β-glucosidasegene compost represent microbially induced active environmental fragmentsfromthreeenvironmentalnichesbasedonNCBI-BLASTp systems as compared to the cold desert Himalayan soil. search.(A)Himalayansoil(KD);(B)Cowdung(CD);(C)Compost(CM). However,thediversityindicesforcarbonutilizationwashighest for the microbial community samples from the Himalayan soil, illustrating that microbial abundance is high, but not metabolically active. Carbon was utilized more efficiently by of restriction pattern, 20 clones were selected from each all the three environmental systems, as compared with other metagenomiclibraryforfurthersequencing. substrates.Itisalreadyreportedthatthemicrobialecosystemis mainlygovernedbasedbythecarboncycle(PaerlandPinckney, Sequencing and Phylogenetic Analysis of 1996). The carbon, phosphorus and sulfur substrates were Partial β-Glucosidase (BGL) Gene grouped into 5, 3, and 3 clusters based on their combined A total of 20 sequences of β-glucosidase genes were obtained utilization by all three samples (Supplementary Figure 1). from each metagenomic library. All sequences were identified Interestingly, nitrogen sources were utilized diversely by the as glycosyl hydrolase family 1 β-glucosidase (BGL) gene by microbial community and no visible clustering was observed BLASTx search and contained the two catalytic glutamic acid even at 1.0 distance threshold. Nitrogen mineralization residues. The sequences were submitted to NCBI GenBank or transformation is carried out by different groups of under accession numbers, KT898053- KT898072 for compost microorganisms in environmental systems, which maybe (CM),KT898073-KT898092forcowdung(CD)andKT898093- responsibleforlackofdiscerniblegroupsinthedifferentsamples KT898112 for Himalayan soil (KD) (Table1). Based on the (Francis et al., 2007). This further ilustrates that nitrogen may FrontiersinMicrobiology|www.frontiersin.org 8 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase FIGURE4|PhylogeneticanalysisbasedonthepartialaminoacidsequencesofGH1β-glucosidasegenesdetectedfromthethreemetagenomic libraries(A)Himalayansoil,(B)Cowdungand(C)CompostsamplesandtheirrelationshipwiththereferencesequencesretrievedfromGenBank.The treewasconstructedusingtheneighbor-joiningmethod. notbeaselectivefactorfordistinguishingamongthemicrobial populations from the three samples, although carbon based utilization pattern revealed that all three environments share selectiveandcommonmetabolicprofiles. Diversity and Distribution of Glycosyl Hydrolase 1 Family β-Glucosidase Gene Potential producers of β-glucosidase were identified by the conventional culture based method, during previous investigations(Tiwarietal.,2014,2015).Although,thecultural diversity is more relevant due to its applications, the unseen potential of β-glucosidase gene embedded in uncultured microbial population of diverse environmental niches needs exploration. Therefore, the GH1 family β-glucosidase protein FIGURE5|DiversityofGH1β-glucosidasegenesbasedonthe sequences were searched extensively and based on the most identifiedbacterialgroups. conservedregion,degenerateprimersweredesigned.Although, the putative amplified region is not representative of the full lengthgenesequence,itencodesthesequencebelongiongtothe completeconserveddomainsequenceoftheBglBsuperfamily. The low identities implying that these β-glucosidases may be The primer pair was used successfully to amplify the as yet undiscovered. In all three environmental niches, GH1 PCR product of desired size (approx. 876 bp) from all three β-glucosidase was distributed in Bacteroidetes, Deinococcus, environmental systems. Furthermore, the metagenomic library α, β, and γ-Proteobacteria, Firmicutes, Actinobacteria and wasconstructedtoperformthediversityanddistributionstudies. Chlamydiaegroupsbelongingtothetotalof39differentgenera. Aftertheselectionofpositiveclones,RFLPbasedscreeningand This result validated the potential of the designed degenerate sequencing,twentyOTUs(OperationalTaxonomicUnits)from primersandconfirmedthatprimerscanworkoverabroadrange each metagenomic library were selected for diversity studies. ofGH1β-glucosidasesandfacilitatefuturemetagenomicstudies Himalayan soil (KD), compost (CM) and cow dung (CD) was ofdiverseecologicalniches. having 6, 11, and 10 novel OTUs based on the identities The highest diversity index and number of microbial of < 75% with known GH1 β-glucosidase protein sequences. groups were detected in the Himalayan soil sample. FrontiersinMicrobiology|www.frontiersin.org 9 October2016|Volume7|Article1597 Tiwarietal. Environment-SpecificDiversityofβ-Glucosidase Himalayan samples were obtained from a cold desert β-glucosidase genes in the environment. These results provide where no microbial transformation was possible due to theevidencethatthedistributionanddiversityofβ-glucosidase the extreme cold climate conditions; this was supported arehabitatspecificandinfluencedsignificantlybytheavailability by the results of the phenotypic microarray experiment ofnutrients,particularlycarbon,asillustratedbythemetabolic revealing lesser metabolic activity as compared with other profilingofabundantmicrobialpopulations. samples. But, geographically the environment contained a hidden organic pool, as recently evidenced by Hood and AUTHOR CONTRIBUTIONS co-workers (Hood et al., 2015). They informed that the mountain glaciers are storing high organic carbon content RT and LN designed the study and RT carried out all the and releasing the carbon into the environment due to experiment. SS and KK helped in analyzing the data. PS and microbial transformation, as and when the ice shield melts LN critically written and revised the manuscript for important due to the climate change. This concluded that cold desert intellectual content. All authors read and approved the final environmentishavinglowactive,buthighabundantmicrobial manuscript. population. The lowest diversity was displayed by cow dung samples, ACKNOWLEDGMENTS whereonlythreegroups,includingBacteroidetes,Firmicutesand Actinobacteriawerepresent.Theselectivepressureonmicrobial The authors are thankful to the National Fund for Basic, load while passing through the animal gut may be the possible Strategic and Frontier Application Research in Agriculture reasonforthislowdiversity(Callawayetal.,2010). (ICAR-NFBSFARA)forfundingtheresearchactivitiesforbiofuel project(NFBSFARA/AE-2006/2010-11). CONCLUSION SUPPLEMENTARY MATERIAL Inconclusion,diverseGH1β-glucosidase geneswereidentified from three different environmental niches using metagenomic The Supplementary Material for this article can be found approach. The low identities of deduced protein sequences online at: http://journal.frontiersin.org/article/10.3389/fmicb. revealed the abundance of unexplored diverse and novel 2016.01597 REFERENCES andglucose.J.Indus.Microbiol.Biotechnol.41,479–488.doi:10.1007/s10295- 014-1400-0 Achtman,M.,andWagner,M.(2008).Microbialdiversityandthegeneticnature Brown, T. R. (2015). A techno-economic review of thermochemical cellulosic of microbial species. Nat. Rev. Microbiol. 6, 431–440. doi: 10.1038/nrmi biofuel pathways. Bioresour. Technol. 178, 166–176. doi: 10.1016/ cro1872 j.biortech.2014.09.053 Aguilar,C.F.,Sanderson,I.,Moracci,M.,Ciaramella,M.,Nucci,R.,Rossi,M., Cairns, J. R. K., and Esen, A. (2010). β-Glucosidases. Cell. Mol. Life Sci. 67, etal.(1997).Crystalstructureoftheβ-glycosidasefromthehyperthermophilic 3389–3405.doi:10.1007/s00018-010-0399-2 archeonSulfolobussolfataricus:resilienceasakeyfactorinthermostability.J. Callaway,T.R.,Dowd,S.E.,Edrington,T.S.,Anderson,R.,Krueger,N.,Bauer, Mol.Biol.271,789–802.doi:10.1006/jmbi.1997.1215 N.,etal.(2010).Evaluationofbacterialdiversityintherumenandfecesof Arnosti,C.(2011).Microbialextracellularenzymesandthemarinecarboncycle. cattlefeddifferentlevelsofdrieddistillersgrainsplussolublesusingbacterial Ann.Rev.Mar.Sci.3,401–425.doi:10.1146/annurev-marine-120709-142731 tag-encodedFLXampliconpyrosequencing.J.Anim.Sci.88,3977–3983.doi: Banerjee, C., Dubey, K. K., and Shukla, P. (2016). Metabolic engineering 10.2527/jas.2010-2900 of microalgal based biofuel production: prospects and challenges. Front. Cantarel, B. L., Coutinho, P. M., Rancurel, C., Bernard, T., Lombard, V., and Microbiol.7:432.doi:10.3389/fmicb.2016.00432 Henrissat,B.(2009).TheCarbohydrate-ActiveEnZymesdatabase(CAZy):an Baweja, M., Nain, L., Kawarabayasi, Y., and Shukla, P. (2016). Current expert resource for glycogenomics. Nucleic Acids Res. 37, D233–D238. doi: technological improvements in enzymes toward their biotechnological 10.1093/nar/gkn663 applications.Front.Microbiol.7:965.doi:10.3389/fmicb.2016.00965 Cota, J., Corrêa, T. L., Damásio, A. R., Diogo, J. A., Hoffmam, Z. B., Garcia, Baweja,M.,Singh,P.K.,andShukla,P.(2015).“Enzymetechnology,functional W.,etal.(2015).ComparativeanalysisofthreehyperthermophilicGH1and proteomics and systems biology towards unraveling molecular basis for GH3familymemberswithindustrialpotential.N.Biotechnol.32,13–20.doi: functionality and interactions in biotechnological processes,” in Frontier 10.1016/j.nbt.2014.07.009 Discoveries and Innovations in Interdisciplinary Microbiology, ed P. Shukla Crooks,G.E.,Hon,G.,Chandonia,J.-M.,andBrenner,S.E.(2004).WebLogo:a (Heidelberg:Springer-Verlag),207–212. sequencelogogenerator.GenomeRes.14,1188–1190.doi:10.1101/gr.849004 Bayer,E.A.,Chanzy,H.,Lamed,R.,andShoham,Y.(1998).Cellulose,cellulases Davies, G., and Henrissat, B. (1995). Structures and mechanisms of glycosyl and cellulosomes. Curr. Opin. Struct. Biol. 8, 548–557. doi: 10.1016/S0959- hydrolases.Structure3,853–859.doi:10.1016/S0969-2126(01)00220-9 440X(98)80143-7 Del Pozo, M. V., Fernández-Arrojo, L., Gil-Martínez, J., Montesinos, A., Béguin,P.(1990).Molecularbiologyofcellulosedegradation.Ann.Rev.Microbiol. Chernikova,T.N.,Nechitaylo,T.Y.,etal.(2012).Microbialβ-glucosidases 44,219–248.doi:10.1146/annurev.mi.44.100190.001251 fromcowrumenmetagenomeenhancethesaccharificationoflignocellulosein Bhatia, Y., Mishra, S., and Bisaria, V. (2002). Microbial β-glucosidases: combinationwithcommercialcellulasecocktail.Biotechnol.Biofuels5,1.doi: cloning,properties,andapplications.Crit.Rev.Biotechnol.22,375–407.doi: 10.1186/1754-6834-5-73 10.1080/07388550290789568 Dusselier,M.,Mascal,M.,andSels,B.F.(2014).“Topchemicalopportunitiesfrom Biver,S.,Stroobants,A.,Portetelle,D.,andVandenbol,M.(2014).Twopromising carbohydratebiomass:achemist’sviewofthebiorefinery,”inSelectiveCatalysis alkalineβ-glucosidasesisolatedbyfunctionalmetagenomicsfromagricultural forRenewableFeedstocksandChemicals,edK.M.Nicholas(Berlin;Heidelberg: soil,includingoneshowinghightolerancetowardsharshdetergents,oxidants Springer),1–40. FrontiersinMicrobiology|www.frontiersin.org 10 October2016|Volume7|Article1597
Description: