AEM Accepted Manuscript Posted Online 28 October 2016 Appl. Environ. Microbiol. doi:10.1128/AEM.02826-16 Copyright © 2016, American Society for Microbiology. All Rights Reserved. 1 Title: Bacteria as emerging indicators of soil condition 2 Running Title: Bacterial indicators of soil health 3 1Syrie M. Hermans, 2Hannah L. Buckley, 3Bradley S. Case, 4Fiona Curran-Cournane, 5Matthew 4 Taylor, 1Gavin Lear# D 5 o w n 6 1School of Biological Sciences, University of Auckland, 3A Symonds Street, Auckland, New lo a d e 7 Zealand. d f r 8 2Department of Ecology, Faculty of Agriculture and Life Sciences, PO Box 85084, Lincoln o m h 9 University, Lincoln 7647, Canterbury, New Zealand. t t p : / 10 3Department of Informatics and Enabling Technologies, Faculty of Environment, Society and /a e m 11 Design, PO Box 85084, Lincoln University, Lincoln 7647, Canterbury, New Zealand. . a s m 12 4Auckland Council, 135 Albert Street, Auckland, New Zealand . o r 13 5Waikato Regional Council, Private Bag 3038, Waikato Mail Centre, Hamilton 3240, New g/ o n 14 Zealand A p r 15 il 1 , 2 16 0 1 9 17 #Correspondence: b y g 18 Dr. Gavin Lear u e s 19 School of Biological Sciences, University of Auckland t 20 3A Symonds Street, Auckland 1010, New Zealand. 21 E-mail: [email protected]. 22 Tel: +64-9-3739799 1 23 Abstract 24 Bacterial communities are important for the health and productivity of soil ecosystems, and have 25 great potential as novel indicators of environmental perturbations. To assess how they are 26 affected by anthropogenic activity, and determine their ability to provide alternative metrics of 27 environmental health, we sought to define which soil variables bacteria respond to across D o w 28 multiple soil types and land uses. We determined, through 16S rRNA amplicon sequencing, the n lo 29 composition of bacterial communities in soil samples from 110 natural or human-impacted sites, a d e d 30 located up to 300 km apart. Overall, soil bacterial communities varied more in response to f r o m 31 changing soil environments than changes in climate or increasing geographic distance. We h t t 32 identified strong correlations between the relative abundance of members of Pirellulaceae and p : / / a 33 soil pH, members of Gaiellaceae and carbon to nitrogen ratio, Bradyrhizobium and the levels of e m . 34 Olsen P, and members of Chitinophagaceae and aluminium concentrations. These relationships a s m . 35 between specific soil attributes and individual soil taxa not only highlight ecological o r g / 36 characteristics of these organisms, but also demonstrates the ability of key bacterial taxonomic o n A 37 groups to reflect the impact of specific anthropogenic activities, even when comparing samples p r il 38 across large geographic areas and diverse soil types. Overall, we provide strong evidence that 1 , 2 0 39 there is scope to use relative taxa abundances as biological indicators of soil condition. 1 9 b y 40 Importance g u e s 41 The impact of land use change and management on soil microbial community composition t 42 remains poorly understood. Therefore, we explored the relationship between a wide range of soil 43 factors and soil bacterial community composition. We included variables related to 44 anthropogenic activity, and collected samples across a large spatial scale to interrogate the 45 complex relationships between various bacterial community attributes and soil condition. We 2 46 provide evidence of strong relationships between individual taxa and specific soil attributes even 47 across large spatial scales, soil, and land use types. Collectively, we were able to demonstrate the 48 largely untapped potential of microorganisms to indicate the condition of soil, and thereby 49 influence the way we monitor the effects of anthropogenic activity on soil ecosystems into the 50 future. D o w n lo a d e d f r o m h t t p : / / a e m . a s m . o r g / o n A p r il 1 , 2 0 1 9 b y g u e s t 3 51 Introduction 52 Soil bacterial communities provide a multitude of ecosystem services which directly, and 53 indirectly, affect the overall functioning of the soil environment (1–3). This has resulted in many 54 studies describing variation in bacterial community composition (4, 5) and functional roles (6–8); 55 however, less effort has been invested to explore how this variation correlates with soil health. D o 56 There is great promise for using bacterial community composition, or the relative abundances of w n lo 57 individual taxa, as indicators of the state of soil environments at regional or even continental a d e 58 scales. Recent advances in next-generation sequencing technologies now make this a plausible d f r o 59 and attractive avenue of research, leading bacterial community data to be proposed as capable of m h 60 providing alternative metrics of environmental health and production potential (9). If shown to tt p : / / 61 be reliable, microbial community indicators could offer significant advantages over traditional a e m 62 chemical and biological measures in terms of the relative speed and ease of data analysis, and the .a s m 63 minimisation of site disturbance during sample collection (10). . o r g 64 For bacterial community attributes to be a viable indicator of soil condition, it is desirable / o n 65 that natural spatial variation in bacterial community composition be less than variation caused by A p r 66 anthropogenic factors. A consensus appears to have emerged that environmental factors, rather il 1 , 2 67 than dispersal limitation, are dominant drivers of bacterial community composition (5, 11). The 0 1 9 68 reduced role of dispersal limitation for determining the beta-diversity of microbial communities, b y g 69 even across broad spatial scales, is presumed to be supported by a global dispersal of microbial u e s t 70 cells, including on major atmospheric (12) and oceanic currents (13). While there have been 71 numerous bacterial biogeography studies that have employed DNA-fingerprinting techniques 72 over large spatial scales (4, 5), DNA-sequencing studies at similar scales are comparatively 73 scarce. To date, most sequencing studies have analysed relatively small numbers of samples, 74 environments and land uses, or only across small spatial scales where the effect of dispersal 4 75 limitation can already be presumed to be minimal. More studies that simultaneously analyse 76 large spatial scales, and a variety of soil and land use types, are required to confirm if 77 relationships observed between bacterial communities and soil environmental factors are 78 pervasive, or if instead, they are strongly mediated by geographic location. This would be the 79 first step in supporting the broad-scale use of bacterial data as a viable indicator of soil health. D o 80 In support of their ability to indicate the condition of the soil environment, previous w n lo 81 biogeographic studies have identified several variables that correlate with changes in soil a d e 82 bacterial community composition. Most notable is the evidence that pH influences bacterial d f r o 83 communities at regional (14), and continental scales (4, 5). Other variables such as the carbon to m h 84 nitrogen ratio, moisture content, and soil temperature also correlate with changes in soil bacterial tt p : / / 85 communities (5). However, relationships between bacterial communities and critical variables a e m 86 associated with the nature and intensity of human land use are frequently overlooked or are .a s m 87 studied in isolation. These include concentrations of the many heavy metals that accumulate and . o r g 88 impact biological communities in urban (e.g., zinc, lead; (15)) and rural settings (e.g., copper, / o n 89 chromium; (16)), as well as core soil physical attributes, such as porosity, which can correlate A p r 90 negatively with stock density, and ultimately impact the production potential of agricultural land il 1 , 2 91 (17, 18). The pairing of large scale surveys of soil bacterial communities to data gathered 0 1 9 92 through long-term soil monitoring programs (5, 19, 20) provides opportunities to uncover and b y g 93 quantify the strength of relationships between bacterial communities and a much wider range of u e s t 94 soil physicochemical variables than what has been previously achieved. This would then define 95 which chemical and physical stresses on the soil environment are reliably portrayed by bacterial 96 communities. 97 To date, many studies have only assessed changes in bacterial communities as a whole, 98 rather than the responses of individual taxa (4, 19). Others have restricted their analyses to 5 99 investigate changes in the most dominant phyla (21, 22), although these are not necessarily the 100 most important, or only ones, driving the changes observed in the overall community (23). 101 Assessing community responses at lower taxonomic levels, such as genera, could highlight 102 important trends that might not always be observed in the higher taxonomic ranks (24). The 103 absence of studies assessing the responses of important taxa on an individual basis not only D o 104 hinders our ability to expand our knowledge of the ecological attributes of these important w n lo 105 community members, but also our ability to truly assess the potential of bacterial taxa to serve as a d e 106 biological indicators of ecosystem health. d f r o 107 By pairing bacterial community data and extensive metadata gathered from 110 sites, we m h 108 asked three main questions. Firstly, is variation in bacterial communities more strongly related to tt p : / / 109 environmental changes or geographic distance separating the communities? We predict that soil a e m 110 bacterial communities will be more strongly correlated with soil environmental factors, rather .a s m 111 than purely spatial factors. This would suggest that bacterial community data may be suitable for . o r g 112 the assessment of soil status across the geographic area from which samples were taken. Our / o n 113 second question is: Which environmental variables correlate with changes in bacterial A p r 114 community composition? While we expect to find, consistent with other studies, that pH will il 1 , 2 115 have a dominant effect on bacterial communities, we also anticipate that we will be able to 0 1 9 116 uncover important relationships between bacterial community structure and other soil variables b y g 117 that are indicators of soil condition. Our final question is: Can the abundances of different u e s t 118 individual taxa be used to monitor soil condition? Determining how individual taxa respond to a 119 range of environmental factors, and especially changes in soil variables brought about by 120 anthropogenic activity, may have important implications for how we monitor the health of our 121 soils in the future. 6 122 Materials and Methods 123 Sample collection 124 We collected samples between 2013 and 2014 from 110 sites in northern New Zealand 125 (Fig 1). Our sampling area covered approximately 29,500 km2 of land consisting of diverse soil 126 types; over half of this area is used for pastoral farming and horticulture, the remainder is D o 127 covered in forest, bare rock (volcanic cones), native tussock or urban areas ((25); Waikato w n lo 128 Regional Council, 2015 [http://www.waikatoregion.govt.nz/Environment/Environmental- a d e 129 Information/Environmental-indicators/Land-and-soil/Land/land1-key-points/]). We classified d f r o 130 sites to Soil Order level according to the New Zealand Soil Classification (26) and the World m h 131 Reference Base for Soil Resources (27). Soil Order classification included (and equivalent in tt p : / / 132 WRB) were Granular Soils (Ferralsols, n=23), Allophanic Soils (Andosols, n=25), Ultic Soils a e m 133 (Acrisols, n=17), Pumice Soils (Andosols, n=14), Gleys (n=11), Organic Soils (Histosols, n=8), .a s m 134 Brown Soils (Cambisols, n=8) and Recent Soils (Fluvisols & Arenosols, n=4). Sites are further . o r g 135 categorised as being dominated by indigenous forest, exotic forest, dairy pasture, dry stock / o n 136 pasture or horticulture ((28); Table S1). A p r 137 For microbial analyses, we collected five soil cores at each site (0-10 cm deep, 2.5 cm il 1 , 2 138 diameter), after removing leaf litter and plant biomass, across a transect at 10 m intervals. These 0 1 9 139 samples were kept on ice until they could be transferred to -20°C storage until further use. For b y g 140 soil chemical analyses (Table 1), we composited an additional 25 soil cores collected from the u e s t 141 same transect at 2 m intervals, while for soil physical analyses (Table 1) we took intact soil cores 142 (0-9 cm deep, 10 cm in diameter) each separated by 15 m (28). 143 Molecular methods 144 Before DNA extraction, we homogenised each thawed soil sample by manual mixing. 145 We used PowerSoil® -htp 96 well DNA Isolation Kits (Mo Bio Laboratories Inc., CA, USA) 7 146 following the manufacturer’s instructions, but with minor modification: (i) mechanical lysis was 147 performed by agitating the plates in a Qiagen TissueLyser II (Retch) for 4 minutes at a frequency 148 of 30 Hz, (ii) ethanol air drying time was extended to 15 min, and (iii) plates incubated at room 149 temperature for five minutes after elution buffer was added. In total, we extracted DNA from 550 150 samples, which we stored at -20 °C until further analysis. D o 151 To characterise the diversity and composition of soil bacterial communities in each w n lo 152 sample, we amplified V3/V4 regions of bacterial 16S rRNA genes from each soil extract using a d e 153 modifications of the primers 341F (5’- d f r o 154 TCGTCGGCAGCGTCAGATGTGTATAAGAGACAGCCTACGGGNGGCWGCAG-3) and m h 155 785R (5’- tt p : / / 156 GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAGGACTACHVGGGTATCTAATCC- a e m 157 3’). This primer pair has been demonstrated to provide good coverage for bacteria and is .a s m 158 purposefully designed for optimal use on Illumina MiSeq DNA sequencing platforms (29). The . o r g 159 primers include Illumina adapter sequences (underlined) that are required for downstream / o n 160 sequencing. We amplified DNA from each sample, as well as mock community DNA (BEI A p r 161 Resources; item HM-783D), with the following amplification conditions: (i) 95 °C for 3 minutes; il 1 , 2 162 (ii) 25 cycles of 95 °C for 30 seconds, 55 °C for 30 seconds, 72 °C for 30 seconds, and then (iii) 0 1 9 163 72 °C for 5 minutes. We individually purified PCR products using SequalPrep Normalisation b y g 164 Plates (Invitrogen) or DNA Clean & Concentrator kits (Zymo Research), as per manufacturer’s u e s t 165 instructions. Finally, we measured and recorded the concentration of purified PCR products 166 using a Qubit® dsDNA HS Assay Kit (Life Technologies, USA), and normalised the 167 concentrations where required. The amplified material was then submitted to New Zealand 168 Genomics Ltd., for sequencing on an Illumina MiSeq instrument using 2 x 300 bp chemistry. 169 Prior to DNA sequencing, the sequencing provider attached a unique combination of Nextera XT 8 170 dual indices (Illumina Inc., USA) to the DNA from each sample, to allow for multiplex 171 sequencing. 172 Bioinformatic methods 173 The DNA sequence data were quality-filtered, after which we picked de novo operational 174 taxonomic units (OTUs) using USEARCH v 7.0 (30). Forward and reverse reads were merged D o w 175 using the fastq_mergepairs command. We truncated reads at the first position that had a quality n lo a 176 score (Q score) of less than 3, and set the minimum length of the merged read to 200 bp. We d e d 177 then trimmed the first 20 bp from the start of all the merged sequences using the -fastq_filter f r o m 178 command since this region had a high probability of error. Reads with more than one expected h t t 179 error were discarded. Finally, we dereplicated sequence data (-derep_fulllength), removed p : / / a 180 singletons (-sortbysize), and clustered sequences into OTUs at 97% sequence similarity, using e m . a 181 the UPARSE-OTU algorithm (31). s m . 182 We performed taxonomic assignment within QIIME (Quantitative Insights into Microbial o r g / 183 Ecology, version 1.8) by comparison against the Greengenes reference database version 13.8 o n A 184 (32) before randomly rarefying to a depth of 2,000 sequences per sample to achieve a standard p r il 1 185 sequencing ‘depth’ across all samples. , 2 0 1 186 Analysis of soil physicochemical and climatic data 9 b y 187 We coupled our bacterial and soil physicochemical datasets with further environmental g u e 188 data collated for each sampling location using ArcGIS 10.3 s t 189 (Environmental Systems Research Institute (ESRI), Redlands, CA). The extraction tools within 190 the spatial analyst toolset were used to obtain climate and site aspect data (Table S2), based on 191 the site location data (NZTM Eastings and Northings). When a large range of soil and climatic 192 variables are measured, several of these are usually correlated with each other, and this could 9 193 have undesirable effects for downstream analyses. We therefore identified highly correlated 194 explanatory variables (with a Pearson’s correlation value greater than 0.6; either negative or 195 positive), and only included one of the representative variables in downstream analyses. This led 196 us to keep 22 variables (we discarded 22 variables; Table S3). For all reported results, any 197 significant correlation of bacterial community composition, or taxa abundance, with a D o 198 representative explanatory variable could equally be caused by variability any of the removed w n lo 199 variables that correlated with the representative variable. a d e d 200 Statistical analyses fr o m 201 We removed 16 samples from our analysis because they had fewer than 2,000 DNA h t t p 202 sequence reads. To eliminate any biases associated with unequal coverage across sites, we : / / a e 203 calculated centroid bacterial community data for each site by taking the mean abundance value m . a 204 for each OTU from three randomly selected samples on each transect. We then assessed s m . o 205 differences in bacterial community composition by calculating the Bray-Curtis dissimilarities for r g / o 206 each pair of samples, using the averaged OTU abundances for each site. We also assessed n A p 207 differences in community composition at the taxonomic levels of phylum, class and genus. Bray- r il 1 208 Curtis dissimilarity matrices were generated in R v3.2.1 using the package ‘vegan’. , 2 0 1 209 We used non-metric multidimensional scaling (nMDS) of the Bray-Curtis dissimilarity 9 b y 210 matrix from phylum abundances to obtain site scores in compositional space using the Primer v.7 g u e 211 computer program (33). Using the ArcGIS kriging function in the extrapolation toolset, we then s t 212 mapped the first and second nMDS axis scores to generate a geographical representation of the 213 spatial patterns in community composition. 214 We used distance decay analysis and linear regression to further investigate patterns in 215 the bacterial communities, for each of the five land uses separately. For this, pairwise 10
Description: