ebook img

2ab assembly: a methodology for automatable, high-throughput assembly of standard biological parts. PDF

2.5 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview 2ab assembly: a methodology for automatable, high-throughput assembly of standard biological parts.

Leguiaetal.JournalofBiologicalEngineering2013,7:2 http://www.jbioleng.org/content/7/1/2 METHODOLOGY Open Access 2ab assembly: a methodology for automatable, high-throughput assembly of standard biological parts Mariana Leguia1,2,3, Jennifer AN Brophy1, Douglas Densmore1,4, Angel Asante1 and J Christopher Anderson1,2,3* Abstract There is growing demand for robustDNA assembly strategies to quickly and accuratelyfabricate geneticcircuits for synthetic biology. One application of this technology is reconstitution of multi-gene assemblies. Here, weintegrate a new software tool chain with2ab assembly and show that it is robust enough to generate 528 distinct composite parts withan error-free success rate of 96%. Finally,we discuss our findings in thecontext ofits implications for biosafety and biosecurity. Keywords: 2ab reaction, Automatedassembly,DNAfabrication, Synthetic biology Background first step, we use cost effective, fast and scalable DNA Synthetic biology uses a ground-up approach to genetic fabrication methodsto generate largesetsof explicit DNA engineering with a vision to solve the technical and constructs automatically. In the second step, we function- conceptual bottlenecks associated with the field. Standard allyscreenthoseDNAsviareadilyavailableassaysthatare biological parts have emerged as a mechanism through alsoautomatable,costeffective,fastandscalable.Addition- which researchers can emphasize the engineering compo- ally, we integrate these activities under a software tool nent of genetic engineering. To viably implement such a chainspecificallydevelopedforthesetypesofapplications. vision, however, a new experimental approach that can Bylinkingthesefunctionstogetherintoasingleprocesswe systematically identify and optimize biological function is not only uncover optimal solutions to a design problem, needed. Currently there are no established methodologies but also encapsulate important data about the parts from to encapsulate biological part function such that meaning- which they were composed. In turn, this leads to greater ful,quantitativepredictionsaboutgeneticcompositioncan insightandfasteroptimization. bemadeapriori.Althoughinpracticepost-hocanalysesof Toillustratehowthisalternateapproachcanbeimple- projects provide insights into the individual properties of mented in a synthetic biology lab we have focused on parts, most engineering strategies require either an ad-hoc the production of multi-subunit protein complexes. The search of intuitively-chosen constructs or library-based ability to produce functional protein from recombinant approaches. In the hands of experienced researchers these sources is essential in synthetic biology and for the methods can be quite effective for systems containing up scientific community at large. Nevertheless, producing to6genes[1].Nevertheless,beyondthisscalethesemeth- these proteins quickly and successfully, particularly in ods appear to be insufficient. We have implemented an the case of multi-subunit complexes, can be challenging. alternate approach to experimental design that uncovers Given an overall lack of tools available to quickly and optimalsolutionstodesignproblemsbylinkingautomated effectively identify appropriate conditions a priori, DNA fabrication to effective construct screening. In the production efforts are often inefficient, resulting in poor or failed attempts. In these cases, original constructs *Correspondence:[email protected] need to be modified, which is often costly, labor- 1DepartmentofBioengineering,UniversityofCalifornia,Berkeley,CA94720, intensive and time-consuming, particularly if multiple USA 2QB3:CaliforniaInstituteforQuantitativeBiologicalResearch,Universityof roundsofmodification arenecessary.Anattractivealter- California,Berkeley,CA94720,USA native to sequential, slow, expensive, multiple rounds of Fulllistofauthorinformationisavailableattheendofthearticle ©2013Leguiaetal.;licenseeBioMedCentralLtd.ThisisanOpenAccessarticledistributedunderthetermsoftheCreative CommonsAttributionLicense(http://creativecommons.org/licenses/by/2.0),whichpermitsunrestricteduse,distribution,and reproductioninanymedium,providedtheoriginalworkisproperlycited. Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page2of16 http://www.jbioleng.org/content/7/1/2 modification is the generation, in parallel, of several show that 2ab assembly is robust enough to generate permutations of a particular genetic architecture, followed hundreds of large error-free DNAs of the type required by rapid functional screening of those permutations. in a single experiment within the paradigm of synthetic Currently however, the assembly of long, error-free DNAs biologywith a96%success rate. of the type required to build complex, tunable genetic circuitsistheprimarybottleneckinthefield. Results In recent years several DNA assembly strategies have SelectionanddesignofanappropriateDNAassembly been described, including Gibson, SLIC, Golden Gate, set:bi-cistronicoperonsofMediatorcomplexforprotein and various BioBrick approaches (for reviews see [2-5]). production Each methodhasarangeofadvantagesanddisadvantages, To test this new experimental approach we set out to and thus, different construction scenarios are best served generatealargesetofDNAconstructsthatwouldallowus by different technologies. Some of these methods can be to measure the robustness of our assembly methodology. used to assemble DNAs simultaneously. However, in the Further, we wanted to make use of assembled DNAs to context of the design paradigm described here, a key par- address a challenging biological problem of significant ameter to consider is whether a particular strategy can be current relevance. The expression and purification of implementedinanautomatedplatformthatisamenableto multi-subunit protein complexes in soluble form is one scaling.Clearly,somechemistrieswillautomatebetterthan such challenge, particularly because there are no tools that others. Thus, indentifying DNA fabrication protocols that can be used at the outset of an experiment to select strat- are compatible with automation and scaling is an essential egiesthatwillyieldsuccessfulresults.Proteinproductionis step towards easing the construction jam. Furthermore, influenced by multiple variables, including expression host, quick functional screens of assembled permutations, via construct composition and architecture, timing, rate, and establishedassaysthatcanalsobeautomatedandscaled,is temperature of expression, purification method, and so on. essential for the isolation and control of most, if not all, of Inthecaseofmulti-subunitcomplexes,potentialchallenges thevariablesaffectingaparticularproject. canbeexacerbatedbecauseindividualsubunitsmayrequire Here, we describe the construction of a set of 528 co-expressionand/orindependentregulationofexpression. distinct composite devices that sample a large combina- Thus, it may be necessary to consider separate control of torial space. Specifically, every device is a bi-cistronic additional variables, such as expression timing, rate and operon constructed out of 8 standard basic parts. stoichiometry,whichcanfurthercomplicateaproject. Operons encode two separate protein fusions containing As proof-of-concept for our 2ab assembly methodology an epitope tag either at the N- or C- terminus that is we put together a set of 528 unique plasmids encoding used to establish function. With support from software permutationsofbi-cistronicoperonsthatcouldbeusedto tools specifically developed for these types of DNA express, identify and purify distinct epitope-tagged fabrication experiments [6,7], we have generated all proteins known to form a stable complex in E. coli. The constructs automatically, using a methodology called proteins selected were Med7 and Med21, two subunits of 2ab assembly. 2ab assembly is a single iterative process, the S. cerevisiae Mediator complex. Mediator is a large based on the BglBricks standard [8], that is amenable to proteinmachine(morethan1MDainsize,containing20+ high-throughput automation and scaling [7]. Like other independent subunits) that functions as a key regulator of ™ BioBricks -based schemes, the BglBricks standard is eukaryotic transcription [9,10]. Despite its importance, based on unique restriction enzyme sites that flank the however,thesizeandcomplexityofMediatorposespecific 0 0 part (BglII at the 5 end and BamHI at the 3 end) and researchchallenges.Thus,toolsthatcanhelpaddresssuch generate compatible cohesive ends that can be joined difficultiesareusefultobegintoaddressexistingobstacles. back together. When two parts are joined using simple Med7 and Med21 were specifically selected as a suitable restriction digestion and ligation reactions, the resulting pair for this assembly because they have been previously composite part also contains unique BglII and BamHI purified in E. coli and because E. coli-expressed Med7 and 0 0 sites flanking the 5 and 3 ends, respectively, as well as a Med21 can form stable hetero-dimers [11]. Thus, such an small scar sequence encoding Gly-Ser separating input assemblysetwouldallowustotestthefunctionalityofour parts. As we show here however, 2ab assembly is not constructspost-assemblyinavarietyofways:forexample, solely dependent on the BglBricks standard. In fact, 2ab by measuring relative protein expression profiles of assembly is flexible enough to accommodate alternate individualsubunits;orbymeasuringthedegreeofprotein- assembly standards as well. Using simple ELISA assays proteininteractionbetweenseparatesubunits. we have probed a variety of parameters that influence We tagged each Mediator subunit with a small epitope protein expression in functional form. Further, we that would enable a variety of downstream assays, surveyed and identified optimal architectures that result includingbutnotlimitedto,detectionofindividualsubunit in the expression of functional protein complexes. We expressionlevelsandprotein-proteininteractions.Weused Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page3of16 http://www.jbioleng.org/content/7/1/2 12 different tags (6X-His, Avi, E, FLAG, HA, HSV, followed by the second ORF (also consisting of 3 basic Myc, S,Strep,T7,V5andVSV)(fulldetailsprovidedin parts:Med21andoneepitopetag,separatedbyaTEVsite), Additional file 1) for which there are commercially- followed by a terminator. The Eugene programming lan- available antibodies (listed in Additional file 2). We varied guage,whichwasspecificallycreatedforthespecificationof thelocationofthetagrelativetotheproteinORF(eitherat synthetic biological designs from a collection of individual the N- or the C-terminus) and we sampled all possible standard parts [6], was used to generate our assembly set architectures:bothsubunitstaggedattheN-terminus[NN] given a defined set of composition constraints. These con- orC-terminus[CC];firstsubunittaggedattheN-terminus straints are declarative rules that either prevent or ensure and second subunit tagged at the C-terminus [NC]; and the appearance of specific tag combinations in a construct. vice-versa[CN](Figure1).Ineachcase,tagswereseparated The general bi-cistronic operon architecture can then be fromtheircorrespondingORFsbyaTEVproteasecleavage permuted while adhering to these constraints. Given 12 in order to facilitate tag removal if desired. Expression of tagsand4possibletaggingarchitectures(NN,NC,CNand every operon was placed under the control of an CC), the number of bi-cistronic operons needed to sample arabinose-inducible Pbad promoter and terminator. theentirecombinatorialspacewas528. All bi-cistronic operons were composed of 8 basic All 528 unique bi-cistronic operons were assembled parts each: a promoter to drive expression, followed by from a total of 31 basic parts (Additional file 1). Basic the first ORF (consisting of 3 basic parts: Med7 and parts encoding tags and ORFs were gene synthesized in one epitope tag, separated by a TEV cleavage site), two versions in order to simplify construction: either Figure1Assemblyofbi-cistronicoperons.(A)Basicpartsusedintheassembly.ORFsandtagsweresynthesizedaseitherN-orC-terminal sets.N-terminalsetscontainRBSsasacomponentofthepart,whileC-terminalsetscontainstopcodonsattheendofthepart.(B)Sample combinatorialarchitecturesforasetofconstructscontainingtwoORFs(Med7inthefirstpositionandMed21inthesecondposition)taggedat theN-orC-terminuswithdifferenttags(VSV-tagforMed7andE-tagforMed21).Everyoperoniscomposedof8basicparts.Forthis arrangement,4differentconstructarchitecturesarepossible:intheNNconfigurationbothORFsaretaggedattheN-terminus;intheNC configurationthefirstORFsistaggedattheN-terminusandthesecondORFistaggedattheC-terminus;intheCNconfigurationthefirstORFsis taggedattheC-terminusandthesecondORFistaggedattheN-terminus;intheCCconfigurationbothORFsaretaggedattheC-terminus. Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page4of16 http://www.jbioleng.org/content/7/1/2 with RBSs and start codons at the beginning, or with composite parts starting from basic standard biological stop codons at the end. Tags in the first group, referred to parts.2abreactionsproceediniterativecyclesofrestriction as N-terminal tags, were designed to sit at the N-terminus digestion, ligation, transformation and selection of desired of Mediator subunits. They included an RBS (determined products.Thereactionenablesligationbydoubleantibiotic using the RBS calculator algorithm [12] set at a target selectionof50and30parts,designatedas“lefty”and“righty” translation rate of 10,000), an ATG start codon for initi- respectively,locatedontwodifferentassemblyvectors. ation of translation, and lacked an in-frame stop codon at The core of 2ab assembly is a set of highly engineered the end. Tags in second group, referred to as C-terminal plasmids containing two different antibiotic resistance tags, were designed to sit at the C-terminus of Mediator genes (from a total of three: A = ampicillin, C = chloram- subunits.TheydidnotincludeanRBSorastartcodon,but phenicol, K = kanamycin) separated by restriction site. they did contain anin-frame stop codonat theend. Medi- Given these three antibiotics, six different combinations of ator subunits were similarly synthesized. N-terminal ORFs assembly vectors are possible: AC, CK, KA, AK, KC and accepted N-terminal tags and did not include an RBS, but CA. For a given vector pair to work properly during 2ab did contain an in-frame stop codon. C-terminal ORFs assembly, the choice of vector pair is pre-determined acceptedC-terminaltagsandincludedanRBS(determined such that, once two plasmids recombine to form new as above), but lacked a stop codon at the end. Both Med7 architectures,desiredchildproductscanbeselectedaway and Med21 basic parts were made as truncations that from undesired products, as well as from parents, using expresswellinE.coliandretaintheirabilitytointeractwith differentialantibioticselection. each other [11]. Med7 was truncated N- and C-terminally Every assembly reaction is subject to the following andincludedaminoacids102–205,whileMed21wastrun- three constraints: First, the antibiotic resistance markers cated C-terminally only and included amino acids 1–132. on every assembly vector must be distinct from one The protease cleavage site part was gene synthesized another (vectors with two AA, CC, or KK genes are not withoutstartorstopcodonsbecauseitwouldseparatetags permitted). Second, the resistance marker on the right fromtheircorrespondingMediatorsubunits. position of the assembly parent donating the “lefty” part Assembly of all target plasmids was completed in 3 must be the same as the resistance marker on left stages of 2ab assembly, according to an assembly tree position of the assembly parent donating the “righty” generated by the AssemblyManager software tool, using part. As illustrated in Figure 2, an AK-KC vector pair a robotics platform containing a Biomek 3000 liquid can be used to assemble parts 1 and 2 together, in that handling robot [7,13]. AssemblyManager examined the order, provided that part 1, the “lefty” or 50 part, is on set of devices to be constructed as a group and created an AK, while part 2, the “righty” or 30 part, is on KC. optimalassemblytree.Optimalitywasdefinedbyaminimal Similarly, a CK-KA pair can be used to assemble parts 1 depth tree, where the stages indicated sets of parallel 2ab and 2 together, in that order, provided that part 1 is on assembly reactions. The number of steps (a single 2ab as- CK and part 2 is on KA, and so on. The final constraint semblyreaction)inthetreewasalsominimized.Stepswere is that the antibiotic combination associated with the furtherreducedbysharingintermediatereactions.Thetree correctly assembled child must be distinguishable from was then post-processed to assign antibiotic resistance the antibiotic combination associated with the child markers to input nodes. In the event of a conflicting anti- byproduct, as well as both of the parents. As illustrated biotic assignment, the tree was modified to accommodate in Figure 2, the AK-KCvector pair produces ACand KK thecorrectresistanceattheexpenseoftheoverallassembly combinations when recombined. Both of these are dis- cost. A set of heuristics were used to accomplish this tinguishable from each other and from the parents. Al- process[13].Specificallyforthisassembly,weused31basic though in theory 2ab assembly can be carried out using parts to make 56 composite parts (made of 2 basic parts a variety of assembly standards, we have developed this each)instage1.Instage2,weusedthepartsmadeinstage methodology for use with the previously described 1 to assemble 48 composite parts (made of 4 basic parts BglBricksstandard[8].Thus,mostofthepartsinassem- each).Inthe final stage, we used thepartsmade instage2 bly pairs described here are BglBricks parts flanked by 0 0 to assemble 528 composite parts (made of 8 basic parts BglII and BamHI restriction sites on their 5 and 3 ends, each). The target set of 528 plasmids contained a total of respectively. Further,thetwo antibiotic markersin every 3696 junctions between parts. However, most of these assemblyvectorareseparatedbyanXhoIsite.Modifica- junctions were not unique, and therefore in practice, only tions to this set-up, where alternate assembly standards 632totaljunctionswereneededtocompletetheset. like the BBF RFC10 have been used to compare the ro- bustness of BglBricks-based 2ab assembly, are specifically Definitionof2abassembly notedthroughout. 2ab assembly is a DNA fabrication methodology that Priortothefirstroundofassembly,“lefty”and“righty” uses 2ab reactions to build progressively more complex parts are defined by specific methylation. For this, lefty Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page5of16 http://www.jbioleng.org/content/7/1/2 Figure2The2abreaction.Thesearecarriedoutin-vitroandusedtomakejunctionsbetweenpartslocatedondifferentassemblyvectors.The choiceofvectorpairforeachjunctionisdeterminedbyanassemblytreegeneratedbyAssemblyManager.AssemblyManagertakesintoaccount antibioticresistancemarkersonbothleftyandrightyinputplasmidsandgeneratesabinarytreecontaininglegalleftyandrightypairs.In practice,leftyandrightyelementsaremadebyharvestingplasmidsfromE.colistrainsthatspecificallymethylateeitherBglII(partJ72015)or BamHI(partJ72015)restrictionsites(methylatedsitesareshowninredandgrayedovertoindicateprotectionfromdigestion)[7,8].Isolated plasmidsarethencombinedinonepotanddigestedwithanenzymecocktailcontainingBglII,BamHIandXhoI.Giventhemethylationstateof eachplasmid,leftiesarecutwithBamHIandXhoI,whilerightiesarecutwithBglIIandXhoI.Followingdigestion,aligationreactionrecombines allfragmentstogenerateanewcompositepartwithadistinctpermutationofantibioticmarkers.Thedesiredchildproductcanthenbeselected awayfromundesiredproducts,includingparentplasmids,bygrowingitintheappropriatecombinationofantibiotics.Eachnewcompositepart cannowbeusediterativelyinadditional2abreactionstocreateprogressivelymorecomplexparts. and righty plasmids are transformed and harvested dictates differential patterns of restriction digestion. In from strains of E. coli that specifically methylate BglII “lefties,”BglIIrestrictionsitesaremethylated,andthus (part J72015) and BamHI (part J72015) restriction protected from digestion, while in “righties,” BamHI sites, respectively [7,8]. The methylation state of each restriction sites are methylated and protected from plasmid, which is different in “lefty” and “right” parts, digestion. Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page6of16 http://www.jbioleng.org/content/7/1/2 In practice, 2ab reactions are carried out in-vitro in were performed in combination in order to ensure proper one pot. Following isolation from their corresponding partassemblyateverystage. methylation strains, “lefties” and “righties” are combined In the first screen, single colonies were streaked onto together and digested with an enzyme cocktail containing bothdouble(AC,CKorKA,dependingontheantibiotic BglII, BamHI and XhoI. Given the methylation state of combination marking the new 2ab junction generated) each, “lefty” is cut with BamHI and XhoI, while “righty” is andtriple(ACK)antibiotic combinations.Transformants cut with BglII and XhoI. This results in the generation of containingcorrectlyassembledpartswereabletogrowon twolinearfragmentsperplasmid.AsillustratedinFigure2, the appropriate double antibiotic combination (indicating AKisdigestedsuchthattheAresistancemarkerisnowon they contained growth markers associated with correctly the same fragment of DNA as the “lefty” or 50 part, while assembled parts), but were not able to grow on the the K resistance marker is on its own. Similarly, KC is triple antibiotic combination (indicating possible co- digested such that the C resistance marker is now on the transformation of one of the parent plasmids and/or samefragmentofDNAasthe“righty”or30part,whilethe inappropriate assembly). As shown in Table 1, the aver- K resistance marker is on its own. Following digestion, a age co-transformation rate was about 5% (111 of 2208), ligation reaction recombines all the fragments to generate indicating that the vast majority of transformants (2097 four distinct possible plasmid architectures. Two are the of 2208) contained growth markers associated with same as the parent plasmids, while the other two are properly assembled parts. Co-transformation rates recombinedchildproducts. ranged from 2% for junctions between AK pairs, to 26% Child plasmids are generated when BamHI overhangs for junctions between KC pairs. We routinely observe a of the “lefty” vector and BglII overhangs of the “righty” wide range of co-transformation rates. The variability is vectorareligatedtogetherononeend,andthetwoXhoI mostly linked to two factors: first, the identity of the overhangs are ligated together on the other end. The antibiotic pair (CK and KC pairs show higher rates of desired child product contains anewcomposite part and co-transformationthanAC,CA,AKandKApairswhen a new distinct permutation of antibiotic markers (AC in equivalent amounts of plasmid DNA are used); and this example), while the child byproduct contains no second, the concentration of plasmid DNA used for part and two copies of the remaining antibiotic marker transformation (higher amounts of plasmid DNA give (KK in this example). Given that each possible plasmid risetohigherratesofco-transformation). architecture contains a different antibiotic combination, Transformants showing the correct growth phenotype desired child products can be selected away from were further screened either by restriction mapping with undesired products, as well as from parents, simply by BglII, BamHI and XhoI, or by colony PCR amplification. growing them in the appropriate antibiotic combination. Even though colony PCR amplification is preferable as a Each new composite part can now be used iteratively in screening tool because it does not require extraction of additional 2ab reactions to create progressively more plasmid DNA, which is costly and time-consuming, the complex parts because all the elements necessary for sizes of many stage 1 and 2 products could not be 2ab assembly have been maintained: the new part is still clearly resolved from those of their parents with this 0 0 flanked by BglII and BamHI sites on its 5 and 3 ends, technique,andthus,wererestrictionmapped.Allproducts respectively; and two antibiotic resistance genes are still of stages 1 and 2 (104 total) were screened by restriction present and separated by an XhoI site. In order for the mapping, while all products of stage 3 (528 total) were next cycle of 2ab assembly to begin, new composite parts screenedbycolonyPCR. need to be specified as lefties or righties (and methylated A few clones from every stage of assembly, including accordingly), and compatible assembly vector partners those with correct,questionableandincorrectrestriction selected. In practice, these choices have been determined maps or amplicons, were selected for sequencing to bytheassemblytreegeneratedusingAssemblyManagerat verify part identity. As shown in Table 1, about 96% of theoutsetoftheassembly. all colonies screened contained correctly assembly parts. The percentages of properly assembled junctions ranged Screeningof2abassembly from 93% for CA and KC pairs, to 100% for AK and KA Tomeasuretherobustnessof2abassembly,everyjunction pairs. The results obtained by sequencing verify that our wasassembledtwice,inseparatereplicateexperiments.For restriction enzyme mapping and colony PCR screens every junction made, at least three independent colonies were stringent enough to assess assembly success accur- containing putative composite parts were screened. All ately. Of the clones showing correct restriction maps or colonies were subjected to several screens: the first was PCR amplicons sequenced (103 total), 100% contained growth phenotyping under various antibiotic combina- properly assembled parts. Of the clones with question- tions; the second was restriction mapping or colony PCR able maps or PCR amplicons sequenced (8 total), 50% amplification; and the last was sequencing. These tests contained properly assembled parts, indicating that we Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page7of16 http://www.jbioleng.org/content/7/1/2 Table1SummaryofBglBricks-based2abassemblyresults Assemblystage 1 2 3 Totals junctiontype CA AK KC CK KA CA junctionsmade 28 14 14 24 24 528 632 coloniesscreenedbygrowthphenotype 84 42 42 156 156 1728 2208 co-transformants 3 1 11 12 7 77 111 %co-transformation 3.57 2.38 26.19 7.69 4.49 4.46 5.03 coloniesscreenedbymappingorPCR 28 14 14 48 48 622 774 coloniesw/correctmaporPCRphenotype 26 13 13 47 48 535 682 coloniesw/questionablemaporPCRphenotype 2 1 1 0 0 75 79 coloniesw/incorrectmaporPCRphenotype 0 0 0 1 0 12 13 coloniessequenced 9 1 14 9 8 73 114 coloniesw/correctmaporPCRsequenced 7 1 13 9 8 65 103 correctsequencingresult 7 1 13 9 8 65 103 coloniesw/questionablemaporPCRsequenced 2 1 1 n/a n/a 4 8 correctsequencingresult 0 1 0 n/a n/a 3 4 coloniesw/incorrectmaporPCRsequenced n/a n/a n/a 0 n/a 4 4 correctsequencingresult n/a n/a n/a n/a n/a 0 0 %correctassembly 92.86 100 92.86 97.92 100 95.06 96.45* *Averageof%assemblyratesforeveryjunctionatvariousassemblystages. had been conservative in our scoring of proper assembly colonies screened (3221 of 3486) were positive for GFP as assessed by restriction enzyme mapping and colony expression. The percentages of properly assembled junc- PCR. Of the clones with incorrect restriction maps or tions ranged from 91% for CK pairs, to 95% for AC pairs. PCR amplicons sequenced (4 total), none contained Toensurethatwewerenotunder-reportingthenumberof properlyassembled parts. correct assemblies, we sequenced the assembly products To further test the robustness of 2ab assembly, we extractedfrom26whitecolonies.Ofthese,nonecontained compared the success rate obtained by BglBricks-based properly assembled parts, indicating that our screens were 2ab assembly to that obtained using a different assembly robustenoughtoassessassemblysuccessaccurately. standard also in the context of 2ab assembly. For this, we chose the original BBF RFC10 standard described by Characterizationoferrors Knight [14], which uses SpeI and XbaI restriction sites In the process of assessing the robustness of our 2ab 0 0 flanking the 5 and 3 ends of parts, respectively. Specific- assemblymethodologyweidentifiedthreemainsourcesof ally,wecarriedoutasmallassemblyusingasplitversionof error. The first was co-transformation of plasmids, which super folder-GFP [15], expressed under a constitutive pro- has been briefly mentioned above. Co-transformation moter, that results in visibly green colonies in E. coli when results in colonies that are able to grow in 2ab combina- properly assembled. The lefty input part consisted of the tionsmarkingthenewlyassembledjunction,butthatmay first173aminoacidsofsf-GFPdrivenbythetetpromoter, or may not contain properly assembled parts. Instead, while the righty input part consisted of the last 64 amino these colonies contain antibiotic resistance markers acidsofsf-GFP.Inputpartsweresubclonedintoallsix2ab associated with correctly assembled junctions in one or assemblyvectorstotestassemblyofalljunctions.Following moreplasmidsthatmayormaynotincludetheassembled assembly, lefty and righty input parts are separated by a plasmidofinterest.Althoughtherewasaslightqualitative small scar sequence (ACTAGA) encoding Thr-Arg, which correlation between co-transformation and certain anti- does not interfere with the fluorophore. As shown in biotic combinations (CK and KC pairs showed higher Table 2, the average co-transformation rate was about 14% ratesofco-transformationthanAC,CA,AKandKApairs (174 of 1199), indicating that the vast majority of transfor- whenequivalentamountsofplasmidDNAareused)there mants(1025of1199)containedgrowthmarkersassociated was no correlation to part size or part source. Instead, with properly assembled parts. Co-transformation rates co-transformationwasdirectlylinkedtotheconcentration ranged from 12% for junctions between KA pairs, to 19% of plasmid DNA used in transformation reactions. for junctions between CK pairs. Proper assembly was Higher amounts of plasmid DNA give rise to higher further assessed by counting green colonies. About 92% of rates of co-transformation, thus, a simple way to reduce Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page8of16 http://www.jbioleng.org/content/7/1/2 Table2SummaryofBBFRFC10-based2abassemblyresults Assemblystage 1 Totals junctiontype CA AK KC CK KA AC totaljunctionsmade 1 1 1 1 1 1 6 coloniesscreenedbygrowthphenotype 200 198 201 200 199 201 1199 co-transformants 29 26 27 37 24 31 174 %co-transformation 14.50 13.13 13.43 18.5 12.06 15.42 14.51 totalcoloniescounted 580 1011 274 366 688 567 3486 whitecolonies 45 83 24 34 48 31 265 greencolonies 535 928 250 332 640 536 3221 totalcoloniessequenced 5 6 5 2 7 1 26 whitecoloniessequenced 5 6 5 2 7 1 26 correctsequencingresult 0 0 0 0 0 0 0 greencoloniessequenced 0 0 0 0 0 0 0 correctsequencingresult n/a n/a n/a n/a n/a n/a 0 %correctassembly 92.24 91.79 91.24 90.71 93.02 94.53 92.40 themistodilutetheamount of DNAusedfor transform- limitation of the methylation strains themselves (which ation. Although co-transformation rates were not high havethepotentialtobesignificantlyimproved),ratherthan enough to present a problem in our assemblies, these of 2ab assembly per se, our reported success rate of 96% is colonies can be easily screened and eliminated by simul- a conservative estimate of the actual rate, indicating that taneously re-streaking on double and triple antibiotic our technology is indeed robust enough to construct large combinations. Colonies containing properly assembled error-freeDNAsetswithaveryhighsuccessrate. parts will be able to grow on two antibiotics marking the newly assembled junction, but they will not be able to Screeningthefunctionalityofassembledbi-cistronic growonallthree.Anycolonythatgrowsontripleantibiotic operons combinationsshouldbediscarded. Addressing the bottleneck around DNA fabrication is A second, less common source of error (12 instances only the first step towards developing high-throughput out of 774), results in colonies that exhibit the appropri- solutionsinsyntheticbiology.Thus,inadditiontoshowing ate biotope, but that contain unrecognizable plasmids. that our DNA assembly methodology is sufficiently robust In every one of these cases, the failure could be corre- to build large error-free DNAs with a ~96% success rate, lated back to poor quality parent input plasmid DNA wewereinterestedintestingthefunctionalityofassembled andcorrected byre-extracting theinputplasmid. constructs. To do this in a cellular context, we induced The last source of error resulted in the production of expression and used indirect ELISA assays to quantify junctionslackingwholeinputparts.Intotal,weidentified steady-state relative protein expression levels from every 80 instances of BglBricks-based 2ab assembled junctions, epitope-tagged Mediator subunit in all 528 bi-cistronic and 9 instances of BBF RFC10-based 2ab assembled operons (entire data set is provided in Additional files 3 junctions,containingassemblyscarsbutlackingeitherthe and4).Fora select subsetofoperons, wealsousedsand- entire lefty or righty input part. In the case of BglBricks- wich ELISA assays to probe protein-protein interactions based assembly, empty junctions arise as a result of and verify functionality of individually expressed proteins incomplete methylation of parent plasmid DNA. When (data not shown). For the purposes of this assembly input plasmids are incompletely methylated they are sub- however,itwasmuchmoreimportanttohaveacomplete ject to part loss during digestion steps with BglII and datasetthatwouldallowustocomparativelyassesoperon BamHI, which effectively eliminates the input part from functionality (quantified by measuring relative protein ex- the reaction but maintains the vector backbone available pressionlevelsviaindirectELISAs),thantoassesindividual for 2ab assembly. In the case of BBF RFC10-based assem- subunit functionality (quantified by measuring protein- bly, empty junctions arise as a result of incomplete heat- proteininteractionsviasandwichELISAs).Giventhesepri- inactivation of restriction enzymes following digestion orities, and given the large number of operons in the as- steps,whichhasthesameeffectasincompletemethylation sembly set (528 plasmids, assayed for two tags each, in of parent input DNA. Given that partial methylation is a triplicate),weused indirect ELISAs,which aresimplerand Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page9of16 http://www.jbioleng.org/content/7/1/2 faster to carry out, to generate a complete data set that this we assembled a small sub-set of 24 bi-cistronic allowed us to make side-by-side comparisons and operonswhereweswitchedthelocationofthetwo ORFs, predictions. placing Med21 in the first position, and Med7 in the To assay the operons, they were first sorted by tag and second. We selected three tags (Avi, HSV and T7) that grouped together such that those with tags in common had shown discernible patterns in the first assembly and were assayed in tandem in replicate experiments. tested them as before. Figure 4 shows that, while the tag Figure 3 shows indirect ELISA data for a representative location patterns remained the same (both ORFs set of constructs sharing VSV and E tags. Given that expressed better when tagged C- rather N-terminally), each operon contains two different tags, and given 4 switching the location of the ORFs equalized protein possible tagging architectures (NN, NC, CN and CC), expression significantly. In the first assembly, where Med every pair of tags appears in 8 different constructs. In 7 was in the first ORF position and Med21 in the second the first four constructsVSV-Tag is fused to the first ORF ORF position,thefolddifference betweenthe two highest (Med7) and E-Tag to the second ORF (Med21). In the last expressingarchitecturesinthesub-set(Med7at44.82and four constructs, which are “reciprocal” architectures,VSV- Med21 at 12.06) was ~33 fold. In the second assembly, Tag is fused to the second ORF (Med21) and E-Tag to the where Med 21 was in the first position and Med7 the first (Med7). The results show that varying the location of second, the fold difference between the two highest thetagfromtheN-totheC-terminusofanORFcanhave expressingarchitecturesinthesub-set(Med21at9.48and a dramatic effect on protein expression levels. In this set, Med7at8.35)decreasedto~1fold.Controlofthisvariable Med7 expressed well when tagged at the C-terminus, but alone can have a significant impact on experimental significantlylesssowhentaggedattheN-terminus.Inturn, outcomes, which is particularly noteworthy given that Med 21 expressed at lower levels than Med7, either when producing protein in stoichiometric equivalences that taggedattheN-orC-terminus. enable formation of functional soluble complexes is often High-throughput approaches such as these enable more important than producing large quantities of non- rapid processing of large numbers of samples in parallel, functional single subunits. Taken together, our results which in turn allows the simultaneous visualization of confirmthatrecombinantproteinexpressionisinfluenced multiple sets of data such that patterns can become by multiple distinct variables that correlate with specific readily apparent. From these patterns, many of the ways sequences of DNA at specific locations in a plasmid. By in which multiple variables influence experimental enabling the rapid and effective construction of plasmid outcomes begin to emerge. To test the hypothesis that variants, 2ab assembly enables access and further control varying the location of the tags would have a more over various experimental conditions which, in turn, significant effect on protein expression levels than facilitate tuning of gene expression circuits used in varying the identity of the tag itself, we plotted protein syntheticbiology. concentrationvaluesobtainedviaELISAsinto“heatmaps” that included all possible tags and tagging combinations Discussion (Additional file 3). In the context of a heat map, the Wesharethevisionofsyntheticbiologyasafielddedicated pattern of expression for Med7 becomes particularly to solving the technical and conceptual bottlenecks of obvious. Regardless of which tag was used, Med7 expres- genetic engineering. Although we currently face several sion levels were significantly higher when the protein was challenges, it is clear that many of these obstacles can be tagged C-terminally. In some instances, Med7 expression attributed either to an insufficient survey of the design levelsinexcessof55foldoverbackgroundwereobserved. space or to a lack of knowledge about the molecular and AnexceptiontothispatternwasFLAG,whichresultedin cellular functions of added components. Both of these expression levels close to background in all architectures problems can be addressed by robust, cost-effective, rapid, tested.ThepatternofexpressionforMed21wasabitless and outsourced DNA fabrication platforms that lower obvious than that for Med 7 because overall Med21 barriers to fabrication and streamline assembly. Basic ™ expressed at lower levels. In cases where patterns were standard biological parts, such as BioBricks , as well as discernible, as in the case of tag combinations between standardassemblystrategiestojointhemtogetherhead-to- Avi, HSV and T7, for example, it was apparent that tail, have emerged as a way to deal with the potential Med21 expression tends to be higher when the protein is cloning problems associated with the uniqueness of each taggedC-terminally. DNA sequence. The rationale behind these approaches is Given the observed patterns we hypothesized that the that by standardizing parts and part junctions to conform location of an ORF with respect to the driving promoter to a particular set of rules, considerations of design and would also influence expression, and specifically, that function can be separated away from those pertaining to expressionfromORFslocatedproximallytothepromoter assembly.Furthermore,becausetheproductsgeneratedvia would be higher than from ORFs located distally. To test these methods are themselves standardized parts, a single, Leguiaetal.JournalofBiologicalEngineering2013,7:2 Page10of16 http://www.jbioleng.org/content/7/1/2 Figure3QuantificationofproteinexpressionfromdifferentiallytaggedORFs.(A)Sampleset-upforindirectandsandwichELISAassays usedtoquantifyproteinexpressionofindividualsubunitsandprotein-proteininteractions,respectively.(B)IndirectELISAdataforarepresentative setof8constructssharingVSVandEtags.Foranytwotags,8differentarchitecturesarepossible.InthefirstfourconstructsVSV-Tag(inblue)is fusedtothefirstORF(Med7)andE-Tag(inred)tothesecondORF(Med21).Inthenextfourconstructs,whichare“reciprocal”architectures,VSV- TagisfusedtothesecondORF(Med21)andE-Tagtothefirst(Med7).Constructswithtagsincommonareassayedintandem,replicate experiments.Reportedvaluesarenormalizedforproteinconcentrationandbackground.Controlexperimentsincludeantibodydrop-outs(not shown)andproteinextractslackingepitopetags.(C)ELISAvaluesplottedinto“heatmaps”tobettervisualizepatternsofexpression.Thetoptwo mapsshowaveragedvaluesfromreplicateexperiments,whilethebottomtwomapsshowthecorrespondingstandarddeviations.Mapsonthe leftshowMed7expressionpatternsfromthefirstORFposition(onthe“y”axis)andshouldbereadfromlefttoright.Mapsontherightshow Med21expressionpatternsfromthesecondORFposition(onthe“x”axis)andshouldbereadfromtoptobottom.Redandyellowcolorsindicate highandlowrelativeexpressionlevels,respectively.

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.