ebook img

Nucleotide Sequences 1986/1987: Structural RNA, Synthetic, and Unannotated Sequences. A Compilation from the Genbank® and EMBL Data Libraries PDF

597 Pages·1987·9.087 MB·English
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Nucleotide Sequences 1986/1987: Structural RNA, Synthetic, and Unannotated Sequences. A Compilation from the Genbank® and EMBL Data Libraries

NUCLEOTIDE SEQUENCES 1986/1987 VOLUME VII STRUCTURAL RNA,S YNTHETIC, AND UNANNOTATEDS EQUENCES A Compilatiforno mt he ® CienBank and EMBL datal ibraries Compieldb y EdwinJ .A tecnioH,o*w adr S.B ilofsJkuyn,etB ossingerC,htr itaisnB urks,G*r ahamN .C ameronM,i1c hael J.C inkoskCya,r·o El. E ngla,n·Vd ictoIr.E sekgowu,• JameWs. F icektt,B·r iaTn.F oleyW,a·l teBr.G oad,· GregoHr.yH amm,1D avid J.H azeldineP,a1t riKcaihan ,L1e sleiK ay,F*r acnesI .L ewitterN,atta leiL opez*, KertsiA .M acinnesM,i*a J .M cLeod,D*e boraLh. M eloneG/ eralMdy ersD,e*b raN elsonJ,u*d itLh. Nia,lJ1o annaK .N ormanE,r*i Dc. R asmussen,A*n dreAa. R evelsW,a*y neP. Rindnoe,Cta roRl. S cheremr,* MauraT .S mit,h*G uenterSt oesserC,.1D avid SwindelBlr/i aLn. T ruji,l*al nodC hang-ShuTnugn g· • GenBank t GenBank :E MBL NucleotSiedqeu enDcaet aL ibrary T-JMOa ilS topK 7JO BBN Laboratories IncorpoEruartoepde an MoleBciuollaorLg ayb oratory LosA lamoNsa tionLaablo ratory(L ANL) JOM oultoSnt reet PostfacJhO 2 20 9 LosA lamosN,e wM exic8o7 545 CambridgMea,s sachus0e2t2t3s8 D-690H0e idelberg FederRaelp ubloifcG ermany 1987 ACADEMIC PRESS,I NC. HarcoBurratc Jeo vanoviPcuhb,l ishers OrlandSoa nD iegoN ewY orkA ustin BostoLno ndonS ydneyT okyoT oronto COPYRIGHT © 1987B Y ACADEMIC PRESS.) NC ALL RIGHTSR ESERVED. NO PART OF THISP UBLICATIONMA Y BE REPRODUCED OR TRANSMITTEDI NA NY FORM OR BY ANY MEANS. l:.LECTRONIC OR MECHANICAL,I NCLUDINGP HOTOCOPY.R ECORDING.O R ANY INFORMATIONS TORAGE ANDR ETRIEVALS YSTEM,W ITHOUT PERMISSIONI NW RITINGF ROM THE PUBLISHER. ACADEMIPCR ESSI.N C. OrlandFol,o ri3d2a8 87 UnitKeidn gdEodmi tpiuobnl isbhye d ACADEMIPCR ESSI NC(.L ONDONL)T D. 24-28O valR oad,L ondonNW I 7DX Byp urchasoirno gt herwoibstea inNiuncgl eotSiedqeu enc1e9s8 6119r8e7c,i piuenndte rstands thatth ei nformatcioonnt ainiendt hicso mpendiuwmh,i chh asb eenp roducferdo mt he information coinntt haeEi unreodp eMaonl eculBairo logLya borat(oErMyB L)N ucleotide SequencDea taL ibraarnyd t heG enBankd®a taba(s"et hien formatihoans"c )o,m ef roma varieotfys ourcepsu,b lishaenddp erhaupnsp ublisThheedi .n formathiaosbn e end eposited int heG enBankd®a tabaasnedt heE MBL NucleotSiedqeu enDcaet aL ibarr,ya ndi th asb een reprodufcoerid n clusiinot nh icso mpendivuima a reliaabnldeq ualictoyn trolplreodc edure, butn os uchp roceisssi nfallible. TherefoPreres,s I,An cca(.dA eP)m,iB co lBte raneakn d NewmanI nc(.B BN)L,o sA lamoNs ationLaalb orat(oLrAyN L)T,h eE uropeMaonl ecular BioloLgayb orat(oErMyB L)a,n dt heU .SG.o vernmemnatk en or epresentoartw iaornrsa nties regardtihnecg o nteonrta ccuraocfty h ien formatBiyow na.y o fe xamplbeu,tn oto fl imitation, AP,B BN,L ANL,E MBL,a ndt heU .S.G overnmenmta ken or epresentoartw iaornr anties ofm erchantaboirlf iittyn efosrs a particpuulrapro soer,t hatth eu seo ft hei nformatwiiolnl noti nfrianngyep atencto,p yrigthrta,ds ee creotrt, r ademaorfak n yt hirpde rsoAnP., B BN, LANL,E MBL,a ndt heU .S.G overnmeanctc epntor esponsibfiolrai ntyey x penselso,s ses, ora ctioinn curroerdu ndertabkye tnh er ecipiaesna t r esuolftt her eceioprtu seo ft he information. Notet haGte nBanki®sa r egistterraedde mafrotkrh eG enetSiecq uenDcaet Baa nke stablished byB BN andL ANL undecro ntrawcitt ht heU .S.N ationIanls titouftH eesa ltahn ds hould beu sedo nliy nt hacto ntext. Informatfiroonmt hicso mpendiumma y bed uplicatreedp,r oducoerdo ,t herwiussee db y ther ecipiebnutti, n n o evenmta y theG enBankt®r ademabrek a ssociawtietdhs uch re-generiantfeodr matainodni ,nn oe vensth altlh erbee a nyr emedfyu rnisbhyeA dP ,B BN, LANL,E MBL, ort heU .S.G overnmefnotrs uch re-generated iinnfcolrumdbaitunitgo n, notl imitteodf inancrieamlu neratoirto enc hniicnatle raction. Pelasne otteh atth ep ropeart tribuotfiN ounc leotSiedqeu enc1e9s8 61!9 87a st hes ourcoef yourd ataan dt hep ubliacv ailaboilfti htiyis n formatiinoc no mputer-reafdoarbmfl reo m BBN andE MBL wilble a ppreciated. LibraorfCy o ngrCeastsa loignPi unbgl icaDtaitoan Nucleotisdeeq uenc1e9s8 61/987. Includiensd exes. Contentvs.:1 .Pr imat-esv .2 .R odent-s v.3 . Othevre rtebraatnedsi nvertebr-at[ese tc.] 1.N ucleotisdeeq eunce-·Tables-Colwloercktse.d I.A tenciEod,w inJ . IIG.e nBank.I llE.u ropaeiske molekylaerbiologiske laboIrVa.Bt BoNr ium. LaboratoirVe.s .L osA lamosN ational Laboratory. QP 625.N89N851 987 547.7'9 87-1782 ISBN 0-12-512517-8( v7. :a lkp.a per) PRl�lH)T iIl'.\Ll "-:"llTHJSTAlo"r,L \S. \URIC\ 87 88 89 90 9 8 7 65 .J 3 21 Preface Thise ight-volcuommep endiuomfn ucleotsiedqeu ences Bothd atabasaersea vailaibnla e varieotfyc omputer­ foundi nt heG enBanakn dE MBL databaissets h et hird readable fAodrdmist.i oinnaflo rmaatbioounot b taintihneg editiroens ultfirnogmt hec ombineedf forotfsa llo ft he GenBandka tabacsaenb eo btainbeydw rititnog techniacnadla dministrsattaifavfteL osA lamoNsa tional Genbank Labora,t otrhyeE uropeMaonl eculBairo logLya boratory, BBN LaboratoIrniceosr porated andB BN LaboratoIrniceosr porlaitsetoden dt het itplaeg e. IOM oultoSnt reet Botht heE MBL andG enBandka tabahsaevsec ontintuoe d CambridgMea,s sachus0e2t2t3s8 growa ta remarkarbalteew ,i teha chd atabadsoeu bliinng USA sizen earloyn ce eayceha rW.e haveo rganized this compendiiunm eisgehltf -contvaoilnuemdee sa,c ohf w hich Furthienrf ormataiboonut th eE MBL NucleotSiedqeu ence isa vailasbelpea rla.ytT ehef irssetv evno lumeesa ccho ntain DataL ibracrayn b eo btainbeydw rititnog thes amei ntroducatnoder xyp lanamtaotreyr ioanleo, r m ore EMBL NucleotSiedqeu enDcaet aL ibrary sectioofns se quenecnet riaensds, e verianld icteots h een tries EuropeaMno leculBairo loLgayb oratory int havto lumeV.o lumVeI IIc ontaian dsa tabadsier ectory PostfacIhO 2 20 9 andm asteirn dicteoas l olf t hev olumes. D-690H0e idelberg As a resuolftc ommentasn ds uggestiwoen rse ceiviend FederRaelp ubloifcG ermany respontsoet hep revioeudsi tiowne, h avem ades everal improvemeinntt sh iesd itiWoen .h opet hasto mes light adjustmeinntt hsel ayoauntd p resentaotfti hoens equence entriiensc,l udiinncgr easuisneog f m ixed-catseexa tn d WayneP. Rindone improvemeinntp su nctuatwiiolnrl,e suilntm akintgh em CambridgMea,s sachusetts more easily trheaaindn at bhlepe a st. Novembe1r7 ,1 986 vii Introduction Outline 1.D1e srciptiooftn h ec ompendium 1.I ntruocdtion Thep rintceodm penmdaikuemts h ee nticroel lenc tio 1.D1e scripotfti hoecn o mpendium ofi nofmraitoni n bothd atabaasvaeislb laet oe very 1.T2h et wod atabases memboefrt hes ciefnitccio mumnitwyh ow ishest ou se 1.N3e wf eaturetsh ieosdf ti oin it.i ncludinvget siagitnorsw ithouta ccestso 2.C onteonftt sh eC ompendium comupte.r sThicso mpieun.mdd rawfnr omt heA mrecian 2.G1e neroaralgn iaztin ooft hec ompendium and Europedaant asbe.as is the thirpdr inted 2.F2i ndianneg n try comlpaitni oofs ubstiaanltlayll n ucleiacci sde quences 3.H owt oR eaadn E ntry reportsenidc e 196.7 Thessee queesn acndt heir 3.S1u mmaorfty hee ntrfyi esl d associaanntoetdao tnish aveb eenc omlpeid from the 3.T2h ef ielidnsd etail publihesdl ietratuarned from sudbiimrsesocnistf rom 4.T woS amplter siE en the aubtyht ohreGse nBasntka faftL osA lmaos iNoaantl Laborataonrdy tbhyeE M BdLa tlabi rarsyt afatfE MBL. Althohu tghef ormacth osefno re ntriiens the printceodm penddififuem rssom ewfhraotm ithnea itht er 1.I nrtodtuicno dataeb.aes vereyn ty rcontaiinnfsmo aritonc onitbruted bothb yE MBaLn db yG enB.a nTkhef inaplr eapratioofn NucleotiSdeuqee nce1s9 869/817 itsh e third thed atian t hec ompendwiauscm arrioeudt b y the databcaosmep enpdubiluihmes da so ner euslto fa uniuqe GenBasntka faft B BNL abortaoiresI ncpoorrat(eBBNd); intneartoinalc ollbaoratiboenwt eent wo laeding theroer,ef thef ormaatn dc ovnenitnos usedi n the nucledoet siequendcaet al birar,i oenseb aseidnt he compendairuesm o mewhcalstoe rt ot hosues eidn t he UnitSetda taensdo nei nE uro.p Tehet wod atbaases are GenBadnakt abtahsaetn o t hose iuns tehde E MBLd ata theE MBNLuc leiodteS equeDnactea Li,be rsaatbrilyhsed lbira.r yTechniAcpaple ndEi ix lluasttersh owt he byt he EurMoopleeacnu Bliaorly oLgabroatyo r(MEB)L. compenfdoirummra etl atteots h ef ormautsse di n the andt heGe nB(anRGk)e entiSce queDnacteBa a n.kw hicihs twod atabafsreosmw hicihtw asc onsutctre.d Oneo f a U.S.G overnment-snpucolnesioacrc eidds e quence thego alosft hceo lbloaratbieowtne eGne nBaannkdE MBL reopstioyr. Botdha tabasseersvm eo lecubliaoorlg ists is conntuiedm ovemetnwota rdc ommosntn adardasn d ando theivrne sitagtorwsor ldwibdye c olclteintgh e convteinnosf or ttwhode a taasbe.s lare gnubmero f reportDeNdAa ndRN As eqnuceeasn d makitnhge amva ilbalei nc omupte-rreabdlaef orm. The primadriys tbruitni omediufmo rb oth dataibsa ses 1.T2h e dtawto abases mangetitcap .e TheE MBLN ucletoideS eqeunceD ataL ibrawrays Thed atian t hec ompenrdefiluem cthte i nfmoariton esatbilsheidn 198b0y t heE uropMeoalne cuBliaoorlyg founidn G enBaRnekl ea4s4.e 0o fA ugu1s9t8 .6 This Labotroray, an intneartoinalc enteorf fundnatmael inforamtiohans b eecno mbiwnietdth h ed atian cdleudi n researwciht hi tmsa ienm pahsiisn t hef iesl odfc ell EMBRLe lea8s.e0w ,h icwha sm adaeva ilbalei nM ay1 98.6 bioolyg,m olecusltaurrc teus.r differteaintino. and Reuglarulpyd atdeids itbrutitoanp ecso ntaintihneg insrtumeantti.o nEMBL, whoshee audaqrteirss in EMBLS equenDcaet aL irbarya reav ailbalef outrmi es Heiderlgb,Ge erm,a niys currenftulnyd ebdy the annaully. nAe ws et doifs tburtiino tapecso ntaining follwoingm embesrt at:e sAutsriaD.e namr.k Fran,c e thee ntiGreen Badnakt abiassa el smoa daeva ilabfloeu r FederRaelpb uliocf G erm,aF niylnand, Grecee. Isare.l timeasn nualalnyd,u pdattea pecson tainionngl y Ita,l ythe Netherndl,sa Nowra,y SpainS.w eedn. entritehsa th avbee eand deodrc hangaerdea vialbale Switzenrd.l aatnhdeU nitKeindg d.o m Tfhier sretel ase midwbaeyt weeeanc h GfeunlBla rnekls ee.a oft heEM BdLa tlabi rarwya si nA pri1l9 8.2 Thes equenicne tsh isc ompendairuem also TheG enBadnakt abwaassce er ateidn 19b8y2 t he availableG efnrBoaomnn fk ol ppdyi ksett.e sBecuaseo f NaitonalI nstitoufGt een erMaeldc ialS cien(cINeGsM) S lmiiteds torae gcapiatcyo,n ly stehqeu cee.ns some oft heU .SN.a itonaIln stitouftH eesa l(tINhH ). Los baisc iedntfiying inofmrationa,n ds omeo f the AlmaosN atoinaLlab orayt (oALrNL).w hicihso peartde by bioliocgala nnotantsi oare incdleud on this theUn irvseyi tof iCofanrliaf ort heD epartmoefn t disbturtiionm edi.u m The reamiinng anntoated Ener,g iysl coateidn L osA lmaosN.e wM exic.o LANL inoframtiocann b ef ounidnt hec ompdeimnu. gathe,ra snnotte,as ando ragnizetsh ed atabaasned tarnmsitsi t to BBNL aboratorIinecpsoor rat.e d a TheG enBadnakt abiassa ev iaalbleo nilneo n the researacnhd cosnutling firm in Cabmridg,e ORR/NIH/PRcOoPmuHptEeTsr y st,ew mhiccha n abcec seesd Massascehtu.t sThec ollectienfdo rmiaotnis p repared overT eleet.n an intrenaotnialt elemmcnuoicantsi o forr elseea bByB Na ndd isrtbiutetdo s usbcribing netrwk.o Theo nlindeat abaisseu pdateevde rsyi x instiitnous tand scienitnir setgsu luaprdt ae.s weekosn thes ames chueldea s them agtniect ape Cosponosfot rhseG enBapnrko jeiccntul det heN atoinal reelaess. Thiosn lisneer viaclesp or voideuss ers with CanecrI nsti.t tuhteNea itonaIln stitoufAt lelr egya nd accetsost heG enBaSnokf twaCrleei anrhgou,s ewhich InfteicuosD iesas.e tsheN atioLniablry a orf Medciien. contsa iinnofrmiaotn aboucto mrmceilayla vaibllae the oNnaatliI nstitouftA er thr.i tDiibsaete,s and softwarpea ckagfeosra naylzinga nd manpiualting DiegstiavnedK indeyD iseas.e asndt heD ivisioofn sequcee.ns ResearRcehs our(cReDR)s ofN I,H as wella st he NaitonaSlc ineceF ouantdino.t heU .S.D epartmoefn t Form orien ofmraitonon t hes evricepsr ovidbeyd Ener,ga yndt hUe .SD.e partomfeD nefte n.s eGenB'ansk theG enBaannkdE MBsLe quelnbicrea ersip.le aswer i:t e firsrte lseeaw asi nO ctob1e9r8. 2 GenBank BBN oLraabtorIin.ec s 10 uMlotoSnt. 1.N3e wf eatuorfe st heidtisi no Cabmridg,eM A0 2238 USA TheC itant iIondehxa sb eena ddetdo asisst or reardse in findibnilgbi oragpihcacli attinosf or EuropMeoalne cuBliaoorlyg Labroatory juornaalr tic.l Tehsis nienwd elxsi tsj uornal NucleiodtSee queDnactea Library tit,l veolumen ubme.r pagneu mrbsea.n dy eaorf Postafch1 0.2209 pubilcant ifooera cahr ticclitee. d D-69H0e0i delberg WesGte ramny As a reuslto f lmiitedr esourcaensd an eve-rincrearsaitneog f s equepnucbielc atiiotn . hasn otb eepno sisblteo c olleancdtp reseanltl sequeesn c itnh ef ullya nontatefdo rm that we woulldki e. Iti sn evrethelveistsa lilmyp ortant ix INTRODUCTION thata t laesta s muchr aws equendcaet aa s Ind,e txhKee ywoPrhdr aIsned .e txheA ccessNiuomnb er possibbelp er eseed.n tThefroer.ew eh avea new Ind.e xthe EMEL En.t rayn dt hIeGn edneBxanrky Ent sectioennt itdl UenannotSaetqeudce e.ns which IndexV ouilnm eV IIaIr e stmaeirn dicteoas l lo f the contauinnansno tated ansdsf iiuendsc elqauences volmuesi nt hiesd tiino. andc itatoin.s Weh opteh aitn t hef utuwreew ill havet her esourtcoem so vet hisi nfromation 2.F2i ndianneg n yt r rapidilnyt oi tsp ropepro stiino in them ain datab.a se Userasp prhoianctgh eda tabfaosrte h ef irsttmi e musdte tmeirnweh icshe citonc onatisn thes equetnhceey A separvaoltuem ies naovwal ibale thatc ontasi n arel ookingf o.r Mosto f the sectnsi oare masterid nices feonrt itdrhaeet abaassw eel la s sefl-epxlanayt,o bruti t ish eluplft op oint thoeu t a mastdeirre cotry aflolor f tehnet riiens t he follwoincgo nvteinnos: datab.a se Yeasatn df ungasle quenacreesi n theP lant Sequenscetecisno . 2.C onteonftt sh ec ompendium Plasmiadnsdt arnsposiosnostl eadf romb atceria Asc ombiinnet dh icso mpdeimnu,t het wod atabases arel itsedi nt hBea cterSieaqlu ensceecisto .n contaai nt otal of8 .nm5eia lirlolnbya sefsr o6m7 00 artcilse. Thef oolwlinign dicaersep rovitdoea ds sist TheS tructRuNrAsa elc tiionnc lutdheess equences userisn finditnhge i nofmratiotnh eyn ee:d the of maturtear nsfeRrN,A rbiosomRaNl,A small KeywoPrhdr asIen d,e xtheT axonomCilcsas ification nucleRaN.Ara ndo thesrtu rctrual RmNoAl eecs.u l Ind,e xtheA uthoIrn d.e xtheC itant iIodne.x the All sturcturRaNlAg eneasn dm osstt urctrualRN A Accesns Niuomber .I tnhdeEeM xEELn ty rIdne,x andt he precursseoqruc eens are lsited witthhe ir GenBaEnnkt rIyn d.e xMosotf t he enatrreai nensto ated oragnsimsi nt ehipra triculsaerci tnos. toi ndictahtele co atnisow ithtihne rteepdso erquences ofc odinrgeg ionasn d otexhpererie mntlaldye tremined TheS ynetthiSce quensceecst iionnc dleus any sitesb ioofli ocgaslig nfiinccae. Fullb ibolgiraipch nulceica cids equentchea ti s ceratd ein a inofmratiiosni nlcduedi ne vereyn ytr,a ndm anoyf t he lbaorataonrddy o enso to ccunra truallyi,n ucdling entriaelss oi ncudlec ommenatbssr tatcedf rom the sytnhetic pltahsaamtri end osti ncdleudw itthh e orgiinapla epr.s Techniacpaplde incelsco atd eafter othebartc erisaelq ucee.ns The omrae jxceopntsi thmea idna tsae ctiionne sa cvho ulmec ontadient ailed to this raurlece D NsAe qucee.ns sincet heayr e explaatoninso fi nfoarmtioinnt hee ntr.i es reagrdeads a meanosf s euqencinngat urlayl occurrRiNAns ge qnucese. 2 G1e nrealo ragnizatoifto hnec ompieunmd Thei ndiidvuaeln triweist hiena chs ectiaorne arrnageda lhpaebtciallyb ye nty rnam.e Summatrayb les Thee ntriients h ec opmendiaurme p resenitne d ands ecitond irectoarriee sid necdal tu theb eginning thirtseeecintno s; witheianc she ctitohnee ntriaerse of eachs ecitont op rovei sdomgeu idafnocrle o cating groupaecdc ordtion tgh es ourcoera gni.s mThese thee ntr.i eTsable1 isa n eorvalslu mmatrbayl eo f secitnosa rea rrangiende ihgtv olum.ea ssf ollwos: thee ntidraet aeb.aTs hist bales howtsh ena meosft he secitnos,a sw ella st hen umbers of serqeupeeos.nr cted VoulmeI .P rimates distienncttr ,ia ensdn ulcoetdieb aseisne acshe ct.i on Theraer et ypciallmyo rer eportseedq uenctehsa n Secti1o.nP rimatSee quences entribeesc auser alpoipvnegs equenacreefs r equleyn t mergiendt ao s ignlec,om bineendty r. VoulmeI I. Rodtesn A tabel thastu mamriztehsee ntriaeppse aartts h e Sectn i2o.R odeSnetq uences begniinng of eachs ect.i oTnhitsa bel isc allde the SectiSounm m.a rTyheS ectni Soummafroyrt heP riamte VoluImIe.I OtheVre rtebraantdIe vnse rbtreates Sequensceecst .i ofnore xamep.ll sit,s byo ragnism (e...gApe ). thec orresopndinogra gnim scode( e...g Setcino 3.O theMarm lmaianS equences APE). then umbeorf r eportseedq uenfcoers that Setcino 4.O theVre rbtreatSee quences oragnims.t hen umboefre nrtie,s then ubmero f bas.e s Setcino 5.I nvreetbraSteeq uences andt hep agen ubmero n whicthh igsr ouopfe ntries begin.s VoulmeI V Plnatsa ndO ragnelles Notteh atth ep agneu bmertsh uroghouatr ea rranged Setcino 6.P lanSte quences separatfeolrey a cshe ct.i oTnhen umbersp rairnet ed Setcino 7.O ragnellSee quences one acpha gwei tah s hrots ectiporne f.i Fxore xamep,l thef irstth repaeg eosfS ecti1o:nP rimatSee quences VolmueV .B atcerianad B acteprhiaoge aren ubmerePdR IM-A1T,EP RIMAT, Ea-n2d PRIMAT.E -3 Table1 showtsh ep agneu bmerp reffiorx e acshe ct.i on Secti8o.nB atceriSaelq uences Secti9o.nB atceropihaSgeeq uences A detladei alhpabeteidzd irectofroyrt hes ection appeairmsm ieadtelayf tetrh eS ectiSoumnam r.y The VoulmeV I. Viurses sectidoinr ecy tcoorntaoinnels ni eo fi nofrmiaotnf or eache nty rin thes ectn iaonds ervaessa copmlete Secti1o0n V iarlS equences tbaleo fc ontefnotrts h aste citonl.s itingt hef ull enty rnam,e thed esrciptn ainodl negtohf e ach ye ntr VoulmeV I.I SturcturRaN,AlS ynthiect, (i...enu mboefrb aspea isr). antdh ep ageo n which andU nantnaoteSde quences eacehn traype par.s Sectn i1o1.S turcturRaNAlS equences 3.H owt oR eaadn E ntry Sectn i1o2. SynthetSieqcu ences Sectn i1o3. UnannotatSeedq uences Thee ntrs ifeore achs ectn iboegianf ter the sectidoinr ecyt.o Erache ntriyss eparaftreotdmh e VoulmeV II.I DatabDaisree ctaonrdM yas teIrn dices nexbty a dashlenide r unnitnhgew idtohf thep ag.e Theraer et wot ypeosfe ntriient sh eco mpeunm:d Ii1 ) Eachv oulmeo ft hec ompendciotunami nst his sel-fconntea,dia nd( 2s)e mgenetd. Segmenetnetdr ies inrtodtuicno,o neo rm orsee citosn ofd at.a tcehnical are uwsheednn o cnontuiougpsi ecoefst hes amneuc leic appdeinc.ea sndi dnicetso thatv olmue. TheA uthor acid molheavcbeue leesn e quenacnedtd h eo rdreingo f Ind,e txheC itatIinodn,e txhTea xonomCilcas sification thpei eciessk nwon. x Table1 :S ummaorfSy e quenPcreess enitne dE Saechc tion Section SecSteicotni on Numrb oefN umboefrN umboefr NumbeCro de Desrciption SequenEcnetsr ieBsa ses -------- ---------------- 1 PRIMATEP rimaSteeq uences 1492 10281 240779 2 RODENT RodeSnetq uences 1638 12721 161212 3 MAMMAL OtheMarm lmiana Sequences 293 245 244554 4 VERT Other Vertebrate S5e5q7u en4c7e4s 400509 5 INVERT InverteSberqauteen ces 696 605 435280 6 PLANT PlanSte quences 717 594 643365 7 ORGANELOLrEagn elSleeq uences 434 368 485666 8 BACT BatceiraSle quences 1310 749 1034165 9 PHAGE BacteorpihaSgeeq uences 338 160 271871 10 VIRAL ViarlS equences 1748 10931 517025 11 RNA StrucatlRu NrAS equences 734 637 69232 12 SYNHTETICS yntthieSce quences 259 224 72029 13 UNANNOTUAnTaEntDna oteSde quences 1377 1374 919833 Overla Slumamr:y 14113 88238 442357 3.S1u mmaorfy tehnet rfyi elds 3.T2h ef ielidnsd etial Eache ntriys c omposoefd sevkeinrdasl of ENTRNYA ME inofmrationr,e ferrteodh erea sf iesl.dN ote very EMB"LDI "N ameasn dG enBa"nLko cuNsam"e s fiealpdp eairnes v ereyn tyr, butt hef ulll sit of possibfliee l,d isnt heo rdeirnw hicthh eayp paer, is Thee ntrnya mei s a shrot,u niqunea met hat asf ollwos: providtehsel baelf oran e nt.r yIno rdetroo ragnize this compean cdoihuemr feiannsit ho ,n it wnaescs esary EntrNya m-e as hor,tu niqunea mep rovidtihneg to choosae u nfiormm ethofdo rn aminagl l tohfe labeflo rt hee nt.r y entr,i reesagrdleosfs which dtahteai bnoafmrsatei no wase xtarctefdr o.m By mutuaalg ree,m ewneht ave Defitniio-na b riedefs cripotfti hoens equceen, presenttheeed n triuendsre then ameass sniegdt o them beginnwiinttghh nea moef t hseo urocreag ni.s m in thGee nBadnakt asbe.a Thec onvteinnosf or choosing thesnea mse,w hicihn cudlea bbrevioantsi for the Segme-nitn dicawtheiscs hem gentth iesn triysi n oragnismfsr omw hich nutchlee aicci sd weries otlea,d a serieosf s eparasteeqdu enfcreostm h es ame ared esicbreidn d etaiinlT echniAcpaple nAd:iEx n tyr moleec.u l Namaen dM olecTuylpeCe o nvteinnos. EMBILD - entrnyam (se) int heEM BdLa tabatshea t TheG enBaennkt rnya mehsa veb eenc allde "lcous corsrpeontdot hee ntrnya meisnt hiwsor k. namse" throuhgoutt hisb oo,k andt heraer em any ocacsiso wnheroen ee nty rrefetrosa noth"elcrou s"o r AccessNiuobmne rs shorcto detsh atp rvoide antoherg rouopf" lcoi;" thitse rmiongoyil ss impal y uniq,u uenchangiidnnetgfi ierfso rt heda tian wayo fr eferg ritonot heern tr.i esTheen trnya mes eacehn tyr; thef irnsutm bienr tlhsieti sk nown used forc orrtephsoen diinngof mraitoni nt heE MBL ast hep rimarayc cessniuomnbo efr tehnety r. SequeLnicbaerr ya reg iveanf tetrh el abe"lME BLI D:" in tsheoecn dl nieo fe acehn tyr.N ota lle ntriheavse Dat-et hey ea,rm otnha,n dd ayw hetnh ifso rmo f beeanss ingedE MBLI D nameast thisst agoef our thee nyt rappeairnet dh eGe nBavnekr siooftn h e colbloariaot,n bute venatlulya lle ntriweisl ble datasbe,ap luisn ofrmiaotonn w htehetrh ee nty ris assingedn ameisnb otdha taebsaa,sn dw e area ctively preilminaorrcy o mlpet.e moving tao cwoamrmndoa nm isnygs tfeomrc orrsepodning entriients h et wod atasbe.as Refere-ncciteasto insf ora llr eferenucseesdt o constreuaccethn t.r y TheG enBaEnnkt rIyn delxsi tsa llo ft heG enBank entrnya measl pheatbailcly, tgoethweirt thh es ection Keywo-rsdhso rpth rasdeess crigbeinnepg r odtusc namaen dp agneu mboenr whiche nttrhbyee gi.n sThe ando theirn ofrmatpieornte inntt ol ookinugpa n otheirn dciesr efetro GenBannka emes,n ntortpy a ge enytr. numrbs,e sincteh esaer et hen ameuss edi n oragnizing the b.o Tohkep agneu bmermsus tb el okoed iunpt he Sour-cmeo scto momnluys edn ameo ft hes ource GenBaEnntkr yI nd.e x orangismf,ol lwoedb ya formaslc iteinifcn am.e Comme-nitnf oramtiotnh adto enso tr edailyf all DEFINITION intot heo thefri el,d isncludiinfnogmar tion abstractedf rom threef erenceasn d Thed efinitioof nan enty rproveisd a brief cors-srfeerentcoeo st heern tr.i es descriptitohnes eqoufe .n Tcheidsef initiisou sne d toc onstrtuhcelt si tinfgo r etnhteri yn thes etcino FeatuarnedSs i ts eTables tabledsei sngedt o direc.t oTryypcialliyt inlcduest hen ameo ft he descrilbcoea toinsa ndr egoniso fb ioolgcial oragnim sando theirm portianfnotra mtiodnes rcbiintgh e singificanwciet hins etqhuceee. n enrty. Inofmratino aboutth et ypeo fm olecaunlde whtehert hes equenpcree senitse cdir culaorr a Orig-idne scritbheess t arotf a sequenicne copmlettea nderme peiasti ncdleudi nb rackaett st he relattioo nea xnpr eiemntalldeyt emrinesdi t.e endo f thed efiinoint for moste ntr.i esThe convteinnos usedi ns pecyiifntgh em olectuylpeae r e Seque-nscteai tstsi cotnh en umbearnsdk indso f desicbred deitna iinl T ecihacnl AppendAi:x E ntyr baseisn t hes eqeunce,fol lwoedb yt hes equence Namaen dM olecTuylpeCe o nvienons.t itsfe.l SeeE xalmep1 foarn e xalmepo fa typicpaali ro f entr.i es xi INTRODUCTION ANIMTlC:Ya Bn.idaunlsm ta pcoytorcohmbe ( ocb)a gen;e exo.n [lNDA] SEGME: NTo1f 2 EMBILD :M INA02 ACCESSNIUOMNB E:R JS0138V80 016 5 DAT:E update8d31 --101 REFENRCEE:S [l(]ab se1s t o8 3)8W argi,nR.,D Bav.ise.R,.L Wee.,,SG .riis,,EB .er,kMs Ma.nd S cazzioo,ccC;h. "hte moasico rganizatoifot nh eap ocytomcehb r goene oefr glaiuslspn iudlnasr eveablyed dn as equienng;c"C ell 27'4- 11(1 891) KEYWDOS:R cytorcohm;ea poyctocohm.er SOUR:C Easeprgliulsn iudlan.s MitocnhdoiroAns eprgliulsn iudlans COMNMT:E Singilnet roofn aboutb po 1c0c5u0p siaemspe o isitoans I 3i n" olngS". c ereivaiegs en.e Operne ading framoefe xo1n c ontuiesna tl aest2 00bipn tiov .s TGAc odefso rt r.p See< uhmmt> <aynsdty mbt.>c See othelrco ib eginn<inanitgmc y.b > SITES: FEATEUSR: key sitsep and escription key from to descprtiin o refnumbr 1 1 numbe-r1e2di5 n [ l]ze;r noo tu se.dp ept 126+ 631 apoyctocmherb o(xeon1 ) ->eppt 126 1 cobcao disnegq uesntcaer t FEATEUS:R pept/IV6S3 2 0 cboai vsslt ar(txe oneln )d key from to desrcipnt io CDS 162 631 apcoytocmheb r poar1t ( 361 is 2ndb asien c oodn) IVS 632 >388 intrIo n ORING:I neahri dn iisii et in ibifg rla gm4e.n t SEQUCEE:N 838b p 320 a1 1c2 132 2g7 4t 1 ataataaacgat aattaaaattaaa atattaatcatta atc ttta agt tt tatatacat gtaaataaaaaaaaaa aaaaattaaa attataaagaat agg 10a1a aaaaaataaaaaa aaaaaaaata aatagtta tgta aaaag tcatctcatctt aaaagataatta cagtt ataatataagt tcacacctccaag ctaat 201t atagttatt ataaattttcg ggatctaatt atgctttatg ttta ggtaacta aataacgatggagt ttcaa tgtcatagtc attactcaatcgat gtat 301c agaagtcaaattttct tag gagctattatag agaattgg aataaatctcagat tacg gttatacctact tacattaacgac ttcta tgccttt cttttt 401 agtatttacaca caagtgagaag gtatttat tagtgatctta caaaacctaaacag tctata catgaatggtcgta actaaga tacatgatattaat tgg 501g ccaccattg ccatgtgtttgatt tt a cct t agtgtcataagagt ttatagg gtgcgttatctaaat ctaac catatgtacggt atatcagcatt aggtc 601a agattgattt gatgtatttt gaaggggttta tcaacatgagaa ccactacgagg tgatcatg t gtta aaaa tcccttatga gtctagaga atccactc a 701 cttggaattgtc atactgaatcctttat tta ataagttttat aatgagcgt agaaattc gaatgacgagcagaagta cacga ggggaggat agttatcat 801a cttgcaaacgc tcagtacagc tacagtgggacaat ct ANIMTC: YaBn.2iudlansm ta pcoytorcohmbe ( ocb)a gen;ee xo.n [2NDA] SEGME:N To2f 2 EMBILD :M IAN03 ACCESSNIUOMNB E:R JS0138V90 0652 DAT:E update8d31 --101 REFENRCEE:S [l(]ab se1s t1o0 8)2W argi.n.RB,.D aive,sR.,WL e.e,,SG .riis,,EB .erk,sMM .a ndS cazzioocC,c.;h "teh moasico rganizatoifot nh e aopcochoyrmteb genoef a seprgliulsn iudlanrse veablyed dn as equienng;c"C ell 274'- 11(1 891) KEYWO:R DcSytoocmh;era poyctorcohm.e SOUR:C Easpreglilsu niudlan.s MitochdornioAsnep rglilsu niudlnas COMNMT:E Signlei notnro fa bou1t0 5b0p o ccupsiaemspe o istoina sI 3i n' 'nlgo"S .c eervise igaen.e Operne ading framee xoof1n c onitunesa tl aest2 00bp iivn.st ToG Ac odefso rt pr.S ee< uhmmta>n d< ystbm>t.c Syee other loci <bnaeigmitnc.ny ibn>g SIT:E S FEATU:R ES key sitsep and escription key from to desrcipiton IVS/pep7t7 0 cobae xons2t ar(tvi sl edn) pept + 77 734 apocytoomcebh (rxeon2 ) pep<-t 734 1 cobcao disnegq ueenncde FEATU:R ES key from to desrcipnt io CDS 77 731 apcoytorcohmbe p ar2t ( 77 is 3dr basien c oodn) IVS ( 1 76 inotnrI ORING:I abou7t5 0b pa ftearn imtcybl SEQUCEE:N 108b2p 373a 123c 140g 446 t 1 gatcaagtaaaaatat tagtctg tataaggatagatg ttaat atttattat aaatctgatcatc aatactaaa atgctgctta aatcaaaattctgat aa 101c aagttctgtctat tatccaatttttta ctct tttagttgac tgcttctatatagag tcat ttaatta agtcgcatcgaagttaa gaggagttcca catt 201t tatgagttcttc gatattcaga tagtaact cttctttccg tatttttata ttataagattat atataatc atatttatctt ttattatt tgtacaatat 301t tgtttttactctgctta agtc ttatgggt atagatagatat atgttgacttagat ccttcgaa atacccac ctgcgttattacagc aagta tcattttt 401 acctttgccttaattat atg tatctaactct aattataata atggtgtttaagt catgttcgtt cgattat atgcatttggattaa tccgt ataactgat 501t tattacaatta agagagcaaagtttta gcac ttagttaaaa gttaagttc tatttatgttta gcttacatctta aataattgtc agagattgac aaaac 601a cgtataagtcc cattgtaaatttgttg aaca atttcttaact tattttttagtc attacttttt atagatg ttccgttgttattg atatt tagaaatac 701 tttagtatgtaaaag agctaaa aaaacatttat attcttgat cctctgtaaaaga acaaaaaatat taataataa gctcagaa ttattatt ataagaatg 801a tattagaacaaa atatatt aaaaagtatgaa tagctttaatctgat ataat caatttattaaatt ttgttgtt tcatacttca ttttagg ttaatcata 901a gtatgaaattgata aataatgattt tcttta aagttatgagaccct ttaaa ta ttat tat ataatattattattt gcttt agagat taattaaatacaa 100t1a taattaattttagatg gag ttatggcaaat ggttgtcgt ttttagttgca aattaagat aatgggactgatt ttcgcgc c Exalmep1 .T wos egmenetnetdrs i ferotmh eO ragnlelSee quenccetsi .o sne xii SEGMENT KEYWORDS Int hose wchaesreaesn enty ris segmeed.n t a Thek eywofrideslc odn tasihnosrw to rdosrp hrases segmefnite lids usetdo i nditcewa hicshe gmetnhti s thati deinyft genep rdouctsa nd otheurs feul entriysi na serioefsn ocnonitugouss equenfcreosm idnetfiyincgh acrtaeriss toif cthe trein.e sThe thes amem oleec.uS legnmteeedn triceosn taai lnba el KewyordP hrasIen depxr voideas m eanfso rl ookinugp aftetrh em olectuylp,ee a tt heen d tohfe f irstl nie allen rtietsh ahta vceo mmokne ywoprhdr eass;i fa of tehnety r, whicihn dicattheeps o sitoifot nh is particeunltarhrya sn o wkoerypdhr as.e tshiisn dicates enty rin tghreo ups eogfm enetnretide .s Then umbaetr thanto ne thoefp hrasiens tkheey word indteox apply thee ndo ft hee ntrnya mael sion dicawtheiscs he gment it. As mall numbeorf p re-terenis show wtohred ofa comeptles equeinscc eon tianeidn t hiesn rty. "nuasisngedi"n p lace anoyfk eywoprhdr a;s etshis woridn dicattheast etnhterh ya s yneotbt e erne viewed tod etmeiren whi.c ihf an.y of thek eywoprhdr ases ACCESSNIUOMNB ERS applyt o .i t Acceisnos nubmersh aveb eena ssginedt o all entreis in theE MBSLe queLnicber aarnydt heGe nBank SOURCE dataasb.eu sinag s ystem watshw aotr koeudtj iontblyy thet wod atabtaesae.m sThesaer bitrlabareylf so r the Thes ourfciee lidsa nE ngilshl naguagset amteent datian lcudeidna ne ntry coonfsa i ssitn glleet ter aboutth eo ragnisamn dt iusesf rowmh icthh es equence follwoed fbiyv dei gi;t usnlitkhee ennatemrsyt. h ey wasi oslate.d Ift hee ntrcyon tasi nvai rals equee.n c cary rnoi noframtin aobout ttyhpeoe r naturoef the the hosto ragnisims u saullyal son am.e dThis ent.r yAccses-ionnu bmersn evecrh an.g beutr ather sattemeinsft ol olwebdy a formadle sniagtioonf the follwo alonwgi tthh ed attah epyo intton o tmaetrh ow sourcoera gnims. typilcya clonsistoifn tgh ef ull thee ntroyr e ntrieqsu setini onh tbm eir geoarngidz e sciteinifcn am.e TheT aoxnomiCcl asfsiicatiIondne x in fuvterurseni soo f eitdhatearb .a Fsoere xapml.e if lsitsa lle ntriaecsc ordtiotn ehgi rfo rmtaalx onomic twod iffereenntt riweistd hi fefernta ccessniubomenr s clasfsiicaotn.is arem ergiendt ao s nigel ent,rb yotahc cessniuobmne rs arei ncdleudi nt hnee we nt.r y COMMENT Thef irnsutm bienrt hel sito fa ccessniuomnb ers is thep rimarayc cessniuomnbf eorr etnhtyer . Ify ou Thec ommeinntcd leusi nfmoaritont hat dnooets citei nofmraitonc onitnaedi na n entirny e ihter redailyf all in othefri el.d ssucha s statements dataasb,e youa ree ncoaugredt oi nclutdheep rimary abstrafcrtoetmdh er eferenacnedcs r sos-erefrences to accessniuomnbi enry oucri atti.o nsincet hinsu mber othert rsieen. wileln baley ourre ardset of idn thed atian q usetion inf uturreeel aseosfe ihterda tasbe.a TheA ccession NumbIenrd elxsi tsa lla ccessniuomnb etrhsah ta vbee ne FEATUARNEDSS ITETSA BLES asisngedt od ataen dt hee ntriteos w hich y htahvee beeans sgine.d Thredeif feretnatb lecsa n appeairn each compenednitu:rm y the EfMeBaLt utraelbse t.h eGe nBank featutraelbse a.n dt hes itetsa lbe. Allt here are desinged to desicbre regionasn dl ocantsi oof DATE biogliocaslin gicfaincweih tinth es equcee.n Thet wo fearteust ablsehso rwe igon,sw itsht artainndeg n ding Thed atef ielcdo ntaian dsa tei n thef orm poinftosre ach feoafti untreers et. Thes itetsa bel, year-mon,t hp-rdeeacdyedb y twhoer d" netedr".e ont heo thehra n.d showisn dividluoacilao tsn of "upda.t "epd-r"eenrty." or" nuannotatedT"h.ed ate interweistth tihnes equcee.n togethweirt ah number givent hieds a te thoefm osrte ceGnetn Barneekla sien thati dnicatwehse thtehrel ocatiisoa ns ingploei nt whicthh iesn truyn derwenmtao jrar neyv iinoss. Ift he ore ncompamsutslieps lbea ess. Thef eaturtebasl es wor"dne tereadp"p ears tbheedf atoer, et his mtehaants presentceodm ef romb othd atabsa;s tehoswei th thiesn ty rwas feinrtsetr iend t hed atabaosne the lwoersceak eywoirndt sh ek eyc olmnu arei nG enBank dateg iv.e nandt hati th as nuontdr egoe nany foarmt.w hliet hoswei tuhp precaskee ywordis nE aMrBeL substnatiarle vinssis oicnet harte elas.e Ift herhea s form.a tTEhMeB fLe atutraebsl aersei nlcudeidnt hose beesno mseu bstiaanlrt evisni,os ucahs t hea ddiitono f casewshr eet hen ubmerisnygs mt ueseidn t heEM BeLn try anotherree fren,c tehed atoef t hem osrte cernetv ision correspotnodt sh en ubmeriunsge idn t heGe nBaennkt ry isg ivepnr ecedbeytd h ew or"du pda.t"Ie fdt hew ord andt heE MBLt able prosvoimdei enfso ramtiotnh at "p-reentraype"par si,t diincattehsa ta ltuhgoht he augmetnhteis n ofmraitonf ounidnt heo thetrwo t abels. comeptles equenacned some potrhteai nontnoa toifo ns Three is otfen considerraebudlned anicny t he appeianrt hiesn rty, theraer ea dditniaola nnotations inofmraitocn ontaniedi n thesteh reteba le.s Full that wbiel iln cdleudo ncree vwi oeft hea rticilne inofmraitona boutt hec onvteinnosu seidn c onstructing questiiosnc omeptle.d Pree-ntriseosem timeusnd ero g thestea blaepsp eairnTs e chniAcpaple ndCi:x S ites severraolu nodfsu pdataensdr veisnisob eofret hye are and FeaTtabulre.es s upgratdoef dul le ntr.i eTshed atfei elidna llo ft he enrtiesi n theU nannotatSeedq uenscetecis obne gsi n witthh ew or"du nnaontadt".e ORIGIN Theo riing fieldde sicbrest hes tart thoef REFERENCES sequen(chete 5't ermin)u isn relattoi soonm e expreiemnatlldye treminseidt seu.c ahs a resrticnt io Thisf ielidn cdleust hen umbe(rni brcake)t s enzycmuet tisnigt. e assginedt o eachc itepda pre;a briesft aetmenotf whicihn ofmratioinn t hee ntrcyo mesf rome ach referen(chseo wni np arent)he;sa ensdt hea ctual SEQUENCE ciattioonf thea rtlie.c Sequendcaet as ubtmtied directtloy tEhMeB Lo r GenBadnakt abases and notT hef irsltnie i n tsheeq uenfciee lgdi ves the publihesd elsewheraer e lsiteda s unpbulishd e total numbbaseer rposafa.i n dt henu mbera doeinfn e,s referen.c eTshesree feenrcegse naelrlhya ve tniot; l e cytoess.i gnuanienst,h ymi(nroe usr cail)s.a ndo ther thesyi mplliys tth ec ontburtioars thea uthoarn d basesr eportiend the sequee.n cAftert his incdleu thew or"dU nbpluihes'df 'ollwoedb yt hey eairn inoframti,o tnhea ctusaelq ueinscl esi tde.T heb ases parenthaensde sta hdder eosfst hec ontribuSteoer . int hes equeanrcene u mbelrnieedb yl nie,a ndt heayr e Tecihacnl Appendix B: CiRtefanet Crioeonenvnctei ons prseentoendeh undrbeads epse rl nie. ing rouopfst e.n forf urtheirno fmraitonon freerenfcoer m.a t xiii INTRODUCTION 4.T woS ampElnert ies regoni.s Thestew oe ntriaelssi oll usattret he uosfe someo f tshpece ialsy mlbsot hacta na ppeianr both Exalmep1 isa reprotdiuoocnf a typicpaali ro f styloefs f eaturteasbe ls. The" +"s ings in the segmenetnetdr ifeosu nidn theO ragnelleS equences GenBafneka tutraebsl iednsi catthea tth ec odirnegg ion setcio.n Int hiesx apmle,t hef irsetn ty rconsisotfs resumienas no thesrem gent. The" "<and""> characters the5 'p ortn ioofa particsuelqaure anncdte h es eocnd in the EMBLf eaturteasb leisn dicatthea tt he consiosftt sh 3e 'p ortiooftn h esa mes equcee.n The intevrenisnegq ueenxctede snb eyontdh ee ndso ft he two entirni tehsi se xcerhpatv eo nlbye epna rtly reproteds equcee.ns Thea ppardeifnfte renicnte hse conrvteedt o fulmli xed-craespern etsaet.i oWnhen nubmerrse poritnet dh et hreteab els itnh eseen tries theraer ef retee xpto rtnsio ofa ne nyt rthaatr en ot ared uet o systeimcda iftferenicnte hse vceonontnis conrvteetdo m ixed ,c tahyse aeres howinn lwoe-rcase thahta vbee en uisnte hdeE M BaLn dG enBadnaktb aase.s chacrtae.r sThec ommpornei fxf ort het woe nty rnames Fore xamep,lt heG enBafneka tutraebesl r eprotst hat isA NIMT;C tYBheorre,ef thef irst einst crayl led thec odisnegq ueinncA eN IMTCtYBem2ri nataets b ase ANIMTCaYnBdlt he sde AcNoInMT.C YTBh2es egmefnite ld 73,4 whilteh ec orrespondEiMnBgLf eaturteasb le att hee ndo ft hef irlstnie o fe acohf t ehsee ntrs ie reprots etnhdoe f t hec doinrge giaosnb as7e31 .T his stateesx pilcittlhyat th e feinrtys rt itsh ef irsotf apparednitf feremnecree lrye flectthseG enBank twos egmenatnsdt hen exti st hes eocndo f two connvteioonfi ncludtihnetg em rinaticoond oinn the segmtes.n reprotedc odinrge gni oand tEhMeB Lc onnvteioonf excludtihnetg em rinatcioodn.o n Thee nty rnameisnt hiesx ampcloen siosfst e vreal part.s Thef irst cthharreaec itneb rohst n ameasr e Theo rigilnni e,sw hicfhol lwo thef eaturaends "AIN", an abrbeviatfioornt hef unguAss pregillus sitetsa bleisn theset iree,sn indicatthea tt he nidul.a nTshel etters idn"iMcTa"tt hea tth esaer e sequenicne A NIMTCsYtBal rtnse ara particular mitcohondrsieaqlu enecnet r,i easnd" YCB" is an resrticnt ieonzymceu ts iet andt hatth es equeinnc e abbreviatfiooran po cytroocmhbe. T hel ascth raatcer ANIMTCiYBs2s eparaftreodmt hati nA NIMTCbYyB l in eaecnhty rnamei s tsheeg mennutbm er. See approxaitme7l5y0 b as.e s Techcnail AppendAi xf ora desrcipitono ft he connvteoinsa nda brbeivatnisuo seidn asisnginge nyt r Thes equefniceeli sdt hel ast fiinte hleed n tyr. naems. Thef irsltnie o ft hef iresntt rsyh owtsh atth eraere 838b aspeai risn t hiss equceen,w hicihcn uldes3 20 Thed efiniftolilowoni ntgh ee nty rnagmiev etsh e adennei,s 121 cytonse,is 132g uannei,s and2 74 nameo f thes ourcoera gnism andi noofmrtahiteorn thymeis.n Fianlly, thea ctusaelq ueinsc lesi teda nd describtihnesg e qucee.n Fore xapml,e "a.n idaunlsm t clearnlumyrb ee.d apocytroocmheb (ocb)a gen;e exoln" i ndicattheast thee ntrcya lleAdNI MTCcYoBnla tisn exon1 of the Plesaer efetort het echniacpapelin cdeasf tetrh e mitcohondrcioabla gceondeif nogra p oyctorcohmbe i n data sectioinns e achv olmuef orad dtioinadle tlasi Aspregisl nliudaunl.s Then otat"i[oNADn] i"m emdiately aboutth ceo vnenitnosu sed priens entthieen ngt riiens follwoign thed efintiino indicattheatsth es equence thicsol lcetnio. Thei ndicaetts h een do fe acvho ulme represae dnotusb lrea-nsdtDeNdAm oleec.u l incdleub rieexfp ltainnoas ainndsr tutcinos fort hier us.e On the secdo nlnie of eache nytr, the correspondEiMnBgLI D nameasr eg iv.e nThee nrty callAeNdI MTCcYorBrel spotnodt sh eEM BLe nty rcalled MIA0N2, andA NIMTCcYoBr2re spontdosM IAN.0 3The accessniuomnb earpsp enaexrt. Eacohf theseen trs ie has tawcoc essniuomnrb s,e snicet hespear tuilcar entrwieersoe r iingalelnytr eedi ndepnetnldyie n each datasbe.a Thed atfei elatdt heen d tohfe osnedlc nie indicattheatsth eseen trs iweere rmoescte nrtelvyi sed int hGee nBarneekla sdea te1dN ovem1b9e8.r3 Thel sito fr eeferncebse gionnst het hdi lrnieo f theseen tr.i eTshen otati"o[nl( ]ab se1s t o8 3)8" indicattheatsth er eefrencei nt hef irsetn yt ris reefrerd toa sr eferen[clea] n dt haitt i st hes ource oft hbea sensu mbefrreod1m t o 8i3nt8 h ee nytr. The remaindoefr t her eferenclesi tinigs a farily connvteoinacli tatfioortnh pea rticruelefarrec ne. Thek eyworfdise ladp paersn ex.t Theset wo entriceasn bleok oedu p usintgh et wok eywords "cytoomceha"rn d" paoyctocmher"io nt hKee yrwdo Phrase Ind.e xThen extf ieltdh,es our,c lesitsf irsthte commounsleynd a moef t hes ourocrgea nsim,A sepriglsl u nidaunl,s folwleodb y thes ciefnitcin ameu setdo classiyf theo ragnisimnt het axonocmliacsf siication inedx, MitocnhdroioAns eprgilnliuudsl a.n sInm any cas,e ssucahs t hosient hiesx amep,tl het wop artosf thseo urfciee alrd eso mewrheaudtn dta.n Thec omemn,tb eginnointn hge n extl nie oft he enytr, givebsr ieifn formaatbiooutnth ee nyt r(atken frotmh er eefrnec)ea ndr eefrst he reaodetrh elrot coi (netri)e tshahta vnea mebse ginn"iAnNgI M"TC.Y B Thretea blaepsp enaerx itn e acohf t ehsee ntr:i es a sitetsa blaen dt wof eatutraebesls. Allt hreoef thestea bliensd ictahtpeeo ritnoso ft hessee quences that coadpeo cyftororoc mhbe; t hiisn cdleust hbea ses nubmerefdr o1m2 6t o1 31i nA NIMTCaYnBdbl a se7s7 to 734i nA NIMTC. YTB2heE MBLf eatures tables were inucdleidn t hestew oe ntiress incet heye xpilcitly statteh er eadifnrga mfeo r tchdoeo nsi n these xiv NucleiodteS equen1c9e8s6 /1987 Seciton1 1:S turcturRaNlAS equences SectiSounmm ary Code SourocfeS equence ReportEsn trieBsa sePsa ge AAURR A.a uriucl-ajuda(eu Mshrmo)ro RNA 1 1 11R8N A1-2 AC ARR A.c asltaleni( mAoe)b raRNA 2 2 281RN A1-2 ACYTR AnaycstitsRN A 3 3 251RN A1-2 AE DR R A.e duli(suM shr)o roRmNA 1 1 11R8N A-13 AFLRR Aseprgliulsf lavruRsN A 1 1 191 RN1A3- AHYRR Aeormonas hiyldrarR oNpAh 1 1 11R8N A1-4 ANGRR Aseprgliulsn igerrRN A 1 1 191 RN1A4- ANI RR Aspregliulsn idulraRnNsA 4 4 476R NA1-4 A PERRA cremonpieurimsc nuim rRNA 1 1 191 RN1A5- A UPRR Anthocpeurnocst a(toHunrswo r)t rRNA 2 2 227RN A1-5 AQUTR Agmeellnumq uadlriuapctutmR NA 1 1 76R NA1-5 AVIRR Azotaocbtevri nenldiai rRNA 1 1 120RN A1-5 BACRR Baicllusa cidocaldraRrNiAu s 1 1 171 RN1A6- BBRR Baiclulsb revriRsN A 1 1 171 RN1A6- BFIRR Baciulslf irmursR NA 1 1 161 RN1A6- BHARR Benechkaerav eyi 1 1 122RN A1-6 BJARR Blephasrmiaj paonciumr RNA 1 1 120RN A1-7 BLYRR Baryl reRNA 1 1 38R NA1-7 BLYTR BarlteRyN A 1 1 76 R1N7A - BMERR Baicllusm eagteirum 1 1 161 RNA-17 BMORR Bombmyoxr riR NA 2 2 243R NA-18 BMOTR Bombmyoxr (iiS lwkomr)t RNA 8 7 549R NA-18 BNATR Brassniacpau( saR peesd)e tRNA 1 1 76R NA-20 BOVMTTBRo vei MnitoocdnhritaR NA 14 12 858R NA1- 2 BOVTR Bovei tnRNA 12 9 715RN A-24 BPAR R Baiclluspa tseurii rRNA 1 1 171 RNA-27 BRARR BrachoipordR NA 1 1 11R9N A-27 BSI RR Blsatcoldaielsliapm lerxR NA 1 1 11R8N A-27 BSTTR Bacilluss tearomtohpiehlsri tRNA 4 4 313RN A-28 BSURR Baiclluss ubtilriRsN A 1 1 11R6N A-29 BSUTR Baiclluss ubitlsi tRNA 13 11 844R NA-29 BVORR Bresusalv aorarsR NA 1 1 120R NA-32 CCRIR Copruiscn ienerusr RNA 1 1 11R8N A-32 CCOUR Cryptheicuomcd oihnnuiRiN A 2 2 261 RNA-33 CELTR Caenhoarbtdiiesle gans 1 1 82R NA-33 CHKCR Chicketnar nlsatoniaclon troRNlA 1 1 102 RNA-33 CHKRR Chikcenr RNA 5 4 437R NA-34 CHKTR ChciketnR NA 1 1 75R NA-35 CHKUR ChcikeunR NA 6 6 771R NA-35 CRARR Copirunsr adiartRuNsA 1 1 181 RNA-36 CRERR ChlamydormRoNnAa s 4 4 571 RNA-36 CSTTR Clostruimds itrckilndaiitR NA 1 1 25R NA-37 CTURR Coelopsoirumt ussliaginriRNsA 1 1 181 RNA-37 CYAT R Cyanotbeiarucm( lBue GArlegae)en t RNA 1 1 76R NA-38 DACCPRDRry oprtieasc muinatCah lrooplarsRtN A 2 2 225 RNA-38 DACRR Dryoprtieasc uimnatraR NA 1 1 12R1N A-38 DDE RR Dacrymdyelcqeiuse secnsr RNA 1 1 181 RNA-39 DJA RR Duegsijapa onircRaN A 1 1 120 RNA-39 DOGSR Other sDtourgc turRaNlA 1 1 149R NA-39 DR ORRD rosohpilraR NA 5 4 383R NA-39 DROSR OtheDrr oshoiplsatu rcturRaNlA 1 1 299R NA-40 DROTR DroshoipltaR NA 7 7 530R NA-40 DRO UR DroshoipluaR NA 5 4 531RN A-43 DUK URD ucukR NA 1 1 78R NA-44 EARRR Eqiusuemta rven(soHer steai lr)R NA 2 1 120 RNA-44 EBI RR Eisenibai ccyli(srB owAnl ag)r RNA 1 1 181 RNA-44 ECO PRE .c olein zaytimcc omponReNnAt 1 1 375 RNA-45 ECO RRE .c olriR NA 47 34 4604 RNA-45 ECTOR E.c oltiR NA 48 37 2945 RNA-53 EGCRR Emplecntemogar aiclreR NA 2 2 239R NA-65 EGRCPTERu glegnraai cliCsho lropsltta RNA 1 1 76R NA-65 EGRRR Euglegnraai clirsR NA 1 1 12R1N A-66 EGRTR Euglegnraai clitsR NA 2 2 15R1N A-66 ES ERRE ndoyplhlm usepmervirvRiN A 1 1 181 RNA-67 ESPTR Eupuhsaisap ertbRaN A 1 1 75R NA-67 EVARR Exobiadisumv acicniriR NA 1 1 11R8N A-67 EWORR Eupoltewso oudfrfriR NA 1 1 120R NA-68 FSBRR Bonfyi srhR NA 4 4 522 RNA-68 FSBTR Bonfyi sthR NA 1 1 75R NA-69 GCLRR Gynmosporma cnlgvaiaiureaformreR NA 1 1 11R8N A-69 GCORR Graicalricao mprersRsNaA 1 1 12R1N A-69 RNA-1

See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.