niman Posted October 22, 2016 Report Share Posted October 22, 2016 (edited) The Broad Institute has release full Santo Domingo Zika sequences. Edited October 22, 2016 by niman Link to comment Share on other sites More sharing options...
niman Posted October 22, 2016 Author Report Share Posted October 22, 2016 Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0208-SER polyprotein gene, complete cds 10,659 bp linear RNA Accession: KY014300.1 GI: 1087986131 GenBankFASTAGraphics Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0216-SER polyprotein gene, complete cds 10,466 bp linear RNA Accession: KY014302.1 GI: 1087986135 GenBankFASTAGraphics Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0127-SER polyprotein gene, complete cds 10,658 bp linear RNA Accession: KY014303.1 GI: 1087986143 GenBankFASTAGraphics Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0180-SER polyprotein gene, complete cds 10,659 bp linear RNA Accession: KY014304.1 GI: 1087986146 GenBankFASTAGraphics Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0076-SER polyprotein gene, complete cds 10,346 bp linear RNA Accession: KY014305.1 GI: 1087986151 GenBankFASTAGraphics Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0436-SER polyprotein gene, complete cds 10,601 bp linear RNA Accession: KY014314.1 GI: 1087986175 GenBankFASTAGraphics Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0269-SER polyprotein gene, partial cds 10,341 bp linear RNA Accession: KY014318.1 GI: 1087986185 GenBankFASTAGraphics Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0115-SER polyprotein gene, complete cds 10,643 bp linear RNA Accession: KY014321.1 GI: 1087986191 GenBankFASTAGraphics Link to comment Share on other sites More sharing options...
niman Posted October 24, 2016 Author Report Share Posted October 24, 2016 LOCUS KY014300 10659 bp RNA linear VRL 21-OCT-2016 DEFINITION Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0208-SER polyprotein gene, complete cds. ACCESSION KY014300 VERSION KY014300.1 DBLINK BioProject: PRJNA344504 BioSample: SAMN05861584 KEYWORDS . SOURCE Zika virus ORGANISM Zika virus Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; Flavivirus. REFERENCE 1 (bases 1 to 10659) AUTHORS Baniecki,M.L., Barnes,K.G., Bosch,I., Freije,C.A., Gehrke,L., Gladden-Young,A.D., Gnirke,A., Luo,C.Y., MacInnis,B., Matranga,C.B., Metsky,H.C., Park,D.J., Qu,J., Sabeti,P.C., Tomkins-Tinch,C.H., West,K.L., Winnicki,S., Wohl,S. and Yozwiak,N.L. TITLE Direct Submission JOURNAL Submitted (20-OCT-2016) Viral Genomics, Infectious Disease Program (Infectious Disease Initiative), Broad Institute, 75 Ames St, Cambridge, MA 02142, USA COMMENT ##Assembly-Data-START## Assembly Method :: github.com/broadinstitute/viral-ngs v. v1.12.0-51-g8588fdb Assembly Name :: DOM_2016_BB-0208_SER-1 Coverage :: 141x Sequencing Technology :: Illumina; Swift LC ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..10659 /organism="Zika virus" /mol_type="genomic RNA" /isolate="Zika virus/H.sapiens-wt/DOM/2016/BB-0208-SER" /isolation_source="serum" /host="Homo sapiens" /db_xref="taxon:64320" /country="Dominican Republic: Santo Domingo" /collection_date="20-Apr-2016" /collected_by="Hospital General de la Plaza de la Salud, Santo Domingo, Dominican Republic" 5'UTR <1..88 CDS 89..10360 /note="contains structural and nonstructural proteins" /codon_start=1 /product="polyprotein" /protein_id="AOY08521.1" /translation="MKNPKKKSGGFRIVNMLKRGVARVSPFGGLKRLPAGLLLGHGPI RMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKK RRGADTSVGIVGLLLTTAMAAEVTRRGSAYYMYLDRNDAGEAISFPTTLGMNKCYIQI MDLGHMCDATMSYECPMLDEGVEPDDVDCWCNTTSTWVVYGTCHHKKGEARRSRRAVT LPSHSTRKLQTRSQTWLESREYTKHLIRVENWIFRNPGFALAAAAIAWLLGSSTSQKV IYLVMILLIAPAYSIRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIE LVTTTVSNMAEVRSYCYEASISDMASDSRCPTQGEAYLDKQSDTQYVCKRTLVDRGWG NGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHET DENRAKVEITPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWF HDIPLPWHAGADTGTPHWNNKEALVEFKDAHAKRQTVVVLGSQEGAVHTALAGALEAE MDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTVEVQYAG TDGPCKVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVG EKKITHHWHRSGSTIGKAFEATVRGAKRMAVLGDTAWDFGSVGGALNSLGKGIHQIFG AAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLSTAVSADVGC SVDFSKKETRCGTGVFVYNDVEAWRDRYKYHPDSPRRLAAAVKQAWEDGICGISSVSR MENIMWRSVEGELNAILEENGVQLTVVVGSVKNPMWRGPQRLPVPVNELPHGWKAWGK SYFVRAAKTNNSFVVDGDTLKECPLKHRAWNSFLVEDHGFGVFHTSVWLKVREDYSLE CDPAVIGTAVKGKEAVHSDLGYWIESEKNDTWRLKRAHLIEMKTCEWPKSHTLWTDGI EESDLIIPKSLAGPLSHHNTREGYRTQMKGPWHSEELEIRFEECPGTKVHVEETCGTR GPSLRSTTASGRVIEEWCCRECTMPPLSFRAKDGCWYGMEIRPRKEPESNLVRSVVTA GSTDHMDHFSLGVLVILLMVQEGLKKRMTTKIIISTSMAVLVAMILGGFSMSDLAKLA ILMGATFAEMNTGGDVAHLALIAAFKVRPALLVSFIFRANWTPRESMLLALASCLLQT AISALEGDLMVLINGFALAWLAIRAMVVPRTDNITLAILAALTPLARGTLLVAWRAGL ATCGGFMLLSLKGKGSVKKNLPFVMALGLTAVRLVDPINVVGLLLLTRSGKRSWPPSE VLTAVGLICALAGGFAKADIEMAGPMAAVGLLIVSYVVSGKSVDMYIERAGDITWEKD AEVTGNSPRLDVALDESGDFSLVEDDGPPMREIILKVVLMTICGMNPIAIPFAAGAWY VYVKTGKRSGALWDVPAPKEVKKGETTDGVYRVMTRRLLGSTQVGVGVMQEGVFHTMW HVTKGSALRSGEGRLDPYWGDVKQDLVSYCGPWKLDAAWDGHSEVQLLAVPPGERARN IQTLPGIFKTKDGDIGAVALDYPAGTSGSPILDKCGRVIGLYGNGVVIKNGSYVSAIT QGRREEETPVECFEPSMLKKKQLTVLDLHPGAGKTRRVLPEIVREAIKTRLRTVILAP TRVVAAEMEEALRGLPVRYMTTAVNVTHSGTEIVDLMCHATFTSRLLQPIRVPNYNLY IMDEAHFTDPSSIAARGYISTRVEMGEAAAIFMTATPPGTRDAFPDSNSPIMDTEVEV PERAWSSGFDWVTDHSGKTVWFVPSVRNGNEIAACLTKAGKRVIQLSRKTFETEFQKT KHQEWDFVVTTDISEMGANFKADRVIDSRRCLKPVILDGERVILAGPMPVTHASAAQR RGRIGRNPNKPGDEYLYGGGCAETDEDHAHWLEARMLLDNIYLQDGLIASLYRPEADK VAAIEGEFKLRTEQRKTFVELMKRGDLPVWLAYQVASAGITYTDRRWCFDGTTNNTIM EDSVPAEVWTRHGEKRVLKPRWMDARVCSDHAALKSFKEFAAGKRGAAFGVMEALGTL PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFV LMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQRSP QDNQMAIIIMVAVGLLGLITANELGWLERTKSDLSHLMGRREEGATIGFSMDIDLRPA SAWAIYAALTTFITPAVQHAVTTSYNNYSLMAMATQAGVLFGMGKGMPFYAWDFGVPL LMIGCYSQLTPLTLIVAIILLVAHYMYLIPGLQAAAARAAQKRTAAGIMKNPVVDGIV VTDIDTMTIDPQVEKKMGQVLLIAVAVSSAILSRTAWGWGEAGALITAATSTLWEGSP NKYWNSSTATSLCNIFRGSYLAGASLIYTVTRNAGLVKRRGGGTGETLGEKWKARLNQ MSALEFYSYKKSGITEVCREEARRALKDGVATGGHAVSRGSAKLRWLVERGYLQPYGK VIDLGCGRGGWSYYAATIRKVQEVKGYTKGGPGHEEPVLVQSYGWNIVRLKSGVDVFH MAAEPCDTLLCDIGESSSSPEVEEARTLRVLSMVGDWLEKRPGAFCIKVLCPYTSTMM ETLERLQRRYGGGLVRVPLSRNSTHEMYWVSGAKSNTIKSVSTTSQLLLGRMDGPRRP VKYEEDVNLGSGTRAVVSCAEAPNMKIIGNRIERIRSEHAETWFFDENHPYRTWAYHG SYEAPTQGSASSLVNGVVRLLSKPWDVVTGVTGIAMTDTTPYGQQRVFKEKVDTRVPD PQEGTRQVMSMVSSWLWKELGKHKRPRVCTKEEFINKVRSNAALGAIFEEEKEWKTAV EAVNDPRFWALVDKEREHHLRGECQSCVYNMMGKREKKQGEFGKAKGSRAIWYMWLGA RFLEFEALGFLNEDHWMGRENSGGGVEGLGLQRLGYVLEEMSRIPGGRMYADDTAGWD TRISRFDLENEALITNQMEKGHRALALAIIKYTYQNKVVKVLRPAEKGKTVMDIISRQ DQRGSGQVVTYALNTFTNLVVQLIRNMEAEEVLEMQDLWLLRRSEKVTNWLQSNGWDR LKRMAVSGDDCVVKPIDDRFAHALRFLNDMGKVRKDTQEWKPSTGWDNWEEVPFCSHH FNKLHLKDGRSIVVPCRHQDELIGRARVSPGAGWSIRETACLAKSYAQMWQLLYFHRR DLRLMANAICSSVPVDWVPTGRTTWSIHGKGEWMTIEDMLVVWNRVWIEENDHMEDKT PVTKWTDIPYLGKREDLWCGSLIGHRPRTTWAENIKNTVNMVRRIIGEEEKYMDYLST QVRYLGEEGSTPGVL" 3'UTR 10361..10659 ORIGIN 1 tcagactgcg acagttcgag tttgaagcga aagctagcaa cagtatcaac aggttttatt 61 ttggatttgg aaacgagagt ttctggtcat gaaaaaccca aaaaagaaat ccggaggatt 121 ccggattgtc aatatgctaa aacgcggagt agcccgtgtg agcccctttg ggggcttgaa 181 gaggctgcca gccggacttc tgctgggtca tgggcccatc aggatggtct tggcgattct 241 agcctttttg agattcacgg caatcaagcc atcactgggt ctcatcaata gatggggttc 301 agtggggaaa aaagaggcta tggaaataat aaagaagttc aagaaagatc tggctgccat 361 gctgagaata atcaatgcta ggaaggagaa gaagagacga ggcgcagata ctagtgtcgg 421 aattgttggc ctcctgctga ccacagctat ggcagcggag gtcactagac gtgggagtgc 481 atactacatg tacttggaca gaaacgatgc tggggaggcc atatcttttc caaccacatt 541 ggggatgaat aagtgttata tacagatcat ggatcttgga cacatgtgtg atgccaccat 601 gagctatgaa tgccctatgc tggatgaggg ggtggaacca gatgacgtcg attgttggtg 661 caacacgacg tcaacttggg ttgtgtacgg aacctgccat cacaaaaaag gtgaagcacg 721 gagatctaga agagctgtga cgctcccctc ccattccact aggaagctgc aaacgcggtc 781 gcaaacctgg ttggaatcaa gagaatacac aaagcacttg attagagtcg aaaattggat 841 attcaggaac cctggcttcg cgttagcagc agctgccatc gcttggcttt tgggaagctc 901 aacgagccaa aaagtcatat acttggtcat gatactgctg attgccccgg catacagcat 961 caggtgcata ggagtcagca atagggactt tgtggaaggt atgtcaggtg ggacttgggt 1021 tgatgttgtc ttggaacatg gaggttgtgt caccgtaatg gcacaggaca aaccgactgt 1081 cgacatagag ctggttacaa caacagtcag caacatggcg gaggtgagat cctactgcta 1141 tgaggcatca atatcagaca tggcttcgga cagccgctgc ccaacacaag gtgaagccta 1201 ccttgacaag caatcagaca ctcaatatgt ctgcaaaaga acgttagtgg acagaggctg 1261 gggaaatgga tgtggacttt ttggcaaagg gagcctggtg acatgcgcta agtttgcatg 1321 ctccaagaaa atgaccggga agagcatcca gccagagaat ctggagtacc ggataatgct 1381 gtcagttcat ggctcccagc acagtgggat gatcgttaat gacacaggac atgaaactga 1441 tgagaataga gcgaaggttg agataacgcc caattcacca agagccgaag ccaccctggg 1501 gggttttgga agcctaggac ttgattgtga accgaggaca ggccttgact tttcagattt 1561 gtattacttg actatgaata acaagcactg gttggttcac aaggagtggt tccacgacat 1621 tccattacct tggcacgctg gggcagacac cggaactcca cactggaaca acaaagaagc 1681 actggtagag ttcaaggacg cacatgccaa aaggcaaact gtcgtggttc tagggagtca 1741 agaaggagca gttcacacgg cccttgctgg agctctggag gctgagatgg atggtgcaaa 1801 gggaaggctg tcctctggcc acttgaaatg tcgcctgaaa atggataaac ttagattgaa 1861 gggcgtgtca tactccttgt gtaccgcagc gttcacattc accaagatcc cggctgaaac 1921 actgcacggg acagtcacag tggaggtaca gtacgcaggg acagatggac cttgcaaggt 1981 tccagctcag atggcggtgg acatgcaaac tctgacccca gttgggaggt tgataaccgc 2041 caaccccgta atcactgaaa gcactgagaa ctctaagatg atgctggaac ttgatccacc 2101 atttggggac tcttacattg tcataggagt cggggagaag aagatcaccc accactggca 2161 caggagtggc agcaccattg gaaaagcatt tgaagccact gtgagaggtg ccaagagaat 2221 ggcagtcttg ggagacacag cctgggactt tggatcagtt ggaggcgctc tcaactcatt 2281 gggcaagggc atccatcaaa tttttggagc agctttcaaa tcattgtttg gaggaatgtc 2341 ctggttctca caaatcctca ttggaacgtt gctgatgtgg ttgggtctga acacaaagaa 2401 tggatctatt tccctcatgt gcttggcctt agggggagtg ttgatcttct tatccacagc 2461 cgtctctgct gatgtggggt gctcggtgga cttctcaaag aaggagacga gatgcggtac 2521 aggggtgttc gtctataacg acgttgaagc ctggagggac aggtacaagt accatcctga 2581 ctccccccgt agattggcag cagcagtcaa gcaagcctgg gaagatggta tctgcgggat 2641 ctcctctgtt tcaagaatgg aaaacatcat gtggagatca gtagaagggg agctcaatgc 2701 aatcctggaa gagaatggag ttcaactgac ggtcgttgtg ggatctgtaa aaaaccccat 2761 gtggagaggt ccacagagat tgcccgtgcc tgtgaacgag ctgccccacg gctggaaggc 2821 ttgggggaaa tcgtacttcg ttagagcagc aaagacaaat aacagctttg tcgtggatgg 2881 tgacacactg aaggaatgcc cactcaaaca tagagcatgg aacagctttc ttgtggagga 2941 tcatgggttc ggggtatttc acactagtgt ctggctcaag gttagagaag attattcatt 3001 agagtgtgat ccagccgtta ttggaacagc tgttaaggga aaggaggctg tacacagtga 3061 tctaggctac tggattgaga gtgagaagaa tgacacatgg aggctgaaga gggcccatct 3121 gatcgagatg aaaacatgtg aatggccaaa gtcccacaca ttgtggacag atggaataga 3181 agagagtgat ctgatcatac ccaagtcttt agctgggcca ctcagccatc acaataccag 3241 agagggctac aggacccaaa tgaaagggcc atggcacagt gaagagcttg aaattcggtt 3301 tgaggaatgc ccaggcacta aggtccacgt ggaggaaaca tgtggaacaa gaggaccatc 3361 tctgagatca accactgcaa gcggaagggt gatcgaggaa tggtgctgca gggagtgcac 3421 aatgccccca ctgtcgttcc gggctaaaga tggctgttgg tatggaatgg agataaggcc 3481 caggaaagaa ccagaaagca acttagtaag gtcagtggtg actgcaggat caactgatca 3541 catggatcac ttctcccttg gagtgcttgt gattctgctc atggtgcagg aagggctgaa 3601 gaagagaatg accacaaaga tcatcataag cacatcaatg gcagtgctgg tagctatgat 3661 cctgggagga ttttcaatga gcgacctggc taagcttgca attttgatgg gcgccacctt 3721 cgcggaaatg aacactggag gagatgtagc tcatctggcg ctgatagcgg cattcaaagt 3781 cagaccagcg ttgctggtat ctttcatctt cagagctaat tggacacccc gtgaaagcat 3841 gctgctggcc ttggcctcgt gtcttttgca aactgcgatc tccgccttgg aaggcgacct 3901 gatggttctc atcaatggtt ttgctttggc ctggttggca atacgagcga tggttgttcc 3961 acgcactgat aacatcacct tggcaatcct ggctgctctg acaccactgg cccggggcac 4021 actgcttgtg gcgtggagag caggccttgc tacttgcggg gggtttatgc tcctctctct 4081 gaagggaaaa ggcagtgtga agaagaactt accatttgtc atggccctgg gactaaccgc 4141 tgtgaggctg gtcgacccca tcaacgtggt gggactgctg ttgctcacaa ggagtgggaa 4201 gcggagctgg ccccctagcg aagtactcac agctgttggc ctgatatgcg cattggctgg 4261 agggttcgcc aaggcagata tagagatggc tgggcccatg gccgcggtcg gtctgctaat 4321 tgtcagttac gtggtctcag gaaagagtgt ggacatgtac attgaaagag caggtgacat 4381 cacatgggaa aaagatgcgg aagtcactgg aaacagtccc cggctcgatg tggcgctaga 4441 tgagagtggt gatttctccc tggtggagga tgacggtccc cccatgagag agatcatact 4501 caaggtggtc ctgatgacca tctgtggcat gaacccaata gccataccct ttgcagctgg 4561 agcgtggtac gtatacgtga agactggaaa aaggagtggt gctctatggg atgtgcctgc 4621 tcccaaggaa gtaaaaaagg gggagaccac agatggagtg tacagagtaa tgactcgtag 4681 actgctaggt tcaacacaag ttggagtggg agttatgcaa gagggggtct ttcacactat 4741 gtggcacgtc acaaaaggat ccgcgctgag aagcggtgaa gggagacttg atccatactg 4801 gggagatgtc aagcaggatc tggtgtcata ctgtggtcca tggaagctag atgccgcctg 4861 ggacgggcac agcgaggtgc agctcttggc cgtgcccccc ggagagagag cgaggaacat 4921 ccagactctg cccggaatat ttaagacaaa ggatggggac attggagcgg ttgcgctgga 4981 ttacccagca ggaacttcag gatctccaat cctagacaag tgtgggagag tgataggact 5041 ttatggcaat ggggtcgtga tcaaaaatgg gagttatgtt agtgccatca cccaagggag 5101 gagggaggaa gagactcctg ttgagtgctt cgagccttcg atgctgaaga agaagcagct 5161 aactgtctta gacttgcatc ctggagctgg gaaaaccagg agagttcttc ctgaaatagt 5221 ccgtgaagct ataaaaacaa gactccgtac tgtgatctta gctccaacca gggttgtcgc 5281 tgctgaaatg gaggaagccc ttagagggct tccagtgcgt tatatgacaa cagcagtcaa 5341 tgtcacccat tctggaacag aaatcgtcga cttaatgtgc catgccacct tcacttcacg 5401 tctactacag ccaatcagag tccccaacta taatctgtat attatggatg aggcccactt 5461 cacagatccc tcaagtatag cagcaagagg atacatttca acaagggttg agatgggcga 5521 ggcggctgcc atcttcatga ccgccacgcc accaggaacc cgtgacgcat ttccggactc 5581 caactcacca attatggaca ccgaagtgga agtcccagag agagcctgga gctcaggctt 5641 tgattgggtg acggatcatt ctggaaaaac agtttggttt gttccaagcg tgaggaacgg 5701 caatgagatc gcagcttgtc tgacaaaggc tggaaaacgg gtcatacagc tcagcagaaa 5761 gacttttgag acagagttcc agaaaacaaa acatcaagag tgggactttg tcgtgacaac 5821 cgacatttca gagatgggcg ccaactttaa agctgaccgt gtcatagatt ccaggagatg 5881 cctaaagccg gtcatacttg atggcgagag agtcattctg gctggaccca tgcctgtcac 5941 acatgccagc gctgcccaga ggagggggcg cataggcagg aatcccaaca aacctggaga 6001 tgagtatctg tatggaggtg ggtgcgcaga gactgacgaa gaccatgcac actggcttga 6061 agcaagaatg ctccttgaca acatttacct ccaagatggc ctcatagcct cgctctatcg 6121 acctgaggcc gacaaagtag cagccattga gggagagttc aagcttagga cggagcaaag 6181 gaagaccttt gtggaactca tgaaaagagg agatcttcct gtttggctgg cctatcaggt 6241 tgcatctgcc ggaataactt acacagatag aagatggtgc tttgatggca cgaccaacaa 6301 caccataatg gaagacagtg tgccggcaga ggtgtggacc agacacggag agaaaagagt 6361 gctcaaaccg aggtggatgg acgccagagt ttgttcagat catgcggccc tgaagtcatt 6421 caaggagttt gccgctggga aaagaggagc ggcttttgga gtgatggaag ccctgggaac 6481 actgccagga cacatgacag agagattcca ggaagccatt gacaacctcg ctgtgctcat 6541 gcgggcagag actggaagca ggccttacaa agccgcggcg gcccaattgc cggagaccct 6601 agagaccatt atgcttttgg ggttgctggg aacagtctcg ctgggaatct ttttcgtctt 6661 gatgaggaac aagggcatag ggaagatggg ctttggaatg gtgactcttg gggccagcgc 6721 atggctcatg tggctctcgg aaattgagcc agccagaatt gcatgtgtcc tcattgttgt 6781 gttcctattg ctggtggtgc tcatacctga gccagaaaag caaagatctc cccaggacaa 6841 ccaaatggca atcatcatca tggtagcagt aggtcttctg ggcttgatca ccgccaatga 6901 actcggatgg ttggagagaa caaagagtga cctaagccat ctaatgggaa ggagagagga 6961 gggagcaacc ataggattct caatggacat tgacctgcgg ccagcctcag cttgggccat 7021 ctatgctgcc ttgacaactt tcattacccc agccgtccaa catgcagtga ccacttcata 7081 caacaactac tccttaatgg cgatggccac gcaagctgga gtgttgtttg gtatgggcaa 7141 agggatgcca ttctacgcat gggactttgg agtcccgctg ctaatgatag gttgctattc 7201 acaattaaca cccctgaccc taatagtggc catcattttg ctcgtggcgc actacatgta 7261 cttgatccca gggctgcagg ctgcagctgc gcgtgctgcc cagaagagaa cggcagctgg 7321 catcatgaag aaccctgttg tggatggaat agtggtgact gacattgaca caatgacaat 7381 tgacccccaa gtggagaaaa agatgggaca ggtgctactc atagcagtag ccgtctccag 7441 cgccatactg tcgcggaccg cctgggggtg gggggaggct ggggccctga tcacagccgc 7501 aacttccact ttgtgggaag gctctccgaa caagtactgg aactcctcta cagccacttc 7561 actgtgtaac atttttaggg gaagttactt ggctggagct tctctaatct acacagtaac 7621 aagaaacgct ggcttggtca agagacgtgg gggtggaaca ggagagaccc tgggagagaa 7681 atggaaggcc cgcttgaacc agatgtcggc cctggagttc tactcctaca aaaagtcagg 7741 catcaccgag gtgtgcagag aagaggcccg ccgcgccctc aaggacggtg tggcaacggg 7801 aggccatgct gtgtcccgag gaagtgcaaa gctgagatgg ttggtggagc ggggatacct 7861 gcagccctat ggaaaggtca ttgatcttgg atgtggcaga gggggctgga gttactacgc 7921 cgccaccatc cgcaaagttc aagaagtgaa aggatacaca aaaggaggcc ctggtcatga 7981 agaacccgtg ttggtgcaaa gctatgggtg gaacatagtc cgtctcaaga gtggggtgga 8041 cgtctttcat atggcggctg agccgtgtga cacgttgctg tgtgacatag gtgagtcatc 8101 atctagtcct gaagtggaag aagcacggac gctcagagtc ctctccatgg tgggggattg 8161 gcttgaaaaa agaccaggag ccttttgtat aaaagtgttg tgcccataca ccagcactat 8221 gatggaaacc ctggagcgac tgcagcgtag gtatggggga ggactggtca gagtgccact 8281 ctcccgcaac tctacacatg agatgtactg ggtctctgga gcgaaaagca acaccataaa 8341 aagtgtgtcc accacgagcc agctcctctt ggggcgcatg gacgggccta ggaggccagt 8401 gaaatatgag gaggatgtga atctcggctc tggcacgcgg gctgtggtaa gctgcgctga 8461 agctcccaac atgaagatca ttggtaaccg cattgaaagg atccgcagtg agcacgcgga 8521 aacgtggttc tttgacgaga accacccata taggacatgg gcttaccatg gaagctatga 8581 ggcccccaca caagggtcag catcctctct agtaaacggg gttgtcaggc tcctgtcaaa 8641 accctgggat gtggtgactg gagtcacagg aatagccatg accgacacca caccgtatgg 8701 tcagcaaaga gttttcaagg aaaaagtgga cactagggtg ccagaccccc aagaaggcac 8761 tcgtcaggtt atgagcatgg tctcttcctg gttgtggaaa gagctaggca aacacaaacg 8821 gccacgagtc tgtaccaaag aagagttcat caacaaggtt cgtagcaatg cagcattagg 8881 ggcaatattt gaagaggaaa aagagtggaa gactgcagtg gaagctgtga acgatccaag 8941 gttctgggct ctagtggaca aggaaagaga gcaccacctg agaggagagt gccagagttg 9001 tgtgtacaac atgatgggaa aaagagaaaa gaaacaaggg gaatttggaa aggccaaggg 9061 cagccgcgcc atctggtata tgtggctagg ggctagattt ctagagttcg aagcccttgg 9121 attcttgaac gaggatcact ggatggggag agagaactca ggaggtggtg ttgaagggct 9181 gggattacaa agactcggat atgtcctaga agagatgagt cgcataccag gaggaaggat 9241 gtatgcagat gacactgctg gctgggatac ccgcatcagc aggtttgatc tagagaatga 9301 agctctaatc accaaccaaa tggagaaagg gcacagggcc ttggcattgg ccataatcaa 9361 gtacacatac caaaacaaag tggtaaaggt ccttagacca gctgaaaaag ggaaaacagt 9421 tatggacatt atttcgagac aagaccaaag ggggagcgga caagttgtca cttacgctct 9481 taacacattt accaacctag tggtgcaact cattcggaat atggaggctg aggaagttct 9541 agagatgcaa gacttgtggc tgctgcggag gtcagagaaa gtgaccaact ggttgcagag 9601 caacggatgg gataggctca aacgaatggc agtcagtgga gatgattgcg ttgtgaagcc 9661 aattgatgat aggtttgcac atgccctcag gttcttgaat gatatgggaa aagttaggaa 9721 ggacacacaa gagtggaaac cctcaactgg atgggacaac tgggaagaag ttccgttttg 9781 ctcccaccac ttcaacaagc tccatctcaa ggacgggagg tccattgtgg ttccctgccg 9841 ccaccaagat gaactgattg gccgggcccg cgtctctcca ggggcgggat ggagcatccg 9901 ggagactgct tgcctagcaa aatcatatgc gcaaatgtgg cagctccttt atttccacag 9961 aagggacctc cgactgatgg ccaatgccat ttgttcatct gtgccagttg actgggttcc 10021 aactgggaga actacctggt caatccatgg aaagggagaa tggatgacca ttgaagacat 10081 gcttgtggtg tggaacagag tgtggattga ggagaacgac cacatggaag acaagacccc 10141 agttacgaaa tggacagaca ttccctattt gggaaaaagg gaagacttgt ggtgtggatc 10201 tctcataggg cacagaccgc gcaccacctg ggctgagaac attaaaaaca cagtcaacat 10261 ggtgcgcagg atcataggtg aggaagaaaa gtacatggac tacctatcca cccaagttcg 10321 ctacttgggt gaagaagggt ctacacctgg agtgctgtaa gcaccaatct taatgttgtc 10381 aggcctgcta gtcagccaca gctcggggaa agctgtgcag cctgtgaccc ccccaggaga 10441 agctgggaaa ccaagcctat agtcaggccg agaacgccat ggcacggaag aagccatgct 10501 gcctgtgagc ccctcagagg acactgagtc aaaaaacccc acgcgcttgg aggcgcagga 10561 tgggaaaaga aggtggcgac cttccccacc cttcaatctg gggcctgaac tggagatcag 10621 ctgtggatct ccagaagagg gactagtggt tagaggaga Link to comment Share on other sites More sharing options...
niman Posted October 24, 2016 Author Report Share Posted October 24, 2016 Sequences producing significant alignments: Select:AllNone Selected:0 AlignmentsDownloadGenBankGraphicsDistance tree of resultsShow/hide columns of the table presenting sequences producing significant alignments Sequences producing significant alignments: Select for downloading or viewing reports Description Max score Total score Query cover E value Ident Accession Select seq gb|KY014300.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0208-SER polyprotein gene, complete cds 18525 18525 100% 0.0 100% KY014300.1 Select seq gb|KY014304.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0180-SER polyprotein gene, complete cds 18471 18471 100% 0.0 99% KY014304.1 Select seq gb|KU853013.1| Zika virus isolate Dominican Republic/2016/PD2, complete genome 18465 18465 100% 0.0 99% KU853013.1 Select seq gb|KU853012.1| Zika virus isolate Dominican Republic/2016/PD1, complete genome 18465 18465 100% 0.0 99% KU853012.1 Select seq gb|KY014295.1| Zika virus isolate Zika virus/H.sapiens-wt/USA/2016/FL-010-URI polyprotein gene, complete cds 18462 18462 100% 0.0 99% KY014295.1 Select seq dbj|LC190723.1| Zika virus genomic RNA, complete genome, strain: ZIKV/Hu/Yokohama/1/2016 18462 18462 100% 0.0 99% LC190723.1 Select seq gb|KX842449.2| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL010U polyprotein gene, complete cds 18462 18462 100% 0.0 99% KX842449.2 Select seq gb|KY014321.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0115-SER polyprotein gene, complete cds 18456 18456 100% 0.0 99% KY014321.1 Select seq gb|KX673530.1| Zika virus isolate PHE_semen_Guadeloupe, complete genome 18453 18453 100% 0.0 99% KX673530.1 Select seq gb|KY014323.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-02-MOS polyprotein gene, complete cds 18447 18447 100% 0.0 99% KY014323.1 Select seq gb|KX922703.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL021U polyprotein gene, complete cds 18447 18447 100% 0.0 99% KX922703.1 Select seq gb|KX838905.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL02M polyprotein gene, complete cds 18447 18447 100% 0.0 99% KX838905.2 Select seq gb|KX832731.1| Zika virus isolate ZIKV/Homo_sapiens/USA//2016/Hu0015SA polyprotein gene, complete cds 18447 18447 100% 0.0 99% KX832731.1 Select seq gb|KX922706.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL038U polyprotein gene, complete cds 18446 18446 100% 0.0 99% KX922706.1 Select seq gb|KX922707.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL039U polyprotein gene, complete cds 18444 18444 100% 0.0 99% KX922707.1 Select seq gb|KX922704.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL030U polyprotein gene, complete cds 18444 18444 100% 0.0 99% KX922704.1 Select seq gb|KY014322.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-03-MOS polyprotein gene, complete cds 18438 18438 100% 0.0 99% KY014322.1 Select seq gb|KY014314.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0436-SER polyprotein gene, complete cds 18438 18438 100% 0.0 99% KY014314.1 Select seq gb|KX838906.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL03M polyprotein gene, complete cds 18438 18438 100% 0.0 99% KX838906.2 Select seq gb|KX838904.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL01M polyprotein gene, complete cds 18438 18438 100% 0.0 99% KX838904.2 Select seq gb|KY014324.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-01-MOS polyprotein gene, complete cds 18435 18435 100% 0.0 99% KY014324.1 Select seq gb|KY014316.1| Zika virus isolate Zika virus/H.sapiens-wt/USA/2016/FL-039-URI polyprotein gene, complete cds 18435 18435 100% 0.0 99% KY014316.1 Select seq gb|KX922705.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL032U polyprotein gene, complete cds 18428 18428 100% 0.0 99% KX922705.1 Select seq gb|KX922708.1| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL04M polyprotein gene, complete cds 18420 18420 100% 0.0 99% KX922708.1 Select seq gb|KY014299.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-04-MOS polyprotein gene, complete cds 18417 18417 100% 0.0 99% KY014299.1 Select seq gb|KX447510.1| Zika virus isolate 1_0049_PF polyprotein gene, complete cds 18375 18375 100% 0.0 99% KX447510.1 Select seq gb|KX280026.1| Zika virus isolate Paraiba_01, complete genome 18372 18372 100% 0.0 99% KX280026.1 Select seq gb|KX447512.1| Zika virus isolate 1_0181_PF polyprotein gene, complete cds 18366 18366 100% 0.0 99% KX447512.1 Select seq gb|KX369547.1| Zika virus strain PF13/251013-18, complete genome 18366 18366 100% 0.0 99% KX369547.1 Select seq gb|KU509998.3| Zika virus strain Haiti/1225/2014, complete genome 18366 18366 100% 0.0 99% KU509998.3 Select seq gb|KJ776791.2| Zika virus strain H/PF/2013, complete genome 18363 18363 100% 0.0 99% KJ776791.2 Select seq gb|KX447509.1| Zika virus isolate 1_0087_PF polyprotein gene, complete cds 18363 18363 100% 0.0 99% KX447509.1 Select seq gb|KU991811.1| Zika virus isolate Brazil/2016/INMI1 polyprotein gene, complete cds 18363 18363 100% 0.0 99% KU991811.1 Select seq gb|KU729217.2| Zika virus isolate BeH823339 polyprotein gene, complete cds 18363 18363 100% 0.0 99% KU729217.2 Select seq gb|KX447513.1| Zika virus isolate 1_0134_PF polyprotein gene, complete cds 18357 18357 100% 0.0 99% KX447513.1 Select seq gb|KX051563.1| Zika virus isolate Haiti/1/2016, complete genome 18357 18357 100% 0.0 99% KX051563.1 Select seq gb|KX811222.1| Zika virus isolate Brazil_2015_MG, complete genome 18354 18354 100% 0.0 99% KX811222.1 Select seq gb|KX197205.1| Zika virus isolate 9, complete genome 18354 18354 100% 0.0 99% KX197205.1 Select seq gb|KX447515.1| Zika virus isolate 1_0030_PF polyprotein gene, complete cds 18354 18354 100% 0.0 99% KX447515.1 Select seq gb|KX447511.1| Zika virus isolate 1_0015_PF polyprotein gene, complete cds 18354 18354 100% 0.0 99% KX447511.1 Select seq gb|KU527068.1| Zika virus strain Natal RGN, complete genome 18354 18354 100% 0.0 99% KU527068.1 Select seq gb|KU321639.1| Zika virus strain ZikaSPH2015, complete genome 18354 18354 100% 0.0 99% KU321639.1 Select seq gb|KX879604.1| Zika virus isolate SN089, complete genome 18348 18348 100% 0.0 99% KX879604.1 Select seq gb|KX447514.1| Zika virus isolate 1_0035_PF polyprotein gene, complete cds 18348 18348 100% 0.0 99% KX447514.1 Select seq gb|KX447516.1| Zika virus isolate 1_0111_PF polyprotein gene, complete cds 18345 18345 100% 0.0 99% KX447516.1 Select seq gb|KU729218.1| Zika virus isolate BeH828305 polyprotein gene, complete cds 18345 18345 100% 0.0 99% KU729218.1 Select seq gb|KU707826.1| Zika virus isolate SSABR1, complete genome 18345 18345 100% 0.0 99% KU707826.1 Select seq gb|KU365779.1| Zika virus strain BeH819966 polyprotein gene, complete cds 18345 18345 100% 0.0 99% KU365779.1 Select seq gb|KX879603.1| Zika virus isolate SN062, complete genome 18339 18339 100% 0.0 99% KX879603.1 Select seq gb|KX262887.1| Zika virus isolate 103451, complete genome 18339 18339 100% 0.0 99% KX262887.1 Select seq gb|KX197192.1| Zika virus isolate ZIKV/H.sapiens/Brazil/PE243/2015, complete genome 18339 18339 100% 0.0 99% KX197192.1 Select seq gb|KU926310.1| Zika virus isolate Rio-S1, complete genome 18339 18339 100% 0.0 99% KU926310.1 Select seq gb|KU926309.1| Zika virus isolate Rio-U1, complete genome 18339 18339 100% 0.0 99% KU926309.1 Select seq gb|KU940228.1| Zika virus isolate Bahia07, partial genome 18336 18336 100% 0.0 99% KU940228.1 Select seq gb|KX694534.1| Zika virus strain ZIKV/Homo sapiens/HND/R103451/2015, complete genome 18330 18330 100% 0.0 99% KX694534.1 Select seq gb|KX198135.1| Zika virus strain ZIKV/Homo sapiens/PAN/BEI-259634_V4/2016, complete genome 18330 18330 100% 0.0 99% KX198135.1 Select seq gb|KU501217.1| Zika virus strain 8375 polyprotein gene, complete cds 18330 18330 100% 0.0 99% KU501217.1 Select seq gb|KU365780.1| Zika virus strain BeH815744 polyprotein gene, complete cds 18330 18330 100% 0.0 99% KU365780.1 Select seq gb|KY014303.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0127-SER polyprotein gene, complete cds 18327 18327 100% 0.0 99% KY014303.1 Select seq gb|KU647676.1| Zika virus strain MRS_OPY_Martinique_PaRi_2015 polyprotein gene, complete cds 18327 18327 100% 0.0 99% KU647676.1 Select seq gb|KU501216.1| Zika virus strain 103344 polyprotein gene, complete cds 18327 18327 100% 0.0 99% KU501216.1 Select seq gb|KU365777.1| Zika virus strain BeH818995 polyprotein gene, complete cds 18327 18327 100% 0.0 99% KU365777.1 Select seq gb|KY014297.1| Zika virus isolate Zika virus/H.sapiens-wt/BRA/2016/FC-6864-URI polyprotein gene, complete cds 18321 18321 100% 0.0 99% KY014297.1 Select seq gb|KX447517.1| Zika virus isolate 1_0038_PF polyprotein gene, complete cds 18321 18321 100% 0.0 99% KX447517.1 Select seq gb|KU758877.1| Zika virus isolate 17271 polyprotein gene, complete cds 18321 18321 100% 0.0 99% KU758877.1 Select seq gb|KX247646.1| Zika virus isolate Zika virus/Homo sapiens/COL/UF-1/2016, complete genome 18321 18321 100% 0.0 99% KX247646.1 Select seq gb|KX156776.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259364_V1-V2/2015, complete genome 18321 18321 100% 0.0 99% KX156776.1 Select seq gb|KX520666.1| Zika virus isolate HS-2015-BA-01 polyprotein gene, complete cds 18318 18318 100% 0.0 99% KX520666.1 Select seq gb|KX156774.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259359_V1-V3/2015, complete genome 18318 18318 100% 0.0 99% KX156774.1 Select seq gb|KU497555.1| Zika virus isolate Brazil-ZKV2015, complete genome 18318 18318 99% 0.0 99% KU497555.1 Select seq gb|KY014327.1| Zika virus isolate Zika virus/H.sapiens-wt/HND/2016/HU-ME167-PLA polyprotein gene, complete cds 18314 18314 100% 0.0 99% KY014327.1 Select seq gb|KU820897.5| Zika virus isolate FLR polyprotein gene, complete cds 18312 18312 100% 0.0 99% KU820897.5 Select seq gb|KX247632.1| Zika virus isolate MEX_I_7 polyprotein gene, complete cds 18312 18312 100% 0.0 99% KX247632.1 Select seq gb|KX156775.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259249_V1-V3/2015, complete genome 18312 18312 100% 0.0 99% KX156775.1 Select seq gb|KX087102.1| Zika virus strain ZIKV/Homo sapiens/COL/FLR/2015, complete genome 18312 18312 100% 0.0 99% KX087102.1 Select seq gb|KU365778.1| Zika virus strain BeH819015 polyprotein gene, complete cds 18312 18312 100% 0.0 99% KU365778.1 Select seq gb|KU312312.1| Zika virus isolate Z1106033 polyprotein gene, complete cds 18312 18312 100% 0.0 99% KU312312.1 Select seq gb|KY014315.1| Zika virus isolate Zika virus/H.sapiens-wt/HND/2016/HU-ME152-SER polyprotein gene, complete cds 18309 18309 100% 0.0 99% KY014315.1 Select seq gb|KU922960.1| Zika virus isolate MEX/InDRE/Sm/2016, complete genome 18309 18309 100% 0.0 99% KU922960.1 Select seq gb|KY014296.1| Zika virus isolate Zika virus/H.sapiens-wt/BRA/2016/FC-DQ131D1-URI polyprotein gene, complete cds 18303 18303 100% 0.0 99% KY014296.1 Select seq gb|KX806557.2| Zika virus isolate TS17-2016, complete genome 18303 18303 100% 0.0 99% KX806557.2 Select seq gb|KX856011.1| Zika virus strain ZIKV/Aedes sp./MEX_I-44/2016, complete genome 18303 18303 100% 0.0 99% KX856011.1 Select seq gb|KX548902.1| Zika virus isolate ZIKV/COL/FCC00093/2015 polyprotein gene, complete cds 18303 18303 100% 0.0 99% KX548902.1 Select seq gb|KX446951.1| Zika virus strain ZIKV/Aedes.sp/MEX/MEX_I-7/2016, complete genome 18303 18303 100% 0.0 99% KX446951.1 Select seq gb|KU937936.1| Zika virus isolate ZIKVNL00013 polyprotein gene, complete cds 18303 18303 100% 0.0 99% KU937936.1 Select seq gb|KU922923.1| Zika virus isolate MEX/InDRE/Lm/2016, complete genome 18303 18303 100% 0.0 99% KU922923.1 Select seq gb|KU501215.1| Zika virus strain PRVABC59, complete genome 18303 18303 100% 0.0 99% KU501215.1 Select seq gb|KX601168.1| Zika virus strain ZIKV/Homo Sapiens/PRI/PRVABC59/2015, complete genome 18300 18300 100% 0.0 99% KX601168.1 Select seq gb|KX446950.1| Zika virus strain ZIKV/Aedes.sp/MEX/MEX_2-81/2016, complete genome 18300 18300 100% 0.0 99% KX446950.1 Select seq gb|KX087101.2| Zika virus strain ZIKV/Homo sapiens/PRI/PRVABC59/2015, complete genome 18300 18300 100% 0.0 99% KX087101.2 Select seq gb|KU870645.1| Zika virus isolate FB-GWUH-2016, complete genome 18300 18300 100% 0.0 99% KU870645.1 Select seq gb|KX893855.1| Zika virus strain Zika virus/Homo sapiens/VEN/UF-2/2016, complete genome 18298 18298 100% 0.0 99% KX893855.1 Select seq gb|KX702400.1| Zika virus strain Zika virus/Homo sapiens/VEN/UF-1/2016, complete genome 18294 18294 100% 0.0 99% KX702400.1 Select seq gb|KX377337.1| Zika virus strain PRVABC-59, complete genome 18294 18294 100% 0.0 99% KX377337.1 Select seq gb|KX766029.1| Zika virus isolate R116265, complete genome 18285 18285 100% 0.0 99% KX766029.1 Select seq gb|KU820898.1| Zika virus isolate GZ01 polyprotein gene, complete cds 18285 18285 100% 0.0 99% KU820898.1 Select seq gb|KX056898.1| Zika virus isolate Zika virus/GZ02/2016 polyprotein gene, complete cds 18282 18282 100% 0.0 99% KX056898.1 Select seq gb|KU955590.1| Zika virus isolate Z16019 polyprotein gene, complete cds 18282 18282 100% 0.0 99% KU955590.1 Select seq gb|KX766028.1| Zika virus isolate R114916, complete genome 18278 18278 100% 0.0 99% KX766028.1 Select seq gb|KU740184.2| Zika virus isolate GD01 polyprotein gene, complete cds 18276 18276 100% 0.0 99% KU740184.2 Link to comment Share on other sites More sharing options...
niman Posted October 24, 2016 Author Report Share Posted October 24, 2016 LOCUS KY014303 10658 bp RNA linear VRL 21-OCT-2016 DEFINITION Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0127-SER polyprotein gene, complete cds. ACCESSION KY014303 VERSION KY014303.1 DBLINK BioProject: PRJNA344504 BioSample: SAMN05861583 KEYWORDS . SOURCE Zika virus ORGANISM Zika virus Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; Flavivirus. REFERENCE 1 (bases 1 to 10658) AUTHORS Baniecki,M.L., Barnes,K.G., Bosch,I., Freije,C.A., Gehrke,L., Gladden-Young,A.D., Gnirke,A., Luo,C.Y., MacInnis,B., Matranga,C.B., Metsky,H.C., Park,D.J., Qu,J., Sabeti,P.C., Tomkins-Tinch,C.H., West,K.L., Winnicki,S., Wohl,S. and Yozwiak,N.L. TITLE Direct Submission JOURNAL Submitted (20-OCT-2016) Viral Genomics, Infectious Disease Program (Infectious Disease Initiative), Broad Institute, 75 Ames St, Cambridge, MA 02142, USA COMMENT ##Assembly-Data-START## Assembly Method :: github.com/broadinstitute/viral-ngs v. v1.12.0-51-g8588fdb Assembly Name :: DOM_2016_BB-0127_SER-1 Coverage :: 231x Sequencing Technology :: Illumina; Swift LC ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..10658 /organism="Zika virus" /mol_type="genomic RNA" /isolate="Zika virus/H.sapiens-wt/DOM/2016/BB-0127-SER" /isolation_source="serum" /host="Homo sapiens" /db_xref="taxon:64320" /country="Dominican Republic: Santo Domingo" /collection_date="11-Apr-2016" /collected_by="Hospital General de la Plaza de la Salud, Santo Domingo, Dominican Republic" 5'UTR <1..88 CDS 89..10360 /note="contains structural and nonstructural proteins" /codon_start=1 /product="polyprotein" /protein_id="AOY08524.1" /translation="MKNPKKKSGGFRIVNMLKRGVARVSPFGGLKRLPAGLLLGHGPI RMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKK RRGAETSVGIVGLLLTTAMAAEVTRRGSAYYMYLDRNDAGEAISFPTTLGMNKCYIQI MDLGHMCDATMSYECPMLDEGVEPDDVDCWCNTTSTWVVYGTCHHKKGEARRSRRAVT LPSHSTRKLQTRSQTWLESREYTKHLIRVENWIFRNPGFALAAAAIAWLLGSSTSQKV IYLVMILLIAPAYSIRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIE LVTTTVSNMAEVRSYCYEASISDMASDSRCPTQGEAYLDKQSDTQYVCKRTLVDRGWG NGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHET DENRAKVEITPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWF HDIPLPWHAGADTGTPHWNNKEALVEFKDAHAKRQTVVVLGSQEGAVHTALAGALEAE MDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTVEVQYAG TDGPCKVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVG EKKITHHWHRSGSTIGKAFEATVRGAKRMAVLGDTAWDFGSVGGALNSLGKGIHQIFG AAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLSTAVSADVGC SVDFSKKETRCGTGVFVYNDVEAWRDRYKYHPDSPRRLAAAVKQAWEDGICGISSVSR MENIMWRSVEGELNAILEENGVQLTVVVGSVKNPMWRGPQRLPVPVNELPHGWKAWGK SYFVRAAKTNNSFVVDGDTLKECPLKHRAWNSFLVEDHGFGVFHTSVWLKVREDYSLE CDPAVIGTAVKGKEAVHSDLGYWIESEKNDTWRLKRAHLIEMKTCEWPKSHTLWTDGI EESDLIIPKSLAGPLSHHNTREGYRTQMKGPWHSEELEIRFEECPGTKVHVEETCGTR GPSLRSTTASGRVIEEWCCRECTMPPLSFWAKDGCWYGMEIRPRKEPESNLVRSMVTA GSTDHMDHFSLGVLVILLMVQEGLKKRMTTKIIISTSMAVLVAMILGGFSMSDLAKLA ILMGATFAEMNTGGDVAHLALIAAFKVRPALLVSFIFRANWTPRESMLLALASCLLQT AISALEGDLMVLINGFALAWLAIRAMVVPRTDNITLAILAALTPLTRGTLLVAWRAGL ATCGGFMLLSLKGKGSVKKNLPFVMALGLTAVRLVDPINVVGLLLLTRSGKRSWPPSE VLTAVGLICALAGGFAKADIEMAGPMAAVGLLIVSYVVSGKSVDMYIERAGDITWEKD AEVTGNSPRLDVALDESGDFSLVEDDGPPMREIILKVVLMTICGMNPIAIPFAAGAWY VYVKTGKRSGALWDVPAPKEVKKGETTDGVYRVMTRRLLGSTQVGVGVMQEGVFHTMW HVTKGSALRSGEGRLDPYWGDVKQDLVSYCGPWKLDAAWDGHSEVQLLAVPPGERARN IQTLPGIFKTKDGDIGAVALDYPAGTSGSPILDKCGRVIGLYGNGVVIKNGSYVSAIT QGRREEETPVECFEPSMLKKKQLTVLDLHPGAGKTRRVLPEIVREAIKTRLRTVILAP TRVVAAEMEEALRGLPVRYMTTAVNVTHSGTEIVDLMCHATFTSRLLQPIRVPNYNLY IMDEAHFTDPSSIAARGYISTRVEMGEAAAIFMTATPPGTRDAFPDSNSPIMDTEVEV PERAWSSGFDWVTDHSGKTVWFVPSVRNGNEIAACLTKAGKRVIQLSRKTFETEFQKT KHQEWDFVVTTDISEMGANFKADRVIDSRRCLKPVILDGERVILAGPMPVTHASAAQR RGRIGRNPNKPGDEYLYGGGCAETDEDHAHWLEARMLLDNIYLQDGLIASLYRPEADK VAAIEGEFKLRTEQRKTFVELMKRGDLPVWLAYQVASAGITYTDRRWCFDGTTNNTIM EDSVPAEVWTRHGEKRVLKPRWMDARVCSDHAALKSFKEFAAGKRGAAFGVMEALGTL PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFV LMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQRSP QDNQMAIIIMVAVGLLGLITANELGWLERTKSDLSHLMGRREEGATIGFSMDIDLRPA SAWAIYAALTTFITPAVQHAVTTSYNNYSLMAMATQAGVLFGMGKGMPFYAWDFGVPL LMIGCYSQLTPLTLIVAIILLVAHYMYLIPGLQAAAARAAQKRTAAGIMKNPVVDGIV VTDIDTMTIDPQVEKKMGQVLLIAVAVSSAILSRTAWGWGEAGALITAATSTLWEGSP NKYWNSSTATSLCNIFRGSYLAGASLIYTVTRNAGLVKRRGGGTGETLGEKWKARLNQ MSALEFYSYKKSGITEVCREEARRALKDGVATGGHAVSRGSAKLRWLVERGYLQPYGK VIDLGCGRGGWSYYAATIRKVQEVKGYTKGGPGHEEPVLVQSYGWNIVRLKSGVDVFH MAAEPCDTLLCDIGESSSSPEVEEARTLRVLSMVGDWLEKRPGAFCIKVLCPYTSTMM ETLERLQRRYGGGLVRVPLSRNSTHEMYWVSGAKSNTIKSVSTTSQLLLGRMDGPRRP VKYEEDVNLGSGTRAVVSCAEAPNMKIIGNRIERIRSEHAETWFFDENHPYRTWAYHG SYEAPTQGSASSLINGVVRLLSKPWDVVTGVTGIAMTDTTPYGQQRVFKEKVDTRVPD PQEGTRQVMSMVSSWLWKELGKHKRPRVCTKEEFINKVRSNAALGAIFEEEKEWKTAV EAVNDPRFWALVDKEREHHLRGECQSCVYNMMGKREKKQGEFGKAKGSRAIWYMWLGA RFLEFEALGFLNEDHWMGRENSGGGVEGLGLQRLGYVLEEMSRIPGGRMYADDTAGWD TRISRFDLENEALITNQMEKGHRALALAIIKYTYQNKVVKVLRPAEKGKTVMDIISRQ DQRGSGQVVTYALNTFTNLVVQLIRNMEAEEVLEMQDLWLLRRSEKVTNWLQSNGWDR LKRMAVSGDDCVVKPIDDRFAHALRFLNDMGKVRKDTQEWKPSTGWDNWEEVPFCSHH FNKLHLKDGRSIVVPCRHQDELIGRARVSPGAGWSIRETACLAKSYAQMWQLLYFHRR DLRLMANAICSSVPVDWVPTGRTTWSIHGKGEWMTTEDMLVVWNRVWIEENDHMEDKT PVAKWTDIPYLGKREDLWCGSLIGHRPRTTWAENIKNTVNMVRRIIGDEEKYMDYLST QVRYLGEEGSTPGVL" 3'UTR 10361..>10658 ORIGIN 1 tcagactgcg acagttcgag tttgaagcga aagctagcaa cagtatcaac aggttttatt 61 ttggatttgg aaacgagagt ttctggtcat gaaaaaccca aaaaagaaat ccggaggatt 121 ccggattgtc aatatgctaa aacgcggagt agcccgtgtg agcccctttg ggggcttgaa 181 gaggctgcca gccggacttc tgctgggtca tgggcccatc aggatggtct tggcgattct 241 agcctttttg agattcacgg caatcaagcc atcactgggt ctcatcaata gatggggttc 301 agtggggaaa aaagaggcta tggaaataat aaagaagttc aagaaagatc tggctgccat 361 gctgagaata atcaatgcta ggaaggagaa gaagagacga ggcgcagaaa ctagtgtcgg 421 aattgttggc ctcctgctga ccacagctat ggcagcggag gtcactagac gtgggagtgc 481 atactatatg tacttggaca gaaacgatgc tggggaggcc atatcttttc caaccacatt 541 ggggatgaat aagtgttata tacagatcat ggatcttgga cacatgtgtg atgccaccat 601 gagctatgaa tgccctatgc tggatgaggg ggtggaacca gatgacgtcg attgttggtg 661 caacacgacg tcaacttggg ttgtgtacgg aacctgccat cacaaaaaag gtgaagcacg 721 gagatctaga agagccgtga cgctcccctc ccattccact aggaagctgc aaacgcggtc 781 gcaaacctgg ttggaatcaa gagaatacac aaagcacttg attagagtcg aaaattggat 841 attcaggaac cctggtttcg ctttagcagc agctgccatc gcttggcttt tgggaagctc 901 aacgagccaa aaagtcatat acttggtcat gatactgctg attgccccgg catacagcat 961 caggtgcata ggagtcagca atagggactt tgtggaaggt atgtcaggtg ggacttgggt 1021 tgatgttgtc ttggaacatg gaggttgtgt caccgtaatg gcacaggaca aaccgactgt 1081 cgacatagag ctggttacaa caacagtcag caacatggcg gaggtaagat cctactgcta 1141 tgaggcatca atatcagaca tggcttcgga cagccgctgc ccaacacaag gtgaagccta 1201 ccttgacaag caatcagaca ctcaatatgt ctgcaaaaga acgttagtgg acagaggctg 1261 gggaaatgga tgtggacttt ttggcaaagg gagcctggtg acatgcgcta agtttgcatg 1321 ctccaagaaa atgaccggga agagcatcca gccagagaat ctggagtacc ggataatgtt 1381 gtcagttcat ggctcccagc acagtgggat gatcgttaat gacacaggac atgaaactga 1441 tgagaataga gcgaaggttg agataacgcc caattcacca agagccgaag ccaccctggg 1501 gggttttgga agcctaggac ttgattgtga accgaggaca ggccttgact tttcagattt 1561 gtattacttg acaatgaata acaagcactg gttggttcac aaggagtggt tccacgacat 1621 tccattacct tggcacgctg gggcagacac cggaactcca cactggaaca acaaagaagc 1681 actggtagag ttcaaggacg cacatgccaa aaggcaaact gtcgtggttc tagggagtca 1741 agaaggagca gttcacacgg cccttgctgg agctctggag gctgagatgg atggtgcaaa 1801 gggaaggctg tcctctggcc acttgaaatg tcgcctgaaa atggataaac ttagattgaa 1861 gggcgtgtca tactccttgt gtaccgcagc gttcacattc accaagatcc cggctgaaac 1921 actgcacggg acagtcacag tggaggtaca gtacgcaggg acagatggac cttgcaaggt 1981 tccagctcag atggcggtgg acatgcaaac tctgacccca gttgggaggt tgataaccgc 2041 taaccccgta atcactgaaa gcactgagaa ctctaagatg atgctggaac ttgatccacc 2101 atttggggac tcttacattg tcataggagt cggggagaag aagatcaccc accactggca 2161 caggagtggc agcaccattg gaaaagcatt tgaagccact gtgagaggtg ccaagagaat 2221 ggcagtcttg ggagacacag cctgggactt tggatcagtt ggaggcgctc tcaactcatt 2281 gggcaagggc atccatcaaa tttttggagc agctttcaaa tcattgtttg gaggaatgtc 2341 ctggttctca caaattctca ttggaacgtt gctgatgtgg ttgggtctga acacaaagaa 2401 tggatctatt tcccttatgt gcttggcctt agggggagtg ttgatcttct tatccacagc 2461 cgtctctgct gatgtggggt gctcggtgga cttctcaaag aaggagacga gatgtggtac 2521 aggggtgttc gtctataacg acgttgaagc ctggagggac aggtacaagt accatcctga 2581 ctccccccgt agattggcag cagcagtcaa gcaagcctgg gaagatggta tctgcgggat 2641 ctcctctgtt tcaagaatgg aaaacatcat gtggagatca gtagaagggg agctcaacgc 2701 aatcctggaa gagaatggag ttcaactgac ggtcgttgtg ggatctgtaa aaaaccccat 2761 gtggagaggt ccacagagat tgcccgtgcc tgtgaacgag ctgccccacg gctggaaggc 2821 ttgggggaaa tcgtacttcg tcagagcagc aaagacaaat aacagctttg tcgtggatgg 2881 tgacacactg aaggaatgcc cactcaaaca tagagcatgg aacagctttc ttgtggagga 2941 tcatgggttc ggggtatttc acactagtgt ctggctcaag gttagagaag attattcatt 3001 agagtgtgat ccagccgtta ttggaacagc tgttaaggga aaggaggctg tacacagtga 3061 tctaggctac tggattgaga gtgagaagaa tgacacatgg aggctgaaga gggcccatct 3121 gatcgagatg aaaacatgtg aatggccaaa gtcccacaca ttgtggacag atggaataga 3181 agagagtgat ctgatcatac ccaagtcttt agctgggcca ctcagccatc acaataccag 3241 agagggctac aggacccaaa tgaaagggcc atggcacagt gaagagcttg aaattcggtt 3301 tgaggaatgc ccaggcacta aggtccacgt ggaggaaaca tgtggaacaa gaggaccatc 3361 tctgagatca accactgcaa gcggaagggt gatcgaggaa tggtgctgca gggagtgcac 3421 aatgccccca ctgtcgttct gggctaaaga tggctgttgg tatggaatgg agataaggcc 3481 caggaaagaa ccagaaagca acttagtaag gtcaatggtg actgcaggat caactgatca 3541 catggatcac ttctcccttg gagtgcttgt gattctgctc atggtgcagg aagggctgaa 3601 gaagagaatg accacaaaga tcatcataag cacatcaatg gcagtgctgg tagctatgat 3661 cctgggagga ttttcaatga gtgacctggc taagcttgca attttgatgg gtgccacctt 3721 cgcggaaatg aacactggag gagatgtagc tcatctggcg ctgatagcgg cattcaaagt 3781 cagaccagcg ttgctggtat ctttcatctt cagagctaat tggacacccc gtgaaagcat 3841 gctgctggcc ttggcctcgt gtcttttgca aactgcgatc tccgccttgg agggcgacct 3901 gatggttctc atcaatggtt ttgctttggc ctggttggca atacgagcga tggttgttcc 3961 acgcactgac aacatcacct tggcaatcct ggctgctctg acaccactga cccggggcac 4021 actgcttgtg gcgtggagag caggccttgc tacttgcggg gggtttatgc tcctctctct 4081 gaagggaaaa ggcagtgtga agaagaactt accatttgtc atggccctgg gactaaccgc 4141 tgtgaggctg gtcgacccca tcaacgtggt gggactgctg ttgctcacaa ggagtgggaa 4201 gcggagctgg ccccctagcg aagtactcac agctgttggc ctgatatgcg cattggctgg 4261 agggttcgcc aaggcagata tagagatggc tgggcccatg gccgcggtcg gtctgctaat 4321 tgtcagttac gtggtctcag gaaagagtgt ggacatgtac attgaaagag caggtgacat 4381 cacatgggaa aaagatgcgg aagtcactgg aaacagtccc cggctcgacg tggcgctaga 4441 tgagagtggt gatttctccc tggtggagga tgacggtccc cccatgagag agatcatact 4501 caaggtggtc ctgatgacca tctgtggcat gaacccaata gccataccct ttgcagctgg 4561 agcgtggtac gtatacgtga agactggaaa aaggagtggt gctctatggg atgtgcctgc 4621 tcccaaggaa gtaaaaaagg gggagaccac agatggagtg tacagagtaa tgactcgtag 4681 actgctaggt tcaacacaag ttggagtggg agttatgcaa gagggggtct ttcacactat 4741 gtggcacgtc acaaaaggat ccgcgctgag aagcggtgaa gggagacttg atccatactg 4801 gggagatgtc aagcaggatc tggtgtcata ctgtggtcca tggaagctag atgccgcctg 4861 ggacgggcac agcgaggtgc agctcttggc cgtgcccccc ggagagagag cgaggaacat 4921 ccagactctg cccggaatat ttaagacaaa ggatggggac attggagcgg ttgcgctgga 4981 ttacccagca ggaacttcag gatctccaat cctagacaag tgtgggagag tgataggact 5041 ttatggcaat ggggtcgtga tcaaaaatgg gagttatgtt agtgccatca cccaagggag 5101 gagggaggaa gagactcctg ttgagtgctt cgagccttcg atgctgaaga agaagcagct 5161 aactgtctta gacttgcatc ctggagctgg gaaaaccagg agagttcttc ctgaaatagt 5221 ccgtgaagcc ataaaaacaa gactccgtac tgtgatctta gctccaacca gggttgtcgc 5281 tgctgaaatg gaggaagccc ttagagggct tccagtgcgt tatatgacaa cagcagtcaa 5341 tgtcacccac tctggaacag aaatcgtcga cttaatgtgc catgccacct tcacttcacg 5401 tctactgcag ccaatcagag tccccaacta taatctgtat attatggatg aggcccactt 5461 cacagatccc tcaagtatag cagcaagagg atacatttca acaagggttg agatgggcga 5521 ggcggctgcc atcttcatga ccgccacgcc accaggaacc cgtgacgcat ttccggactc 5581 caactcacca attatggaca ccgaagtgga agtcccagag agagcctgga gctcaggctt 5641 tgattgggtg acggatcatt ctggaaaaac agtttggttt gttccaagcg tgaggaacgg 5701 caatgagatc gcagcttgtc tgacaaaggc tggaaaacgg gtcatacagc tcagcagaaa 5761 gacttttgag acagagttcc agaaaacaaa acatcaagag tgggactttg tcgtgacaac 5821 tgacatttca gagatgggcg ccaactttaa agctgaccgt gtcatagatt ccaggagatg 5881 cctaaagccg gtcatacttg atggcgagag agtcattctg gctggaccca tgcctgtcac 5941 acatgccagc gctgcccaga ggagggggcg cataggcagg aatcccaata aacctggaga 6001 tgagtatctg tatggaggtg ggtgcgcaga gactgacgaa gaccatgcac actggcttga 6061 agcaagaatg ctccttgaca atatttacct ccaagatggc ctcatagcct cgctctatcg 6121 acctgaggcc gacaaagtag cagccattga gggagagttc aagcttagga cggagcaaag 6181 gaagaccttt gtggaactca tgaaaagagg agatcttcct gtttggctgg cctatcaggt 6241 tgcatctgcc ggaataacct acacagatag aagatggtgc tttgatggca cgaccaacaa 6301 caccataatg gaagacagtg tgccggcaga ggtgtggacc agacacggag agaaaagagt 6361 gctcaaaccg aggtggatgg acgccagagt ttgttcagat catgcggccc tgaagtcatt 6421 caaggagttt gccgctggga aaagaggagc ggcttttgga gtgatggaag ccctgggaac 6481 actgccagga cacatgacag agagattcca ggaagccatt gacaacctcg ctgtgctcat 6541 gcgggcagag actggaagca ggccttacaa agccgcggcg gcccaattgc cggagaccct 6601 agagaccatt atgcttttgg ggttgctggg aacagtctcg ctgggaatct ttttcgtctt 6661 gatgaggaac aagggcatag ggaagatggg ctttggaatg gtgactcttg gggccagcgc 6721 atggctcatg tggctctcgg aaattgagcc agccagaatt gcatgtgtcc tcattgttgt 6781 gttcctattg ctggtggtgc tcatacctga gccagaaaag caaagatctc cccaggacaa 6841 ccaaatggca atcatcatca tggtagcagt aggtcttctg ggcttgatta ccgccaatga 6901 actcggatgg ttggagagaa caaagagtga cctaagccat ctaatgggaa ggagagagga 6961 gggggcaacc ataggattct caatggacat tgacttgcgg ccagcctcag cttgggccat 7021 ctatgctgcc ttgacaactt tcattacccc agccgtccaa catgcagtga ccacttcata 7081 caacaactac tccttaatgg cgatggccac gcaagctgga gtgttgtttg gtatgggcaa 7141 agggatgcca ttctacgcat gggactttgg agtcccgctg ctaatgatag gttgctactc 7201 acaattaaca cccctgaccc taatagtggc catcattttg ctcgtggcgc actacatgta 7261 cttgatccca gggctgcagg cagcagctgc gcgtgctgcc cagaagagaa cggcagctgg 7321 catcatgaag aaccctgttg tggatggaat agtggtgact gacattgaca caatgacaat 7381 tgacccccaa gtggagaaaa agatgggaca ggtgctactc atagcagtag ccgtctccag 7441 cgccatactg tcgcggaccg cctgggggtg gggggaggct ggggccctga tcacagccgc 7501 aacttccact ttgtgggaag gctctccgaa caagtactgg aactcctcta cagccacttc 7561 actgtgtaac atttttaggg gaagttactt ggctggagct tctctaatct acacagtaac 7621 aagaaacgct ggcttggtca agagacgtgg gggtggaaca ggagagaccc tgggagagaa 7681 atggaaggcc cgcttgaacc agatgtcggc cctggagttc tactcctaca aaaagtcagg 7741 catcaccgag gtgtgcagag aagaggcccg ccgcgccctc aaggacggtg tggcaacggg 7801 aggccatgct gtgtcccgag gaagtgcaaa gctgagatgg ttggtggagc ggggatacct 7861 gcagccctat ggaaaggtca ttgatcttgg atgtggcaga gggggctgga gttactacgc 7921 cgccaccatc cgcaaagttc aagaagtgaa aggatacaca aaaggaggcc ctggtcatga 7981 agaacccgtg ttggtgcaaa gctatgggtg gaacatagtc cgtcttaaga gtggggtgga 8041 cgtctttcat atggcggctg agccgtgtga cacgttgctg tgtgacatag gtgagtcatc 8101 atctagtcct gaagtggaag aagcacggac gctcagagtc ctctccatgg tgggggattg 8161 gcttgaaaaa agaccaggag ccttttgtat aaaagtgttg tgcccataca ccagcactat 8221 gatggaaacc ctggagcgac tgcagcgtag gtatggggga ggactggtca gagtgccact 8281 ctcccgcaac tctacacatg agatgtactg ggtctctgga gcgaaaagca acaccataaa 8341 aagtgtgtcc accacgagcc agctcctctt ggggcgcatg gacgggccta ggaggccagt 8401 gaaatatgag gaggatgtga atctcggctc tggcacgcgg gctgtggtaa gctgcgctga 8461 agctcccaac atgaagatca ttggtaaccg cattgaaagg atccgcagtg agcacgcgga 8521 aacgtggttc tttgacgaga accacccata taggacatgg gcttaccatg gaagctatga 8581 ggcccccaca caagggtcag cgtcctctct aataaacggg gttgtcaggc tcctgtcaaa 8641 accctgggat gtggtgactg gagtcacagg aatagccatg accgacacca caccgtatgg 8701 tcagcaaaga gttttcaagg aaaaagtgga cactagggtg ccagaccccc aagaaggcac 8761 tcgtcaggtt atgagcatgg tctcttcctg gttgtggaaa gagctaggca aacacaaacg 8821 gccacgagtc tgtaccaaag aagagttcat caacaaggtt cgtagcaatg cagcattagg 8881 ggcaatattt gaagaggaaa aagagtggaa gactgcagtg gaagctgtga acgatccaag 8941 gttctgggct ctagtggaca aggaaagaga gcaccacctg agaggagagt gccagagttg 9001 tgtgtacaac atgatgggaa aaagagaaaa gaaacaaggg gaatttggaa aggccaaggg 9061 cagccgcgcc atctggtata tgtggctagg ggctagattt ctagagttcg aagcccttgg 9121 attcttgaac gaggatcact ggatggggag agagaactca ggaggtggtg ttgaagggct 9181 gggattacaa agactcggat atgtcctaga agagatgagt cgcataccag gaggaaggat 9241 gtatgcagat gacactgctg gctgggacac ccgcattagc aggtttgatc tggagaatga 9301 agctctaatc accaaccaaa tggagaaagg gcacagggcc ttggcattgg ccataatcaa 9361 gtacacatac caaaataaag tggtaaaggt ccttagacca gctgaaaaag ggaaaacagt 9421 tatggacatt atttcgagac aagaccaaag ggggagcgga caagttgtca cttacgctct 9481 taacacattt accaacctag tggtgcaact cattcggaat atggaggctg aggaagttct 9541 agagatgcaa gacttgtggc tgctgcggag gtcagagaaa gtgaccaact ggttgcagag 9601 caacggatgg gataggctca aacgaatggc agtcagtgga gatgattgcg ttgtgaagcc 9661 aattgatgat aggtttgcac atgccctcag gttcttgaat gatatgggaa aagttaggaa 9721 ggacacacaa gagtggaaac cctcaactgg atgggacaac tgggaagaag ttccgttttg 9781 ctcccaccac ttcaacaagc tccatctcaa ggacgggagg tccattgtgg ttccctgccg 9841 ccaccaagat gaactgattg gccgggcccg cgtctctcca ggggcgggat ggagcatccg 9901 ggagactgct tgcctagcaa aatcatatgc gcaaatgtgg cagctccttt atttccacag 9961 aagggacctc cgactgatgg ccaatgccat ttgttcatct gtgccagttg actgggttcc 10021 aactgggaga actacctggt caatccatgg aaagggagaa tggatgacca ctgaagacat 10081 gcttgtggtg tggaacagag tgtggattga ggagaacgac cacatggaag acaagacccc 10141 agttgcgaaa tggacagaca ttccctattt gggaaaaagg gaagacttgt ggtgtggatc 10201 tctcataggg cacagaccgc gcaccacctg ggctgagaac attaaaaaca cagtcaacat 10261 ggtgcgcagg atcataggtg atgaagaaaa gtacatggac tacctatcca cccaagttcg 10321 ctacttgggt gaagaagggt ctacacctgg agtgctgtaa gcaccaatct taatgttgtc 10381 aggcctgcta gtcagccaca gcttggggaa agctgtgcag cctgtgaccc ccccaggaga 10441 agctgggaaa ccaagcctat agtcaggccg agaacgccat ggcacggaag aagccatgct 10501 gcctgtgagc ccctcagagg acactgagtc aaaaaacccc acgcgcttgg aggcgcagga 10561 tgggaaaaga aggtggcgac cttccccacc cttcaatctg gggcctgaac tggagatcag 10621 ctgtggatct ccagaagagg gactagtggt tagaggag Link to comment Share on other sites More sharing options...
niman Posted October 24, 2016 Author Report Share Posted October 24, 2016 Sequences producing significant alignments: Select:AllNone Selected:0 AlignmentsDownloadGenBankGraphicsDistance tree of resultsShow/hide columns of the table presenting sequences producing significant alignments Sequences producing significant alignments: Select for downloading or viewing reports Description Max score Total score Query cover E value Ident Accession Select seq gb|KY014303.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0127-SER polyprotein gene, complete cds 18525 18525 100% 0.0 100% KY014303.1 Select seq gb|KX198135.1| Zika virus strain ZIKV/Homo sapiens/PAN/BEI-259634_V4/2016, complete genome 18474 18474 100% 0.0 99% KX198135.1 Select seq gb|KX156776.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259364_V1-V2/2015, complete genome 18465 18465 100% 0.0 99% KX156776.1 Select seq gb|KX156774.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259359_V1-V3/2015, complete genome 18462 18462 100% 0.0 99% KX156774.1 Select seq gb|KU647676.1| Zika virus strain MRS_OPY_Martinique_PaRi_2015 polyprotein gene, complete cds 18462 18462 100% 0.0 99% KU647676.1 Select seq gb|KU820897.5| Zika virus isolate FLR polyprotein gene, complete cds 18456 18456 100% 0.0 99% KU820897.5 Select seq gb|KX247646.1| Zika virus isolate Zika virus/Homo sapiens/COL/UF-1/2016, complete genome 18456 18456 100% 0.0 99% KX247646.1 Select seq gb|KX156775.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259249_V1-V3/2015, complete genome 18456 18456 100% 0.0 99% KX156775.1 Select seq gb|KX087102.1| Zika virus strain ZIKV/Homo sapiens/COL/FLR/2015, complete genome 18456 18456 100% 0.0 99% KX087102.1 Select seq gb|KU922960.1| Zika virus isolate MEX/InDRE/Sm/2016, complete genome 18444 18444 100% 0.0 99% KU922960.1 Select seq gb|KU922923.1| Zika virus isolate MEX/InDRE/Lm/2016, complete genome 18438 18438 100% 0.0 99% KU922923.1 Select seq gb|KX893855.1| Zika virus strain Zika virus/Homo sapiens/VEN/UF-2/2016, complete genome 18433 18433 100% 0.0 99% KX893855.1 Select seq gb|KX702400.1| Zika virus strain Zika virus/Homo sapiens/VEN/UF-1/2016, complete genome 18429 18429 100% 0.0 99% KX702400.1 Select seq gb|KX548902.1| Zika virus isolate ZIKV/COL/FCC00093/2015 polyprotein gene, complete cds 18429 18429 100% 0.0 99% KX548902.1 Select seq gb|KX447510.1| Zika virus isolate 1_0049_PF polyprotein gene, complete cds 18411 18411 100% 0.0 99% KX447510.1 Select seq gb|KU991811.1| Zika virus isolate Brazil/2016/INMI1 polyprotein gene, complete cds 18408 18408 100% 0.0 99% KU991811.1 Select seq gb|KX447512.1| Zika virus isolate 1_0181_PF polyprotein gene, complete cds 18402 18402 100% 0.0 99% KX447512.1 Select seq gb|KX369547.1| Zika virus strain PF13/251013-18, complete genome 18402 18402 100% 0.0 99% KX369547.1 Select seq gb|KU509998.3| Zika virus strain Haiti/1225/2014, complete genome 18402 18402 100% 0.0 99% KU509998.3 Select seq gb|KJ776791.2| Zika virus strain H/PF/2013, complete genome 18399 18399 100% 0.0 99% KJ776791.2 Select seq gb|KX447509.1| Zika virus isolate 1_0087_PF polyprotein gene, complete cds 18399 18399 100% 0.0 99% KX447509.1 Select seq gb|KX447513.1| Zika virus isolate 1_0134_PF polyprotein gene, complete cds 18393 18393 100% 0.0 99% KX447513.1 Select seq gb|KX197205.1| Zika virus isolate 9, complete genome 18390 18390 100% 0.0 99% KX197205.1 Select seq gb|KX447515.1| Zika virus isolate 1_0030_PF polyprotein gene, complete cds 18390 18390 100% 0.0 99% KX447515.1 Select seq gb|KX447511.1| Zika virus isolate 1_0015_PF polyprotein gene, complete cds 18390 18390 100% 0.0 99% KX447511.1 Select seq gb|KX280026.1| Zika virus isolate Paraiba_01, complete genome 18390 18390 100% 0.0 99% KX280026.1 Select seq gb|KU321639.1| Zika virus strain ZikaSPH2015, complete genome 18390 18390 100% 0.0 99% KU321639.1 Select seq gb|KX447514.1| Zika virus isolate 1_0035_PF polyprotein gene, complete cds 18384 18384 100% 0.0 99% KX447514.1 Select seq gb|KX051563.1| Zika virus isolate Haiti/1/2016, complete genome 18384 18384 100% 0.0 99% KX051563.1 Select seq gb|KX447516.1| Zika virus isolate 1_0111_PF polyprotein gene, complete cds 18381 18381 100% 0.0 99% KX447516.1 Select seq gb|KU729218.1| Zika virus isolate BeH828305 polyprotein gene, complete cds 18381 18381 100% 0.0 99% KU729218.1 Select seq gb|KU707826.1| Zika virus isolate SSABR1, complete genome 18381 18381 100% 0.0 99% KU707826.1 Select seq gb|KU365779.1| Zika virus strain BeH819966 polyprotein gene, complete cds 18381 18381 100% 0.0 99% KU365779.1 Select seq gb|KX262887.1| Zika virus isolate 103451, complete genome 18375 18375 100% 0.0 99% KX262887.1 Select seq gb|KX197192.1| Zika virus isolate ZIKV/H.sapiens/Brazil/PE243/2015, complete genome 18375 18375 100% 0.0 99% KX197192.1 Select seq gb|KX811222.1| Zika virus isolate Brazil_2015_MG, complete genome 18372 18372 100% 0.0 99% KX811222.1 Select seq gb|KU497555.1| Zika virus isolate Brazil-ZKV2015, complete genome 18372 18372 99% 0.0 99% KU497555.1 Select seq gb|KY014297.1| Zika virus isolate Zika virus/H.sapiens-wt/BRA/2016/FC-6864-URI polyprotein gene, complete cds 18366 18366 100% 0.0 99% KY014297.1 Select seq gb|KX879604.1| Zika virus isolate SN089, complete genome 18366 18366 100% 0.0 99% KX879604.1 Select seq gb|KX694534.1| Zika virus strain ZIKV/Homo sapiens/HND/R103451/2015, complete genome 18366 18366 100% 0.0 99% KX694534.1 Select seq gb|KU758877.1| Zika virus isolate 17271 polyprotein gene, complete cds 18366 18366 100% 0.0 99% KU758877.1 Select seq gb|KU926309.1| Zika virus isolate Rio-U1, complete genome 18366 18366 100% 0.0 99% KU926309.1 Select seq gb|KU365780.1| Zika virus strain BeH815744 polyprotein gene, complete cds 18366 18366 100% 0.0 99% KU365780.1 Select seq gb|KU940228.1| Zika virus isolate Bahia07, partial genome 18363 18363 100% 0.0 99% KU940228.1 Select seq gb|KU365777.1| Zika virus strain BeH818995 polyprotein gene, complete cds 18363 18363 100% 0.0 99% KU365777.1 Select seq gb|KX879603.1| Zika virus isolate SN062, complete genome 18357 18357 100% 0.0 99% KX879603.1 Select seq gb|KU501217.1| Zika virus strain 8375 polyprotein gene, complete cds 18357 18357 100% 0.0 99% KU501217.1 Select seq gb|KU365778.1| Zika virus strain BeH819015 polyprotein gene, complete cds 18357 18357 100% 0.0 99% KU365778.1 Select seq gb|KU312312.1| Zika virus isolate Z1106033 polyprotein gene, complete cds 18357 18357 100% 0.0 99% KU312312.1 Select seq gb|KY014304.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0180-SER polyprotein gene, complete cds 18354 18354 100% 0.0 99% KY014304.1 Select seq gb|KU729217.2| Zika virus isolate BeH823339 polyprotein gene, complete cds 18354 18354 100% 0.0 99% KU729217.2 Select seq gb|KU527068.1| Zika virus strain Natal RGN, complete genome 18354 18354 100% 0.0 99% KU527068.1 Select seq gb|KU501216.1| Zika virus strain 103344 polyprotein gene, complete cds 18354 18354 100% 0.0 99% KU501216.1 Select seq gb|KY014327.1| Zika virus isolate Zika virus/H.sapiens-wt/HND/2016/HU-ME167-PLA polyprotein gene, complete cds 18350 18350 100% 0.0 99% KY014327.1 Select seq gb|KX447517.1| Zika virus isolate 1_0038_PF polyprotein gene, complete cds 18348 18348 100% 0.0 99% KX447517.1 Select seq gb|KX247632.1| Zika virus isolate MEX_I_7 polyprotein gene, complete cds 18348 18348 100% 0.0 99% KX247632.1 Select seq gb|KU937936.1| Zika virus isolate ZIKVNL00013 polyprotein gene, complete cds 18348 18348 100% 0.0 99% KU937936.1 Select seq gb|KU501215.1| Zika virus strain PRVABC59, complete genome 18348 18348 100% 0.0 99% KU501215.1 Select seq gb|KY014315.1| Zika virus isolate Zika virus/H.sapiens-wt/HND/2016/HU-ME152-SER polyprotein gene, complete cds 18345 18345 100% 0.0 99% KY014315.1 Select seq gb|KX601168.1| Zika virus strain ZIKV/Homo Sapiens/PRI/PRVABC59/2015, complete genome 18345 18345 100% 0.0 99% KX601168.1 Select seq gb|KX520666.1| Zika virus isolate HS-2015-BA-01 polyprotein gene, complete cds 18345 18345 100% 0.0 99% KX520666.1 Select seq gb|KX087101.2| Zika virus strain ZIKV/Homo sapiens/PRI/PRVABC59/2015, complete genome 18345 18345 100% 0.0 99% KX087101.2 Select seq gb|KY014296.1| Zika virus isolate Zika virus/H.sapiens-wt/BRA/2016/FC-DQ131D1-URI polyprotein gene, complete cds 18339 18339 100% 0.0 99% KY014296.1 Select seq gb|KX806557.2| Zika virus isolate TS17-2016, complete genome 18339 18339 100% 0.0 99% KX806557.2 Select seq gb|KX856011.1| Zika virus strain ZIKV/Aedes sp./MEX_I-44/2016, complete genome 18339 18339 100% 0.0 99% KX856011.1 Select seq gb|KX446951.1| Zika virus strain ZIKV/Aedes.sp/MEX/MEX_I-7/2016, complete genome 18339 18339 100% 0.0 99% KX446951.1 Select seq gb|KX377337.1| Zika virus strain PRVABC-59, complete genome 18339 18339 100% 0.0 99% KX377337.1 Select seq gb|KU926310.1| Zika virus isolate Rio-S1, complete genome 18339 18339 100% 0.0 99% KU926310.1 Select seq gb|KU820898.1| Zika virus isolate GZ01 polyprotein gene, complete cds 18339 18339 100% 0.0 99% KU820898.1 Select seq gb|KX446950.1| Zika virus strain ZIKV/Aedes.sp/MEX/MEX_2-81/2016, complete genome 18336 18336 100% 0.0 99% KX446950.1 Select seq gb|KU870645.1| Zika virus isolate FB-GWUH-2016, complete genome 18336 18336 100% 0.0 99% KU870645.1 Select seq gb|KU853013.1| Zika virus isolate Dominican Republic/2016/PD2, complete genome 18330 18330 100% 0.0 99% KU853013.1 Select seq gb|KU853012.1| Zika virus isolate Dominican Republic/2016/PD1, complete genome 18330 18330 100% 0.0 99% KU853012.1 Select seq gb|KY014300.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0208-SER polyprotein gene, complete cds 18327 18327 100% 0.0 99% KY014300.1 Select seq dbj|LC190723.1| Zika virus genomic RNA, complete genome, strain: ZIKV/Hu/Yokohama/1/2016 18327 18327 100% 0.0 99% LC190723.1 Select seq gb|KX056898.1| Zika virus isolate Zika virus/GZ02/2016 polyprotein gene, complete cds 18327 18327 100% 0.0 99% KX056898.1 Select seq gb|KU955590.1| Zika virus isolate Z16019 polyprotein gene, complete cds 18327 18327 100% 0.0 99% KU955590.1 Select seq gb|KX766028.1| Zika virus isolate R114916, complete genome 18323 18323 100% 0.0 99% KX766028.1 Select seq gb|KY014321.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0115-SER polyprotein gene, complete cds 18321 18321 100% 0.0 99% KY014321.1 Select seq gb|KU740184.2| Zika virus isolate GD01 polyprotein gene, complete cds 18321 18321 100% 0.0 99% KU740184.2 Select seq gb|KU761564.1| Zika virus isolate GDZ16001 polyprotein gene, complete cds 18321 18321 100% 0.0 99% KU761564.1 Select seq gb|KY014295.1| Zika virus isolate Zika virus/H.sapiens-wt/USA/2016/FL-010-URI polyprotein gene, complete cds 18318 18318 100% 0.0 99% KY014295.1 Select seq gb|KX842449.2| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL010U polyprotein gene, complete cds 18318 18318 100% 0.0 99% KX842449.2 Select seq gb|KX766029.1| Zika virus isolate R116265, complete genome 18312 18312 100% 0.0 99% KX766029.1 Select seq gb|KX922707.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL039U polyprotein gene, complete cds 18309 18309 100% 0.0 99% KX922707.1 Select seq gb|KX673530.1| Zika virus isolate PHE_semen_Guadeloupe, complete genome 18309 18309 100% 0.0 99% KX673530.1 Select seq gb|KX117076.1| Zika virus isolate Zhejiang04, complete genome 18309 18309 100% 0.0 99% KX117076.1 Select seq gb|KY014320.1| Zika virus isolate Zika virus/H.sapiens-wt/BRA/2016/FC-DQ42D1-URI polyprotein gene, complete cds 18305 18305 100% 0.0 99% KY014320.1 Select seq gb|KX922706.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL038U polyprotein gene, complete cds 18305 18305 100% 0.0 99% KX922706.1 Select seq gb|KY014323.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-02-MOS polyprotein gene, complete cds 18303 18303 100% 0.0 99% KY014323.1 Select seq gb|KY014322.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-03-MOS polyprotein gene, complete cds 18303 18303 100% 0.0 99% KY014322.1 Select seq gb|KY014314.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0436-SER polyprotein gene, complete cds 18303 18303 100% 0.0 99% KY014314.1 Select seq gb|KX922703.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL021U polyprotein gene, complete cds 18303 18303 100% 0.0 99% KX922703.1 Select seq gb|KX838906.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL03M polyprotein gene, complete cds 18303 18303 100% 0.0 99% KX838906.2 Select seq gb|KX838905.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL02M polyprotein gene, complete cds 18303 18303 100% 0.0 99% KX838905.2 Select seq gb|KX832731.1| Zika virus isolate ZIKV/Homo_sapiens/USA//2016/Hu0015SA polyprotein gene, complete cds 18303 18303 100% 0.0 99% KX832731.1 Select seq gb|KY014316.1| Zika virus isolate Zika virus/H.sapiens-wt/USA/2016/FL-039-URI polyprotein gene, complete cds 18300 18300 100% 0.0 99% KY014316.1 Select seq gb|KX922704.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL030U polyprotein gene, complete cds 18300 18300 100% 0.0 99% KX922704.1 Select seq gb|KX185891.1| Zika virus isolate Zika virus/CN/SZ02/2016 polyprotein gene, complete cds 18300 18300 100% 0.0 99% KX185891.1 Select seq gb|KU963796.1| Zika virus isolate SZ-WIV01 polyprotein gene, complete cds 18300 18300 100% 0.0 99% KU963796.1 Link to comment Share on other sites More sharing options...
niman Posted October 24, 2016 Author Report Share Posted October 24, 2016 LOCUS KY014304 10659 bp RNA linear VRL 21-OCT-2016 DEFINITION Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0180-SER polyprotein gene, complete cds. ACCESSION KY014304 VERSION KY014304.1 DBLINK BioProject: PRJNA344504 BioSample: SAMN05844989 KEYWORDS . SOURCE Zika virus ORGANISM Zika virus Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; Flavivirus. REFERENCE 1 (bases 1 to 10659) AUTHORS Baniecki,M.L., Barnes,K.G., Bosch,I., Freije,C.A., Gehrke,L., Gladden-Young,A.D., Gnirke,A., Luo,C.Y., MacInnis,B., Matranga,C.B., Metsky,H.C., Park,D.J., Qu,J., Sabeti,P.C., Tomkins-Tinch,C.H., West,K.L., Winnicki,S., Wohl,S. and Yozwiak,N.L. TITLE Direct Submission JOURNAL Submitted (20-OCT-2016) Viral Genomics, Infectious Disease Program (Infectious Disease Initiative), Broad Institute, 75 Ames St, Cambridge, MA 02142, USA COMMENT ##Assembly-Data-START## Assembly Method :: github.com/broadinstitute/viral-ngs v. v1.12.0-51-g8588fdb Assembly Name :: DOM_2016_BB-0180_SER-1 Coverage :: 250x Sequencing Technology :: Illumina; Swift LC ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..10659 /organism="Zika virus" /mol_type="genomic RNA" /isolate="Zika virus/H.sapiens-wt/DOM/2016/BB-0180-SER" /isolation_source="serum" /host="Homo sapiens" /db_xref="taxon:64320" /country="Dominican Republic: Santo Domingo" /collection_date="18-Apr-2016" /collected_by="Hospital General de la Plaza de la Salud, Santo Domingo, Dominican Republic" 5'UTR <1..88 CDS 89..10360 /note="contains structural and nonstructural proteins" /codon_start=1 /product="polyprotein" /protein_id="AOY08525.1" /translation="MKNPKKKSGGFRIVNMLKRGVARVSPFGGLKRLPAGLLLGHGPI RMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKK RRGADTSVGIVGLLLTTAMAAEVTRRGSAYYMYLDRNDAGEAISFPTTLGMNKCYIQI MDLGHMCDATMSYECPMLDEGVEPDDVDCWCNTTSTWVVYGTCHHKKGEARRSRRAVT LPSHSTRKLQTRSQTWLESREYTKHLIRVENWIFRNPGFALAAAAIAWLLGSSTSQKV IYLVMILLIAPAYSIRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIE LVTTTVSNMAEVRSYCYEASISDMASDSRCPTQGEAYLDKQSDTQYVCKRTLVDRGWG NGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHET DENRAKVEITPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWF HDIPLPWHAGADTGTPHWNNKEALVEFKDAHAKRQTVVVLGSQEGAVHTALAGALEAE MDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTVEVQYAG TDGPCKVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVG EKKITHHWHRSGSTIGKAFEATVRGAKRMAVLGDTAWDFGSVGGALNSLGKGIHQIFG AAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLSTAVSADVGC SVDFSKKETRCGTGVFVYNDVEAWRDRYKYHPDSPRRLAAAVKQAWEDGICGISSVSR MENIMWRSVEGELNAILEENGVQLTVVVGSVKNPMWRGPQRLPVPVNELPHGWKAWGK SYFVRAAKTNNSFVVDGDTLKECPLKHRAWNSFLVEDHGFGVFHTSVWLKVREDYSLE CDPAVIGTAVKGKEAVHSDLGYWIESEKNDTWRLKRAHLIEMKTCEWPKSHTLWTDGI EESDLIIPKSLAGPLSHHNTREGYRTQMKGPWHSEELEIRFEECPGTKVHVEETCGTR GPSLRSTTASGRVIEEWCCRECTMPPLSFRAKDGCWYGMEIRPRKEPESNLVRSVVTA GSTDHMDHFSLGVLVILLMVQEGLKKRMTTKIIISTSMAVLVAMILGGFSMSDLAKLA ILMGATFAEMNTGGDVAHLALIAAFKVRPALLVSFIFRANWTPRESMLLALASCLLQT AISALEGDLMVLINGFALAWLAIRAMVVPRTDNITLAILAALTPLARGTLLVAWRAGL ATCGGFMLLSLKGKGSVKKNLPFVMALGLTAVRLVDPINVVGLLLLTRSGKRSWPPSE VLTAVGLICALAGGFAKADIEMAGPMAAVGLLIVSYVVSGKSVDMYIERAGDITWEKD AEVTGNSPRLDVALDESGDFSLVEDDGPPMREIILKVVLMTICGMNPIAIPFAAGAWY VYVKTGKRSGALWDVPAPKEVKKGETTDGVYRVMTRRLLGSTQVGVGVMQEGVFHTMW HVTKGSALRSGEGRLDPYWGDVKQDLVSYCGPWKLDAAWDGHSEVQLLAVPPGERARN IQTLPGIFKTKDGDIGAVALDYPAGTSGSPILDKCGRVIGLYGNGVVIKNGSYVSAIT QGRREEETPVECFEPSMLKKKQLTVLDLHPGAGKTRRVLPEIVREAIKTRLRTVILAP TRVVAAEMEEALRGLPVRYMTTAVNVTHSGTEIVDLMCHATFTSRLLQPIRVPNYNLY IMDEAHFTDPSSIAARGYISTRVEMGEAAAIFMTATPPGTRDAFPDSNSPIMDTEVEV PERAWSSGFDWVTDHSGKTVWFVPSVRNGNEIAACLTKAGKRVIQLSRKTFETEFQKT KHQEWDFVVTTDISEMGANFKADRVIDSRRCLKPVILDGERVILAGPMPVTHASAAQR RGRIGRNPNKPGDEYLYGGGCAETDEDHAHWLEARMLLDNIYLQDGLIASLYRPEADK VAAIEGEFKLRTEQRKTFVELMKRGDLPVWLAYQVASAGITYTDRRWCFDGTTNNTIM EDSVPAEVWTRHGEKRVLKPRWMDARVCSDHAALKSFKEFAAGKRGAAFGVMEALGTL PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFV LMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQRSP QDNQMAIIIMVAVGLLGLITANELGWLERTKSDLSHLMGRREEGATIGFSMDIDLRPA SAWAIYAALTTFITPAVQHAVTTSYNNYSLMAMATQAGVLFGMGKGMPFYAWDFGVPL LMIGCYSQLTPLTLIVAIILLVAHYMYLIPGLQAAAARAAQKRTAAGIMKNPVVDGIV VTDIDTMTIDPQVEKKMGQVLLIAVAVSSAILSRTAWGWGEAGALITAATSTLWEGSP NKYWNSSTATSLCNIFRGSYLAGASLIYTVTRNAGLVKRRGGGTGETLGEKWKARLNQ MSALEFYSYKKSGITEVCREEARRALKDGVATGGHAVSRGSAKLRWLVERGYLQPYGK VIDLGCGRGGWSYYAATIRKVQEVKGYTKGGPGHEEPVLVQSYGWNIVRLKSGVDVFH MAAEPCDTLLCDIGESSSSPEVEEARTLRVLSMVGDWLEKRPGAFCIKVLCPYTSTMM ETLERLQRRYGGGLVRVPLSRNSTHEMYWVSGAKSNTIKSVSTTSQLLLGRMDGPRRP VKYEEDVNLGSGTRAVVSCAEAPNMKIIGNRIERIRSEHAETWFVDENHPYRTWAYHG SYEAPTQGSASSLVNGVVRLLSKPWDVVTGVTGIAMTDTTPYGQQRVFKEKVDTRVPD PQEGTRQVMSMVSSWLWKELGKHKRPRVCTKEEFINKVRSNAALGAIFEEEKEWKTAV EAVNDPRFWALVDKEREHHLRGECQSCVYNMMGKREKKQGEFGKAKGSRAIWYMWLGA RFLEFEALGFLNEDHWMGRENSGGGVEGLGLQRLGYVLEEMSRIPGGRMYADDTAGWD TRISRFDLENEALITNQMEKGHRALALAIIKYTYQNKVVKVLRPAEKGKTVMDIISRQ DQRGSGQVVTYALNTFTNLVVQLIRNMEAEEVLEMQDLWLLRRSEKVTNWLQSNGWDR LKRMAVSGDDCVVKPIDDRFAHALRFLNDMGKVRKDTQEWKPSTGWDNWEEVPFCSHH FNKLHLKDGRSIVVPCRHQDELIGRARVSPGAGWSIRETACLAKSYAQMWQLLYFHRR DLRLMANAICSSVPVDWVPTGRTTWSIHGKGEWMTTEDMLVVWNRVWIEENDHMEDKT PVTKWTDIPYLGKREDLWCGSLIGHRPRTTWAENIKNTVNMVRRIIGEEEKYMDYLST QVRYLGEEGSTPGVL" 3'UTR 10361..10659 ORIGIN 1 tcagactgcg acagttcgag tttgaagcga aagctagcaa cagtatcaac aggttttatt 61 ttggatttgg aaacgagagt ttctggtcat gaaaaaccca aaaaagaaat ccggaggatt 121 ccggattgtc aatatgctaa aacgcggagt agcccgtgtg agcccctttg ggggcttgaa 181 gaggctgcca gccggacttc tgctgggtca tgggcccatc aggatggtct tggcgattct 241 agcctttttg agattcacgg caatcaagcc atcactgggt ctcatcaata gatggggttc 301 agtggggaaa aaagaggcta tggaaataat aaagaagttc aagaaagatc tggctgccat 361 gctgagaata atcaatgcta ggaaggagaa gaagagacga ggcgcagata ctagtgtcgg 421 aattgttggc ctcctgctga ccacagctat ggcagcggag gtcactagac gtgggagtgc 481 atactacatg tacttggaca gaaacgatgc tggggaggcc atatcttttc caaccacatt 541 ggggatgaat aagtgttata tacagatcat ggatcttgga cacatgtgtg atgccaccat 601 gagctatgaa tgccctatgc tggatgaggg ggtggaacca gatgacgtcg attgttggtg 661 caacacgacg tcaacttggg ttgtgtacgg aacctgccat cacaaaaaag gtgaagcacg 721 gagatctaga agagctgtga cgctcccctc ccattccact aggaagctgc aaacgcggtc 781 gcaaacctgg ttggaatcaa gagaatacac aaagcacttg attagagtcg aaaattggat 841 attcaggaac cctggcttcg cgttagcagc agctgccatc gcttggcttt tgggaagctc 901 aacgagccaa aaagtcatat acttggtcat gatactgctg attgccccgg catacagcat 961 caggtgcata ggagtcagca atagggactt tgtggaaggt atgtcaggtg ggacttgggt 1021 tgatgttgtc ttggaacatg gaggttgtgt caccgtaatg gcacaggaca aaccgactgt 1081 cgacatagag ctggttacaa caacagtcag caacatggcg gaggtaagat cctactgcta 1141 tgaggcatca atatcagaca tggcttcgga cagccgctgc ccaacacaag gtgaagccta 1201 ccttgacaag caatcagaca ctcaatatgt ctgcaaaaga acgttagtgg acagaggctg 1261 gggaaatgga tgtggacttt ttggcaaagg gagcctggtg acatgcgcta agtttgcatg 1321 ctccaagaaa atgaccggga agagcatcca gccagagaat ctggagtacc ggataatgct 1381 gtcagttcat ggctcccagc acagtgggat gatcgttaat gacacaggac atgaaactga 1441 tgagaataga gcgaaggttg agataacgcc caattcacca agagccgaag ccaccctggg 1501 gggttttgga agcctaggac ttgattgtga accgaggaca ggccttgact tttcagattt 1561 gtattacttg actatgaata acaagcactg gttggttcac aaggagtggt tccacgacat 1621 tccattacct tggcacgctg gggcagacac cggaactcca cactggaaca acaaagaagc 1681 actggtagag ttcaaggacg cacatgccaa aaggcaaact gtcgtggttc tagggagtca 1741 agaaggagca gttcacacgg cccttgctgg agctctggag gctgagatgg atggtgcaaa 1801 gggaaggctg tcctctggcc acttgaaatg tcgcctgaaa atggataaac ttagattgaa 1861 gggcgtgtca tactccttgt gtaccgcagc gttcacattc accaagatcc cggctgaaac 1921 actgcacggg acagtcacag tggaggtaca gtacgcaggg acagatggac cttgcaaggt 1981 tccagctcag atggcggtgg acatgcaaac tctgacccca gttgggaggt tgataaccgc 2041 caaccccgta atcactgaaa gcactgagaa ctctaagatg atgctggaac ttgatccacc 2101 atttggggac tcttacattg tcataggagt cggggagaag aagatcaccc accactggca 2161 caggagtggc agcaccattg gaaaagcatt tgaagccact gtgagaggtg ccaagagaat 2221 ggcagtcttg ggagacacag cctgggactt tggatcagtt ggaggcgctc tcaactcatt 2281 gggcaagggc atccatcaaa tttttggagc agctttcaaa tcattgtttg gaggaatgtc 2341 ctggttctca caaatcctca ttggaacgtt gctgatgtgg ttgggtctga acacaaagaa 2401 tggatctatt tccctcatgt gcttggcctt agggggagtg ttgatcttct tatccacagc 2461 cgtctctgct gatgtggggt gctcggtgga cttctcaaag aaggagacga gatgcggtac 2521 aggggtgttc gtctataacg acgttgaagc ctggagggac aggtacaagt accatcctga 2581 ctccccccgt agattggcag cagcagtcaa gcaagcctgg gaagatggta tctgcgggat 2641 ctcctctgtt tcaagaatgg aaaacatcat gtggagatca gtagaagggg agctcaatgc 2701 aatcctggaa gagaatggag ttcaactgac ggtcgttgtg ggatctgtaa aaaaccccat 2761 gtggagaggt ccacagagat tgcccgtgcc tgtgaacgag ctgccccacg gctggaaggc 2821 ttgggggaaa tcgtacttcg ttagagcagc aaagacaaat aacagctttg tcgtggatgg 2881 tgacacactg aaggaatgcc cactcaaaca tagagcatgg aacagctttc ttgtggagga 2941 tcatgggttc ggggtatttc acactagtgt ctggctcaag gttagagaag attattcatt 3001 agagtgtgat ccagccgtta ttggaacagc tgttaaggga aaggaggctg tacacagtga 3061 tctaggctac tggattgaga gtgagaagaa tgacacatgg aggctgaaga gggcccatct 3121 gatcgagatg aaaacatgtg aatggccaaa gtcccacaca ttgtggacag atggaataga 3181 agagagtgat ctgatcatac ccaagtcttt agctgggcca ctcagccatc acaataccag 3241 agagggctac aggacccaaa tgaaagggcc atggcacagt gaagagcttg aaattcggtt 3301 tgaggaatgc ccaggcacta aggtccacgt ggaggaaaca tgtggaacaa gaggaccatc 3361 tctgagatca accactgcaa gcggaagggt gatcgaggaa tggtgctgca gggagtgcac 3421 aatgccccca ctgtcgttcc gggctaaaga tggctgttgg tatggaatgg agataaggcc 3481 caggaaagaa ccagaaagca acttggtaag gtcagtggtg actgcaggat caactgatca 3541 catggatcac ttctcccttg gagtgcttgt gattctgctc atggtgcagg aagggctgaa 3601 gaagagaatg accacaaaga tcatcataag cacatcaatg gcagtgctgg tagctatgat 3661 cctgggagga ttttcaatga gtgacctggc taagcttgca attttgatgg gtgccacctt 3721 cgcggaaatg aacactggag gagatgtagc tcatctggcg ctgatagcgg cattcaaagt 3781 cagaccagcg ttgctggtat ctttcatctt cagagctaat tggacacccc gtgaaagcat 3841 gctgctggcc ttggcctcgt gtcttctgca aactgcgatc tccgccttgg aaggcgacct 3901 gatggttctc atcaatggtt ttgctttggc ctggttggca atacgagcga tggttgttcc 3961 acgcactgat aacatcacct tggcaatcct ggctgctctg acaccactgg cccggggcac 4021 actgcttgtg gcgtggagag caggccttgc tacttgcggg gggtttatgc tcctctctct 4081 gaagggaaaa ggcagtgtga agaagaactt accatttgtc atggccctgg gactaaccgc 4141 tgtgaggctg gtcgacccca tcaacgtggt gggactgctg ttgctcacaa ggagtgggaa 4201 gcggagctgg ccccctagcg aagtactcac agctgttggc ctgatatgcg cattggctgg 4261 agggttcgcc aaggcagata tagagatggc tgggcccatg gccgcggtcg gtctgctaat 4321 tgtcagttac gtggtctcag gaaagagtgt ggacatgtac attgaaagag caggtgacat 4381 cacatgggaa aaagatgcgg aagtcactgg aaacagtccc cggctcgatg tggcgctaga 4441 tgagagtggt gatttctccc tggtggagga tgacggtccc cccatgagag agatcatact 4501 caaggtggtc ctgatgacca tctgtggcat gaacccaata gccataccct ttgcagctgg 4561 agcgtggtac gtatacgtga agactggaaa aaggagtggt gctctatggg atgtgcctgc 4621 tcccaaggaa gtaaaaaagg gggagaccac agatggagtg tacagagtaa tgactcgtag 4681 actgctaggt tcaacacaag ttggagtggg agttatgcaa gagggggtct ttcacactat 4741 gtggcacgtc acaaaaggat ccgcgctgag aagcggtgaa gggagacttg atccatactg 4801 gggagatgtc aagcaggatc tggtgtcata ctgtggtcca tggaagctag atgccgcctg 4861 ggacgggcac agcgaggtgc agctcttggc cgtgcccccc ggagagagag cgaggaacat 4921 ccagactctg cccggaatat ttaagacaaa ggatggggac attggagcgg ttgcgctgga 4981 ttacccagca ggaacttcag gatctccaat cctagacaag tgtgggagag tgataggact 5041 ttatggcaat ggggtcgtga tcaaaaatgg gagttatgtt agtgccatca cccaagggag 5101 gagggaggaa gagactcctg ttgagtgctt cgagccttcg atgctgaaga agaagcagct 5161 aactgtctta gacttgcatc ctggagctgg gaaaaccagg agagttcttc ctgaaatagt 5221 ccgtgaagct ataaaaacaa gactccgtac tgtgatctta gctccaacca gggttgtcgc 5281 tgctgaaatg gaggaagccc ttagagggct tccagtgcgt tatatgacaa cagcagtcaa 5341 tgtcacccat tctggaacag aaatcgtcga cttaatgtgc catgccacct tcacttcacg 5401 tctactacag ccaatcagag tccccaacta taatctgtat attatggatg aggcccactt 5461 cacagatccc tcaagtatag cagcaagagg atacatttca acaagggttg agatgggcga 5521 ggcggctgcc atcttcatga ccgccacgcc accaggaacc cgtgacgcat ttccggactc 5581 caactcacca attatggaca ccgaagtgga agtcccagag agagcctgga gctcaggctt 5641 tgattgggtg acggatcatt ctggaaaaac agtttggttt gttccaagcg tgaggaacgg 5701 caatgagatc gcagcttgtc tgacaaaggc tggaaaacgg gtcatacagc tcagcagaaa 5761 gacttttgag acagagttcc agaaaacaaa acatcaagag tgggactttg tcgtgacaac 5821 cgacatttca gagatgggcg ccaactttaa agctgaccgt gtcatagatt ccaggagatg 5881 cctaaagccg gtcatacttg atggcgagag agtcattctg gctggaccca tgcctgtcac 5941 acatgccagc gctgcccaga ggagggggcg cataggcagg aatcccaaca aacctggaga 6001 tgagtatctg tatggaggtg ggtgcgcaga gactgacgaa gaccatgcac actggcttga 6061 agcaagaatg ctccttgaca atatttacct ccaagatggc ctcatagcct cgctctatcg 6121 acctgaggcc gacaaagtag cagccattga gggagagttc aagcttagga cggagcaaag 6181 gaagaccttt gtggaactca tgaaaagagg agatcttcct gtttggctgg cctatcaggt 6241 tgcatctgcc ggaataacct acacagatag aagatggtgc tttgatggca cgaccaacaa 6301 caccataatg gaagacagtg tgccggcaga ggtgtggacc agacacggag agaaaagagt 6361 gctcaaaccg aggtggatgg acgccagagt ttgttcagat catgcggccc tgaagtcatt 6421 caaggagttt gccgctggga aaagaggagc ggcttttgga gtgatggaag ccctgggaac 6481 actgccagga cacatgacag agagattcca ggaagccatt gacaacctcg ctgtgctcat 6541 gcgggcagag actggaagca ggccttacaa agccgcggcg gcccaattgc cggagaccct 6601 agagaccatt atgcttttgg ggttgctggg aacagtctcg ctgggaatct ttttcgtctt 6661 gatgaggaac aagggcatag ggaagatggg ctttggaatg gtgactcttg gggccagcgc 6721 atggctcatg tggctctcgg aaattgagcc agccagaatt gcatgtgtcc tcattgttgt 6781 gttcctattg ctggtggtgc tcatacctga gccagaaaag caaagatctc cccaggacaa 6841 ccaaatggca atcatcatca tggtagcagt aggtcttctg ggcttgatca ccgccaatga 6901 actcggatgg ttggagagaa caaagagtga cctaagccat ctaatgggaa ggagagagga 6961 gggagcaacc ataggattct caatggacat tgacctgcgg ccagcctcag cttgggccat 7021 ctatgctgcc ttgacaactt tcattacccc agccgtccaa catgcagtga ccacttcata 7081 caacaactac tccttaatgg cgatggccac gcaagctgga gtgttgtttg gtatgggcaa 7141 agggatgcca ttctacgcat gggactttgg agtcccgctg ctaatgatag gttgctactc 7201 acaattaaca cccctgaccc taatagtggc catcattttg ctcgtggcgc actacatgta 7261 cttgatccca gggctgcagg cagcagctgc gcgtgctgcc cagaagagaa cggcagctgg 7321 catcatgaag aaccctgttg tggatggaat agtggtgact gacattgaca caatgacaat 7381 tgacccccaa gtggagaaaa agatgggaca ggtgctactc atagcagtag ccgtctccag 7441 cgccatactg tcgcggaccg cctgggggtg gggggaggct ggggccctga tcacagccgc 7501 aacttccact ttgtgggaag gctctccgaa caagtactgg aactcctcta cagccacttc 7561 actgtgtaac atttttaggg gaagttactt ggctggagct tctctaatct acacagtaac 7621 aagaaacgct ggcttggtca agagacgtgg gggtggaaca ggagagaccc tgggagagaa 7681 atggaaggcc cgcttgaacc agatgtcggc cctggagttc tactcctaca aaaagtcagg 7741 catcaccgag gtgtgcagag aagaggcccg ccgcgccctc aaggacggtg tggcaacggg 7801 aggccatgct gtgtcccgag gaagtgcaaa gctgagatgg ttggtggagc ggggatacct 7861 gcagccctat ggaaaggtca ttgatcttgg atgtggcaga gggggctgga gttactacgc 7921 cgccaccatc cgcaaagttc aagaagtgaa aggatacaca aaaggaggcc ctggtcatga 7981 agaacccgtg ttggtgcaaa gctatgggtg gaacatagtc cgtctcaaga gtggggtgga 8041 cgtctttcat atggcggctg agccgtgtga cacgttgctg tgtgacatag gtgagtcatc 8101 atctagtcct gaagtggaag aagcacggac gctcagagtc ctctccatgg tgggggattg 8161 gcttgaaaaa agaccaggag ccttttgtat aaaagtgttg tgcccataca ccagcactat 8221 gatggaaacc ctggagcgac tgcagcgtag gtatggggga ggactggtca gagtgccact 8281 ctcccgcaac tctacacatg agatgtactg ggtctctgga gcgaaaagca acaccataaa 8341 aagtgtgtcc accacgagcc agctcctctt ggggcgcatg gacgggccta ggaggccagt 8401 gaaatatgag gaggatgtga atctcggctc tggcacgcgg gctgtggtaa gctgcgctga 8461 agctcccaac atgaagatca ttggtaaccg cattgaaagg atccgcagtg agcacgcgga 8521 aacgtggttc gttgacgaga accacccata taggacatgg gcttaccatg gaagctatga 8581 ggcccccaca caagggtcag catcctctct agtaaacggg gttgtcaggc tcctgtcaaa 8641 accctgggat gtggtgactg gagtcacagg aatagccatg accgacacca caccgtatgg 8701 tcagcaaaga gttttcaagg aaaaagtgga cactagggtg ccagaccccc aagaaggcac 8761 tcgtcaggtt atgagcatgg tctcttcctg gttgtggaaa gagctaggca aacacaaacg 8821 gccacgagtc tgtaccaaag aagagttcat caacaaggtt cgtagcaatg cagcattagg 8881 ggcaatattt gaagaggaaa aagagtggaa gactgcagtg gaagctgtga acgatccaag 8941 gttctgggct ctagtggaca aggaaagaga gcaccacctg agaggagagt gccagagttg 9001 tgtgtacaac atgatgggaa aaagagaaaa gaaacaaggg gaatttggaa aggccaaggg 9061 cagccgcgcc atctggtata tgtggctagg ggctagattt ctagagttcg aagcccttgg 9121 attcttgaac gaggatcact ggatggggag agagaactca ggaggtggtg ttgaagggct 9181 gggattacaa agactcggat atgtcctaga agagatgagt cgcataccag gaggaaggat 9241 gtatgcagat gacactgctg gctgggatac ccgcatcagc aggtttgatc tagagaatga 9301 agctctaatc accaaccaaa tggagaaagg gcacagggcc ttggcattgg ccataatcaa 9361 gtacacatac caaaataaag tggtaaaggt ccttagacca gctgaaaaag ggaaaacagt 9421 tatggacatt atttcgagac aagaccaaag ggggagcgga caagttgtca cttacgctct 9481 taacacattt accaacctag tggtgcaact cattcggaat atggaggctg aggaagttct 9541 agagatgcaa gacttgtggc tgctgcggag gtcagagaaa gtgaccaact ggttgcagag 9601 caacggatgg gataggctca aacgaatggc agtcagtgga gatgattgcg ttgtgaagcc 9661 aattgatgat aggtttgcac atgccctcag gttcttgaat gatatgggaa aagttaggaa 9721 ggacacacaa gagtggaaac cctcaactgg atgggacaac tgggaagaag ttccgttttg 9781 ctcccaccac ttcaacaagc tccatctcaa ggacgggagg tccattgtgg ttccctgccg 9841 ccaccaagat gaactgattg gccgggcccg cgtctctcca ggggcgggat ggagcatccg 9901 ggagactgct tgcctagcaa aatcatatgc gcaaatgtgg cagctccttt atttccacag 9961 aagggacctc cgactgatgg ccaatgccat ttgttcatct gtgccagttg actgggttcc 10021 aactgggaga actacctggt caatccatgg aaagggagaa tggatgacca ctgaagacat 10081 gcttgtggtg tggaacagag tgtggattga ggagaacgac cacatggaag acaagacccc 10141 agttacgaaa tggacagaca ttccctattt gggaaaaagg gaagacttgt ggtgtggatc 10201 tctcataggg cacagaccgc gcaccacctg ggctgagaac attaaaaaca cagtcaacat 10261 ggtgcgcagg atcataggtg aggaagaaaa gtacatggac tacctatcca cccaagttcg 10321 ctacttgggt gaagaagggt ctacacctgg agtgctgtaa gcaccaatct taatgttgtc 10381 aggcctgcta gtcagccaca gcttggggaa agctgtgcag cctgtggccc ccccaggaga 10441 agctgggaaa ccaagcctat agtcaggccg agaacgccat ggcacggaag aagccatgct 10501 gcctgtgagc ccctcagagg acactgagtc aaaaaacccc acgcgcttgg aggcgcagga 10561 tgggaaaaga aggtggcgac cttccccacc cttcaatctg gggcctgaac tggagatcag 10621 ctgtggatct ccagaagagg gactagtggt tagaggaga Link to comment Share on other sites More sharing options...
niman Posted October 24, 2016 Author Report Share Posted October 24, 2016 Sequences producing significant alignments: Select:AllNone Selected:0 AlignmentsDownloadGenBankGraphicsDistance tree of resultsShow/hide columns of the table presenting sequences producing significant alignments Sequences producing significant alignments: Select for downloading or viewing reports Description Max score Total score Query cover E value Ident Accession Select seq gb|KY014304.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0180-SER polyprotein gene, complete cds 18525 18525 100% 0.0 100% KY014304.1 Select seq gb|KU853013.1| Zika virus isolate Dominican Republic/2016/PD2, complete genome 18474 18474 100% 0.0 99% KU853013.1 Select seq gb|KU853012.1| Zika virus isolate Dominican Republic/2016/PD1, complete genome 18474 18474 100% 0.0 99% KU853012.1 Select seq gb|KY014300.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0208-SER polyprotein gene, complete cds 18471 18471 100% 0.0 99% KY014300.1 Select seq dbj|LC190723.1| Zika virus genomic RNA, complete genome, strain: ZIKV/Hu/Yokohama/1/2016 18471 18471 100% 0.0 99% LC190723.1 Select seq gb|KY014321.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0115-SER polyprotein gene, complete cds 18465 18465 100% 0.0 99% KY014321.1 Select seq gb|KY014295.1| Zika virus isolate Zika virus/H.sapiens-wt/USA/2016/FL-010-URI polyprotein gene, complete cds 18462 18462 100% 0.0 99% KY014295.1 Select seq gb|KX842449.2| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL010U polyprotein gene, complete cds 18462 18462 100% 0.0 99% KX842449.2 Select seq gb|KX922707.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL039U polyprotein gene, complete cds 18453 18453 100% 0.0 99% KX922707.1 Select seq gb|KY014323.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-02-MOS polyprotein gene, complete cds 18447 18447 100% 0.0 99% KY014323.1 Select seq gb|KY014322.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-03-MOS polyprotein gene, complete cds 18447 18447 100% 0.0 99% KY014322.1 Select seq gb|KY014314.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0436-SER polyprotein gene, complete cds 18447 18447 100% 0.0 99% KY014314.1 Select seq gb|KX922703.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL021U polyprotein gene, complete cds 18447 18447 100% 0.0 99% KX922703.1 Select seq gb|KX838906.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL03M polyprotein gene, complete cds 18447 18447 100% 0.0 99% KX838906.2 Select seq gb|KX838905.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL02M polyprotein gene, complete cds 18447 18447 100% 0.0 99% KX838905.2 Select seq gb|KX832731.1| Zika virus isolate ZIKV/Homo_sapiens/USA//2016/Hu0015SA polyprotein gene, complete cds 18447 18447 100% 0.0 99% KX832731.1 Select seq gb|KX922706.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL038U polyprotein gene, complete cds 18446 18446 100% 0.0 99% KX922706.1 Select seq gb|KY014316.1| Zika virus isolate Zika virus/H.sapiens-wt/USA/2016/FL-039-URI polyprotein gene, complete cds 18444 18444 100% 0.0 99% KY014316.1 Select seq gb|KX922704.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL030U polyprotein gene, complete cds 18444 18444 100% 0.0 99% KX922704.1 Select seq gb|KX673530.1| Zika virus isolate PHE_semen_Guadeloupe, complete genome 18444 18444 100% 0.0 99% KX673530.1 Select seq gb|KX838904.2| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL01M polyprotein gene, complete cds 18438 18438 100% 0.0 99% KX838904.2 Select seq gb|KX922705.1| Zika virus isolate ZIKV/Homo_sapiens/USA/2016/FL032U polyprotein gene, complete cds 18437 18437 100% 0.0 99% KX922705.1 Select seq gb|KY014324.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-01-MOS polyprotein gene, complete cds 18435 18435 100% 0.0 99% KY014324.1 Select seq gb|KX922708.1| Zika virus isolate ZIKV/Aedes_aegypti/USA/2016/FL04M polyprotein gene, complete cds 18429 18429 100% 0.0 99% KX922708.1 Select seq gb|KY014299.1| Zika virus isolate Zika virus/A.aegypti-wt/USA/2016/FL-04-MOS polyprotein gene, complete cds 18426 18426 100% 0.0 99% KY014299.1 Select seq gb|KX447510.1| Zika virus isolate 1_0049_PF polyprotein gene, complete cds 18393 18393 100% 0.0 99% KX447510.1 Select seq gb|KX280026.1| Zika virus isolate Paraiba_01, complete genome 18390 18390 100% 0.0 99% KX280026.1 Select seq gb|KX447512.1| Zika virus isolate 1_0181_PF polyprotein gene, complete cds 18384 18384 100% 0.0 99% KX447512.1 Select seq gb|KX369547.1| Zika virus strain PF13/251013-18, complete genome 18384 18384 100% 0.0 99% KX369547.1 Select seq gb|KU509998.3| Zika virus strain Haiti/1225/2014, complete genome 18384 18384 100% 0.0 99% KU509998.3 Select seq gb|KJ776791.2| Zika virus strain H/PF/2013, complete genome 18381 18381 100% 0.0 99% KJ776791.2 Select seq gb|KX447509.1| Zika virus isolate 1_0087_PF polyprotein gene, complete cds 18381 18381 100% 0.0 99% KX447509.1 Select seq gb|KU991811.1| Zika virus isolate Brazil/2016/INMI1 polyprotein gene, complete cds 18381 18381 100% 0.0 99% KU991811.1 Select seq gb|KU729217.2| Zika virus isolate BeH823339 polyprotein gene, complete cds 18381 18381 100% 0.0 99% KU729217.2 Select seq gb|KX447513.1| Zika virus isolate 1_0134_PF polyprotein gene, complete cds 18375 18375 100% 0.0 99% KX447513.1 Select seq gb|KX811222.1| Zika virus isolate Brazil_2015_MG, complete genome 18372 18372 100% 0.0 99% KX811222.1 Select seq gb|KX197205.1| Zika virus isolate 9, complete genome 18372 18372 100% 0.0 99% KX197205.1 Select seq gb|KX447515.1| Zika virus isolate 1_0030_PF polyprotein gene, complete cds 18372 18372 100% 0.0 99% KX447515.1 Select seq gb|KX447511.1| Zika virus isolate 1_0015_PF polyprotein gene, complete cds 18372 18372 100% 0.0 99% KX447511.1 Select seq gb|KU321639.1| Zika virus strain ZikaSPH2015, complete genome 18372 18372 100% 0.0 99% KU321639.1 Select seq gb|KX879604.1| Zika virus isolate SN089, complete genome 18366 18366 100% 0.0 99% KX879604.1 Select seq gb|KX447514.1| Zika virus isolate 1_0035_PF polyprotein gene, complete cds 18366 18366 100% 0.0 99% KX447514.1 Select seq gb|KX051563.1| Zika virus isolate Haiti/1/2016, complete genome 18366 18366 100% 0.0 99% KX051563.1 Select seq gb|KX447516.1| Zika virus isolate 1_0111_PF polyprotein gene, complete cds 18363 18363 100% 0.0 99% KX447516.1 Select seq gb|KU729218.1| Zika virus isolate BeH828305 polyprotein gene, complete cds 18363 18363 100% 0.0 99% KU729218.1 Select seq gb|KU707826.1| Zika virus isolate SSABR1, complete genome 18363 18363 100% 0.0 99% KU707826.1 Select seq gb|KU527068.1| Zika virus strain Natal RGN, complete genome 18363 18363 100% 0.0 99% KU527068.1 Select seq gb|KU365779.1| Zika virus strain BeH819966 polyprotein gene, complete cds 18363 18363 100% 0.0 99% KU365779.1 Select seq gb|KX879603.1| Zika virus isolate SN062, complete genome 18357 18357 100% 0.0 99% KX879603.1 Select seq gb|KX262887.1| Zika virus isolate 103451, complete genome 18357 18357 100% 0.0 99% KX262887.1 Select seq gb|KX197192.1| Zika virus isolate ZIKV/H.sapiens/Brazil/PE243/2015, complete genome 18357 18357 100% 0.0 99% KX197192.1 Select seq gb|KU926310.1| Zika virus isolate Rio-S1, complete genome 18357 18357 100% 0.0 99% KU926310.1 Select seq gb|KU926309.1| Zika virus isolate Rio-U1, complete genome 18357 18357 100% 0.0 99% KU926309.1 Select seq gb|KY014303.1| Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0127-SER polyprotein gene, complete cds 18354 18354 100% 0.0 99% KY014303.1 Select seq gb|KU940228.1| Zika virus isolate Bahia07, partial genome 18354 18354 100% 0.0 99% KU940228.1 Select seq gb|KX694534.1| Zika virus strain ZIKV/Homo sapiens/HND/R103451/2015, complete genome 18348 18348 100% 0.0 99% KX694534.1 Select seq gb|KX198135.1| Zika virus strain ZIKV/Homo sapiens/PAN/BEI-259634_V4/2016, complete genome 18348 18348 100% 0.0 99% KX198135.1 Select seq gb|KU501217.1| Zika virus strain 8375 polyprotein gene, complete cds 18348 18348 100% 0.0 99% KU501217.1 Select seq gb|KU365780.1| Zika virus strain BeH815744 polyprotein gene, complete cds 18348 18348 100% 0.0 99% KU365780.1 Select seq gb|KU647676.1| Zika virus strain MRS_OPY_Martinique_PaRi_2015 polyprotein gene, complete cds 18345 18345 100% 0.0 99% KU647676.1 Select seq gb|KU501216.1| Zika virus strain 103344 polyprotein gene, complete cds 18345 18345 100% 0.0 99% KU501216.1 Select seq gb|KU365777.1| Zika virus strain BeH818995 polyprotein gene, complete cds 18345 18345 100% 0.0 99% KU365777.1 Select seq gb|KY014297.1| Zika virus isolate Zika virus/H.sapiens-wt/BRA/2016/FC-6864-URI polyprotein gene, complete cds 18339 18339 100% 0.0 99% KY014297.1 Select seq gb|KX447517.1| Zika virus isolate 1_0038_PF polyprotein gene, complete cds 18339 18339 100% 0.0 99% KX447517.1 Select seq gb|KU758877.1| Zika virus isolate 17271 polyprotein gene, complete cds 18339 18339 100% 0.0 99% KU758877.1 Select seq gb|KX247646.1| Zika virus isolate Zika virus/Homo sapiens/COL/UF-1/2016, complete genome 18339 18339 100% 0.0 99% KX247646.1 Select seq gb|KX156776.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259364_V1-V2/2015, complete genome 18339 18339 100% 0.0 99% KX156776.1 Select seq gb|KX520666.1| Zika virus isolate HS-2015-BA-01 polyprotein gene, complete cds 18336 18336 100% 0.0 99% KX520666.1 Select seq gb|KX156774.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259359_V1-V3/2015, complete genome 18336 18336 100% 0.0 99% KX156774.1 Select seq gb|KU497555.1| Zika virus isolate Brazil-ZKV2015, complete genome 18336 18336 99% 0.0 99% KU497555.1 Select seq gb|KY014327.1| Zika virus isolate Zika virus/H.sapiens-wt/HND/2016/HU-ME167-PLA polyprotein gene, complete cds 18332 18332 100% 0.0 99% KY014327.1 Select seq gb|KU820897.5| Zika virus isolate FLR polyprotein gene, complete cds 18330 18330 100% 0.0 99% KU820897.5 Select seq gb|KX247632.1| Zika virus isolate MEX_I_7 polyprotein gene, complete cds 18330 18330 100% 0.0 99% KX247632.1 Select seq gb|KX156775.1| Zika virus strain ZIKV/Homo sapiens/PAN/CDC-259249_V1-V3/2015, complete genome 18330 18330 100% 0.0 99% KX156775.1 Select seq gb|KX087102.1| Zika virus strain ZIKV/Homo sapiens/COL/FLR/2015, complete genome 18330 18330 100% 0.0 99% KX087102.1 Select seq gb|KU365778.1| Zika virus strain BeH819015 polyprotein gene, complete cds 18330 18330 100% 0.0 99% KU365778.1 Select seq gb|KU312312.1| Zika virus isolate Z1106033 polyprotein gene, complete cds 18330 18330 100% 0.0 99% KU312312.1 Select seq gb|KY014315.1| Zika virus isolate Zika virus/H.sapiens-wt/HND/2016/HU-ME152-SER polyprotein gene, complete cds 18327 18327 100% 0.0 99% KY014315.1 Select seq gb|KU922960.1| Zika virus isolate MEX/InDRE/Sm/2016, complete genome 18327 18327 100% 0.0 99% KU922960.1 Select seq gb|KY014296.1| Zika virus isolate Zika virus/H.sapiens-wt/BRA/2016/FC-DQ131D1-URI polyprotein gene, complete cds 18321 18321 100% 0.0 99% KY014296.1 Select seq gb|KX806557.2| Zika virus isolate TS17-2016, complete genome 18321 18321 100% 0.0 99% KX806557.2 Select seq gb|KX856011.1| Zika virus strain ZIKV/Aedes sp./MEX_I-44/2016, complete genome 18321 18321 100% 0.0 99% KX856011.1 Select seq gb|KX548902.1| Zika virus isolate ZIKV/COL/FCC00093/2015 polyprotein gene, complete cds 18321 18321 100% 0.0 99% KX548902.1 Select seq gb|KX446951.1| Zika virus strain ZIKV/Aedes.sp/MEX/MEX_I-7/2016, complete genome 18321 18321 100% 0.0 99% KX446951.1 Select seq gb|KU937936.1| Zika virus isolate ZIKVNL00013 polyprotein gene, complete cds 18321 18321 100% 0.0 99% KU937936.1 Select seq gb|KU922923.1| Zika virus isolate MEX/InDRE/Lm/2016, complete genome 18321 18321 100% 0.0 99% KU922923.1 Select seq gb|KU501215.1| Zika virus strain PRVABC59, complete genome 18321 18321 100% 0.0 99% KU501215.1 Select seq gb|KX601168.1| Zika virus strain ZIKV/Homo Sapiens/PRI/PRVABC59/2015, complete genome 18318 18318 100% 0.0 99% KX601168.1 Select seq gb|KX446950.1| Zika virus strain ZIKV/Aedes.sp/MEX/MEX_2-81/2016, complete genome 18318 18318 100% 0.0 99% KX446950.1 Select seq gb|KX087101.2| Zika virus strain ZIKV/Homo sapiens/PRI/PRVABC59/2015, complete genome 18318 18318 100% 0.0 99% KX087101.2 Select seq gb|KU870645.1| Zika virus isolate FB-GWUH-2016, complete genome 18318 18318 100% 0.0 99% KU870645.1 Select seq gb|KX893855.1| Zika virus strain Zika virus/Homo sapiens/VEN/UF-2/2016, complete genome 18316 18316 100% 0.0 99% KX893855.1 Select seq gb|KX702400.1| Zika virus strain Zika virus/Homo sapiens/VEN/UF-1/2016, complete genome 18312 18312 100% 0.0 99% KX702400.1 Select seq gb|KX377337.1| Zika virus strain PRVABC-59, complete genome 18312 18312 100% 0.0 99% KX377337.1 Select seq gb|KX766029.1| Zika virus isolate R116265, complete genome 18303 18303 100% 0.0 99% KX766029.1 Select seq gb|KU820898.1| Zika virus isolate GZ01 polyprotein gene, complete cds 18303 18303 100% 0.0 99% KU820898.1 Select seq gb|KX056898.1| Zika virus isolate Zika virus/GZ02/2016 polyprotein gene, complete cds 18300 18300 100% 0.0 99% KX056898.1 Select seq gb|KU955590.1| Zika virus isolate Z16019 polyprotein gene, complete cds 18300 18300 100% 0.0 99% KU955590.1 Select seq gb|KX766028.1| Zika virus isolate R114916, complete genome 18296 18296 100% 0.0 99% KX766028.1 Select seq gb|KU740184.2| Zika virus isolate GD01 polyprotein gene, complete cds 18294 18294 100% 0.0 99% KU740184.2 Link to comment Share on other sites More sharing options...
niman Posted October 24, 2016 Author Report Share Posted October 24, 2016 LOCUS KY014321 10643 bp RNA linear VRL 21-OCT-2016 DEFINITION Zika virus isolate Zika virus/H.sapiens-wt/DOM/2016/BB-0115-SER polyprotein gene, complete cds. ACCESSION KY014321 VERSION KY014321.1 DBLINK BioProject: PRJNA344504 BioSample: SAMN05844988 KEYWORDS . SOURCE Zika virus ORGANISM Zika virus Viruses; ssRNA viruses; ssRNA positive-strand viruses, no DNA stage; Flaviviridae; Flavivirus. REFERENCE 1 (bases 1 to 10643) AUTHORS Baniecki,M.L., Barnes,K.G., Bosch,I., Freije,C.A., Gehrke,L., Gladden-Young,A.D., Gnirke,A., Luo,C.Y., MacInnis,B., Matranga,C.B., Metsky,H.C., Park,D.J., Qu,J., Sabeti,P.C., Tomkins-Tinch,C.H., West,K.L., Winnicki,S., Wohl,S. and Yozwiak,N.L. TITLE Direct Submission JOURNAL Submitted (20-OCT-2016) Viral Genomics, Infectious Disease Program (Infectious Disease Initiative), Broad Institute, 75 Ames St, Cambridge, MA 02142, USA COMMENT ##Assembly-Data-START## Assembly Method :: github.com/broadinstitute/viral-ngs v. v1.12.0-51-g8588fdb Assembly Name :: DOM_2016_BB-0115_SER-1 Coverage :: 58x Sequencing Technology :: Illumina; Swift LC ##Assembly-Data-END## FEATURES Location/Qualifiers source 1..10643 /organism="Zika virus" /mol_type="genomic RNA" /isolate="Zika virus/H.sapiens-wt/DOM/2016/BB-0115-SER" /isolation_source="serum" /host="Homo sapiens" /db_xref="taxon:64320" /country="Dominican Republic: Santo Domingo" /collection_date="11-Apr-2016" /collected_by="Hospital General de la Plaza de la Salud, Santo Domingo, Dominican Republic" 5'UTR <1..88 CDS 89..10360 /note="contains structural and nonstructural proteins" /codon_start=1 /product="polyprotein" /protein_id="AOY08542.1" /translation="MKNPKKKSGGFRIVNMLKRGVARVSPFGGLKRLPAGLLLGHGPI RMVLAILAFLRFTAIKPSLGLINRWGSVGKKEAMEIIKKFKKDLAAMLRIINARKEKK RRGADTSVGIVGLLLTTAMAAEVTRRGSAYYMYLDRNDAGEAISFPTTLGMNKCYIQI MDLGHMCDATMSYECPMLDEGVEPDDVDCWCNTTSTWVVYGTCHHKKGEARRSRRAVT LPSHSTRKLQTRSQTWLESREYTKHLIRVENWIFRNPGFALAAAAIAWLLGSSTSQKV IYLVMILLIAPAYSIRCIGVSNRDFVEGMSGGTWVDVVLEHGGCVTVMAQDKPTVDIE LVTTTVSNMAEVRSYCYEASISDMASDSRCPTQGEAYLDKQSDTQYVCKRTLVDRGWG NGCGLFGKGSLVTCAKFACSKKMTGKSIQPENLEYRIMLSVHGSQHSGMIVNDTGHET DENRAKVEITPNSPRAEATLGGFGSLGLDCEPRTGLDFSDLYYLTMNNKHWLVHKEWF HDIPLPWHAGADTGTPHWNNKEALVEFKDAHAKRQTVVVLGSQEGAVHTALAGALEAE MDGAKGRLSSGHLKCRLKMDKLRLKGVSYSLCTAAFTFTKIPAETLHGTVTVEVQYAG TDGPCKVPAQMAVDMQTLTPVGRLITANPVITESTENSKMMLELDPPFGDSYIVIGVG EKKITHHWHRSGSTIGKAFEATVRGAKRMAVLGDTAWDFGSVGGALNSLGKGIHQIFG AAFKSLFGGMSWFSQILIGTLLMWLGLNTKNGSISLMCLALGGVLIFLSTAVSADVGC SVDFSKKETRCGTGVFVYNDVEAWRDRYKYHPDSPRRLAAAVKQAWEDGICGISSVSR MENIMWRSVEGELNAILEENGVQLTVVVGSVKNPMWRGPQRLPVPVNELPHGWKAWGK SYFVRAAKTNNSFVVDGDTLKECPLKHRAWNSFLVEDHGFGVFHTSVWLKVREDYSLE CDPAVIGTAVKGKEAVHSDLGYWIESEKNDTWRLKRAHLIEMKTCEWPKSHTLWTDGI EESDLIIPKSLAGPLSHHNTREGYRTQMKGPWHSEELEIRFEECPGTKVHVEETCGTR GPSLRSTTASGRVIEEWCCRECTMPPLSFRAKDGCWYGMEIRPRKEPESNLVRSVVTA GSTDHMDHFSLGVLVILLMVQEGLKKRMTTKIIISTSMAVLVAMILGGFSMSDLAKLA ILMGATFAEMNTGGDVAHLALIAAFKVRPALLVSFIFRANWTPRESMLLALASCLLQT AISALEGDLMVLINGFALAWLAIRAMVVPRTDNITLAILAALTPLARGTLLVAWRAGL ATCGGFMLLSLKGKGSVKKNLPFVMALGLTAVRLVDPINVVGLLLLTRSGKRSWPPSE VLTAVGLICALAGGFAKADIEMAGPMAAVGLLIVSYVVSGKSVDMYIERAGDITWEKD AEVTGNSPRLDVALDESGDFSLVEDDGPPMREIILKVVLMTICGMNPIAIPFAAGAWY VYVKTGKRSGALWDVPAPKEAKKGETTDGVYRVMTRRLLGSTQVGVGVMQEGVFHTMW HVTKGSALRSGEGRLDPYWGDVKQDLVSYCGPWKLDAAWDGHSEVQLLAVPPGERARN IQTLPGIFKTKDGDIGAVALDYPAGTSGSPILDKCGRVIGLYGNGVVIKNGSYVSAIT QGRREEETPVECFEPSMLKKKQLTVLDLHPGAGKTRRVLPEIVREAIKTRLRTVILAP TRVVAAEMEEALRGLPVRYMTTAVNVTHSGTEIVDLMCHATFTSRLLQPIRVPNYNLY IMDEAHFTDPSSIAARGYISTRVEMGEAAAIFMTATPPGTRDAFPDSNSPIMDTEVEV PERAWSSGFDWVTDHSGKTVWFVPSVRNGNEIAACLTKAGKRVIQLSRKTFETEFQKT KHQEWDFVVTTDISEMGANFKADRVIDSRRCLKPVILDGERVILAGPMPVTHASAAQR RGRIGRNPNKPGDEYLYGGGCAETDEDHAHWLEARMLLDNIYLQDGLIASLYRPEADK VAAIEGEFKLRTEQRKTFVELMKRGDLPVWLAYQVASAGITYTDRRWCFDGTTNNTIM EDSVPAEVWTRHGEKRVLKPRWMDARVCSDHAALKSFKEFAAGKRGAAFGVMEALGTL PGHMTERFQEAIDNLAVLMRAETGSRPYKAAAAQLPETLETIMLLGLLGTVSLGIFFV LMRNKGIGKMGFGMVTLGASAWLMWLSEIEPARIACVLIVVFLLLVVLIPEPEKQRSP QDNQMAIIIMVAVGLLGLITANELGWLERTKSDLSHLMGRREEGATIGFSMDIDLRPA SAWAIYAALTTFITPAVQHAVTTSYNNYSLMAMATQAGVLFGMGKGMPFYAWDFGVPL LMIGCYSQLTPLTLIVAIILLVAHYMYLIPGLQAAAARAAQKRTAAGIMKNPVVDGIV VTDIDTMTIDPQVEKKMGQVLLIAVAVSSAILSRTAWGWGEAGALITAATSTLWEGSP NKYWNSSTATSLCNIFRGSYLAGASLIYTVTRNAGLVKRRGGGTGETLGEKWKARLNQ MSALEFYSYKKSGITEVCREEARRALKDGVATGGHAVSRGSAKLRWLVERGYLQPYGK VIDLGCGRGGWSYYAATIRKVQEVKGYTKGGPGHEEPVLVQSYGWNIVRLKSGVDVFH MAAEPCDTLLCDIGESSSSPEVEEARTLRVLSMVGDWLEKRPGAFCIKVLCPYTSTMM ETLERLQRRYGGGLVRVPLSRNSTHEMYWVSGAKSNTIKSVSTTSQLLLGRMDGPRRP VKYEEDVNLGSGTRAVVSCAEAPNMKIIGNRIERIRSEHAETWFFDENHPYRTWAYHG SYEAPTQGSASSLVNGVVRLLSKPWDVVTGVTGIAMTDTTPYGQQRVFKEKVDTRVPD PQEGTRQVMSMVSSWLWQELGKHKRPRVCTKEEFINKVRSNAALGAIFEEEKEWKTAV EAVNDPRFWALVDKEREHHLRGECQSCVYNMMGKREKKQGEFGKAKGSRAIWYMWLGA RFLEFEALGFLNEDHWMGRENSGGGVEGLGLQRLGYVLEEMSRIPGGRMYADDTAGWD TRISRFDLENEALITNQMEKGHRALALAIIKYTYQNKVVKVLRPAEKGKTVMDIISRQ DQRGSGQVVTYALNTFTNLVVQLIRNMEAEEVLEMQDLWLLRRSEKVTNWLQSNGWDR LKRMAVSGDDCVVKPIDDRFAHALRFLNDMGKVRKDTQEWKPSTGWDNWEEVPFCSHH FNKLHLKDGRSIVVPCRHQDELIGRARVSPGAGWSIRETACLAKSYAQMWQLLYFHRR DLRLMANAICSSVPVDWVPTGRTTWSIHGKGEWMTTEDMLVVWNRVWIEENDHMEDKT PVTKWTDIPYLGKREDLWCGSLIGHRPRTTWAENIKNTVNMVRRIIGEEEKYMDYLST QVRYLGEEGSTPGVL" 3'UTR 10361..>10643 ORIGIN 1 tcagactgcg acagttcgag tttgaagcga aagctagcaa cagtatcaac aggttttatt 61 ttggatttgg aaacgagagt ttctggtcat gaaaaaccca aaaaagaaat ccggaggatt 121 ccggattgtc aatatgctaa aacgcggagt agcccgtgtg agcccctttg ggggcttgaa 181 gaggctgcca gccggacttc tgctgggtca tgggcccatc aggatggtct tggcgattct 241 agcctttttg agattcacgg caatcaagcc atcactgggt ctcatcaata gatggggttc 301 agtggggaaa aaagaggcta tggaaataat aaagaagttc aagaaagatc tggctgccat 361 gctgagaata atcaatgcta ggaaggagaa gaagagacga ggcgcagata ctagtgtcgg 421 aattgttggc ctcctgctga ccacagctat ggcagcggag gtcactagac gtgggagtgc 481 atactacatg tacttggaca gaaacgatgc tggggaggcc atatcttttc caaccacatt 541 ggggatgaat aagtgttata tacagatcat ggatcttgga cacatgtgtg atgccaccat 601 gagctatgaa tgccctatgc tggatgaggg ggtggaacca gatgacgtcg attgttggtg 661 caacacgacg tcaacttggg ttgtgtacgg aacctgccat cacaaaaaag gtgaagcacg 721 gagatctaga agagctgtga cgctcccctc ccattccact aggaagctgc aaacgcggtc 781 gcaaacctgg ttggaatcaa gagaatacac aaagcacttg attagagtcg aaaattggat 841 attcaggaac cctggcttcg cgttagcagc agctgccatc gcttggcttt tgggaagctc 901 aacgagccaa aaagtcatat acttggtcat gatactgctg attgccccgg catacagcat 961 caggtgcata ggagtcagca atagggactt tgtggaaggt atgtcaggtg ggacttgggt 1021 tgatgttgtc ttggaacatg gaggttgtgt caccgtaatg gcacaggaca aaccgactgt 1081 cgacatagag ctggttacaa caacagtcag caacatggcg gaggtaagat cctactgcta 1141 tgaggcatca atatcagaca tggcttcgga cagccgctgc ccaacacaag gtgaagccta 1201 ccttgacaag caatcagaca ctcaatatgt ctgcaaaaga acgttagtgg acagaggctg 1261 gggaaatgga tgtggacttt ttggcaaagg gagcctggtg acatgcgcta agtttgcatg 1321 ctccaagaaa atgaccggga agagcatcca gccagagaat ctggagtacc ggataatgct 1381 gtcagttcat ggctcccagc acagtggaat gatcgttaat gacacaggac atgaaactga 1441 tgagaataga gcgaaggttg agataacgcc caattcacca agagccgaag ccaccctggg 1501 gggttttgga agcctaggac ttgattgtga accgaggaca ggccttgact tttcagattt 1561 gtattacttg actatgaata acaagcactg gttggttcac aaggagtggt tccacgacat 1621 tccattacct tggcacgctg gggcagacac cggaactcca cactggaaca acaaagaagc 1681 actggtagag ttcaaggacg cacatgccaa aaggcaaact gtcgtggttc tagggagtca 1741 agaaggagca gttcacacgg cccttgctgg agctctggag gctgagatgg atggtgcaaa 1801 gggaaggctg tcctctggcc acttgaaatg tcgcctgaaa atggataaac ttagattgaa 1861 gggcgtgtca tactccttgt gtaccgcagc gttcacattc accaagatcc cggctgaaac 1921 actgcacggg acagtcacag tggaggtaca gtacgcaggg acagatggac cttgcaaggt 1981 tccagctcag atggcggtgg acatgcaaac tctgacccca gttgggaggt tgataaccgc 2041 caaccccgta atcactgaaa gcactgagaa ctctaagatg atgctggaac ttgatccacc 2101 atttggggac tcttacattg tcataggagt cggggagaag aagatcaccc accactggca 2161 caggagtggc agcaccattg gaaaagcatt tgaagccact gtgagaggtg ccaagagaat 2221 ggcagtcttg ggagacacag cctgggactt tggatcagtt ggaggcgctc tcaactcatt 2281 gggcaagggc atccatcaaa tttttggagc agctttcaaa tcattgtttg gaggaatgtc 2341 ctggttctca caaatcctca ttggaacgtt gctgatgtgg ttgggtctga acacaaagaa 2401 tggatctatt tccctcatgt gcttggcctt agggggagtg ttgatcttct tatccacagc 2461 cgtctctgct gatgtggggt gctcggtgga cttctcaaag aaggagacga gatgcggtac 2521 aggggtgttc gtctataacg acgttgaagc ctggagggac aggtacaagt accatcctga 2581 ctccccccgt agattggcag cagcagtcaa gcaagcctgg gaagatggta tctgcgggat 2641 ctcctctgtt tcaagaatgg aaaacatcat gtggagatca gtagaagggg agctcaatgc 2701 aatcctggaa gagaatggag ttcaactgac ggtcgttgtg ggatctgtaa aaaaccccat 2761 gtggagaggt ccacagagat tgcccgtgcc tgtgaacgag ctgccccacg gctggaaggc 2821 ttgggggaaa tcgtacttcg ttagagcagc aaagacaaat aacagctttg tcgtggatgg 2881 tgacacactg aaggaatgcc cactcaaaca tagagcatgg aacagctttc ttgtggagga 2941 tcatgggttc ggggtatttc acactagtgt ctggctcaag gttagagaag attattcatt 3001 agagtgtgat ccagccgtta ttggaacagc tgttaaggga aaggaggctg tacacagtga 3061 tctaggctac tggattgaga gtgagaagaa tgacacatgg aggctgaaga gggcccatct 3121 gatcgagatg aaaacatgtg aatggccaaa gtcccacaca ttgtggacag atggaataga 3181 agagagtgat ctgatcatac ccaagtcttt agctgggcca ctcagccatc acaataccag 3241 agagggctac aggacccaaa tgaaagggcc atggcacagt gaagagcttg aaattcggtt 3301 tgaggaatgc ccaggcacta aggtccacgt ggaggaaaca tgtggaacaa gaggaccatc 3361 tctgagatca accactgcaa gcggaagggt gatcgaggaa tggtgctgca gggagtgcac 3421 aatgccccca ctgtcgttcc gggctaaaga tggctgttgg tatggaatgg agataaggcc 3481 caggaaagaa ccagaaagca acttagtaag gtcagtggtg actgcaggat caactgatca 3541 catggatcac ttctcccttg gagtgcttgt gattctgctc atggtgcagg aagggctgaa 3601 gaagagaatg accacaaaga tcatcataag cacatcaatg gcagtgctgg tagctatgat 3661 cctgggagga ttttcaatga gtgacctggc taagcttgca attttgatgg gtgccacctt 3721 cgcggaaatg aacactggag gagatgtagc tcatctggcg ctgatagcgg cattcaaagt 3781 cagaccagcg ttgctggtat ctttcatctt cagagctaat tggacacccc gtgaaagcat 3841 gctgctggcc ttggcctcgt gtcttttgca aactgcgatc tccgccttgg aaggcgacct 3901 gatggttctc atcaatggtt ttgctttggc ctggttggca atacgagcga tggttgttcc 3961 acgcactgat aacatcaccc tggcaatcct ggctgctctg acaccactgg cccggggcac 4021 actgcttgtg gcgtggagag caggccttgc tacttgcggg gggtttatgc tcctctctct 4081 gaagggaaaa ggcagtgtga agaagaactt accatttgtc atggccctgg gactaaccgc 4141 tgtgaggctg gtcgacccca tcaacgtggt gggactgctg ttgctcacaa ggagtgggaa 4201 gcggagctgg ccccctagcg aagtactcac agctgttggc ctgatatgcg cattggctgg 4261 agggttcgcc aaggcagata tagagatggc tgggcccatg gccgcggtcg gtctgctaat 4321 tgtcagttac gtggtctcag gaaagagtgt ggacatgtac attgaaagag caggtgacat 4381 cacatgggaa aaagatgcgg aagtcactgg aaacagtccc cggctcgatg tggcgctaga 4441 tgagagtggt gatttctccc tggtggagga tgacggtccc cccatgagag agatcatact 4501 caaggtggtc ctgatgacca tctgtggcat gaacccaata gccataccct ttgcagctgg 4561 agcgtggtac gtatacgtga agactggaaa aaggagtggt gctctatggg atgtgcctgc 4621 tcccaaggaa gcaaaaaagg gggagaccac agatggagtg tacagagtaa tgactcgtag 4681 actgctaggt tcaacacaag ttggagtggg agttatgcaa gaaggggtct ttcacactat 4741 gtggcacgtc acaaaaggat ccgcgctgag aagcggtgaa gggagacttg atccatactg 4801 gggagatgtc aagcaggatc tggtgtcata ctgtggtcca tggaagctag atgccgcctg 4861 ggacgggcac agcgaggtgc agctcttggc cgtgcccccc ggagagagag cgaggaacat 4921 ccagactctg cccggaatat ttaagacaaa ggatggggac attggagcgg ttgcgctgga 4981 ttacccagca ggaacttcag gatctccaat cctagacaag tgtgggagag tgataggact 5041 ttatggcaat ggggtcgtga tcaaaaatgg gagttatgtt agtgccatca cccaagggag 5101 gagggaggaa gagactcctg ttgagtgctt cgagccttcg atgctgaaga agaagcagct 5161 aactgtctta gacttgcatc ctggagctgg gaaaaccagg agagttcttc ctgaaatagt 5221 ccgtgaagct ataaaaacaa gactccgtac tgtgatctta gctccaacca gggttgtcgc 5281 tgctgaaatg gaggaagccc ttagagggct tccagtgcgt tatatgacaa cagcagtcaa 5341 tgtcacccat tctggaacag aaatcgtcga cttaatgtgc catgccacct tcacttcacg 5401 tctactacag ccaatcagag tccccaacta taatctgtat attatggatg aggcccactt 5461 cacagatccc tcaagtatag cagcaagagg atacatttca acaagggttg agatgggcga 5521 ggcggctgcc atcttcatga ccgccacgcc accaggaacc cgtgacgcat ttccggactc 5581 caactcacca attatggaca ccgaagtgga agtcccagag agagcctgga gctcaggctt 5641 tgattgggtg acggatcatt ctggaaaaac agtttggttt gttccaagcg tgaggaatgg 5701 caatgagatc gcagcttgtc tgacaaaggc tggaaaacgg gtcatacagc tcagcagaaa 5761 gacttttgag acagagttcc agaaaacaaa acatcaagag tgggactttg tcgtgacaac 5821 cgacatttca gagatgggcg ccaactttaa agctgaccgt gtcatagatt ccaggagatg 5881 cctaaagccg gtcatacttg atggcgagag agtcattctg gctggaccca tgcctgtcac 5941 acatgccagc gctgcccaga ggagggggcg cataggcagg aatcccaaca aacctggaga 6001 tgagtatctg tatggaggtg ggtgcgcaga gactgacgaa gaccatgcac actggcttga 6061 agcaagaatg ctccttgaca atatttacct ccaagatggc ctcatagcct cgctctatcg 6121 acctgaggcc gacaaagtag cagccattga gggagagttc aagcttagga cggagcaaag 6181 gaagaccttt gtggaactca tgaaaagagg agatcttcct gtttggctgg cctatcaggt 6241 tgcatctgcc ggaataactt acacagatag aagatggtgc tttgatggca cgaccaacaa 6301 caccataatg gaagacagtg tgccggcaga ggtgtggacc agacacggag agaaaagagt 6361 gctcaaaccg aggtggatgg acgccagagt ttgttcagat catgcggccc tgaagtcatt 6421 caaggagttt gccgctggga aaagaggagc ggcttttgga gtgatggaag ccctgggaac 6481 actgccagga cacatgacag agagattcca ggaagccatt gacaacctcg ctgtgctcat 6541 gcgggcagag actggaagca ggccttacaa agccgcggcg gcccaattgc cggagaccct 6601 agagaccatt atgcttttgg ggttgctggg aacagtctcg ctgggaatct ttttcgtctt 6661 gatgaggaac aagggcatag ggaagatggg ctttggaatg gtgactcttg gggccagcgc 6721 atggctcatg tggctctcgg aaattgagcc agccagaatt gcatgtgtcc tcattgttgt 6781 gttcctattg ctggtggtgc tcatacctga gccagaaaag caaagatctc cccaggacaa 6841 ccaaatggca atcatcatca tggtagcagt aggtcttctg ggcttgatca ccgccaatga 6901 actcggatgg ttggagagaa caaagagtga cctaagccat ctaatgggaa ggagagagga 6961 gggagcaacc ataggattct caatggacat tgacctgcgg ccagcctcag cttgggccat 7021 ctatgctgcc ttgacaactt tcattacccc agccgtccaa catgcagtga ccacttcata 7081 caacaactac tccttaatgg cgatggccac gcaagctgga gtgttgtttg gtatgggcaa 7141 agggatgcca ttctacgcat gggactttgg agtcccgctg ctaatgatag gttgctactc 7201 acaattaaca cccctgaccc taatagtggc catcattttg ctcgtggcgc actacatgta 7261 cttgatccca gggctgcagg cagcagctgc gcgtgctgcc cagaagagaa cggcagctgg 7321 catcatgaag aaccctgttg tggatggaat agtggtgact gacattgaca caatgacaat 7381 tgacccccaa gtggagaaaa agatgggaca ggtgctactc atagcagtag ccgtctccag 7441 cgccatactg tcgcggaccg cctgggggtg gggggaggct ggggccctga tcacagccgc 7501 aacttccact ttgtgggaag gctctccgaa caagtactgg aactcctcta cagccacttc 7561 actgtgtaac atttttaggg gaagttactt ggctggagct tctctaatct acacagtaac 7621 aagaaacgct ggcttggtca agagacgtgg gggtggaaca ggagagaccc tgggagagaa 7681 atggaaggcc cgcttgaacc agatgtcggc cctggagttc tactcctaca aaaagtcagg 7741 catcaccgag gtgtgcagag aagaggcccg ccgcgccctc aaggacggtg tggcaacggg 7801 aggccatgct gtgtcccgag gaagtgcaaa gctgagatgg ttggtggagc ggggatacct 7861 gcagccctat ggaaaggtca ttgatcttgg atgtggcaga gggggctgga gttactacgc 7921 cgccaccatc cgcaaagttc aagaagtgaa aggatacaca aaaggaggcc ctggtcatga 7981 agaacccgtg ttggtgcaaa gctatgggtg gaacatagtc cgtctcaaga gtggggtgga 8041 cgtctttcat atggcggctg agccgtgtga cacgttgctg tgtgacatag gtgagtcatc 8101 atctagtcct gaagtggaag aagcacggac gctcagagtc ctctccatgg tgggggattg 8161 gcttgaaaaa agaccaggag ccttttgtat aaaagtgttg tgcccataca ccagcactat 8221 gatggaaacc ctggagcgac tgcagcgtag gtatggggga ggactggtca gagtgccact 8281 ctcccgcaac tctacacatg agatgtactg ggtctctgga gcgaaaagca acaccataaa 8341 aagtgtgtcc accacgagcc agctcctctt ggggcgcatg gacgggccta ggaggccagt 8401 gaaatatgag gaggatgtga atctcggctc tggcacgcgg gctgtggtaa gctgcgctga 8461 agctcccaac atgaagatca ttggtaaccg cattgaaagg atccgcagtg agcacgcgga 8521 aacgtggttc tttgacgaga accacccata taggacatgg gcttaccatg gaagctatga 8581 ggcccccaca caagggtcag catcctctct agtaaacggg gttgtcaggc tcctgtcaaa 8641 accctgggat gtggtgactg gagtcacagg aatagccatg accgacacca caccgtatgg 8701 tcagcaaaga gttttcaagg aaaaagtgga cactagggtg ccagaccccc aagaaggcac 8761 tcgtcaggtt atgagcatgg tctcttcctg gttgtggcaa gagctaggca aacacaaacg 8821 gccacgagtc tgtaccaaag aagagttcat caacaaggtt cgtagcaatg cagcattagg 8881 ggcaatattt gaagaggaaa aagagtggaa gactgcagtg gaagctgtga acgatccaag 8941 gttctgggct ctagtggaca aggaaagaga gcaccacctg agaggagagt gccagagttg 9001 tgtgtacaac atgatgggaa aaagagaaaa gaaacaaggg gaatttggaa aggccaaggg 9061 cagccgcgcc atctggtata tgtggctagg ggctagattt ctagagttcg aagcccttgg 9121 attcttgaac gaggatcact ggatggggag agagaactca ggaggtggtg ttgaagggct 9181 gggattacaa agactcggat atgtcctaga agagatgagt cgcataccag gaggaaggat 9241 gtatgcagat gacactgctg gctgggatac ccgcatcagc aggtttgatc tagagaatga 9301 agctctaatc accaaccaaa tggagaaagg gcacagggcc ttggcattgg ccataatcaa 9361 gtacacatac caaaacaaag tggtaaaggt ccttagacca gctgaaaaag ggaaaacagt 9421 tatggacatt atttcgagac aagaccaaag ggggagcgga caagttgtca cttacgctct 9481 caacacattt accaacctag tggtgcaact cattcggaat atggaggctg aggaagttct 9541 agagatgcaa gacttgtggc tgctgcggag gtcagagaaa gtgaccaact ggttgcagag 9601 caacggatgg gataggctca aacgaatggc agtcagtgga gatgattgcg ttgtgaagcc 9661 aattgatgat aggtttgcac atgccctcag gttcttgaat gatatgggaa aagttaggaa 9721 ggacacacaa gagtggaaac cctcaactgg atgggacaac tgggaagaag ttccgttttg 9781 ctcccaccac ttcaacaagc tccatctcaa ggacgggagg tccattgtgg ttccctgccg 9841 ccaccaagat gaactgattg gccgggcccg cgtctctcca ggggcgggat ggagcatccg 9901 ggagactgct tgcctagcaa aatcatatgc gcaaatgtgg cagctccttt atttccacag 9961 aagggacctc cgattgatgg ccaatgccat ttgttcatct gtgccagttg actgggttcc 10021 aactgggaga actacctggt caatccatgg aaagggagaa tggatgacca ctgaagacat 10081 gcttgtggtg tggaacagag tgtggattga ggagaacgac cacatggaag acaagacccc 10141 agttacgaaa tggacagaca ttccctattt gggaaaaagg gaagacttgt ggtgtggatc 10201 tctcataggg cacagaccgc gcaccacctg ggctgagaac attaaaaaca cagtcaacat 10261 ggtgcgcagg atcataggtg aggaagaaaa gtacatggac tacctatcca cccaagttcg 10321 ctacttgggt gaagaagggt ctacacctgg agtgctgtaa gcaccaatct taatgttgtc 10381 aggcctgcta gtcagccaca gcttggggaa agctgtgcag cctgtgaccc ccccaggaga 10441 agctgggaaa ccaagcctat ggtcaggccg agaacgccat ggcacggaag aagccatgct 10501 gcctgtgagc ccctcagagg acactgagtc aaaaaacccc acgcgcttgg aggcgcagga 10561 tgggaaaaga aggtggcgac cttccccacc cttcaatctg gggcctgaac tggagatcag 10621 ctgtggatct ccagaagagg gac Link to comment Share on other sites More sharing options...
Recommended Posts
Please sign in to comment
You will be able to leave a comment after signing in
Sign In Now