Entry information : DpePxd01
Entry ID 7653
Creation 2010-10-22 (Marcel Zamocky)
Last sequence changes 2016-02-17 (Christophe Dunand)
Sequence status complete
Reviewer Achraf Jemmat
Last annotation changes 2016-02-17 (Achraf Jemmat)
Peroxidase information: DpePxd01
Name DpePxd01
Class Peroxidasin    [Orthogroup: Pxd001]
Taxonomy Eukaryota Metazoa Arthropoda Insecta Drosophilidae Drosophila
Organism Drosophila persimilis    [TaxId: 7234 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value DpePxd01
start..stop
S start..stop
DpspsPxd01 3164 0 1..1534 1..1529
DmPxd-A 2699 0 15..1530 18..1527
DerPxd01 2692 0 19..1530 21..1526
DyaPxd01 2679 0 19..1530 21..1528
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '7653' 'join(1342060..1342220,1360876..1361091,1361203..1361274,1361353..1361594,1361672..1362343,1363964..1364173,1364293..1364471,1364612..1364793,1368755..1371425)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 1342060..1342220 159 N° 2 1360876..1361091 214 N° 3 1361203..1361274 70 N° 4 1361353..1361594 240
N° 5 1361672..1362343 670 N° 6 1363964..1364173 208 N° 7 1364293..1364471 177 N° 8 1364612..1364793 180
N° 9 1368755..1371425 2669  
join(1342060..1342220,1360876..1361091,1361203..1361274,1361353..1361594,1361672 ..1362343,1363964..1364173,1364293..1364471,1364612..1364793,1368755..1371425)


exon

Literature and cross-references DpePxd01
Literature Drosophila 12 genomes consortium (2007) Evolution of genes and genomes on the Drosophila phylogeny. Nature 450:203-218.
Protein ref. UniProtKB:   B4H1J9
DNA ref. GenBank:   CH479202.1 (1342060..1371425)
mRNA ref. GenBank:   XM_002024692.1
Protein sequence: DpePxd01
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1534 (1514)
PWM (Da):   %s   171492 (169078.8)  
PI (pH):   %s   6.18 (6.15) Peptide Signal:   %s   cut: 21 range:21-1534
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MWWRGVLLFHLFLLAGWSEAAYCPTGCNCYERTVRCIRAKRTTTP
QVPYDTQV
LDLRFNHFEEVPADAFRGMGQLSTLFLNENELAHLQDGAFQGLLALRFLYLNNNRLSRLPAAIFQGLPRVEAIYLENNDIFQLPAGVFDNLPRLNRLFLYNNKLTQLPVEGFNKLNSLKR
LRLDGNAIDCNCGVYSLWRRWHLDAQRQLVTISLTCAEPQALQRQSFASLQEQHFK
AKPNLLVAPQDLQTFAGESVQLDCEVTGLPKPQITWMHNTNEVGEDQVNREILLSGSLLIRSVA
TTDMGIYQCLARNEMGEVRSQPIRLVVSSSSSSNRNPLDNPHIDPRSNQVWADADGNANADAGGATPTPPSFTHQPHDQIVALHGAGHVLLDCAASGWPQPDIQWFVNGRQLAQSTASLQ
LQANGSLLLLQPTQLTAGTYRCEASNRLGTVQATARVEVK
DLPEILMAPQNQTIKLGKAFVLECDADGNPLPTIDWQFNGSPLASTPAGDLLLENENTELVVSAARQDHAGVYRCTARNE
NGETSAEATIKVERSQSPPRVAIEPSNLVAITGTTIELPCQAEQPEVGL
QISWRRDGRLIDPNVQLTEKYQISGAGSLFVKNVTILDGGRYECQLKNEFGRASASALVTRNNVDLAPGDR
YVRIAFAEAAKEIDLAINNTLDTLFSNRSSTGPPNYGELLRVFRFPTGEARQLARAAEIYERTLVNIRKHVQRGDNLSMSSEEYEFRDLLSREHLHLVAELSGCMEHREMPNCTDMCYHS
RYRSIDGTCNNLMHPTWGASLTAFRRLAPPIYENGFSMPVGWTKGQLYAGHPKPSARLVSTSVVATKEITPDSRITHMVMQWGQFLDHDLDHAIPSVSSESWDGIDCKKSCEMAPPCYPI
EVPPNDPRVRNRRCIDVVRSSAICGSGMTSLFFDSVQHREQINQLTSYIDASQVYGYSTPFAQELRNLTADEGLLRVGVHFPKQKDMLPFAAPQDGMDCRRNLDENTMSCFVSGDIRVNE
QVGLLAMHTIWMREHNRLATKLREINPHWDGDTLYQEARKIVGAQMQHITFKQWLPLIIGESGMQLLGEYKGYNPQLNPSIANEFATAALRFGHTIINPILHRLNETFQPIPQGHLLLHK
AFFAPWRLAYEGGVDPLLRGMLAVPAKLKTPDQNLNTELTEKLFQATHAVALDLAAINIQRGRDHGIPGYNVYRKFCNLSVAEDFEDLSDISNAGIRQKMKELYGHPDNVDVWLGGILED
QVEGGKVGPLFQCLLVEQFRRLRDGDRLYYENPGVFLPEQLVQIKQANFGRVLCDVGDNFDQVTENVFILAKHQGGYKKCEDIPGINLYLWQDCGNCNSMPTIFDSYIPQTYTKRSSRQK
RDLRQPKEKEQEEVPATESYDSPLEALYDVNEERVSGLEELIGIFQKELKKLHKKLRKLEDSCNAVDAEPVAQVVQLAPAPAPVAPKPRRSHCVDDKGTTRLNNEVWSPDVCTKCNCFHG
QVNCLREKCGEVSCPPGIDPLTPPEACCPHCPMLKGELP

Retrieve as FASTA  
Remarks Complete sequence from genomic (8 introns). no EST.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGTGGTGGCGAGGAGTGCTCCTGTTCCACCTGTTCCTGCTGGCGGGCTGGTCGGAGGCCGCCTACTGTCCGACGGGATGCAACTGCTACGAGCGCACCGTGCGCTGCATTCGCGCGAAG
CGCACGACCACTCCGCAAGTGCCCTACGACACCCAAGTTCT
CTGTGAGTATTCCTCTTCATCCTTTTCCGGGGGATTGGGGCTCTGTTCTGCATAAATCATTCTGGCTGTTGGCTTTCCA
AATTCCAAATTCCATAAATTAGTTTTTAATGTTTTCCGTTGGCCCTCTGCAGGCGGCAGTGGCGTTAATTTGATGCCCTGCCCCATCTGTGCCAGCATCATCTTTATCTCCATGCATCCA
ACCTAACCTCCCACCACCATCAACACCCCCACCCCCACCATCGTGTTGACTCAATTAAACGGATGTCTAGAGAGAGAGCTGCCTGGGTTCTGGGCTCTGGGTTCTGGATTCTGTGTTCTG
TGCTCTGTGCTCTGTGCCCTGGGATTTGTATATTTATCGCACGCCATGCCATGCCCCACACAAAGCGATTGGAGGCGACGCAAACACAACGCGACTACGCGACTCGCAATATTTGCATGC
ACGCGGAATACTTCTGACGTTGCATTTATTTTCCGAAATGTCAGCAGCTTTCGCATGAATTTCGGGAGCCACACCACCAGAATCCGAATCCTCGAAGCGGATTTGTGGGTGGATGGAGCG
GCGGATAACTTGGCAAATGTTTGTGGAGCATATGCATATTTATATAGGCCGCGGCATGCTTTCTCCTCCTGCCCCTCCATTTGGGGCAGCATCTGTGCTGTGCTCTGCTGTGCTGTGCAA
CGGTAGCAGCTGCCACAGCCGGCGGCAAGTTGTCAGACGACAGATAGAAAGAGAGACAGCTCTGCTCTGGAGGTGGAATTTCTCCTGGTATTTATGGGGGAATTCAATTTTCATTGGGTT
AAGTTGTTGAGCGAACGATGGAGGATGTACACGAATAAACTGACAGATATTCACACATTCTGTCAGCCAGAGAATGGGCTTCTTCTGTACGGTGCTGCAGTGTGCTGCACAGTGGCTACA
GAGACAGATTTTACCGGATGAAAGATCTATACGGATCCAAGTGAAAGAAGAAGTTCTGAGCTTGAGGAAAAACCCAGTGATGGCCGATTTCAAATTGAAAACAAGGGGCAGGTCCACATT
CACCATCACCAGTGCAACATAGAGATACATATGTATCTACCAAATTCATCGAGAACCTCCTGCATTTCGGCAGTCATTCCCAGCCTTATCCCATAAGCTCCACAGACCTGGCATAAGCAA
AAAGAATTCTTTGTTTATCCATCATCCGAAGTCGCATACTGCAAATCCGCACTCCATATCATTCCTCTGATCCTCTACCATTTGTTGGCTCGCCGCACAACAATTTCTCAAAGAAGTGAA
ACCCCAAACCACACCCACCCCCACAAAAAAGGAGTAGCCACAGAGAGCACAGACGGAGGGAGGCACTTTTGGCAGTACTTAAATTAATCTTCATTTTAAATAAGGTGATGGCTGCATAAT
TTATCATTGCCCCGAGAGCTGAATGCACACAAAACACTCACACAACAATAGGAAAAATTCCAAAAGAAGGAGAAGGAGAGGGAGCAACAGAGACGGGGCGAGTGGTGGTGTGGATGCATT
CAGCAACTTTTCTGATTCCATTCTGATTCGCATTTCATTTATTCCGCATCCGACCCGTTGCACACTTTTAATTAATTTGTTTCATTTATAACGAGTACGAGTACGAGTACGACTACTGCA
TGTGTCTAAAAAAGTTTCCCTGGTTTTTCCTCCAGCTTGTTCTTCCCCTTTGCCCCCTTCCCCCTTCCATGTGGTGTGTGCTTTCCATGGGCAGCAGCCCACATGTTTATGTGCGGCGAG
TCGACTCGAGTCGGGTCAGATTTTCTCCATTTATATTTCATATTTTCTCAAAATGTTCACTCGAATGCCTCCTAAAATAAAATACAAAGAAAAAAAAAAACAAAATGCTCGCATGTACTC
GTACTAGTCTCCAGGCAGAGGCAGAGGCAGAGGCAGAGGCAGTGACAGACATTATATTTTTGTATTTGTTTGTGCTCTCCGATGCGCATGGAAATGTTTTTGCCTTAATTTTGTTAATGG
CCAAAATGTTGTTTATAATTTTCGTTGCTGGAAGATGCTTCGGGTCGGGGCTCTCGAGATAGATGTCGGACAGAAAATGAAATGTAAAGAAGCAAAGTCAAATAAAATAGTTTATTCAAA
CGAATTCCTTAATATTCATCTCAAAAATGTGTTTTATTTGAACTCCAGAAACCTCTTGGATCCACTATTGCCAAATCAAAGAATTCTTGGCTCAAAAGGCTACACGAAAATGTATCTTAA
GCCAGAAATTCTCCTGAAATTTGTTGTAGAAGAGATTGAGAGCCAGATATCAAATCTTTCGAAGCTTTGGGGTTCTATTTTTGCATATCCTGCAATGGTCTCACAGATAAACTGTTAAGA
TTTTAAGAGTATATTCAAATGAATGAGAGCTTCCATGTTATAGTCCTACTTCTAATGTGATATCTTGCCTGCAAGTCTACACTTGCTTCGGGGAAAATATCCTGTATATTGCTATCCGCA
CTTCATTGAAAAAGCCTTCCTCTCCAGCTGCTGCTGCTGCTGCAGCGCCAGCACCAACGCCAACGCCAGCTCAGGGCCTAAGCCACATTTCACTTCCTTGAAACTGTCCATCCATCAAGC
ATAAAGTCCTCTGCCTTTTTTTCAGCTTTTGCCTCCATTGAAAGTTCCGCTTTCAATGGCTGCTGGGTCGCAAAAAAGAACTGCAACTCACTCTGCCTGCCTCTGCCCCATCTCAGCCCC
TTCCCTTGCCCTCCGTGCCCTTCGGGCCCCAACAAAAGGCCAAACATTTTGGGTCTTTCCGTGTGCATTTTGAACTTGCAACTAAATGGAAGTCCGAGCCGAGACTCTGTAACATATGCG
AGCGGCGGCCGGCAACTTGCATCATTCATCTTGCCACACGCCACTCCGCCCACCCACACACATGGCGCTTGGCGCTTGACGCTGCAAAGGGGGCGAAGGGGAGGCTGCAGTTTTTTTTGC
TCCCTTTCCATCGGAGCTAAATAAATGCACTTGATTTCATTTCATGATGATGCATTTTCCGCTGGCCTCCCATAAACCACACACATTTCTTGCCTCCTCGGAGGCAGTTGCTTTGTCATT
ATACGAGCGGGGCATGCCACACTCGAAGCCGTGCCTCCGCCCCTACTTTCTTTGCATAGAAGAGCGAACCACATAAAATACAGATGGAGCCCGTGCAGCAGATGAACAAAATCTATTATG
CACTGAGAAAAATGGTACTATCTTTGCACTCCAAAAGGAACTTTAAACGTGAAACCCCCTCAAATATATATACCTTCCTTTTGCCGGTCAAATATGGGCATTGAAAACCGCATTATTCCA
CACGATTCATTAGCTCTGGGATGGGGTTCATCTGCGAGGCCATAAATTATCTCTATTGGTTATTTTTCTAATTGAATTAAACACCGGTCGTGCCATATCGTATCACCTCATATTTTCCCA
TAAATCACTTCCCCAAAAGGACAATTAGTTGGCTTTTTGCAACGACTTTGAGGCAACAGTTTGGCGTACAGGAAAACAAGACCAAAAAAAAAGGGAACTTTAACTTGAGAGGCTTTCAGA
TCAAAGTTAATGATATAATATGCCAAGAGCTTGATTTAGTTAAGTTTCATTCAATTTGGAAAATAATATTGGAGCAAAGGCATATGGAAATGGGTTTAATATTAGCTTGGTCTCGGCTAA
CTTTTCCATGGAACTTTTCCATATATGCATTGACGTATCGAATATTTTTCAAACATTTGCATTGTTCTTTTTCTTTTTATTGTTTTTGCGAGTGTGGCCTCTGGTCTTTGCCTCTTGGAG
GCTCGGCAGCGGCGGCAGCGCCTTGTGCAATCTAATAACTTCGGGCATTATGCACATATGAAATATATCTCATCTAATGTCTGCCCCTCCCCTCTCTGTCCGCCTGCCTGTGTGGGTGGC
TTTGCTGTTGGAACAAAAATCGCCTCACGTTGCCATATGCCTGGCATATTAAAAACTTTGATTAGTGTCCAGTTCGCTGGCTCTTGCCACGCCCACGCCCACACCACACACCACACACAT
ATGCATTGTTCTGCATAATATATGGCCACTACATTGCGATGTACATTGTACATATGTACATATAGTATTTGCTTGGATTTTTGTTTGTGAGTTTTGTGAATCAACTATTTGGGAAAAGTA
TTTTTTGTTTGCCCAAAAATTGTTTCAATCTGCCCTCCCGCTCCAATACGCTGGCTGGCTGGCTGGCTGGCTGGCTGGTTGGGTGGCTGTTTTCTTTCTCCCTCTTTGGCTGCTCGCTGT
TTTTCTTCTCTTTCTATTTCGCTGTGTGCCCCCCTCTCTCTTTCTCCCATTCTCTCTGTGAATTTCTCATTTCGTTTTTGGTGCATTTATGCGTGACAAATGCCTTGTATTTTGTGTAGC
GCTCTACGGGTCGTCGTCTCCATATCCGGGCTCCTGGTTCCTGGTTCCAGGATCTCCTCCTCCTCCTCCTCCTTCTGCCATTCCCCCTGTTCCCTCTGTTATTAAATATGCATAAATTTC
CTTTTTGCACATAGCATTCGTTGCTGCTGCTGTTGTTTTTAGTTGCTGGAATTTTGCATTTACATTTGCTGTTTATTTGTCGGTGGAAATATTGAAATTGTTGCCAATGGATGTCCTAGC
AGCAGCGGCAGCAGCAGCAGCAGCTTCTGGGTCCTGCCTGCATTTCCTGGGGGACTCGAAGTGAAATGATATCCAGGGTTTTCCGTTCAGGTGTGGTTCTTGTTGGAATAATTACAGCAG
AAGCAGAATCCGAATCAGAATCAGCAGCAGCTGTGCCTCTGGCTGTGGCTGTATCTGTATCTGTGGCTGTGGCTGGGGCTGTCTTTGTGGAATATGCCCAAATATTGTATCTTCTGGATG
GCAGGAAGCTGTTGCAAATTGTCTAAAGATTGCCTAATTTGCTTGTAATATTTGATTTGATTTAATATGCATCCCTCAAATGCATTTCATATGGATTTAATTAGTTTTTAGCTTCAATTG
ATGCGAGCATCAAATGTTAATTGAATGATGAATGCTTATGGGATTATTATTAGGAAATACTCAAAGAATACCAGATGTTCATTTAACAAATTAGTTTACACACTCGAAGAACTGCTCCCC
ATCTCCCTCTGTTTCTCCCTCTCCCTCTCACTCTCTGTCTTCCTGTGGCAGATCCATCCATCTCTGTTTGCGTTTAGTCCCTGGCAATAAATCCCTCTAAATGCAATTGCCAATTCAATT
ACCAATCAATCAAAGAGCATTCAAACTTCGTCCTTTGGAGCTAGCAAATCCCCCTCCCCTGACACACACACACACGCACTCCTCGTCTGGCTCCTCCTCTCCTCCCCCCCGTTTCGTGTG
CAATACAATATCATTGCTGTAAGAGCTTTTTTCGGCTGCAGTTGCAGTGTTTGGTTTTTGTGGCATTCAGTCTGACTTTTCGATTGTTAAATATTTGCAGGGCGGGCAACAAAAGCCCGA
CACGGCACTGAAAACAAAATGAATCCGATTCTGTTGGCCAGAATGTGCCACTCAATCGGTAATATGATTTGTAATTATGCAAATGAAGCCGTCCCCCAGTGCACCACCAATAGGGAAAGG
GCAAAAAGAAAAGCAGAGAGGCAGAGACGGAGAGAGAGTCAAATGGGAGTTACATTTTTGGATGATTAAAGCTGAATGAGAAAAGTTGTCCGCCTCAAAGGGAAAGGCAGAAGCAGGGCA
GGGCAGAGGCAGGGCAGGGCTGGGGAAAGGGTCCCCAGCGGCAGGGCAAAGTTTTTGCCAATCAAAGCGTATGCATTTTTAATTCACTGCCAAATGCAGATGGAATGAAGGAGATGGCTA
TCGAAGAGAGACAGAGAGAGAGAGAGAGAGACCAACGTGAAGGAAGCCGAATCTCGAGTTGCATCGAGAGAGCGGGGGCCCCAATGCGATTCTGTCAAGGAGCCTGTCCGACTGTCTATC
TCTCTGTCTCTGTGCCTGCCTGCCAGGCTGTTCGCTTCACTTTGACATAGAAAAATCTTCTATTGATTTCGCCTTGCCACATGCAGCATGCAGCATGCAGCATGGCTGCCCCCAATCGTA
TCAGCCATCCCTCTTTTGGCTCCCTCTTCAGCCCCACCTTCTCCTTCTCTCCCCCACTCCCCCACGTCTCGTCACGCTGTGCAATTTGCATTTAAAGTTTAGCTTTTTTCTTTTTGCCAT
TCTCCCTTTTGCATTCCTGCCAATTCCTTTGCTTTAATGCTACAAAGTAGCACTTGCTTCATTGCCATCGTCCACGCCCCCAGTCCCAGTCCCAGTCCCAGTCGCAGCCCCCAGCCCTCA
GCCTCTGGCCAACGCCCACACGCTTTGTCATTTCGCATGCTCGTTAGTGCAGAGCAAAGAATGCAAAATAGCAAGATACTGGCGGGATAGTAGGTGGCAGGAGGTGGCAGGAGATAACAT
GCAACACCACCAACATCATCGCTACACAAACACATGCCACAATTTATAAGCAAAGTTATTTTGCCCCAGCAGAAAAAAAACACGAGGTGGGAAAGGGCTGGGGCAGGGGCTGGGGCTGGG
GCAGAGCTCGGTTGCCCCGATAGCAACCGGCAACATGTCAAAGTGGAGCAATCGGCATGTGCAACGGACACGGAGCAGAACGGACACAGACACGGACACGGATTCGGACTTGGAGTCGGA
GTCGGAATCGGACTCTGTCGGAGGCAACTGTCGGTCAAGCATAAGAAACAAGCAGTAGAAGCGGTCCATAAGTATGCAATGGCAAAAGAGAGAAAAACACAGAGACCCAGAGAGACGGAG
AGACGGAGAGACAGAGAGTGTATATACGAATAGGGGATACCTTAAAAACAAAGGGAAACAGTATCCGTGATAGCTTTGGTAAATACCAGCCTTATGTATCGCCATATCCTTAAAAGTGTA
TAAGGGTCAAAGACGAAGTAACCCGATAGAAGAGAAGCAGAGACCTTTGGATAAGTTCAAAAGAGCCAGTATGGATCTTAAATATTTGAAGACTCCTCCGGCTGCTGCCTCCTGCCTGTG
GCATACCCCAAGCAGCACTCCGGAATTACCTTACCCGTTTTAAGAGAAAGGTAGAATTCTGGAGGAGCCCAAAAGCGGGCGAATGAAAAGCATTGAAAAGGCAAAGGCCTGGAATGCTGA
AACCTGCGCTGAGAGATGCCGCTGCCGATGCCGTGGCCGATGCTGGCCATCATGTGGAGCCGACCAGGTAAAGTCAACAACAGAGTCAGAGTGGCAGCAGGGCGAGTGGAAAAAATGTAA
GAAAAGATGCCACTGCCACTGGTAGGGCTACTGCCCTGCCGCTCACTGCTCGACGAGGCGGCAAACTAACCCCACCCCCGTCCAGAGCAGGTCCAGGGCCAAGTGGCAGACCACGTAGCC
AAGGCAGCGGACAGATGCATAGCCGTAATAATGTTAATGTAAATGTTGCAAGTGGAGCAGGTACAGGGGCACAGGGGCACACGGGGCAAGTCGAGTGGCCATCAGTTGGCCCGAGCTGCT
GCTGCTGTTGAAGATTCTGGCAGATATTAACATGGCGTATACGTAATGGTTCCGAATTAGTGGACTGAATTATCTTTAATATTTATGCTATCTATAATATAGTGGTTATTTATAAGGATA
GCGGGGGGTTCTTTGGGGTTTCAACTAGCACTCTCGACTTACCTCTACTCCGCGAGAAGAAGTCGAGCAAAAGGGGATTGGAAGATCACGCGTTGCGACGGTCCACAAGGAAATTGTACA
AGGTGGTCCGCTAACTCCCGTTCGGGTCTTATTTCACGAAAGAGTGAGCGCACTAGAGAGAGCGCTGATGAGAAAGAGAAGAACGAAGCCTGCTGCTGTACGACCCGAAAGGAGAGCGCG
TGGCGAGAGCGGACCACCTTCCATACCACCTGCGGGTAGTGGCGTTTTGGTGCTTAAGCGGAAGAATACAATGATGAGCAAAGGAATAGCGAAAGGTATAGCTCAAATGACTATGAACTA
TAGAAGGCTCAAATAAGCGGCTAAACGATCCTCAAAAGCTTATTTGGCTCAGGTTCAATACCACATACCACGTACTCGCTCAAATGCATTGCAATGGGGGAAATTGCCTTTAAAAGAATT
GGAAATGAGTTTGAAAATAGCCGAAATACACAATAAAAGAACTTAAATTTAATATCCTCCCCCCCACCAATGCCACGCCCTAATCTAATACCCAGAACACACACACAGTGCAAAAAATAC
AATATATGTAGGAGGACTGGCATAATATTCGGCTTTTATCCGCATATTAAACCATTTCCACTTGAATGGAGAACTGATTTCATTGCGTTTTTATGCTCGTACGATCCATATATCGTACGC
AATGCTCCTGTATATGTACATATATCCACTCCACTTCACGTGCATTGTGTGTGTGCTTCCCTTTTTGGCTCTTCAAATATATAAAATATATCCTAGTGCCAGATGGGAGAAGGCGGAAGG
CAGAGGAGACAAGGGAGGGGCATATAAATCTGCTGTATAATTGAATTTCACACGCTGCCAAAATATCAAATAAAAAATAAATAAAACCCAAAAAGCAACGAAAGTTTAAATAAAATTTAG
TTCAACAAAAATACCCAAGCGGTGAAGGGTGGGGAGGGGGGGTGGGAATGGCGGAGTGTGAGCTGCCTGAAAATGTGAGAATTATAACCGAACTGAACCGAACCACCTCCCCACTCTCCC
CCAATGGGCACCCTTCTTCTGGTAGCCACTTTTAACCATGGAGGTGTTGACCGAACTGCAAGCAGCTTAAGCAGCATGCCACAGCAGTCACCAGTCACCACGCAACAGAGCAACCCGTCG
GTGGAGGGGGGTACGAGGAGTATGAAATCCTGTGGAGAATGCAGATGGGAATGAAAGTCCTGAGGAGATTGCAGATGGGAATTGAAGGTCCTCAGGAGGAGGGGGACGGGACTTGCTCTG
GGAATGGGAATGGGAATACTCGCTTGTATGCTCGCTCCTTTAGTCGGACGGCAATTGCATGTGTGGCAAGTGCTCTTGTGAGACATTTACACCTGTAAAAACATTTGGACCAGAGGAGGG
TAGGAGGAAGAGGAAGAAGCAGCGGCAGCAGCAGCAGCAGCAGCAGCCGGGGGCTGGAGGAACAGGAAGAGTATGGAGCTAAGGGGAGCATAATATTGGGAGTAGAAGAAATAAGAGCAC
GGAAAGGAATGGAGAAGGAGAATCATTAGCAGGATAGGCAGCAGTGCCTGCGGGGGGGGGACCACCAGTAGGGGAAGCAGTACCTGGAGACCAGAGAATGGTGATGGCTGATGGCACAAC
ACATTAGCAGCCCAAGAAAACAGAAAATAGAAAACAGAAAAGTGACGCTCGAGCTGTGACAAGAGCAGTCCTCGTGTGTAAAGAACAGAGCAGAACAGAACAGAACAGAACAGAACAGAA
CAGAACAGAACACTCTATAAATAGAACTAAGGACACCTAAGGACGATCTAAGGGGAGAGCTAAGAGAGCTGCAGGCACCCAAGGAGTGCAGCACGAGGAGAGTGCACTGCCTCAGATGAC
CAGCAATTTGGAAAGGCTTCGAAATTTGTTGCTTGGGAATTTCCTTGCTTTATCTCCAGCTGGTTTTCTGGTTTCAGGTTCGACTTTTCCTACTGCTTTTCTTTGGGAGTTGCCCTCCGC
TGCTCCCGCCGCTGCTCCTGCCCTGCTCCTCCTCCTGCTGTTGGGGAGTTCTTCTTTTTGCGAGTTTTTTCAGTGGCAGCCCAAAAGTATGCAACGAAAGTTTAGATTCCACCCCTCCCT
CACGCTCGCTCGCTCGCTCGCTCCGACTCTCGGAAGTGAATTTTCCGCACGTGGCCTAACTTTTGCTGCCTGGCTCCGTGGCCCCTCCCAGGACGCACCATTCTGGCGTGTCATGCAGTC
AGCAATCTTTATCCAACCCCCAGCCCCCAAACCCCCCTCGCACTCCCACTCGACTGCACGCCCCTTTGAGTGGGCGCCTTTCATTCTTTTCTTGCGGATTTTCCTCGAGGATTGCATTTA
ACGCAGAAACAGGAAGGAAGGAAATGCCATGTCGACGCTAAAACTGGGTTAGCTTACATAACGACAACATTAACGACAACGTTTTCCTTCCTCTCAAACAGGGGGTCGGCGGGGCAGGGC
AGGGCAGTGCAGGGCAGGGCAGATGGCGGAGGAGGAGCTGAATGAAGCCATGTAGGAGGGCGTTCGCTTCACGCTTCGTCGCTTATTGGACATAATTACTGAATATAACAATTCTGTGGT
GTGTGCGGCATGGGGCATGGACTCCGCCTGCCATAGCCACTTGTGCGTAACCGCAGCAGCGACAGATAACTGCAACAAAAGCAAAGTCCGGGGCATGCAGCCAGCCAGCGATGGTGTGCT
GGGGCCAAAGGACCCACATATCAATTCCGTGTGAGCGGACATCGTGCAACTTCTTGTTGCAACAGTTCCCGGCCGGACAGGAGCACCGCCCCGGTTCCTGGTCAGTGGAGAAGAGCAGGC
ATCCGTTGCCTTTATCGCTTACACTTGGCCAAAGCCCCGCTTTTATGCTTCTTTTTTTATACCCGATACTCAAAATGAGTATTGGGGTATATTAGATTTGTGGTAAAAGTGGATGTGTGT
AACGTCCAGAAGGAATCGTCCGTCTGTCCGTCTGTCCGTCCCCTTCAGCGCCTAATGCTCAAAGACTATAAGAGCCAGAGCACCGATGTTTTGGAACCAGACTTCTGTGATATGTCACTG
CTACAAAAATATTTCAAAACTTTGCCCCGCCCACTTCCGCCCCCACAAAGGGCGAAAATCTGTGGCATCCACAATTTTAAAGATAAGATAAAACCAAAAACGCAGAATCGTAGAGAATGA
CCATATCTTATAGACTTATAATCTGAATTGGATCGTATTATTATTATAGCCAGCATCAAGAAAACAATTTCATTTTTTCTCGCCCTATCTCTCTCTAACACACACGTAGCATAGGCGGCT
TTGCTTAGAGTAAAACATTAGCGCCTAGATCTCAGAGACTATAAAAGCTAGAGCAGCCAAATTTGGTATCCACACTCCTAATATATCGGACCGAGACGAGTTTGTTTCAAAATTTCGCCA
CACCCCCTTCCGCCCCCGCAAAGAATGCAAATCTGGGGATATTCACAAATCTCAGAGACTATTAAAGCTAGAGTAACCAAATTTGGTATCCGCACTTCTGTTAGATCTCACTATAAAACG
TATATCTCAGAATTTCGCCCCACCCCCTTCCGCCCCCACAAAGGACGAAAATCTGTTGCATCCACAATATTGCAGATTCGAGAAAACTTAAAACGCAGAATCATAGATAGCGACCATATC
TATCAGATTGCTGAATCTGGATCAGATCAGATCATTTTTATAGCGAAAGGAAACAAATCAATTTGCACTGGCTACGCAGCGCCCGACGTCACGCTAAGACTGATTTTCTGTCTCTCTCGC
ACGCACTCTTTGTCGTGTCGATTAATATAAGCGGCGTCTGCCGGAGGAGAGCCATACTGACTTAGTATCGGGTATAACTGTAGAGTTGCGGTGTCCGCAGCAACTCACAACGTTCCTCCT
CGTTTTTTTTGTGTTTGGCATTGATTCAACGGAATCCTTAAAGTTGAACAGTTTTAGAGTTGGATTTGGGGTTTGGGTTTCGGGTTTCGGGTCTCCGATTTCCGATTTCCGATTTCGGAT
ATTGGGTCTGGGCTTGGAGCGTAATGCCAACGAAACTTTTGGCCCTTTTGTCACTTTTGGAACAATGTTAATTGCTTTAGAGTTTACTTTTCGACTTTAGCTTCCCGGGATAAAGGGGGG
CGGGGGCGATGGGCGGTGGGCGGTGGGCGGATGTCCCAAGGCATGCATTAAATTGTGCTGATCTGCTGCATGCCATGGTTTGGCTCTTTTGTTGCTTTGTTCGAATGGAAATCGGAGCTA
AACAAAGCGTTTTATGCATGCACACTCCAAGCCAATTACGCAAAGTGCTCTAAGTATCTCAAACAGCAGCAGCAGCAGCAGCAGCAGCCGGGGGCAGCAACCGACCCAAAACTGGACACA
ATTACATTAACAGTTTAGCACACACACACCGATAGAGGGAGAGAGGGAGAGAGTGTGTGTGTGTGTGTGCAAGGATTGAGGGCAAAACAAACAGAATGAAATTAAATCAAAGGCTCGAAA
ATAAGCTCTGGAAAAGGAGTGCTGCCTGCTGCTGCTGCAGTAACCAACAGTAACCAACAGTAACTAACTGGGCTAAAAGCTGTTGGCTTGTCTTGGCCCCTTGTGCCTCTCTGCCGCTGC
AATCGTCCCTCCCTCAGCCTCTGCCACTTGCTGAAGCTGTCAGACTGACAAAGTGACTGCCTGACTGACAGCTGAACTGTTACATACACCAACACACACACACACACACACACAGAGAAC
AACACAAATAGACGACTGCACACAAATAAACAATGTCAAGAGTTAAAAGCGGTAGGGTAGGGGGTAGGGGTGGAGCTGGGGGTGGCACTGTGCACAGCTTTTGTTTAACAATAATAAAAT
GGCACTACTATAGAAAAAACAAAAAATAAGAACAGCCTGCAGGCGAGTGGCTCGACCAAGAGGATACCCCAGGGCCCAGCATGGATGCAACAGCCCCTACTGATCTTGCATCGTCCATTA
TCGGCACTCCAAAGCCCAGTCCTATGGCTCCATAGGATGGATCTCGACTATGGATGATAATCCAGATTGTGTACCCTTTCTGGGATCGTCCTGAACACAACATACCCTTGGTAGCCCTGG
AAATGCCCGCCAGAACAAAGGCAAAGGGCAGGAGGATCCCCTCAGTGCAATTGTGATCTGTGGCATAGGCTGAAGGATTGCATTACCAAGTGTTTTACACACACACACACAGCACAGAGA
CAAGCAGAAAGACAAAGACAATTGTCATGGGTCCTTCGGACTATCGTCAGGTGTAAGGGGGGGAGGAGACCATCACCTGGGCAGGGCAACCCCAGCCAACAGAAGGACACACGGGTATCC
GCAGATGCAAGGATGAGCTCCGCCATTGTAGGGCACTTTTATCCATTGTTGTTGTTGCGGCTGCCACTTTTTGTTGCAGCTTCCTTGCTGCGTGTTTGTAAGCCATGTTATTACAATAAT
GTGTCTCCCAGTGTGTGTGTGTGTGCGTGCGTGCGTCTGTGTGCATGCGTGTGTGTTTGCGGCATCCTTCCATCCGTCTGTGTATCTGTGCATCTGTGCATCTGTGTGGGTTGTGGAGTG
TGGAGTGTGTGGCAGGCCAAAGTTAAATCATGTTTTTATTGTCACATTTGCCATAAAGGCAGCCCAGGGACCAGAGCCAGAGCCAAAGCACACCAGCAGCAGCAGCAGCAGACCGTAAAA
GGATTTGGATCGGCTTGCAGGAATGGGAGCAGGGGCACAAGGCGTCATAAGATGCTGCCTAAGCTGTAGGCCCACCTGAGAGGAGCACAGAGACGGAGGGACAGAGAGGCAGGGGAGGCA
GGGGAGGTACATATAGGATGTAAGCCAGACAAATCATAATCAGAGCCAACAACCAGGCCACGCAAGGCGAAAGCAAAAGGCAAGTCCCAACAAAGCACAATTTCACCTTAACAACGAGGC
ATGGCCGAGAGGGGAGAGTGGGACTAAAGGGGGGGTTGATCCCACTGATAAGGCAGCAGCGGCAGTAGCTGCACCCTCCGACGCAGCCACCCCACGTTTATAGCCTTGTTTATGTTGCAT
ACTTTATGGCGCTTCAAGGGGTTTCGAACGGGAACAGAGGCAGAAAGATACTCCGGGGAGGGGGGGAGGCAGGAGCGTGTGGCGCGCGCGGAAAAAATGGCAGAGAGTCTAATGAAAATG
CAATGAACTAGAAAATAAACTGACTCAATGGATAAAAGCCAACTAACGTTGAAAGCGGAACAGCGCGCAGCGAACAGCAAACCAATTTCAATGCAGGAAGCAGGATGCAGGAGGCAGGAG
CCATAGTTACGAGCATAAAAATTAAACATCTGCTCTGTTTATTGCCGGTCTTATATTGTCTGATGGCAGGGGCGTTATCTTCCGATATAACAATAGCCACAAATGGCAAATGGAGAGTCC
TTAGAGGCTCTTTTCATGAGCCACAAACGAAGAAGGAATCGAAGGACATAGAGCTGTTGCTCCGACCTAGCTTTTCAGTGAAGAAAAGCCCAACACTCCTTCACACTCCTGCCCTAGGAA
GGGACTTTCCACTCCAAAACTCTTCCGAAATGGGGCTTTTAGTCCCTAGAGAAACTTTGAACAATCGAAATGGCAATTCGAGTGCATGGAAGACCACAGTTCTGGCCAAAAACGACAATG
CAATTGAGATCTCTAAGCGAATGCCACAAACATTTACTGTCAAATGTGCGCCGCCACAAACAGTTACTGCCACACCTCCCCCACACGCCCGCCACACGCACAGTCTTGCCGCACTCTTGT
CGCTGCACTGCATGTTGCATGTGGAGGGCCGCTCTCTAATTTACTTGCTGTGGTTTTTCCTTCACTAATTGTGACAACTAATTGAAATCAAATTAGTGGAAGCCACTAAACAGCTTAACG
CTCGCTGAGCATGTATTTGATTGTGTCCCTCCCGATCTCCCCCCTTGTTGCATGCTGCATACTGCATGTTGCATGTTGCATGGGGTGCCAATTGTTTTCATTCAATTTACTGTCGGAGGT
GGCTTTAAAATATCGCTCTTGGCCTCAGTTTTGGCTAGCCATAAATTTTCCTCAACGGACATCAAAGCGTATACGTAATGGCCCACCCAGTGTGCGTGTGTGTGCTTATGAGAGTGTGTG
CGTGTGCAGGCATGCGTGAATAGACAAACAAGCGAGAGAGAGAGCGAGAGAGAGCAGGCAAATGCCTTTGTCAATAATATGCATTAATTAAGTGCAGACATGGACACCGGGGACACAGTC
CACAGTTCACAGTCCACAGTCCGGCGTCCACAAGCCAGCAAGGAGTCGACGGAGCAATGGAGCCACACAGCAGGGTTGCAGCACGGTTACGGATACGGCTACGGATACGGATACGGGTAG
GGGCACGGGCACGGGCACCGACACAGAGGCACCGACACAGAGGCAGGTGCCGTACCGATTCGAACCGTGCCGCACCGGACACAGCTTGGCACTCGCTTTATGCATTTCAAATTTTAATTG
AAAATATCGTCCGAATTTAATGCAAAAATGCAAATGTCATTATCCAAACAAAAACAAAAAAAAATAGCCATTTTCCAAAAGCTGCAGCAGTGGCGGAAGTGTCACAGCATTGAATAGTAG
TTTCAAGATTCAATTTCCTTTTCCGCCTGCCACGCACACATGGACACATGGACACATGGACACATGGATGGGGCAGGTGGGCCTGCAGGTGGGCTTAGGGCGTGGTCTGCCTGGAAGAAT
GAAAGTTATTTGTGCCGCCTTTGAGCATCTGTTCGGTTGGGACTCGCTTCAGTTCTCTCTTTTTGCATGGCCGCTGCTCGACAAATAGTTGCGCCACTGACATGACGCCAACTATGCGGC
CTGCCACTGCCACTTGCCACTGGCAGTACCTCTGCCGCAGAACAGGAAGAGGCATAGGGAACATCTGCAGCTGCTAGGCTCGTCGCCTGTCGACCGTCGGCTTTGTGGACTCTGGACTCC
ACTCCGGACAGGATGGCGTATGAGTGTGTGTGCACCATTGAAGTGGACCATCGTTTGTCACACGCGTCATGGGTGGGCCTTCCCCTGACGCGTGCAGAAGCAGAGGCAGAGGCAGAACAG
TGGTGGCACAGTGGCATGGAAAAGGGTCCGGTAGCACTTTGCCTGAACAAATAAAACAGAGATGATAGCCGGTGTCTTCCAGGGATAAAAACGAGCTCAGAATGTTGATTCAAATTAGTC
AGATAGTTATGGTGCCACACTCCACACCCCCCCCCACAGCCCCTCCCACTCGCTTGACCCAATGTGGCAGAGGACTGCCGCTGCCCCGTGTGGATTGTTGGGTGGTTGGGGCTAATGGAG
TTTGCAAGAAGTGGGCGCAAAATGTGGATCAAGCAGCCGCATTCGCACTCGCACAAAGGGTTGCTCTTCGTCTTCTGCTGCTGCTGCCCCTCCTGCAACATAGTGCGGCTCCTGCTCCTG
CTCCTTCTTAGAGAGAGAGAGAAAGAAATGCGAGCCTTTGCAAATTGTTGCTGCACCCGCATCCCAGTCGCCATCGGTCTGGACCACCGACAGCCTCTGCCTCTGCCTCTACCACTGCCT
CGCCCCACTCCACAGCCACCGCAACGTGCATGTGGCTGCCACACAAGAGAGATGTGGATAGAGAGTGTAAGAGAGAGGTTACCACATGGATACCAGTGGGTCGACTAGCCTCAGGACTTG
GGTCTTTCATCAGGCCAGAGCCGGAGCCGTAGCCGGAGCCAGAGCCAAGACCGAAACCAGATAGCAGGAGGCAGGAGGCGGCAGGCAGGAGCCAGGAGCCAGGGAGTCGCCGTGGCAGCG
GCAGTGGCGGCTTTGTTTATGTGTGGCTCCATTTGGGCGTATGCATGTCCTTCTCCTACGGCTTCTCTCCCTTGGATTTACTCCATGGTGGTTGCATGCCCCCTGCCCCTACCCCTGTCC
CTGCCCCTGCCTCAAGCTCTACGTGCGAGTGGTTTTTGTGTTGTTCTGGATTTTTAAGTTGTTTTCGTTGTCGTTGGTTGTTTGTGATGTTGTTTTTTGTTGTTTTTGCTTTTTTTTTGT
GTGCGTTCATCTCCTTTTTTGTGTGTGCCTTTCCTTGTGTTGCATCCACGTCTTTAGTTGCAATTTGAGCATACCAACAAACATATGCACTTAACACACACGAACATGGACACGAGCACG
GACACGGAGCACTCACACGGAGAAAGGATGAACGCTGACACCAATTAGTGCCTCCTGCTGCTGCTGCTGCTGCTGCTGCCGCTGCTGTTGCAAGTTACTGCTGCTTCTGCTGCTGCTGCA
AGTTAGTTAGTTGCTGGGTTGTTTGTGCGTTGCATGTTTACACTAGCCACAAAACATAATTTCTGTTTAGACAACTTTAGCGGCAGCCCCACTTTCCGGTTTTCCACTTTCCGTGCCGTT
TGTGGCCACAGGAACGGCCCCTGTCCGTGCAGCACCTTCGCCCTCTGCTCCTGCTGCTGCTGCCCCTCGACAAAGGGGAAACGTGGCAGGAAGGATACCCTGGACGAGCCAATCATTGCT
TTGCTTCAGACTACCAATTGTTCCTGTGGCAGCTATATGATATTGCGATCCCATCTGATCCTCACTGAAAGCCAACATGAGGCAAGCATCAGAATCCTATTACCCCGCCCCTAGGGCATG
GGTATGGGCCCATAATTATCGCAACAACGAAGTTGGCCAAAAAGGCAGCAGGCAGGAGCAGGCAGCAGGCAGCAGGCAGCAGTTTGCATTTTTGGTATAATTTTCACTTTTGCCAAGGAA
ACTTTTTCGCTGCCGCATTGAATTTGCAACCAACTAAAACCCGCCACACCCGCCACACCCGCCACACCCGCCACACCCGCCCGACCCAGACCAGCGGTTGCCGCTGTTGCAGCTTGCAAA
CTTCCTGAGTTGATTAGAAAGTTGTTTTTTTAGCCGCTGTTGCTGCCACATCCTCCCTGCAGTTGCTAATGGCAGCTTAGAGCAAAAGGAGCAGGGCATGGCAGCGCAGGGCAGGGCAGG
GCAGGGCAGGGCAGGAGGATGTTGCAAGGCCCCCTCGCAGCTCGCGAGCATATGCGAGGAGTGAGACCTCAATCTGTGGTTGAGTTCCTCGTCGTTTGGTTGCTTGGTGACTTGGGATTG
TGGCATGGTTGCTACTAGGGCGCGCAAAGCTTATCGCTTCGGTGGCTTGTCGTCAGGCAGCAGGCAGCAGGCAGCAGGCAGCGGCAGCAACATAAACTATGCGAGCATCAACTTTTTTTG
TAATTAATTTTTAATGTGTGTGCAAAATAAAATGTTTACGTACATAAATGTTCTTGTAGTTTCTGCCTTCCCCTGCCATATCAAGGGGATGTGGCTGGGAGCGAAAGGGTTTATCAAGCG
CTCTAAGAGGCGGATAAGCAGCGCTTAAGTGGGTGCCACAGAGGAAGAGGAAGGGGAAGAGGAAGAGGGTAGCTATGATTTAAGATTGTGTTACGCCCAGAGGAAAATAATTGGATTGCC
TTAAACAGCGTTGGCTGTTGGATACAACAAAAAAGAACGATGCCTGACAAATGGGTTTCTCTTGTGCGGAATATACACGCCTTTCTATCAATCCCTAATTGCATATGACAATTGCTATTG
CACCCACTATCCCAAAGAGAGAGAGAGATAGAAGAAGAAGAAGAAGAAGAAGAGCACATGACAGACAAGACGGGAAGGGAAGGGACAGGACGGGACGGGACAGGACAGACACTGGCAACT
TGCTGCAATTTGCTTTGGAACCAAAAATTGATTTACCCTCGACTGCTAAACGTGCCACATCCTCCTGCCAACCATCCATCCTTCCATCTACGTCCCCATGTCATCTGCCAGATAAACGAT
TTGCTCCATACAGCTACAGCTACATCTACTGCTACATCTACTGCTACTGCTATCATGATTTCTCACTTTGTTCCTCCTCTTTTCCGTACATTTTGCAGAGATTTGCGTTTCAATCACTTC
GAGGAGGTGCCGGCAGACGCTTTCCGCGGCATGGGGCAACTGTCGACGCTGTTCCTGAACGAGAACGAGCTGGCCCACCTGCAGGATGGGGCCTTCCAGGGGCTGCTCGCCCTGAGGTTC
CTCTATCTGAACAATAATCGGCTAAGCCGCTTACCGGCGGCCATCTTCCAGGGATTGCCGCGCGTGGAGGCGAT
ATGTAAGTGAAACGCGGCACAGCCCTTAATTAAGGATATGCAGAAA
GGCATAAATCGCCTTAACGACGCCCACTTTTCCGCTTTTCCCCCTTTCTCCCGCCCTCCACTGTTAGATATCTGGAGAACAATGACATTTTCCAGCTGCCTGCCGGAGTTTTTGACAATT
TGCCACGTCTGAATCGCCT
CTGTAAGTTGGATGGATGGATGGATGGATGGATACGAGAGGGAGATTAACTTTTTTGGTGGGTACATCTCCATTTTCCAGCTTCCTTTACAACAACAAGCT
CACCCAACTGCCGGTGGAGGGATTCAACAAACTGAACAGCCTGAAGCGTCTGCGATTGGATGGGAATGCCATCGACTGCAACTGCGGTGTGTACTCCTTATGGAGACGCTGGCATCTGGA
TGCCCAGCGTCAGCTGGTGACCATCTCCTTGACCTGTGCCGAGCCGCAAGCCCTGCAGCGCCAGAGCTTCGCCAGCCTCCAGGAGCAGCACTTCAAGTGTG
TGGTAAGTGCGAACAGAAC
AGAGCTGAGAGCTGAGAACGGAGAACTGAGAACTGCACTTCAATCTTTCACCCGTTGCAGCCAAGCCCAATCTCCTGGTGGCCCCACAGGACTTGCAGACCTTCGCCGGGGAGTCCGTTC
AACTCGACTGCGAGGTCACGGGTCTGCCCAAGCCCCAGATCACGTGGATGCACAACACGAACGAGGTCGGCGAGGATCAGGTCAACCGGGAAATCCTGCTAAGCGGCAGCCTGCTCATCC
GCAGCGTGGCAACCACTGACATGGGCATCTATCAGTGCCTGGCCCGCAACGAGATGGGGGAGGTACGCTCCCAGCCCATCCGCCTCGTTGTCAGCAGCAGCAGCAGCAGCAACCGGAACC
CACTGGACAACCCCCACATCGACCCTCGCAGCAATCAGGTATGGGCGGATGCGGATGGGAATGCGAATGCGGATGCAGGAGGTGCAACACCCACGCCACCGAGCTTCACCCACCAGCCGC
ATGACCAAATTGTGGCCCTTCATGGCGCGGGACACGTGCTCCTCGATTGCGCCGCCTCCGGCTGGCCACAGCCGGACATACAATGGTTCGTCAATGGTCGCCAGCTCGCCCAGTCCACCG
CCAGCCTCCAGCTGCAGGCCAACGGCAGCCTGCTCCTCCTGCAGCCCACCCAGCTGACAGCCGGCACGTATCGGTGCGAGGCGAGCAATCGCCTGGGCACTGTCCAGGCCACCGCCCGCG
TGGAGGTGAAGG
GGGTGAGTGTGACACAACAACTTCCAGAAGTAAGGGGGGCTGGGGGGGCACGGCACTCGTTTCGGGATTGCATTTCAGTCACTGCCAGACGACATACCATCGTCCGCA
GTCCGCAGTCCGCAGTCCGCAGTCCACTGGCAACACAGTCACAGTCACAACTGTCGCGTTTTACTCAATTTGCAGTCGATTTAATTTGTTTTCGCCTGAGATGAGGATGAGGATGAGGAT
GAGCATGGACCATGAAGCATGCAGCATGAAGCATGCAGCATGGACTCTGCGACTAGACACTGAGATTGAACATGGAGTCTGCTTTGGATCGGGGTGTTCGGGACTATGCGGAAGAGCTTG
TCGAGGGGTTTTCATAGATGATTGGATTATGTCAGATGTGGCGGCAAAACGGTGGCAGAAGGGTGCTCCTCCGTCAAATCAGAGTACAGAAGGACGGGACGGCACGGGTACTGCTGTACG
AAAAACCTAACAAAAGAAGAGCATTCAAACTTTTGGTTTAACCACGAATAAATGTATGAAAACAATGTTTCGCTTGAGCATTTTTATGTTCATTCGTTATTGCTGCAATTCAAGCGTTTG
TACGTGACTGTACTTTATTGCGCAATAGAAATGGACGGGGGGCCAGCAGGGGAAAACGCCCTGATATGTCCATAAGGGAAGGGGTTCCCAGACCGACCGACCGACCAACAGACCGACAGA
CCAACCGAATGCCAGAGGGAATGATAGAGTGGCAGAGTGGCAGCGCCTTGATGCAACAATGGCTCATTATAATTTGTGGTTGTCGGCACAAAATGAAGAGCAGACAGCCGGAGCAAAATT
TATGCATTATTTTGGAAAATAAGTGCAAAAATTATAAACTATTATGTAGAGGGGGGGGCGGGGGGTACGGTACGCACAAATTGCGAAGCTGCGGTTATCCCGAAAGTGCAGTGCTGCAGA
GGTGCAGAGGTGCAGAGGTGCAGAGAGGTGCAGGGGACGTGCGAGTGTATGGACTGTTTAAGCAGTCAGGCGATGCGCTTGGTCAGCATAATTAACCCGGCAAACACACTCACATACCCA
TAAACAGATGCACAGAGAGAGAAAAGCTCGAATGTTTTCCGAGTCCATTTCCAGTTGTACAAAATATAGATCAAAGAGGCATTTTCCAAGATAGTGTTTTCATTCTTCTGTCGATGTGCA
GATAAAATATATGGGTTTGGTGCAGATTTTCTCGGAGTGTAGGGGCAAATGCATGCACTGGCGCACACACACATTCACGAGCATATGAATAAGCAAATAAATATGCTGGCACCGCTCAAA
AGTATGCAATGCAAAAATTTGTATTCCAAACGCCCCAGGGGGCAGGCGGCAGGAGGCAAAGCAAAGCTGTGATCAAAGCGAATGGAGGGGCGTGCAGTGGGAGGGGGGAGAGGGAGAGGG
ATATCGAGTGGGATACAGAGTGGAACGAAGAAAGAGTTTATGGACAAAAGGCGGAATAGAAAACGAACCGCAGGCGGCGTCGGCGTCGCCGTCGGCGTCGACGTCGGCTATGACCATGTG
CCTTTTTTGTTGCTGCCATTTGGAGCGATAGAATTGCTCGAAAATCGCTCATCCGCGATTCATACCACTTGCAGATTTGCCCGAAATTTTAATGGCACCCCAAAACCAAACAATCAAACT
GGGCAAAGCCTTTGTGCTGGAATGCGATGCCGATGGCAATCCACTGCCCACCATCGACTGGCAGTTCAATGGATCCCCGCTCGCCAGCACCCCGGCCGGAGACCTGCTGCTGGAGAACGA
GAACACAGAGCTGGTGGTTAGTGCTGCCCGCCAGGACCATGCTG
TGGTAAGTGGTTCAGTGCGGGGCGGGACCTCTCGGGAAAGTGATTCCAAATGGCTTACATTTCATTTCATTTCTCT
CTCCCCTCCCCACCCCTCCGCTCACCGTGCACGTCCCTGTTGCAGGTGTCTACCGGTGCACGGCGCGCAACGAGAATGGCGAGACGAGTGCCGAGGCAACGATTAAAGTGGAACGCTCCC
AGTCCCCGCCTCGGGTGGCCATCGAACCGAGCAATTTGGTGGCCATTACGGGCACCACCATTGAGCTGCCCTGCCAGGCCGAGCAGCCGGAGGTGGGACTGCAG
AGGTAAAGTTTCCCGA
ACGGAATATCCCAAAAAACTTTTCTTTTTTATTTGCCCCGCCACTTGTTGACGCTTCCTTTGGCTTTCCCACCCCCCCGCCACATCCTCTCCCCCTCTGTCTATCTCTCCCTCTCTTTAC
ATACAGATTTCGTGGCGCCGTGATGGTCGCCTCATCGATCCCAATGTCCAGCTGACGGAAAAGTATCAAATAAGCGGCGCCGGCAGTCTGTTCGTGAAGAATGTGACCATCCTCGATGGC
GGTCGCTACGAGTGCCAGCTGAAGAACGAATTCGGAAGGGCCTCCGCCTCGGCGCTGGTTACCATCAG
AGGTGAGTGGCGGTGGCGAAAAAACCGGGATGCGGAATCAAATAGTTCAACT
TTGAGTCTCAGACCGCAACTTTAATTGCGCTCCAGAGTGGGACGGGATGGGATGGGATGGGGCCGGCATGGCAGGGGGCGGTCTCCATAAAACTCCATTAAAATCAAAAAGGGAAAACTT
TTTCTCAAAGAGAGAGAGCAGAGCAGAGCAAAGCAGAGAGGACTTACCAATAAAAGGCCTCGACAGGCAGACGACGACGACGACGACGACGACGACGACGACGATGCAGTTAAAATGCGT
TTAAGTGCGCGGTAAACTTGTTTAATTAATTGAAAAATACTCGTAGTTGTGCCACAGCAGAAAAAATATATACACAGCAGCAGTCACACAACAGTGGGAGCTACTGCTTCGGGAAGGCCT
GCAAATGCCGCCAAGAAAAACGCAAAAGCACGATCCACGATGAAATTATGCAAGGTGGTCCGCTCTCGCAACGCGCTCTCATTTTGGGTCTTACTTTGCCCAGTACAACAGCAGAGCGCT
CTCAGTAGAGCGTAAAATTAGACCCGAAAGAGAGCTGATCTTGAGCTTAAGAAAACACAATATGCAAAGACAGTAAAGAGAAAAAGCGAAAAGGTGTCGCCTCTCCCTTACGTTTTCCTT
TCCTGCACACAAATGAGGAGCCGACGCTTCGCGATTACGCCAAGCGCGTGACGATTTGGAACGAACGAACAAGCAAAAGGAAATCCAATAATTCACAGCAAATTCAATGAGTGCGTAGAT
TTTTTTGGGTAGATTAATGTAGGGTGAGGCCCGACAGGACTGAAGTAACGAGAGAGCGTGAGATCACCAAGACTTGAATGGCTGACCGACCTCACCTTTGCAATGAGACATCTTTGGTAC
ATAAGCAAGTAATTACAAAATTTTCAAGAAATATAATCCTGCCATGAAGCTGATTTGAATGCACTTCCAAAAATAGTTTTCAAGGCAGGAAGCCCTGCGAGTGCACCGACAAATGCCAGC
CTCAAATGATTTTTTAATCCGGCTGTTGCTCCACTGTGCTAAATTGCTTGGAAACCACTCACACTCCTGCAAAGACCCGTTGTCCGGCTGCGAGTCCGGGCCCGGGTCCGCAAAAAAGGC
GGCTATGAAAAGAACATAAAACCATAAAATAAAGCCAGCAGCTAAACTCCAGCACAGTGGCTAGAAATACATAGGAATCTTGAAAATTAAAACAGCCAAACAGTCGCGTCAGGGTGCTTT
TCAGGCAGTTAAAAATTTACATAACTTGGCCAATAGCTTGGGACACTTTCCGCCAAATCAGAGCCCACTGTGCCGCTGGCTAAACTGCATATAAAATATGCCCCAAAAAAGAACATCAAA
AGGCGGCAACGAAAGCACAAAAGTTGTCGTTGCCGAAAACGTGCTAAGCCCCGGACAAGCTACGTGCCACATACCCCACCGTACACACAGATAGAGAGAGAAAGGCGAAACAGACGGGGA
CAGAGGCAAAGATAGACAGAGAATCTCTCACAGAGGCAGCCGCACAAGTTATCCAAATTGCCCAAATTGAAATTGAGTTGACAAGTTGCCGGCTGGAACACGAAAGGAGTACAAGAACTA
GAACTAGAAGTGGAAGTGGAAGTGGTAGTGGTGGTGGAAATGGGACTGGAATCTTCTCCAGCTTTCATCTTCCATGTGCATCTGCATCTGCATCTGCATCTCCCACTTCAACTTTAACTT
TCTTTTGGGTTCACATACATATATAGTGGTAGTACTCCTCGAAAATAGAACTTTTCCAAATTGTTTGTTCCGCAAATATGGCAAAGCCAACAAAAAAACACCCAGAAGATGGAAAAGAGA
CAAGGAATTTATTGAGCACAGGCTCCATCCACATCCACGGCTGGCTCCCGCCTTGGGTCGTAAATCAATCTGAGGCAAGCAGAGGGAAGCAGTGGCAGGGACAGTGGCATATCTGGCCAT
TAACCGAGGGGTAAACGTTGGACCATTGATGCCCCACAAAGAGAGAACCGAACCGAACCGCATCGCAGCGGCTTAGTTTTTGCTTACACAGACAGGCAGCCTTTTCCAAATGCATAATGC
AGCTAAAGGGAAAACCAGCAGGAGGAGGAGGGGGATCGGAAAATAAGGATTTCCATGTACATGTATGTGGTGCACGCACCATTTAGTGCGGCAGAGCCCGGAAGCCGAATCGGCTTACAG
GGGAGCCGTCGGCATTAGAAACTTTACTCATTACGGTGCCAGCCGCAGTCGCCAGACCGCAGCAAAGGACCAAGAATTAAAAGAAGAAAAACAAAAAAAAGGCATAGCATAGCAGAAGAG
TAAGTAGTCCGTAGTCCGCTGTCCGTGGTCCGGAGAGAGTGTGAGAATGCAACGCTAAAAGTGAAAAACTATGTTGAAAGACGGATAGGAAGTGCAGTGCAGTGCAGTGCAGCGCAGGGC
AAGGACAGGGACAAAGATCCGGTGTGTAGGGGAGGGAGGAAGCGTGGAAGCGTCTCTCTAACNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTT
ACTGCCCCTGTTGCTGCAGCTCCTCCTGGGGCTGCTGCTGGCGAAAAAGGCTAACCACGGAAATACATTGCCATACGCCGCACCGCACAGCATGTGGCAGAGCCAGAGGGCGGCAGGGCA
GGGCAGGGCACGGCGGGGCAGGGAGACCAAAACCAACAAAAAACTTTAGCAAAATGAGGCGACTGCAAGTGGATGGAGACTAAAGGTATACCCTGACTAACATTCCCGAATGAGATTAAT
TACATTGATGGAATGAGATATCAGAGGAGCCCTTAAAAAGAACTAGAATCAGAAGAAGACTTATCATAGAAAAGTGATCCATTCTCGAGCATTTGAGAGCATATAGATTCTAGAAAGATT
CGCTTGACCTCCTCACAGGATTAGAAATTCCTCACATACCCTACACACAAAAACAGAAAAGCAAAGGAGCAACCACAGAATGGCAGAGAGAAAATATATGCACTTGGGTCCTGGCCTGGC
AGTGGCCCAGGAGCAGTGGCTACCCACTAGAAGCACTACCAAACTATCCCACGCAGCAGAGCCACTGATAGTGGCAGACTTTTGGCACGGCACAGGACGGCACAGCAGGGCACTACTTGT
GGTGCAACAGGGTGTATTATGGTCGTTAAGTTATTACCTAGAACGTCGCCTCCGTCGCCCGTTGTAGTCGCTGCCATCGTCACCCCGCACAATGCCAAAGTTTACACAGGCGAAAGTTTT
CCGAGGATTCTGAGCATTCCTTGAACAGCCACGCTGCATGGCAGACAGAGAATGTGAGGCATGAATTGTTGTTAATGCTGCCTCCCCACCCCTCCCTTTGCAGCAGTCAGAGTTTGAGAG
CGCGGATTTTCCTTTTGGCGCACAAAAAAGTGCAGCAAGATTTCCAGGGGGTGAGGGGGGAGGACTGTCCGAAGGACACACAGACGGAGAGACGGACAGACATCTAGGCGAAGAAAGGGA
AATAAACTGGCTGCCATTTTTATTAGACAACGCGGCCATTTTAACGACTCCTCGGAGGCGTCTCTGGCGCTGGCGCTGGCGCTGGCATTATGGTAATTAAAATGTGGAAATTCCTTGAAA
ATATCTTGATTGTTTCTTCGCTTTGTGCCGCGTCGTCGCTTTGTGGTGGAATATCTGCTCGAGGAATCTCGAAAGTCTAGCCCTAGGATTGTAGTCCATTGTTGTAAATTAATTGGTCCA
TTTTTGGGGAATATCTTCTCCTTGAATCATTACCAGAATCCAGAATCCCCTGTAAAATCCTTTCTTTGCAGAAACAATGTGGATCTGGCACCGGGAGATCGTTACGTGCGCATTGCCTTC
GCGGAGGCGGCCAAGGAGATTGATCTGGCCATCAACAACACCCTGGACACCCTCTTCTCGAACCGCTCGTCCACGGGGCCACCGAACTATGGGGAGCTGCTGCGGGTGTTCCGCTTCCCC
ACGGGCGAGGCAAGGCAGCTGGCGCGTGCCGCCGAGATATACGAGCGGACCCTGGTCAACATCCGAAAGCACGTGCAGCGGGGAGACAACCTGAGCATGAGCAGCGAGGAGTACGAGTTC
AGGGACCTGCTCTCCAGGGAGCATCTGCATCTGGTGGCGGAGCTGTCGGGCTGCATGGAGCACCGGGAGATGCCGAACTGCACGGACATGTGCTACCACTCGCGCTATCGCAGCATCGAC
GGCACGTGCAACAATCTGATGCATCCCACATGGGGTGCCTCCCTTACGGCCTTCCGCCGCCTGGCGCCGCCTATCTACGAGAACGGATTCAGCATGCCCGTGGGCTGGACAAAGGGCCAG
CTGTATGCCGGCCATCCGAAGCCCAGTGCCAGGCTGGTGTCCACCTCGGTGGTGGCCACCAAGGAGATAACCCCCGACAGCCGGATCACACACATGGTGATGCAGTGGGGCCAGTTCCTG
GACCACGATCTGGACCACGCCATACCCTCGGTGAGCTCGGAGAGCTGGGACGGCATCGACTGCAAGAAGAGCTGCGAGATGGCCCCGCCCTGCTACCCCATCGAAGTGCCACCGAATGAC
CCGCGTGTGAGGAACCGCCGCTGCATCGATGTGGTGCGCTCCAGCGCCATCTGCGGCTCGGGCATGACCTCGCTCTTCTTTGACAGCGTCCAGCACCGCGAGCAGATCAACCAGCTGACA
TCCTACATCGATGCCTCGCAGGTGTACGGCTACAGCACGCCCTTCGCCCAGGAGCTGCGCAATCTGACCGCCGACGAGGGTCTGCTCCGCGTGGGCGTCCACTTCCCCAAGCAGAAGGAC
ATGCTGCCCTTTGCGGCCCCGCAGGACGGCATGGACTGCCGCCGCAATCTCGACGAGAACACCATGAGCTGCTTTGTGTCCGGCGACATACGGGTTAACGAGCAGGTCGGCCTCCTGGCC
ATGCACACCATCTGGATGCGGGAGCACAATCGGCTGGCCACCAAGCTGCGCGAGATCAATCCCCATTGGGACGGCGATACGCTGTACCAGGAGGCCCGCAAGATCGTCGGCGCCCAGATG
CAGCATATCACCTTCAAGCAGTGGCTGCCCCTGATCATCGGCGAGAGTGGCATGCAGCTGCTCGGAGAGTACAAGGGCTACAATCCGCAGCTGAATCCGAGCATTGCCAATGAGTTTGCC
ACAGCTGCCCTGCGCTTCGGCCACACCATCATCAATCCCATCCTGCACCGCCTGAACGAGACCTTCCAGCCCATTCCGCAGGGCCATCTGCTACTCCACAAGGCCTTCTTTGCCCCCTGG
CGCCTGGCCTACGAGGGCGGAGTGGATCCCCTGCTGAGAGGCATGCTGGCGGTGCCCGCGAAGCTGAAGACCCCCGACCAGAACCTCAACACGGAGCTCACGGAGAAGCTGTTCCAGGCG
ACGCATGCGGTGGCCCTGGACCTGGCCGCCATCAACATTCAGCGGGGCCGTGATCACGGCATTCCCGGCTACAATGTCTACAGGAAGTTCTGCAACCTCAGCGTGGCCGAGGACTTTGAG
GATCTCTCGGACATTAGCAATGCGGGAATTCGGCAGAAGATGAAGGAGCTGTATGGTCATCCGGACAACGTGGACGTTTGGTTGGGCGGCATTCTGGAGGATCAGGTGGAGGGCGGCAAG
GTGGGTCCGCTGTTCCAGTGTCTGCTCGTCGAGCAGTTCCGTCGCCTGCGCGACGGCGATCGCTTGTACTACGAGAATCCGGGCGTGTTCCTGCCCGAGCAACTCGTCCAGATCAAGCAG
GCCAACTTCGGACGCGTCCTGTGCGATGTGGGTGACAATTTCGACCAGGTCACAGAGAATGTGTTCATCCTGGCCAAGCATCAGGGCGGCTACAAGAAGTGCGAGGACATTCCCGGCATC
AACCTCTATCTGTGGCAGGACTGCGGCAACTGCAACAGCATGCCCACCATCTTTGACTCCTACATTCCACAGACGTACACCAAGAGGAGCAGTCGCCAGAAGAGAGACCTCCGACAGCCC
AAGGAGAAGGAGCAGGAGGAGGTCCCAGCCACCGAGAGTTACGACAGTCCCTTGGAAGCCCTCTACGATGTCAACGAGGAGCGCGTTAGTGGCCTGGAGGAGCTGATTGGAATCTTCCAG
AAAGAGCTAAAGAAACTGCACAAGAAGCTGCGCAAACTCGAGGACTCCTGCAATGCCGTAGATGCCGAGCCAGTGGCTCAGGTGGTGCAGCTCGCACCAGCACCGGCCCCCGTTGCCCCG
AAGCCCAGGCGCAGTCACTGCGTGGATGACAAGGGAACAACGCGGCTGAACAACGAGGTCTGGTCTCCGGACGTGTGCACCAAGTGCAACTGCTTCCACGGCCAGGTAAACTGTTTGCGG
GAGAAGTGTGGCGAGGTGAGCTGCCCGCCCGGAATCGATCCTCTGACGCCGCCAGAGGCCTGCTGCCCGCACTGCCCGATGCTCAAAGGAGAGCTGCCGTAG

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGTGGTGGCGAGGAGTGCTCCTGTTCCACCTGTTCCTGCTGGCGGGCTGGTCGGAGGCCGCCTACTGTCCGACGGGATGCAACTGCTACGAGCGCACCGTGCGCTGCATTCGCGCGAAG
CGCACGACCACTCCGCAAGTGCCCTACGACACCCAAGTTCT
AGATTTGCGTTTCAATCACTTCGAGGAGGTGCCGGCAGACGCTTTCCGCGGCATGGGGCAACTGTCGACGCTGTTCCTG
AACGAGAACGAGCTGGCCCACCTGCAGGATGGGGCCTTCCAGGGGCTGCTCGCCCTGAGGTTCCTCTATCTGAACAATAATCGGCTAAGCCGCTTACCGGCGGCCATCTTCCAGGGATTG
CCGCGCGTGGAGGCGAT
ATATCTGGAGAACAATGACATTTTCCAGCTGCCTGCCGGAGTTTTTGACAATTTGCCACGTCTGAATCGCCTCTTCCTTTACAACAACAAGCTCACCCAACTG
CCGGTGGAGGGATTCAACAAACTGAACAGCCTGAAGCGTCTGCGATTGGATGGGAATGCCATCGACTGCAACTGCGGTGTGTACTCCTTATGGAGACGCTGGCATCTGGATGCCCAGCGT
CAGCTGGTGACCATCTCCTTGACCTGTGCCGAGCCGCAAGCCCTGCAGCGCCAGAGCTTCGCCAGCCTCCAGGAGCAGCACTTCAAGTGTG
CCAAGCCCAATCTCCTGGTGGCCCCACAG
GACTTGCAGACCTTCGCCGGGGAGTCCGTTCAACTCGACTGCGAGGTCACGGGTCTGCCCAAGCCCCAGATCACGTGGATGCACAACACGAACGAGGTCGGCGAGGATCAGGTCAACCGG
GAAATCCTGCTAAGCGGCAGCCTGCTCATCCGCAGCGTGGCAACCACTGACATGGGCATCTATCAGTGCCTGGCCCGCAACGAGATGGGGGAGGTACGCTCCCAGCCCATCCGCCTCGTT
GTCAGCAGCAGCAGCAGCAGCAACCGGAACCCACTGGACAACCCCCACATCGACCCTCGCAGCAATCAGGTATGGGCGGATGCGGATGGGAATGCGAATGCGGATGCAGGAGGTGCAACA
CCCACGCCACCGAGCTTCACCCACCAGCCGCATGACCAAATTGTGGCCCTTCATGGCGCGGGACACGTGCTCCTCGATTGCGCCGCCTCCGGCTGGCCACAGCCGGACATACAATGGTTC
GTCAATGGTCGCCAGCTCGCCCAGTCCACCGCCAGCCTCCAGCTGCAGGCCAACGGCAGCCTGCTCCTCCTGCAGCCCACCCAGCTGACAGCCGGCACGTATCGGTGCGAGGCGAGCAAT
CGCCTGGGCACTGTCCAGGCCACCGCCCGCGTGGAGGTGAAGG
ATTTGCCCGAAATTTTAATGGCACCCCAAAACCAAACAATCAAACTGGGCAAAGCCTTTGTGCTGGAATGCGATGCC
GATGGCAATCCACTGCCCACCATCGACTGGCAGTTCAATGGATCCCCGCTCGCCAGCACCCCGGCCGGAGACCTGCTGCTGGAGAACGAGAACACAGAGCTGGTGGTTAGTGCTGCCCGC
CAGGACCATGCTG
GTGTCTACCGGTGCACGGCGCGCAACGAGAATGGCGAGACGAGTGCCGAGGCAACGATTAAAGTGGAACGCTCCCAGTCCCCGCCTCGGGTGGCCATCGAACCGAGC
AATTTGGTGGCCATTACGGGCACCACCATTGAGCTGCCCTGCCAGGCCGAGCAGCCGGAGGTGGGACTGCAG
ATTTCGTGGCGCCGTGATGGTCGCCTCATCGATCCCAATGTCCAGCTG
ACGGAAAAGTATCAAATAAGCGGCGCCGGCAGTCTGTTCGTGAAGAATGTGACCATCCTCGATGGCGGTCGCTACGAGTGCCAGCTGAAGAACGAATTCGGAAGGGCCTCCGCCTCGGCG
CTGGTTACCATCAG
AAACAATGTGGATCTGGCACCGGGAGATCGTTACGTGCGCATTGCCTTCGCGGAGGCGGCCAAGGAGATTGATCTGGCCATCAACAACACCCTGGACACCCTCTTC
TCGAACCGCTCGTCCACGGGGCCACCGAACTATGGGGAGCTGCTGCGGGTGTTCCGCTTCCCCACGGGCGAGGCAAGGCAGCTGGCGCGTGCCGCCGAGATATACGAGCGGACCCTGGTC
AACATCCGAAAGCACGTGCAGCGGGGAGACAACCTGAGCATGAGCAGCGAGGAGTACGAGTTCAGGGACCTGCTCTCCAGGGAGCATCTGCATCTGGTGGCGGAGCTGTCGGGCTGCATG
GAGCACCGGGAGATGCCGAACTGCACGGACATGTGCTACCACTCGCGCTATCGCAGCATCGACGGCACGTGCAACAATCTGATGCATCCCACATGGGGTGCCTCCCTTACGGCCTTCCGC
CGCCTGGCGCCGCCTATCTACGAGAACGGATTCAGCATGCCCGTGGGCTGGACAAAGGGCCAGCTGTATGCCGGCCATCCGAAGCCCAGTGCCAGGCTGGTGTCCACCTCGGTGGTGGCC
ACCAAGGAGATAACCCCCGACAGCCGGATCACACACATGGTGATGCAGTGGGGCCAGTTCCTGGACCACGATCTGGACCACGCCATACCCTCGGTGAGCTCGGAGAGCTGGGACGGCATC
GACTGCAAGAAGAGCTGCGAGATGGCCCCGCCCTGCTACCCCATCGAAGTGCCACCGAATGACCCGCGTGTGAGGAACCGCCGCTGCATCGATGTGGTGCGCTCCAGCGCCATCTGCGGC
TCGGGCATGACCTCGCTCTTCTTTGACAGCGTCCAGCACCGCGAGCAGATCAACCAGCTGACATCCTACATCGATGCCTCGCAGGTGTACGGCTACAGCACGCCCTTCGCCCAGGAGCTG
CGCAATCTGACCGCCGACGAGGGTCTGCTCCGCGTGGGCGTCCACTTCCCCAAGCAGAAGGACATGCTGCCCTTTGCGGCCCCGCAGGACGGCATGGACTGCCGCCGCAATCTCGACGAG
AACACCATGAGCTGCTTTGTGTCCGGCGACATACGGGTTAACGAGCAGGTCGGCCTCCTGGCCATGCACACCATCTGGATGCGGGAGCACAATCGGCTGGCCACCAAGCTGCGCGAGATC
AATCCCCATTGGGACGGCGATACGCTGTACCAGGAGGCCCGCAAGATCGTCGGCGCCCAGATGCAGCATATCACCTTCAAGCAGTGGCTGCCCCTGATCATCGGCGAGAGTGGCATGCAG
CTGCTCGGAGAGTACAAGGGCTACAATCCGCAGCTGAATCCGAGCATTGCCAATGAGTTTGCCACAGCTGCCCTGCGCTTCGGCCACACCATCATCAATCCCATCCTGCACCGCCTGAAC
GAGACCTTCCAGCCCATTCCGCAGGGCCATCTGCTACTCCACAAGGCCTTCTTTGCCCCCTGGCGCCTGGCCTACGAGGGCGGAGTGGATCCCCTGCTGAGAGGCATGCTGGCGGTGCCC
GCGAAGCTGAAGACCCCCGACCAGAACCTCAACACGGAGCTCACGGAGAAGCTGTTCCAGGCGACGCATGCGGTGGCCCTGGACCTGGCCGCCATCAACATTCAGCGGGGCCGTGATCAC
GGCATTCCCGGCTACAATGTCTACAGGAAGTTCTGCAACCTCAGCGTGGCCGAGGACTTTGAGGATCTCTCGGACATTAGCAATGCGGGAATTCGGCAGAAGATGAAGGAGCTGTATGGT
CATCCGGACAACGTGGACGTTTGGTTGGGCGGCATTCTGGAGGATCAGGTGGAGGGCGGCAAGGTGGGTCCGCTGTTCCAGTGTCTGCTCGTCGAGCAGTTCCGTCGCCTGCGCGACGGC
GATCGCTTGTACTACGAGAATCCGGGCGTGTTCCTGCCCGAGCAACTCGTCCAGATCAAGCAGGCCAACTTCGGACGCGTCCTGTGCGATGTGGGTGACAATTTCGACCAGGTCACAGAG
AATGTGTTCATCCTGGCCAAGCATCAGGGCGGCTACAAGAAGTGCGAGGACATTCCCGGCATCAACCTCTATCTGTGGCAGGACTGCGGCAACTGCAACAGCATGCCCACCATCTTTGAC
TCCTACATTCCACAGACGTACACCAAGAGGAGCAGTCGCCAGAAGAGAGACCTCCGACAGCCCAAGGAGAAGGAGCAGGAGGAGGTCCCAGCCACCGAGAGTTACGACAGTCCCTTGGAA
GCCCTCTACGATGTCAACGAGGAGCGCGTTAGTGGCCTGGAGGAGCTGATTGGAATCTTCCAGAAAGAGCTAAAGAAACTGCACAAGAAGCTGCGCAAACTCGAGGACTCCTGCAATGCC
GTAGATGCCGAGCCAGTGGCTCAGGTGGTGCAGCTCGCACCAGCACCGGCCCCCGTTGCCCCGAAGCCCAGGCGCAGTCACTGCGTGGATGACAAGGGAACAACGCGGCTGAACAACGAG
GTCTGGTCTCCGGACGTGTGCACCAAGTGCAACTGCTTCCACGGCCAGGTAAACTGTTTGCGGGAGAAGTGTGGCGAGGTGAGCTGCCCGCCCGGAATCGATCCTCTGACGCCGCCAGAG
GCCTGCTGCCCGCACTGCCCGATGCTCAAAGGAGAGCTGCCGTAG

Retrieve as FASTA