Entry information : DpspsPxd01
Entry ID 7654
Creation 2010-10-25 (Marcel Zamocky)
Last sequence changes 2016-02-17 (Christophe Dunand)
Sequence status complete
Reviewer Achraf Jemmat
Last annotation changes 2016-02-17 (Achraf Jemmat)
Peroxidase information: DpspsPxd01
Name DpspsPxd01
Class Peroxidasin    [Orthogroup: Pxd001]
Taxonomy Fungi/Metazoa; Metazoa; Bilateria; Ecdysozoa
Organism Drosophila pseudoobscura pseudoobscura    [TaxId: 46245 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value DpspsPxd01
start..stop
S start..stop
DpePxd01 3164 0 1..1529 1..1534
DmPxd-A 2698 0 15..1525 18..1527
DerPxd01 2686 0 19..1525 21..1526
DyaPxd01 2677 0 19..1525 21..1528
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '7654' 'join(1311184..1311344,1328721..1328936,1329048..1329119,1329190..1329431,1329502..1330158,1331745..1331954,1332074..1332252,1332393..1332574,1336518..1339188)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 1311184..1311344 159 N° 2 1328721..1328936 214 N° 3 1329048..1329119 70 N° 4 1329190..1329431 240
N° 5 1329502..1330158 655 N° 6 1331745..1331954 208 N° 7 1332074..1332252 177 N° 8 1332393..1332574 180
N° 9 1336518..1339188 2669  
join(1311184..1311344,1328721..1328936,1329048..1329119,1329190..1329431,1329502 ..1330158,1331745..1331954,1332074..1332252,1332393..1332574,1336518..1339188)


exon

Literature and cross-references DpspsPxd01
Literature Drosophila 12 genomes consortium (2007) Evolution of genes and genomes on the Drosophila phylogeny. Nature 450:203-218.
Protein ref. UniProtKB:   Q29FB4
DNA ref. GenBank:   CH379067.3 (1311184..1339188)
mRNA ref. GenBank:   XM_001354252.2
Protein sequence: DpspsPxd01
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1529 (1509)
PWM (Da):   %s   170971.72 (168558.5)  
PI (pH):   %s   6.18 (6.15) Peptide Signal:   %s   cut: 21 range:21-1529
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MWWRGVLLFHLFLLAGWSEAAYCPTGCNCYERTVRCIRAKRTTTP
QVPYDTQV
LDLRFNHFEEVPADAFRGMGQLSTLFLNENELAHLQDGAFQGLLALRFLYLNNNRLSRLPAAIFQGLPRVEAIYLENNDIFQLPVGVFDNLPRLNRLFLYNNKLTQLPVEGFNKLNSLKR
LRLDGNAIDCNCGVYSLWRRWHLDAQRQLVTISLTCAEPQALQRQSFASLQEQHFK
AKPNLLVAPQDLQTFAGESVQLDCEVTGLPKPQITWMHNTNEVGEDQVNREILLSGSLLIRSVA
TTDMGIYQCLARNEMGEVRSQPIRLVVSSSSSSSNRNPLDNPHIDPSSNQVWADADAGGATPTPPSFTHQPHDQIVALHGAGHVLLDCAASGWPQPDIQWFVNGRQLAQSTASLQLQANG
SLLLLQPTQLTAGTYRCEASNRLGTVQATARVEVK
DLPEILMAPQNQTIKLGKAFVLECDADGNPLPTIDWQFNGSPLASTPSGDLLLENENTELVVSAARQDHAGVYRCTARNENGETS
AEATIKVERSQSPPRVAIEPSNLVAITGTTIELPCQAEQPEVGL
QISWRRDGRLIDPNVQLTEKYQISGAGSLFVKNVTILDGGRYECQLKNEFGRASASALVTRNNVDLAPGDRYVRIA
FAEAAKEIDLAINNTLDTLFSNRSSTGPPNYGELLRVFRFPTGEARQLARAAEIYERTLVNIRKHVQRGDNLSMSSEEYEFRDLLSREHLHLVAELSGCMEHREMPNCTDMCYHSRYRSI
DGTCNNLMHPTWGASLTAFRRLAPPIYENGFSMPVGWTKGQLYAGHPKPSARLVSTSVVATKEITPDSRITHMVMQWGQFLDHDLDHAIPSVSSESWDGIDCKKSCEMAPPCYPIEVPPN
DPRVRNRRCIDVVRSSAICGSGMTSLFFDSVQHREQINQLTSYIDASQVYGYSTPFAQELRNLTADEGLLRVGVHFPKQKDMLPFAAPQDGMDCRRNLDENTMSCFVSGDIRVNEQVGLL
AMHTIWMREHNRLATKLREINPHWDGDTLYQEARKIVGAQMQHITFKQWLPLIIGDSGMQLLGEYKGYNPQLNPSIANEFATAALRFGHTIINPILHRLNETFQPIPQGHLLLHKAFFAP
WRLAYEGGVDPLLRGMLAVPAKLKTPDQNLNTELTEKLFQATHAVALDLAAINIQRGRDHGIPGYNVYRKFCNLSVAEDFEDLSDISNAGIRQKMKELYGHPDNVDVWLGGILEDQVEGG
KVGPLFQCLLVEQFRRLRDGDRLYYENPGVFLPEQLVQIKQANFGRVLCDVGDNFDQVTENVFILAKHQGGYKKCEDIPGINLYLWQDCGNCNSMPTIFDSYIPQTYTKRSSRQKRDLRQ
PKEKEQEEVPATESYDSPLEALYDVNEERVSGLEELIGSFQKELKKLHKKLRKLEDSCNAVDAEPVAQVVQLAPAPAPVAPKPRRSHCVDDKGTTRLNNEVWSPDVCTKCNCFHGQVNCL
REKCGEVSCPPGIDPLTPPEACCPHCPMLKGELP

Retrieve as FASTA  
Remarks complete sequence from genomic (Chromo XR, 8 introns). No EST.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGTGGTGGCGAGGAGTGCTCCTGTTCCACCTGTTCCTGCTGGCGGGCTGGTCGGAGGCCGCCTACTGTCCGACGGGATGCAACTGCTACGAGCGCACCGTGCGCTGCATTCGCGCGAAG
CGCACGACCACTCCGCAAGTGCCCTACGACACCCAAGTTCT
CTGTGAGTATTCCTCTTCATCCTTTTCCGGGGGATTGGGGCTCTGTTCTGCATAAATCATTCTGGCTGTTGGCTTTCCA
AATTCCAAATTCCATAAATTAGTTTTTAATGTTTTCCGTTGGCCCTCTGCAGGCGGCAGTGGCGTTAATTTGATGCCCTGCCCCATCTGTGCCAGCATCATCTTTATCTCCATGCATCCA
ACCTAACCTCCAACCACCATCAACACCCCCACTCCCACCCCCACCATCGTGTTGACTCAATTAAACGGATGTCTAGAGAGAGAGCTGCCTGGGTTCTGGGCTCTGGGTTCTGGATTCTGT
GTTCTGTGTTCTGTGCTCTGTGCCCTGGGATTTGTATATTTATCGCACGCCATGCCATGCCCCACACAAAGCGATTGGAGGCGACGCAAACACAACGCGACAACGCGACTACGCGACTCG
CAATATTTGCATGCACGCGGAATACTTCTGACGTTGCATTTATTTTCCGAAATGTCAGCAGCTTTCGCATGAATTTCGGGAGCCACACCACCAGAATCCGAATCCTCGAAGCGGATTTGT
GGGTGGATGGAGCGGCGGATAACTTGGCAAATGTTTGTGGAGCATATGCATATTTATATAGGCCGCGGCATGCTTTCTCCTCCTGCCCCTCCATTTGGGGCAGCATCTGTGCTGTGCTCT
GCTGTGCAACGGTAGCAGCTGCCACAGCCGGCGGCAAGTTGTCAGACGACAGATAGAAAGAGAGACAGCTCTGCTCTGGAGGTGGAATTTCTCCTGGTATTTATGGGGGAATTCAATTTT
CATTGGGTTAAGTTGTTGAGCGAACGATGGAGGATGTACACGAATAAACTGACAGATATTCGCACATTCTGTCAGCCAGAGAATGGGCTCCTTCTGTACGGTGCTGCAGTGTGCTGCACA
GTGGCTACAGAGACAGGTTTTACCGGATGAAAGATCTATACAGATCCAAGAGAAAGAAGAAGTTCTGAGCTTGAGGAAAAACCCAGTGATGGCAGATTTCAAATTGAAAACAAGGGGCAG
GTCCATATTCACCATCACCATACAACATGGAGATATGTATCTACCAAATTCATCGGGAACCTGCTGCATTTCGGCAGTCATTCCCAGCCCCATCCCATAAGCTCCACATAACTGGCATAA
GCAAAAAGAATTCTTTGTTTATCCATCATCCGAAGTCGCATACTGCAAATCCGCACTCCATATCATTCCTCTGATCCTCTACCATTTGTTGGCTCGCCGCACAACAATTTCTCAAAGAAG
TGAAACCCGAAACCACACCCACCCCCACAAAAAAGGAGTAGCCACAGAGAGCACAGACGGAGAGAGGCACTTTTGGCAGTACTTAAATTAATCTTCATTTTAAATAAGGTGATGGCTGCA
TAATTTATCATTGCCCCGAGAGCTGAATGCACACAAAACACTCACACAACAATAGGAAAAATTCCAAAAGAAGGAGAAGGAGAGGGAGCAACAGAGACGGGGCGAGTGGTGGTGTGGATG
CATTCAGCAACTTTTCTGATTCCATTCTGATTCGCATTTCATTTATTCCGCATCCGACCCGTTGCACACTTTTAATTAATTTGTTTCATTTATAACGAGTACGAGTACGAGTACGACTAC
TGCATGTGTCTAAAAAAGTTTCCCTGGTTTTTCCTCCAGCTTGTTCTTCCCCTTTGCCCCCCTTTCATGTGGTGTGTGCTTTCCATGGGCAGCAGCCCACATGTTTATGTGCGGCGAGTC
GACTCGAGTCGGGTCAGATTTTCTCCATTTATATTTCATATTTTCTCAAAATGTTCACTCGAATGCCTCCTAAAATAAAATACAAAAAAAAAACAACCAAATGCTCGCATGTACTCGTAC
TAGTCTCCAGGCAGAGGCAGTGATAGACATTATATTTTTGCATTTGTTTGTGCTATCCGATGCGCATGGAAATGTTTTTGCCTTAATTTTGTTAATGGCCAAAATGTTGTTTATAATTTT
CGTTGCTGGAAGATGCTTTCGGGTCGGGGCTCTCGAGATAGATGTCTGACAGAAAATAAAATGTAAAGACACAAAGTCAAATAAAATAGTTTATTCAAACGAATTCCTCAAAAATGTGTT
TTATTTGAACTCCAGAAACCTCTTGGATCCACCATTGCCAAAGCAAAGAATTCTTGGCTCAAAAGGCTACAGGAAAATGTATCTTAAGCCAGAAATTCTCCTGAAATTTGTTGTAGAAGA
GATTGAGAGCCAGATTTGATATCTTTCGAAGCTTTGGGGTTCTATTTTTGCATATCCTGCAATGTTCGGATTTTAAGAGTATATTCAAATGAATGAGAGCTTCCATGTTATAGTCCTACT
TCTAATGTGATATCTTGCTTGCAAGTCTACACTTGCTTCGGGGAAAATATCCTGTATATTGCTATCCGCACTTCATTGAAAAAGCCTTCCTCTCCAGCTGCTGCTGCTGCTGCAGCACCA
ACGCCAGCTCAGGGCCTAAGCCACATTTCACTTCCTTGAAACTGTCCATCCATCAAGCATAAAGTCCTCTGCCTTTTTTTCAGCTTTTGCCTCCATTGAAAGTTCCGCTTTCAATGGCTG
CTGGGTCGTAAAAAAGAACTGCAACTCACTCTGCCTGCCTCTGCCCCATCTCAGCCCCTTCCCTTGCCCTCCGTGCCCTTCGGGCCCCAACAAAAGGCCAAACATTTTGGCTCTTTCCGT
GTGCATTTTGAACTTGCAACTAAATGGAAGTCCGAGCCGAGACTCTGTAACATATGCGAGCGGCGGCCGGCAACTTGCATCATTCATCTTGCCACACGCCACTCCGCCCACCCACACACA
TGGCGCTTGGCGCTGCGAAGAGGAGGCTGCCGTTTTTTTTTGCTCCCTTTCCATCGGAGCTAAATAAATGCACTTGATTTCATTTCATGATGATGCATTTTCCGCTGACCTCCCATAAAC
CACACACATTTCTTGCCTCCTCGGAGGCAGTTGCTTTGTCATTATACGAGCGGGGCAAGCCACACTCGAAGCCGTGCCTCCGCCCCTACTTTCTTTGCATAGAAGAGCGAACCACATAAA
ATACAGATGGAGCCCGTGCGGCAGATGAACAAAATCTATTATGCACTGAGAGAAACGGTACTATCTTTGCATTCCAAAAGGAACTTTAAACGTGAAACCCCCTCAAATATATATACCTTC
CTTTTGCCAGTCAGATATGGGCATTTCCATTTCATTAGCTCTGGGATGGGGTTCATCTGCGAGGCCATAAATTATCTCTATTGGTTATTTTTATAATTGAATTAAACACCGGTCGTGCCA
TATCATATCACCTCATAGTTTCCCATAAATCACTTCCAAAAAGGAAACTTAGTTGGCTTTTTGCAAGGACTTTGAGGCAACAGTTTGGCGTAGAGGAAAACACGACCAAAAAAAGGGAAC
TTTTGCTTGAGAGGCTTTCAGATCAAAGTTAATGATATAATATGCCAAGTGCTTGATTTAGTTAAGTTTCATTCAATTTGGAAAATAATATTGGAGCAAAGGCATATGGAAATGGGTTTA
ATATTAGCTTGGTCTCGGCTAACTTTTCCATGGAACTTTTCCATATATGCATTGACGTATCGAATATTTTTCAAACATTTGCATTGTTCTTTTTCTTTTTATTGTTTTTGCAAGTGTGGC
CTCTGGTCTTTGCCTCTTGGAGGCTCGGCAGCGGCGGCAGCGCCTTGTGCAATCTAATAACTTCGGGCATTATGCACATATGAAATATATCTCATCTAATGTCTGCCCCTCCCCCTCCCG
TTCCCTCTCTGTCTGCCTGCCTGTGTGGGTGGCTTTGCTGTTGGAACAAAAATCGCCTCACGTTGCCATATGCCGGGCATATTAAAAACTTTGATTAGTGTCCAGTTCGCTGGCTCTTGC
CACGCCCACGCCCACGCCCATGCCCACACCACACACCACACACACATGCATTGTTCTGCATAATATATGGCCACTACATTGCGATGTACATTGTACATACATATAGTATTTGCTTGGATT
TTTGTTTGTGAGTTTTGTGAATCAACTATTTGGGAAAAGTATTTTTTGTTTGCCCAAAAATTGTTTCAATCTGCCCTCCCGCTCCAATACGCTGGCTGGCTGGCTGGCTGGTTGGGTGGC
TGTTTTCTTTCTCCCTCTTCGGCTGCTCGCTGTTTTTCTTCTCTTTCTATTTCGCTGTGTGCCCCCCTCTCTCTTTCTCCCATTCTCTCTGTGAATTCCTCATTTCGTTTTTGGTGCATT
TATGCGTGACAAATGCCTTGTATTTTGTGTAGCGCTCTACGGGTCGTCGTCTCCATATCCGGGCTCCTGGTTCCTGGTTCCAGGATCTCCTCCTCCTCCTCCTCCTTCTGCCATTCCCCC
TGTTCCCTCTGTTATTAAATATGCATAAATTTCCTTTTTGCACATAGCATTCGTTGCTGCTGCTGTTGTTTTTAGTTGTTGGAATTTTGCATTTACATTTGCTGTTTATTTGTCGGTGGA
AATATTGAAATTGTTGCCAATGGATGTCCTAGCAGCAGCGGCAGCAGCAGCTTCTGGCTCCTGCCTGCATTTCCTGGGGGACTCGAAGTGAAATGATATCCAGGGTTTTCCGTTCAGGTG
TGGTTCTTGTTGGAATAATTACAGCAGAATTCGAATCCGAATCAGAATCAGCAGCAGCTGTGCCTCTGGCTGTGGCTGTATCTGTATCTGTGGCTGTGGCTGGGGCTGTCTTTGTGGAAT
ATGCTCAAATATTGTATCTTCTGGATGGCAGGAAGCTGTTGCAAATTGTCTAAAGATTGCCTAATTTGCTTGTAATATTTGATTTGATTTAATATGCATCCCTCAAATGCATTTCATACG
GATTTAATTAGTTTTTAGCTTCAATTGATGCGAGCATCAAATGTTAATTGAATGATGAATGCTTATGGGATTATTATTAGGAAATACTCAAAGAATACCAGATGTTCATTTAACAAATTA
GTTTACACACTCGAAGAACTGCTCCCCATCTCCCTCTGTTTCTCCCTCTCCCTCTCACTCTCTGTCTTCCTGTGGCAGATCCATCCATCTCTGTTTGCGTTTAGTCCCTGGCAATAAATC
CCTCTAAATGCAATTGCCAATTCAATTACCAATCAATCAAAGAGCATTCAAACTTCGTCCTTTGGAGCTAGCAATCCCCCTCCCCTGACACACACACACACGCACTCCTTGTCTGGCTCC
TCCTCTCCTCTCCCTCTCCTCGCCCCCGTTTCGTTTGCAATACAATATCATTGCTGTAAGAGCTTTTTTCGGCTGCAGTTGCAGTGTTTGGTTTTTGTGGCATTCAGTCTGACTTTTCGA
TTGTTAAATATTTGCAGGGCGGGCAACAAAAGCCCGACACGGCACTGAAAACAAAATGAATCCGATTCTGTTGGCCAGAATGTGCCACTCAATCGGTAATATGATTTGTAATTATGCAAA
TGAAGCCGTCCCCCAGTGCCCCACCAATAGGGAAAGGGCAACAAGAAAAGCAGAGACGGAGAGAGAGTCAAATGGGAGTTACATTTTTGGATGATTAAAGCTGAATGAGAAAAGTTGTCC
GCCTCAAAGGGAAAGGCAGAGGCAGGGCAGGGCAGGGCAGAGGCAGGGCAGGGCAGGGCAGGGGGAAAGGGTCCCCAGCGGCAGGGCAAAGTTTTTGCCAATCAAAGCGTATGCATTTTT
AATTCACTGCCAAATGCAGATGGAATGAAGGAGATGGCTATCGAAGAGAGATAGAGAGAGGGAGACCAACGTGAAGGATGCCGAATCTCGAGTTGCATCGAGAGAGCGGGAGCCCCAATG
CGATTCTGTCAAGGAGCCTGTCCGACTGTCTATCTCTCTGTCTCTGTGCCTGCCTGCCAGGCTGTTCGCTTCACTTTGACATAGAAAAATCTTCTATTGATTTCGCCTTGCCACATGCAG
CATGCAGCATGGCTGCCCCCAATCGTATCAGCCATCCCTCTTTTGGCTCCCTCTTCAGCCCCGCCTTCTCCCCCACTCCCCCACGTCTCGTCACGCTGTGCAATTTGCATTTAAAGTTTA
GCTTCTTTCTTTTTGCCATTCTCCCTTTTGCATTCCTGCCAATTCCTTTGCTTTAATGCTACAAAGTAGCACTTGCTTCATTGCCATCGTCCACGCCCCCAGCCCCCAGCCCCCAGCCCT
CAGCCCTCAGCCTCTGGCCAACGCCCACACGCTTTGTCATTTCGCATGCTCGTTAGTGCAGAGCAAAGAATGCAAAATAGCAAGATACTGGCGGGATAGTAGGTGGCAGGAGGTGGCAGG
AGATAACATGCAACACCACCAACATCATCGCTACACAAACACATGCCACAATTTATAAGCAAAGTTATTTTGCCCCAGCAGAAAAAAACACGAGGTGGGAAAGGGCTGGGCCAGGGGCTG
GGGCAGAGCTCCGTTGCCCCGATAGCAACCGGCAACATGTCAAAGTGGAGCAATCGGCATGTGCAACGGACACGGAGCAGAACGGACACAGACACGGACACGGATTCGGACTTGGAGTCG
GAGTCGGAATCGGACTCTGTCGGAGGCAACTGTCGGTCAAGCATAAGAAACAAGCAGTAGAAGCGGTCCATAAGTATGCAATGGCAAAAGAGAGAAAAACACAGAGACCCAGAGAGACGG
AGAGACAGAGAGTGTATATACGACTAGGGGATACCTTAAAAACAAAGGGAAACAGTATCTGTGATAGCTTTGGTAAATACCAGCCTTATGTATCACCATATCCTTAAAAGTGTATAAAGG
TCAAAGACGAAGTAACCCGATAGAAGAGATGCAGAGACCTTTGGATAAGTTCAAAAGAGCCAGTATGGATCTTAAATATTTGTAGACCCGTACGGCTGCTGCCTCCTGCCTGTGGCATAC
CCCAAGCAGCACTCCGGAATTACCTTACCCGTTTTAAGAGAAAGGTAGAATTCTGGAGGAGCCCAAAAGCGGGCGAATGAAAAGCATTGAAAAAGCAAAGTCCTGGAATGCTGAAACCTG
CGCTGAGAGATGCCGCTGCCGATGCCGTGGCCGATGCTGGCCATCATGTGGAGCCGACCAGGTAAAGTCAACAACAGAGTCAGAGTGGCAGCAGGGCGAGTGGAAAAAATGTAAGAAAAG
ATGCCACTGCCACTGGTAGGGCTACTGCCCTGCCGCTCACTGCTCGACGAGGCGGCAAACTAACCCCACCCCCGTCCAGAGCAGGTCCAGGGCCAAGTGGCAGACCACGTAGCCAAGGCA
CCGGACAGATGCATAGCCGTAATAATGTTAATGTAAATGTTGCAAGTGGAGCAGGTACAGGGGCACAGGGGTACAGGGGCACACGGGGCAAGTCGAGTGGCCATCAGTTGGTCCGAGCTG
CTGCTGCTGTTGAAGATTCTGGCAGATATTAACATGGCGTATACGTAATGGTTCCGAATTAGTGGACTGAATTATCTTTAATACTTATGCTATCTATAATATAGTGGTTATTTATAAGTA
TAGCGGGGGGTTCTTTGGAGTTTCAACTAGCATTCTCGACTTACCTCTGCTCCGTGAGAAGAAGTCGAGCAAAAGGGGATGGGAAGATCACGCTTTGCGACGGTCCACAAGAAAATTGTA
AAAGGTGGTCCGCTAACTCCCGTTCGGGACTTATTTCACGCAAGAGTGTTCGCACTAGGGAGAGCGCTGATGAGAAAGAGAAGAACGAAGCCTGTTGCTGTACGACCCGAAAGGAGAGCG
CGTGGCGAGAGCGGACCACCTTCTATACCACCTGCGGGTAGTGGCGTTTTGGTGCTTAAGCGGAAGAATACAATGATGAGCAAAGGAATAGCGAAAGGTATGGCTCAAATGACTATGAAC
TATAGAAGGCTCAAATAAGGGGCTAAACGATCCTCAAAAGCTTATTTGGCTCAGGTTCAATACCACATACCCCGTACTCGCTCAAATGCATTGCAATGGGGGAAATTGCCTTTAAAAGAA
TTGGAAAAGAGTTTGAAAATAGCCGAAATACACAATAAAAGAACTTAAATTTAATATCCCCCCCCACCAATGCCACGCCCTAATCTAATACCCAGAACACACACACAGTGCAAAAAAATA
CAATATATGTAGGAGGACTGGCATAATATTCGGCTTTTATCCGCATATTAAACCATTTCCACTTGAATGGAGAACTGATTTCATTGCGTTTTTCTGCTCGTACGATCCATATATCGTACG
CAATGCTCCTGTACATGTACATACATATATCCACTCCACTTCACGTGCATTGTGTGTGTGCTTCCCTTTTTGGCTCTTCGAATATATAAAATATATCCTAGTGCCAGATGGGAGAAGGCG
GAAGGCAGAGGAGACAAGGGAGGGGCATATAAATCTGCTGTATAATTGAATTTCACACGCTGCCAAAATATCAAATAAAAAATAAATAAAACCCAAAAGCAACGAAAGTTTAAATAAAAT
TTAGTTCAACAAAAATACCCAAGCGGTGAAGGGTGGGGAGGGGGGGATGGGAATGGCGGAGTGTGTGCTGCCTGAAAATGTGAGAATTATAACCGAACTGAACCGAACCACCTCCCCACT
CTCCCCCAACGGGCACCCTTCTTCTGGTAGCCACTTTTAACCATGGAGGTGTTGACCGAACTGCAAGCAGCTTAAGCAGCATGCCACAGCAGTCACCAGTCACCAGGCAACAGAGCAACC
CGTCGGTGGAGGGGGGTACGAGGAGTATGAAATCCTGTGGAGAATGCAGATGGGAATGAAAGTCCTGAGGAGATTGCAGATGGGAATTGAAGGTCCTCAGGAGGAGGGGGACGGGACTTG
CTCTGGAAATGGGAATGGGAATACTCGCCTGTATGCTCGCTCCTTTAGTCGGACGGCAATTGCATGTGTGGCAAGTGCTCTTGTGAGACATTTACACCTGTAAAAACATTTGGACCAGAG
GAGGGTAGGAGGAAGAGGAAGAAGCAGCGGCAGCAGCAGCAGCAGCCGGGGGCTGGAGGAACAGGAAGAGCATGGAGCTAAGGGGAGCATAAGTAATATTGGGAGTAGAAGAAATAAGAG
CACGGAATGGAATGGAGAAGGAGAATCATTAGCAGGATAGGCAGCAGTGCCTGCGGGGGGGGGGGGAGCACCAGTAAGGGAAGCAGTACCTGGAGACCAGAGAATGGTGATGGCTGATGG
CACAACACATTAGCAGCCCAAGAAAACAGAAAAGTGACGCTCGAGCTGTGACAAGAGCAGTCCTCGTGTGTAAAGAACAGAACAGAGCAGAACAGAACAGAACAGAACAGAACAGAACAC
TCTATAAATAGAACTAAGGACACCTAAGGACGATCTAAGGGGAGAACTAGGAGAGCTGCAGACACCCAAGGAGTGCAGCACGAGGAGAGTGCACTGCCTCAGATGACCAGCAATTTGGAA
AGGCTTCGAAATTTGTTGCTTGGGAATTTCCTTGCTTTATCTCCAGCTGGTTTTCTGGTTTCAGGTTCGACTTTTCCTACTGCTTTTCTTTGGGAGTTGCCCTCCGCTGCTCCCGCCCTG
CTCCTGCCCTGCTCCTCCTCCTGCTGTTGGGGAGTTCTTCTTTTTTCGAGTTTTTTCAGTGGCAGCCCAAAAGTATGCAACGAAAGTTTAGATTCCACCCCTCCCCCCCACCGCACGCTC
GCTCCGACTCTCGGAAGTGAATTTTCCGCACGTGGCCTAACTTTTGCTGCCTGGCTCCGTGGCCCCTCCCAGGACGCACCATTCTGGCGTGTCATGCAGTCAGCAATCTTTATCCAACCC
CAAACCCCCAAACCCCCCTCGCACTCCCACTCGACTGCACGCCCCTTTGAGTGGGCGCCTTTCATTCTTTTCTTGCGGATTTTCCTCGAGGATTGCATTTAACGCAGAAACAGGAAGGAA
GGAAATGCCATGTCGACGCTAAAACTGGGTTAGCTTACATAACGACAACATTAACGACAACGTTTTCCTTCCTCTCAAACAGGGGGTCGGCGGGGGCAGGGCAGGGCAGATGGAGGGGGT
GGAGCTGAATGAAGCCATGTAGGAGGGCGTTCGCTTCACGCTTCGTCGCTTATTGGACATAATTACTGAATATAACAATTCTGTGGTGTGTGCGGCATGGGGCATGGACTCCGCCTGCCA
TAGCCACTTGTGCGTAACCGCAGCAGCGACAGATAACTGCAACAAAAGCAAAGTCCGGGGCATGCGGCCAGCCAGCGATGGTGTGCTGGGGCCAAAGGACCCACATATCAATTCCGTGTG
AGCGGACATCGTGCAACTTCTTGTTGCAGCAGTTCCCGGCAGGACAGGAGCACCGCCCCGGTTCCTGGTCAGTGGAGAAGAGCAGGCATCCGTTGCCTTTATCGCTTACACTTGGCCAAA
GCCCCGCTTTTATGCTTCGTTTTTTTTTTGTGTTTGGCATTGATTCAACGGAATCCTTAAAGTTGAACAGTTTTAGAGTTGGATTTGGGGTTTGGGTTTCGGGTTTCGGGTCTCCGATTT
CCGATTTCGAATTTTGGGTCTGGGCTGGGAGCGTAATGCCAGCGAAACTTTTGGCCCTTTTGTCACTTTTGGAACAATGTTAATTGCTTTAGAGTTTACTTTTCGACTTTAGCTTCCCGG
GATAAAGGGGGGCGGGGGCGATGGGCGATGGGCGGTGGGCGGTGGGCGGATGTCCCAAGGCATGCATTAAATTGTGCTGATCTGCTGCATGCCATGGTTTGGCTCTTTTGTTGCTCTGTT
CGAATGGAAATGGGAGCTAAACAAAGCGTTTTATGCATGCACACTCCAAGCCAATTACGCAAAGTGCTCTAAGTATCTCAAACAGCAGCAGCAGCAGCAGCCGGGGGCAGCAACCGACCC
AAAACTGGACACAATTACATTAACAGTTTAGCACACACACACCGATAGAGGGAGAGAGGGAGTGTGTGTGTGTGTGCAAGGATTGAGGGCAAAACAAACAGAATGAAATTAAATCAAAGG
CTCGAAAATAAGCTCTGGAAAAGGAGTGCTGCTGCTGCCTGCCTGCTGCAGTAACCAACAGTAACCAACAGTAACCAACGGTAACTAACTGGGCTAAAAGCTGTTGGCTTGTCTTGGCCC
CTTGTGCCTCTCTGCCGCTGCTATCGTCCCTCAGCCTCTGCCTCTGCCACTTGCTGAAGCTGTCAGACTGACAAAGTGACTGCCTGACTGACAGCTGAACTGTTACACACACCAACACAC
CAACACACACACACAGAGAACAACACAAATAGACGACTGCACACAAATAAACAATGTCAAGAGTTAAAAGCGGTAGGGTAGGGCGTGGAGGTGGGGGTGGCACTGTGCACAGCTTTTGTT
TAACAATAATAAAATGGCACTACTATAGAAAAAACAAAAAATAAGAACAGCCTGCAGGCGAGTGGCTCGACCAAGAGGATACCCCAGGGCCCAACATGGATGATCTTGCATCGTACATTA
TCGGCACTCCAAAGCCGAGTCCTATGGCTCCATAGGATGGATCACGACTATGGTTAATAATCCAGATTGTATACCCTTTCTGGGATCGTCTTGAACACAACATACCCTTGGTAGCCCTGG
AAATACCCGCCAGAGGCAAAGGCAAAGGGCAGCAGGATCCGCTCAGTGCAATTGGGATCTGTGGTATAGGCTGAAGGATTGCATTACCAAGTGTTTTACACACACACACACACAACACAG
AGACAAGCAGAAAGACAAAGACAATTGTCATGGGTCCTTCGGACTATCGTCAGGTGTAAGGGGGGGGAGGAGGCCATCACCTGGGCAGGGCAACCCCAGCCAACAGAAGGACACACGGGT
ATCCGCAGATGCAAGGATGAGCTCCGCCATTGTAGGGCACTTTTATCCATTGTTGTTGTTGCGGCTGCCACTTTTTGTTGCAGCTTCCTTGCTGCGTGTTTGTAAGCCATGTTATTACAA
TAATGTGTCTCCAAGTGTGTGTGCGTGTGCGTGCATGCGTGTGTGTTTGCGGCATCCTTCCATCCGTCTGTGTATCTGTGCATCTGTGTGGGTTGTGGAGTGTGGAGTGCGGAGTGTGTG
GCAGGCCAAAGTTAAATCATGTTTTTATTGTCACATTTGCCATAAAGGCAGCCCAGGGACCAGAGCCAGAGCCAAAGCACACCAGCAGCAGCAGCAACAGCAGCAGCAGACCGTAAAAGG
ATTTGGATCGGCTTGCAGGAATGGGAGCAGGGGCACAAGGCGTCATAAGATGCTGCCTAAGCTGAAGGCCCACCTGAGAGACGGAGAGACGGAGAGGCAGGGGAGGTACATATAGGATGT
AAGCCAGACAAATCATAATCAGAGCCAACAACCAGGCCACGCAAGGCGAAAGCAAAAGGCAAGTCCCAACAAAGCACAATTTCACCTTAACAACGAGGCATGGCCGAGAGGGGAGAGAGG
GACTAAAGGGGGGTTGATCCCACTGATAAGGCAGCAGCGGCAGTAGTTGCACCCTCCGACGCAGCCGCCCCACGTTTATAGCCTTGTTTATGTTGCATACTTTATGGCGCTTCAAGGGGT
TTCGAACGGGAACAGAGGCAGAAAGATACTCCGGGGGGAGGCAGGAGCGTGTGGCGCACGCGGAAAAAATGGCAGAGAGTCTAATGAAAATGCAATGAACTAGAAAATAAACTGACTCAA
TGGATAAAAGCCAACTAACGTTGAAAGCGGAACAGCGCGCAGCGAACAGCAAACCAATTTCAATGCAGGATGCAGGAGCCATAGTTACGAGCATAAAAATTAAACATCTGCTCTGTTTAT
TGCCGGTCTTATATTGTATGATGGCAGGGGCGTTATCTTTCGATATTACAATAGCCACAAATGGCAAATGGAGAGTCCTTAGAGGCTCTTTTCATGAGCCACAAACTAAGAAGGAATCGA
AGGACATAGAGCTGTTGCTCCGACGTAGCTTTTCAGTGAAGAAAAGCCCAACACTACTTCACACTCCTGCCCTAGGAAGGGAGTTTCCACTCGAAAACTCTTCCAAAATGGGGCTTTTAG
TCCCTAGAGAAACTTTGAACAATCGAAATGCCAATTCGAGTGCATGGAAGACCACAGTTCTGGCCAAAAACGACAATACAATTGAGATCTCTAAGCGAATGCCACAAACATTTACTGTCA
AATGTGCGCCGCCACAAACAGTTAATGCCACACCTCCCCCACACGCCCGCCACACGCACAGTCTTGCCGCACTCTTGTCGCTGCATGAAACATGCAATGTGGAGGGCCGCTCTCTAATTT
ACTTGCTGTGGTTTTTCCTTCACTAATTGTGACAACTAATTGAAATCAAATTAGTGGAAGCCACTAAACAGCTTAACGCTCGCTGGGCATGTATTTGATTGTGTCTCTCCCGATCTCCCC
CTTGTTGCATGCTGCATGTTGCATGGAGTGCCAATTGTTTTCATTCAATTTACTGTCGGAGGTGGCTTTAAAATATCGCTCTTGGCTTCAGTTTTGGCTAGCCATAAATTTTCCTCAACG
GACATCAAAGCGTATACGTAATGGCCCACCCAGTGTGCGTGTGTGTGCTTATGAGAGTGTGTGCGTGTGCAGGCATGCGTGAATAGACAAACATGCGAGAGAGAGAGCGAGAGAGAGAGA
GAGCAGGCAAATGCCTTTGTCAATAATATGCATTAATTAAGTGCAGACATGGACACCGGGGACACAGTCCACAGTTCACAGTCCACAGTCCGGCGTCCACAAGCCAGCAAGGAGTCGACG
GAGCAATGGAGCCACACAGCAGGGTTGCAGCACGGTTACGGATACGGCTACGGATACGGATACGGGTAGGGGCACGGGCACGGGCACCGACACAGAGGCACCGACACAGAGGCAGGTGCC
GTACCGATTCGAACCGTGCCGCACCGGACACAGCTTGGCACTCGCTTTATGCATTTCAAATTTTAATTGAAAATATCGTCCGAATTTAATGCAAAAATGCAAATGTCATTATCCAAACAA
AAACAAAAAAATAGCCATTTTCCAAAAGCTGCAGCAGTGGCGGAAGTGTCACAGCATTGAATAGTAGTTTCAAGATTCAATTTCCTTTTCCGCCTGCCACGCACACATGGACACATAGAC
ACACGGACACACATGGATGGGGCAGGTGGGCCTGCTGGTGGGCTTAGGGCGTGGTCTGCCTGGAAGAATGAAAGTTATTTGTGCCGCCTTTGAGCATCTGTTCGGTTGGGACTCGCTTCA
GTTCTCTCTTTTTGCATGGCCGCTACTCGACAAATAGTTGCGCCACTGACACGACGCCAACTATGCGGCCTGCCACTGCCACTTGCCACTGGCAGTACCTCTGCCGCAGAACAGGAAGAG
GCATAGGGAACATCTGCAGCTGCTAGGCTCGTCGCCTGTCGACCGTCGACCGTCGGCTTTGTGGACTCTGGACTCCACTCCGGACAGGATGGCGTATGAGTGTGTGTGCACCATTGAAGT
GGACCATCGTTTGTCACACGCGTCATGGGTGGGCAGGGGTGGCAGAGACAGAGGCAGAACAGTGGTGGCACAGTGGCATGGAAAAGGGTCCGGTTGCACTTTGCCTGAACAAATAAAACA
GAGATGATAGCCGGTGTCTTCCAGGGATAAAAACGAGCTCAGAATGTTGATTCAAATTAGTCAGATAGTTATGGTGCCACACTCCACATCCCCCCCACAGCCCCACCCACTCGCTTGACC
CAATGTGGCAGAGGACTGCCGCTGCTGCTGCCTCGTGTGGATTGTTGGGTGGTTGGGGCTAATGGAGTTTGCAAGAAGTGGGCGCAAAATGTGGATCAAGCAGCCGCATTCGCACTCGCA
CAAAGGGTTGCTCTTCGCCTTCTGCTGCTGCTGCTGCCACTCCTGCAACATAGTGCGGCTCCTGCTCCTGCTCCTTCTTAGAGAGAGAAAGAGAGAGAGAGAGAAAGAAATGCGAGCCTT
TGCAAATTGTTGCTGCACCCGCATCCCAGTCGCCATCGGTCTGGACTGCCTCTGCCACTGCCTCGCCCCACTCCACAGCCACCGCAACGTGCATGTGGCTGCCACACAAGAGAGATGTGG
ATAGAGAGTGTAAGAGAGAGGTTACCACATGGATACCAGTGGGTCGACTAGCCTCAGGACTTGGGTCTTTCATCAGGCCAGAGCCGGAGCCGGAGCCAGAGCCAGAGCCAAGACCGAAAC
CAGATAGCAGGAGGCGGAAGGCAGGAGGCAGGAGCCAGGAGCCAGGGAGTCGCCGTGGCAGCGGCAGTGGCGGCTTTGTTTATGTGTGGCTCCATTTGGGCGTATGCATGTCCTTCTCCT
ACGGCTTCTCTCCCTTGGATTTACTCCATGGTGGTTGCATGCCCCCTGCCCCTGCCCCTGTCCCTGCCCCTGTCCCTGCCCCTGCCTCAAGCTCTACGTGCGAGTGGTTTTTGTGTTGTT
CTGGATTTTTAAGTTGTTTTCGTTGTCGTTGGTTGTTTGTGATGTTGTTTTTTGTTGTTTTTGCTTTTTTTTGTGTGCGTTCATCTCCTTTTTTGTGTGTGCCTTTCCTTGTGTTGCATC
CACGTCTTTAGTTGCAATTTGAGCATACCAACAAACATATGCACTTAACACACACGAACATGGACACGAGCACGGACACGGAGCACTCACACGGAGAAAGGATGAACGCTGACACCAATT
AGTGCCTCCTGCTGCTGCTGCTGCTGCCGCTGCTGTTGCAAGTTACTGCTGCTGCTGCAAGTTAGTTAGTTGCTGGGTTGTTTGTGCGTTGCATGTTTACACTAGCCACAAAACATAATT
TCTGTTTAGACAACTTTAGCGGCAGCCCCACTTTCCAGTTTTCCACTTTCCGTGCCGTTTGTGGCCACAGGAACGGCCCCTGTCCGTGCAGCACCTTCGCCCTCTGCTCCTGCTCCTGCT
GCTGCTGCCCCTCGACAAAGGGGAAACGTGGCAGGAAGGATACCCTGGACGAGCCAATCATTGCTTTGCTTCAGACTACCGATTGTTCCTGTGCCAGCTATATGATATTGTGATCCCATC
TGATCCTCACTGAGAGCCAACGTGAGGCAAGCATCAGAATCCTATTACCCCGCCCCTAGGGCTTCGGTATGGGCCCATAATTATCGCAACAACAATGACACGAGTCCCCGCATCCCACAT
CCGGCAGCGGTAAGCGAAGTTGGCCAAAAAGGCAGCAGGCAGGAGCAGGCAGCAGGCAGCAGTTTGCATTTTTGGTATAATTTTCACTTTTGCCAAGGAAACTTTTTCGCTGCCGCATTG
AATTTGCAACCAACTAAAACCCGCCACACCCAGACCAGCGGTTGCCTCTGTTGCAGCTTGAAAACTTCCTGAGTTGATTAGAAAGTTGTTTTTTTAGCCGCTGTTGCTGCCACATCCTCC
CTGCAGTTGCTAATGGCAGCTTAGAGCAAAAGGAGCCGGGCATGGCAGCGCAGGGCAGGGCAGGGCAGGGCAGGAGGATGTTGCAAGGCCCCCTCGCAGCTCGCGAGCATATGCGAGGAG
TGAGACCTCAATCTGTGGTTGAGTTCCTCGTCGTTTGGTTGCTTGGTGACTTGGGATTGTGGCATGGTTGCTACTAGGGCGCGCAAAGCTTATCGCTTCGGTGGCTTGTCGTCAGGCAGC
AGGCAGCGGCAGCAACATAAACTATGCGAGCATCAACTTTTTTTGTAATTAATTTTTAATGTGTGTGCAAAAAAAAATGTTTACGTACATAAATGTTCTTGTAGTTTCTGCCTCCCCCTG
GCTCCCCCTGCCATATCAAGGGGATGTGGCTGGGAGCGAAAGGGTTTATCAAGCGCTCTAAGAGGCGGATAAGCAGCGCTTAAGTGGGTGCCACAGAGGAAGAGGAAGGGGAAGAGGAAG
AGGGTAGCTATGATTTAAGATTGTGTTACGCCCAGAGGAAAATAATTGGATTGCCTTAAACAGCGTTGGCTGTTGGATACAACAAAAAAGAACGATGCCTGACAAATGGGTTTCCCTTGT
GCGGAATATACACGCCTTTCTATCAATCCCTAATTGCATATGACAATTGCTATTGCACCCATACCCAAAGAGAGAGAGAGAGATAGAAGAAGAAGAAGAGCACATGACAGACCAGACGGG
ACGGGAAGGGAAGGGACAGGATAGGACAGGACAGACACTGGCAACTTGCTGCAATTTGCTTTGGAACCAAAAATTGATTTACCCCCCGACTGCTAAACGTGCCACATCCTCCTGCCAACC
ATCCATCCTTCCATCTACGTCCCCATGTCATCTGCCAGATAAACGATTTGCTCCATACAGCTACATCTACTGCTACATCCACTGCTACTGCTATCATGATTTCTCACTTTGTTCCTCCTC
TTTTCCGTACATTTTGCAGAGATTTGCGTTTCAATCACTTCGAGGAGGTGCCGGCAGACGCTTTCCGCGGCATGGGGCAACTGTCGACGCTGTTCCTGAACGAGAACGAGCTGGCCCACC
TGCAGGATGGGGCCTTCCAGGGGCTGCTCGCCCTGAGGTTCCTCTATCTGAACAATAATCGGCTAAGCCGCTTACCGGCGGCCATCTTCCAGGGATTGCCGCGCGTGGAGGCGAT
ATGTA
AGTGAAACGCGGCACAGCCCTTAATTAAGGATATGCAAAAAGGCATAAATCGCCTTAACGACGCCCACTTTTCCGCTTTTCCCCCTTTCTCCCGCCATCCACTGTTAGATATCTGGAGAA
CAATGACATTTTCCAGCTGCCTGTCGGAGTTTTTGACAATTTGCCACGTCTGAATCGCCT
CTGTAAGTTGGATGGATGGACGGATACGAGAGGGAGATTAACTTTTTTGGTGGGTGCATC
TCCATTTTCCAGCTTCCTTTACAATAACAAGCTCACCCAACTGCCGGTGGAGGGATTCAACAAACTGAACAGCCTGAAGCGTCTGCGATTGGATGGGAATGCCATCGACTGCAACTGCGG
TGTGTACTCCTTATGGAGACGCTGGCATCTGGATGCCCAGCGTCAGCTGGTGACCATCTCCTTGACCTGTGCCGAGCCGCAAGCCCTGCAGCGCCAGAGCTTCGCCAGCCTCCAGGAGCA
GCACTTCAAGTGTG
TGGTAAGTGCGAACAGAACAGAGCTGAGAACGGAGAACTGAGAGCTGCCCTTCAATCTTTCACCCGTTGCAGCCAAGCCCAATCTCCTGGTGGCCCCACAGGACTT
GCAGACCTTCGCCGGGGAGTCCGTTCAACTCGACTGCGAGGTCACGGGTCTGCCCAAGCCCCAGATCACGTGGATGCACAACACGAACGAGGTCGGCGAGGATCAGGTCAACCGGGAAAT
CCTGCTAAGCGGCAGCCTGCTCATCCGCAGCGTGGCAACCACTGACATGGGCATCTATCAGTGCCTGGCCCGCAACGAGATGGGGGAGGTACGCTCCCAGCCCATCCGCCTCGTTGTCAG
CAGCAGCAGCAGCAGCAGCAACCGGAACCCACTGGACAACCCCCACATCGACCCTAGCAGCAATCAGGTATGGGCGGATGCGGATGCAGGAGGTGCAACACCCACGCCACCGAGCTTCAC
CCACCAGCCGCATGACCAAATTGTGGCCCTTCATGGCGCGGGACACGTGCTCCTCGATTGCGCCGCCTCCGGCTGGCCACAGCCGGACATACAATGGTTCGTCAATGGTCGCCAGCTCGC
CCAGTCCACCGCCAGCCTCCAGCTGCAGGCCAACGGCAGCCTGCTCCTCCTGCAGCCCACCCAGCTGACAGCCGGCACGTATCGGTGCGAGGCGAGCAATCGCCTGGGCACTGTCCAGGC
CACCGCCCGCGTGGAGGTGAAGG
GGGTGAGTGTGACACAACAACTTCCAGAAGCAAGGGGGGCTGGGGGGGCACGGCACTCGTTTCGGGATTGCATTTCAGTCACTGCCAGACGACATAC
CATCGTCCACAGTCCAGAGTCCGCAGTCCGCAGTCCGCAGTCCACTGGCAACACAGTCACAGTCACAACTGTCGCGTTTTACTCAATTTGCAGTCGATTTAATTTGTTTTCGCCTGAGAT
GAGGATGAGGATGAACATGGAGCATGAAGCATGCAGCATGAAGCATGCAGCATGGACTCTGCGACTAGACACTGAGATTGAACATGGAGTCTGCTTTGGATCGGGGTGTTCGGGACTATG
CGGAAGAGCTTGTCGAGGGGTTTTCATAGATGATTGGATTATGTCAGATGTGGCGGCAAAACGGTGGCAGAAGGGTGCTCCTCCGTCAAATCAGAGTACAAAAGGACGGGACGGCACGGG
TACTGCTGTACGAAAAACCTAACAAAAGAAGAGCATTCAAACTTTTGGTTTAACCACGAATAAATGTATGAAAACAATGTTTCGCTTGAGCATTTTTATGTTCATTCGTTATTGCTGCAA
TTCAAGCGTTTGTACGTGACTGTACTTTATTGCGCAATAGAAATGGACGGGGGGCCAGCAGGGGAAAACGCCCTGATATGTCCATAAGGGAAGGGGTTCCCAGACCGACCGACCGACAGA
CCAACCGAATGCCAGAGGGAATGATAGAGTGGCAGAGTGGCAGCGCCTTGATGCAACAATGGCTCATTATAATTTGTGGTTGTCGGCACAAAATGAAGAGCAGACAGCCGGAGCAAAATT
TATGCATTATTTTGGAAAATAAGTGCAAAAATTATAAACTATTATGTAGAAGGGGGGGCGGGTGGGGGGTATGGTACGCACAAATTGCGAAGCTGCGGTTATCCCGAAAGTGCAGAGTGC
AGTGCTGCAGAGATGTGCAGGGGACGTGCGAGTGTATGGACTGTTTAAGCAGTCAGGCGATGCGCTTGGTCAGCATAATTAACCCGGCAAACACAGTCACATACCCATAAACCGATGCAC
AGAGAGAGAAAAGTTCGATTATTTTCCGAGTCCATTTCCAGTTGTGCGATATAGATCAAAGAGGCATTTTCCAAGATAGTGTTTTCATTCTTCTGTCGATGTACGGATAAAATATATGGG
TTTGGTGTAGATTTTCTCGGAGTGTAGGGGCAAATGCATGCACTGGCGCACACACACATTCACGAGCATATGAATAAGCAAATAAATATGCTGGCACCGCTCAAAAGTATGCAATGCAAA
AATTTGTATTCCATACGCCCCAGGGGGCAGGCGGCAGGAGGCAAAGCAAAACCGTGATCAAAGCGAAAGGAGGGGCGTGCAGTGGGTAGGGAGAGGGAGAGGGATATCGAGTGGGATACA
GAGTGGAACGAAGAAAGAGTTTATGGACAAAAGGCGGAATAGAAAACGAACCGCAGGCGGCGTCGGCGTCGCCGTCGACGTCGGCTATGACCATGTGCCTTTTTTGTTGCTGCCATTTGG
AGCGATAGAATTGCTCGAAAATCGCTCATCCGCGATTCATACCACTTGCAGATTTGCCCGAAATTTTAATGGCACCCCAAAACCAAACAATCAAACTGGGCAAAGCCTTTGTGCTGGAAT
GCGATGCCGATGGCAATCCACTGCCCACCATCGACTGGCAGTTCAATGGATCCCCGCTCGCCAGCACCCCGTCCGGAGACCTGCTGCTGGAGAACGAGAACACAGAGCTGGTGGTTAGTG
CTGCCCGCCAGGACCATGCTG
TGGTAAGTGGTTCAGTGCGGGGCGGGACCTCTCGGGAAAGTGATTCCAAATGGCTTACATTTCATTTCATTTCTCTCTCCCCTCCCCTCCCCTCCGCTC
CCCGTGCACGTTCCTGTTGCAGGTGTCTACCGGTGCACGGCGCGCAACGAGAATGGCGAGACGAGTGCCGAGGCAACGATTAAAGTGGAACGCTCCCAGTCCCCGCCTCGGGTGGCCATC
GAACCGAGCAATTTGGTGGCCATTACGGGCACCACCATTGAGCTGCCCTGCCAGGCCGAGCAGCCGGAGGTGGGACTGCAG
AGGTAAAGTTTCCCGAACGGAATATCCCAAAAAACTTTT
CTTTTTTATTTGCCCCGCCACTTGTTGACGCTTCCTTTGGCTTTCCCACCCCCCGGCCACATCCTCTCCCCCTCTGTCTATCTCTCCCTCTCTTTCCATACAGATTTCGTGGCGCCGTGA
TGGTCGCCTCATCGATCCCAATGTCCAGCTCACGGAAAAGTATCAAATAAGCGGCGCCGGCAGTCTGTTCGTGAAGAATGTGACCATCCTCGATGGCGGTCGCTACGAGTGCCAGCTGAA
GAACGAATTCGGAAGGGCCTCCGCCTCGGCGCTGGTTACCATCAG
AGGTGAGTGGCGGTGGCGAAAAAACCGGGATGCGGAATCAAATAGTTCAACTTTGAGTCTCAAACCGCAACTTTA
ATTGCGCTCCAGAGTGGGATGGGATGGGACGGGATGGGGGCGGCATGGCAGGGGGCGGTCTCCATAAAACTCCATTAAAATCAAAAAGGGAAAACTTTTTCTCAAAGAGAGAGAGCAGAG
CAGAGCAAAGCAGAGAGGACTCACCAAGAAAAGGCCTCGACAGGCAGACGACGACGACGACGACGACGACGACGACGCCGACGACGATGCAGTTAAAATGCGTTTAAGTGCGCGGTAAAC
TTGTTTAATTAATTGAAAAATACTCGTAGTTGTGCCACAGCAGAAAAAATATGTACACAGCAGCAGTCACACAACAGTGGGAGCTACTGCTTCGGGAAGGCCTGCAAATGCCGCCAAGAA
AAACGCAAAAGCACGATCCCCGATGAAATTATGCAAGGTGGTCCGCTCTCGCAACGCGCTCTCTTTTTGGGTCTTACTTTGCCCAGTACAACAGCAGAGCGCTCTCAGTAGAGCGTAAAA
TTAGACCCGAAAGAGAGCTGATCTTGGGCTGAAGAAAACACAATATGCAAAGACAGTAAAGACAACAAGCGAAAAGGTGTCGCCTCTCCCTTACGTTTTCCTTTCCGGCACACAAATGAG
GAGCCGACGATTCGCGGTTACGTCAAGCGCGTGACGATTTAGAACGAACGAAGAAGCAAAAGGAAATCCAATAATTCACAGCAAATTCAATGAGTGCGTAGATTTTTTTTGTAGATTAAT
GTAAGGTGAGGACCGACAGAACTGAAGTAACGAGAGAGCGTGAGATCACCAAGACTTGGAATGGCTGGCCTCACCTTTGCAATGAGACATCGTTGGTACATATGTAGGCCCGTAATTACA
AAATTTTCAAGATATATAATCCTGTCATGAAGCTGATTTGAATGCACTTCCAAAAATAGTTTTCAAGGCAGGAAGCCCTGCGAGTGCACCGACAAATGCCAGCCTCAAATGATTTTTTAA
TCCGGCTGTTGCTCCACTGTGCTAAATTGCTTGGAAAGCACTCACACTCCTGCAAAGACCCGTTGTCCGGCTGCGAGTCCGGGTCCGGGTCCGCAAAAAAGGCGGCTATGAAAAGAACAT
AAAACCATAAAATAAAGCCAGCAGCTAAACTCCAGCACAGTGGCTAGAAATACATAGGAATCTTGAAAATTGAAACAGCCAAACAGTCGCGTCAGTGTGCTTTTCAGGCAGTTAAAAATT
GACATAACTTGGCTAATAGCTTGGGACACTTTCCGCCAAATCAGAGCCCACTGTGCCGCTGGCTAAACTGCATATAAAATATGCCCCAAAAAAGAACATCAAAAGGCGGCAACGAAAGCA
CAAAAGTTGTCGTTGCCGAAAACGTGCTAAGCCCCGGACAAGCTACGTGCCACATACCCCACCGTACACACAGATAGAAAGAGAAAGGCGAAAGAGACGGGGACGGGGACAGAGGCAAAG
ATAGACAGAGAATCTCTCACAGAGGCAGCCGCACAAGTTATCCAAATTGCCCAAATTGAAATTGAGTTGACAAGTTGCCGGCTGGAACACGAAAGGAGTACAAGAACTAGAACTAGAAGT
GGAAGTGGTAGTGGTAGTGGTAGTGGTAGTCGTGGTGGAAATGGGACTGGAATCTTCTCCAGCTTTCATCTTCCATGTGCATCTGCGTCTGCATCTCCCACTTCAACTTTAACTTTCTTT
TGGGTTCACATACATATATATTGGTAGTACTCCTCGAAAATAGAACTTTTCCAAATTGTTTGTTCCGCAAATATGGCAAAGCCAACAAAAAAACAGCCAGAAGATGGAAAAGAGACAAGG
AATTTATTGAGCACAGGCTCCATCCACATCCACGGCTGGCTCCCGCCTTGGGTCGTAAATCAATCTGAGGCAAGCAGAGGGAAGCAGTGGCAGGGACAGTGGCATATCTGGCCATTAACC
GAGGGGTAAACGTTGGACCATTGATGGCCCACAAAGAGAGAACCGAACCGAACCGCATCGCAGCGGCTTAGTTTTTGCTTACACAGACAGGCAGCATTTTCCAAATGCATAATGCAGCTA
AAGGGAAAACCAGAAGGAGGAGGAGGGGGATCGGAAAATAAGGATTTCCATGTACATGTATGTGGTGCACGCACCATTTAGTGCGGCAGAGCCCGGAAGCCGAATCGGCTTACAGGGGAG
CCGTCGGCATTAGAAACTTTACTCATTACGGTGCCAGCCGCAGTCGCCAGACCGCAGCAAAGGACCAAGAATTAAAAGAAGAAAAACAAAAAAAAGGCATAGCATAGCAGAAGAGTAAGT
AGTCCGTAGTCCGCTGTCCGTGGTCCGGAGAGAGTGTGAGAAAGCAACGCTAAAAGTGAAAAACTATGTTGAAAGACGAAAAGGAAGTGCAGTGCAGTGCAGCGCAGGGCAAGGACAGGG
ACAAAGATCCGGTGTGTAGGGAAGGGAGGAACCGTGGAAGCGTCTCTCTAACAAAAAGTGAGTGAAAACCGCTGAAAGATTCCCTCCGCTCCCCTCCCCTCAGCACCTACCCACCGCCAT
CCAGAAGGCAGCACACTGAAATCCGCCTAAGTTGTGGGTAATGCCGGCGTAAAACAAAAGCTTATCCCGTTACTGCCACCGTTACTGCCCCTGTTGCTGCAGCTCCTCCTGGGGCTGCTG
CTGGCGAAAAAGGCTAACCACGGAAATACATTGCCATACGCCGCACCGCACAGCATGTGGCAGAGCCAGAGGGCGGCAGGGCAGGGCAGGGCACGGCGGGGCAGGGAGACCAAAACCAAC
AAAAAACTTTAGCAAAATGAGGCGACTGCAAGTGGATGGAGACTAAAGGTATACCCTGACTAACATTCCCGAATCAGATCAATTACATTGATGGAATGAGGTATCAGAGGAGCCCTTAAA
AAGAACTAGAATCAGGAGAAGACTTATCATAGAAATGAGATCCATTCTCTAGCATTTGAGAGCATATAGATTCTAGAAAGATTCGCTTGACCTCCTCACAGGATTAGAAATTCCTCACAT
ACCCTACACACAAAAACAGAAAAGCAAAGGAGCAACCACAGAATGGCAGAGGGAAAATATATGCACTCGGGTCCTGGCCTGGCAGTGGCCCAGGAGCAGTGGCTACCCACTAGAAGCACT
ACCAAACTATCCCACGCAGCAGAGCCACTGATAGTGGCAGACTTTTGGCACGGCACAAGACGGCACAGCAGGGCACTACTTGTGGTGCAACAGGGTGTATTATGGTCGTTAAGTTATTAC
CTAGAACGTCGCCTCCGTCGCTCGTTGTAGCCGCTGCCATCGTCACCCCGCACAATGCCAAAGTTTACACAGGCGAAAGTTTTCCGAGGATTCTGAGCATTCCTTGAACAGCCACGCTGC
ATGGCAGACAGAGAATGTGAGGCATGAATTGTTGTTAATGCTGCCTCCCCTCCCTTTGCAGCAGTCAGAGTTTGAGAGCGCGGATTTTCCTTTTGGCGCACAAAAAAGTGCAGCAAGATT
TCCAGGGGGTGACAGGGGGAGGACTGCCCGAAGGACACACAGACGGAGAGACGGACAGACATCTAGGCGAAGAAAGGGAAATAAACTGGCTGCCATTTTTATTAGACAACGCGGCCATTT
TAACGACTCCTCGGAGGCGTCTCTGGCGCTGGCGCTGGCGCTGGCATTATGGTAATTAAAATGTGGAAATTCCTTGAAAATATCTTGATTGTTTCTTCGCTTTGTGCCGCGTCGTCGCTT
TGTGGTGGAATATCTGCTCTAGGAATCTCGAAAGTCTAGCCCTAGGATTGTAGTCCATTGTTGTAAATTAATTGGTCCATTTTTGGGGAATATCTTCTCCTTGAATCATTACCAGAATCC
AGATTCCCCTGTGAAATCCTTTCTTTGCAGAAACAATGTGGATCTGGCACCGGGAGATCGTTACGTGCGCATTGCCTTCGCGGAGGCGGCCAAGGAGATTGATCTGGCCATCAACAACAC
CCTGGACACCCTCTTCTCGAACCGCTCGTCCACGGGGCCACCGAACTATGGGGAGCTGCTGCGGGTGTTCCGCTTCCCCACGGGCGAGGCAAGGCAGCTGGCGCGTGCCGCCGAGATATA
CGAGCGGACCCTGGTCAACATCCGAAAGCACGTGCAGCGGGGAGACAACCTGAGCATGAGCAGCGAGGAGTACGAGTTCAGGGACCTGCTCTCCAGGGAGCATCTGCATCTGGTGGCGGA
GCTGTCGGGCTGCATGGAGCACCGGGAGATGCCGAACTGCACGGACATGTGCTACCACTCGCGCTATCGCAGCATCGACGGCACGTGCAACAATCTGATGCATCCCACATGGGGTGCTTC
CCTTACGGCCTTCCGCCGCCTGGCGCCGCCTATCTACGAGAACGGATTCAGCATGCCCGTGGGCTGGACAAAGGGCCAGCTGTATGCCGGCCATCCGAAGCCCAGTGCCAGGCTGGTGTC
CACCTCGGTGGTGGCCACCAAGGAGATAACCCCGGACAGCCGGATCACACACATGGTGATGCAGTGGGGCCAGTTCCTGGACCACGATCTGGACCACGCCATACCCTCGGTGAGCTCGGA
GAGCTGGGACGGCATCGACTGCAAGAAGAGCTGCGAGATGGCCCCGCCCTGCTACCCCATCGAAGTGCCGCCGAATGACCCGCGTGTGAGGAACCGCCGCTGCATCGATGTGGTGCGCTC
CAGCGCCATCTGCGGCTCGGGCATGACCTCGCTCTTCTTTGACAGCGTCCAGCACCGCGAGCAGATCAACCAGCTGACATCCTACATCGATGCCTCGCAGGTGTACGGCTACAGCACGCC
CTTCGCCCAGGAGCTGCGCAATCTGACCGCCGACGAGGGTCTGCTCCGCGTGGGCGTCCACTTCCCCAAGCAGAAGGACATGCTGCCCTTTGCGGCCCCGCAGGACGGCATGGACTGCCG
CCGCAATCTCGACGAGAACACCATGAGCTGCTTTGTGTCCGGGGACATACGGGTTAACGAGCAGGTCGGCCTCCTGGCCATGCACACCATCTGGATGCGAGAGCACAATCGGCTGGCCAC
CAAGCTGCGCGAGATCAATCCCCATTGGGACGGCGATACGCTGTACCAGGAGGCCCGCAAGATCGTCGGCGCCCAGATGCAGCATATCACCTTCAAGCAGTGGCTGCCCCTGATCATCGG
CGATAGTGGCATGCAGCTGCTCGGAGAGTACAAGGGCTACAATCCGCAGCTGAATCCGAGCATTGCCAATGAGTTTGCCACAGCTGCCCTGCGCTTCGGCCACACCATCATCAATCCCAT
CCTGCACCGTCTGAACGAGACCTTCCAGCCCATTCCGCAGGGCCATCTGCTACTCCACAAGGCCTTCTTTGCCCCCTGGCGCCTGGCCTACGAGGGCGGAGTGGATCCCCTGCTGAGAGG
CATGCTGGCGGTGCCCGCGAAGCTGAAGACCCCCGACCAGAACCTCAACACGGAGCTCACGGAGAAGCTGTTCCAGGCGACGCATGCGGTGGCCCTGGACCTGGCCGCCATCAACATTCA
GCGGGGCCGTGATCACGGCATTCCCGGCTACAATGTCTACAGGAAGTTCTGCAACCTCAGCGTGGCCGAGGACTTTGAGGATCTCTCGGACATCAGCAACGCGGGAATTCGGCAGAAGAT
GAAGGAGCTGTATGGTCATCCGGACAACGTGGACGTTTGGTTGGGCGGCATTCTGGAGGATCAGGTGGAGGGCGGCAAGGTGGGTCCGCTGTTCCAGTGTCTGCTCGTCGAGCAGTTCCG
TCGCCTGCGCGACGGCGATCGCTTGTACTACGAGAATCCGGGCGTGTTCCTGCCCGAGCAACTCGTCCAGATCAAGCAGGCCAACTTCGGACGCGTCCTGTGCGATGTGGGTGACAATTT
CGACCAGGTCACCGAGAATGTGTTCATCCTGGCCAAGCATCAGGGCGGCTACAAGAAGTGCGAGGACATTCCCGGCATCAACCTCTATCTGTGGCAGGACTGCGGCAACTGCAACAGCAT
GCCCACCATCTTTGACTCCTACATTCCACAGACGTACACCAAGAGGAGCAGTCGCCAGAAGAGAGACCTCCGACAGCCCAAGGAGAAGGAGCAGGAGGAGGTCCCAGCCACCGAGAGTTA
CGACAGTCCCTTGGAAGCCCTCTACGATGTGAACGAGGAGCGCGTTAGTGGCCTGGAGGAGCTGATTGGAAGCTTCCAGAAGGAGCTAAAGAAACTGCACAAGAAGCTGCGCAAGCTCGA
GGACTCCTGCAATGCCGTAGATGCCGAGCCAGTGGCCCAGGTGGTGCAGCTCGCACCAGCACCGGCCCCCGTTGCCCCGAAGCCCAGGCGCAGTCACTGCGTGGATGACAAGGGAACAAC
GCGGCTGAACAACGAGGTCTGGTCTCCGGACGTGTGCACCAAGTGCAACTGCTTCCACGGCCAGGTAAACTGTTTGCGGGAGAAGTGTGGCGAGGTGAGCTGCCCGCCCGGAATCGATCC
TCTGACGCCGCCAGAGGCCTGCTGCCCGCACTGCCCGATGCTCAAAGGAGAGCTGCCGTAG

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGTGGTGGCGAGGAGTGCTCCTGTTCCACCTGTTCCTGCTGGCGGGCTGGTCGGAGGCCGCCTACTGTCCGACGGGATGCAACTGCTACGAGCGCACCGTGCGCTGCATTCGCGCGAAG
CGCACGACCACTCCGCAAGTGCCCTACGACACCCAAGTTCT
AGATTTGCGTTTCAATCACTTCGAGGAGGTGCCGGCAGACGCTTTCCGCGGCATGGGGCAACTGTCGACGCTGTTCCTG
AACGAGAACGAGCTGGCCCACCTGCAGGATGGGGCCTTCCAGGGGCTGCTCGCCCTGAGGTTCCTCTATCTGAACAATAATCGGCTAAGCCGCTTACCGGCGGCCATCTTCCAGGGATTG
CCGCGCGTGGAGGCGAT
ATATCTGGAGAACAATGACATTTTCCAGCTGCCTGTCGGAGTTTTTGACAATTTGCCACGTCTGAATCGCCTCTTCCTTTACAATAACAAGCTCACCCAACTG
CCGGTGGAGGGATTCAACAAACTGAACAGCCTGAAGCGTCTGCGATTGGATGGGAATGCCATCGACTGCAACTGCGGTGTGTACTCCTTATGGAGACGCTGGCATCTGGATGCCCAGCGT
CAGCTGGTGACCATCTCCTTGACCTGTGCCGAGCCGCAAGCCCTGCAGCGCCAGAGCTTCGCCAGCCTCCAGGAGCAGCACTTCAAGTGTG
CCAAGCCCAATCTCCTGGTGGCCCCACAG
GACTTGCAGACCTTCGCCGGGGAGTCCGTTCAACTCGACTGCGAGGTCACGGGTCTGCCCAAGCCCCAGATCACGTGGATGCACAACACGAACGAGGTCGGCGAGGATCAGGTCAACCGG
GAAATCCTGCTAAGCGGCAGCCTGCTCATCCGCAGCGTGGCAACCACTGACATGGGCATCTATCAGTGCCTGGCCCGCAACGAGATGGGGGAGGTACGCTCCCAGCCCATCCGCCTCGTT
GTCAGCAGCAGCAGCAGCAGCAGCAACCGGAACCCACTGGACAACCCCCACATCGACCCTAGCAGCAATCAGGTATGGGCGGATGCGGATGCAGGAGGTGCAACACCCACGCCACCGAGC
TTCACCCACCAGCCGCATGACCAAATTGTGGCCCTTCATGGCGCGGGACACGTGCTCCTCGATTGCGCCGCCTCCGGCTGGCCACAGCCGGACATACAATGGTTCGTCAATGGTCGCCAG
CTCGCCCAGTCCACCGCCAGCCTCCAGCTGCAGGCCAACGGCAGCCTGCTCCTCCTGCAGCCCACCCAGCTGACAGCCGGCACGTATCGGTGCGAGGCGAGCAATCGCCTGGGCACTGTC
CAGGCCACCGCCCGCGTGGAGGTGAAGG
ATTTGCCCGAAATTTTAATGGCACCCCAAAACCAAACAATCAAACTGGGCAAAGCCTTTGTGCTGGAATGCGATGCCGATGGCAATCCACTG
CCCACCATCGACTGGCAGTTCAATGGATCCCCGCTCGCCAGCACCCCGTCCGGAGACCTGCTGCTGGAGAACGAGAACACAGAGCTGGTGGTTAGTGCTGCCCGCCAGGACCATGCTG
GT
GTCTACCGGTGCACGGCGCGCAACGAGAATGGCGAGACGAGTGCCGAGGCAACGATTAAAGTGGAACGCTCCCAGTCCCCGCCTCGGGTGGCCATCGAACCGAGCAATTTGGTGGCCATT
ACGGGCACCACCATTGAGCTGCCCTGCCAGGCCGAGCAGCCGGAGGTGGGACTGCAG
ATTTCGTGGCGCCGTGATGGTCGCCTCATCGATCCCAATGTCCAGCTCACGGAAAAGTATCAA
ATAAGCGGCGCCGGCAGTCTGTTCGTGAAGAATGTGACCATCCTCGATGGCGGTCGCTACGAGTGCCAGCTGAAGAACGAATTCGGAAGGGCCTCCGCCTCGGCGCTGGTTACCATCAG
A
AACAATGTGGATCTGGCACCGGGAGATCGTTACGTGCGCATTGCCTTCGCGGAGGCGGCCAAGGAGATTGATCTGGCCATCAACAACACCCTGGACACCCTCTTCTCGAACCGCTCGTCC
ACGGGGCCACCGAACTATGGGGAGCTGCTGCGGGTGTTCCGCTTCCCCACGGGCGAGGCAAGGCAGCTGGCGCGTGCCGCCGAGATATACGAGCGGACCCTGGTCAACATCCGAAAGCAC
GTGCAGCGGGGAGACAACCTGAGCATGAGCAGCGAGGAGTACGAGTTCAGGGACCTGCTCTCCAGGGAGCATCTGCATCTGGTGGCGGAGCTGTCGGGCTGCATGGAGCACCGGGAGATG
CCGAACTGCACGGACATGTGCTACCACTCGCGCTATCGCAGCATCGACGGCACGTGCAACAATCTGATGCATCCCACATGGGGTGCTTCCCTTACGGCCTTCCGCCGCCTGGCGCCGCCT
ATCTACGAGAACGGATTCAGCATGCCCGTGGGCTGGACAAAGGGCCAGCTGTATGCCGGCCATCCGAAGCCCAGTGCCAGGCTGGTGTCCACCTCGGTGGTGGCCACCAAGGAGATAACC
CCGGACAGCCGGATCACACACATGGTGATGCAGTGGGGCCAGTTCCTGGACCACGATCTGGACCACGCCATACCCTCGGTGAGCTCGGAGAGCTGGGACGGCATCGACTGCAAGAAGAGC
TGCGAGATGGCCCCGCCCTGCTACCCCATCGAAGTGCCGCCGAATGACCCGCGTGTGAGGAACCGCCGCTGCATCGATGTGGTGCGCTCCAGCGCCATCTGCGGCTCGGGCATGACCTCG
CTCTTCTTTGACAGCGTCCAGCACCGCGAGCAGATCAACCAGCTGACATCCTACATCGATGCCTCGCAGGTGTACGGCTACAGCACGCCCTTCGCCCAGGAGCTGCGCAATCTGACCGCC
GACGAGGGTCTGCTCCGCGTGGGCGTCCACTTCCCCAAGCAGAAGGACATGCTGCCCTTTGCGGCCCCGCAGGACGGCATGGACTGCCGCCGCAATCTCGACGAGAACACCATGAGCTGC
TTTGTGTCCGGGGACATACGGGTTAACGAGCAGGTCGGCCTCCTGGCCATGCACACCATCTGGATGCGAGAGCACAATCGGCTGGCCACCAAGCTGCGCGAGATCAATCCCCATTGGGAC
GGCGATACGCTGTACCAGGAGGCCCGCAAGATCGTCGGCGCCCAGATGCAGCATATCACCTTCAAGCAGTGGCTGCCCCTGATCATCGGCGATAGTGGCATGCAGCTGCTCGGAGAGTAC
AAGGGCTACAATCCGCAGCTGAATCCGAGCATTGCCAATGAGTTTGCCACAGCTGCCCTGCGCTTCGGCCACACCATCATCAATCCCATCCTGCACCGTCTGAACGAGACCTTCCAGCCC
ATTCCGCAGGGCCATCTGCTACTCCACAAGGCCTTCTTTGCCCCCTGGCGCCTGGCCTACGAGGGCGGAGTGGATCCCCTGCTGAGAGGCATGCTGGCGGTGCCCGCGAAGCTGAAGACC
CCCGACCAGAACCTCAACACGGAGCTCACGGAGAAGCTGTTCCAGGCGACGCATGCGGTGGCCCTGGACCTGGCCGCCATCAACATTCAGCGGGGCCGTGATCACGGCATTCCCGGCTAC
AATGTCTACAGGAAGTTCTGCAACCTCAGCGTGGCCGAGGACTTTGAGGATCTCTCGGACATCAGCAACGCGGGAATTCGGCAGAAGATGAAGGAGCTGTATGGTCATCCGGACAACGTG
GACGTTTGGTTGGGCGGCATTCTGGAGGATCAGGTGGAGGGCGGCAAGGTGGGTCCGCTGTTCCAGTGTCTGCTCGTCGAGCAGTTCCGTCGCCTGCGCGACGGCGATCGCTTGTACTAC
GAGAATCCGGGCGTGTTCCTGCCCGAGCAACTCGTCCAGATCAAGCAGGCCAACTTCGGACGCGTCCTGTGCGATGTGGGTGACAATTTCGACCAGGTCACCGAGAATGTGTTCATCCTG
GCCAAGCATCAGGGCGGCTACAAGAAGTGCGAGGACATTCCCGGCATCAACCTCTATCTGTGGCAGGACTGCGGCAACTGCAACAGCATGCCCACCATCTTTGACTCCTACATTCCACAG
ACGTACACCAAGAGGAGCAGTCGCCAGAAGAGAGACCTCCGACAGCCCAAGGAGAAGGAGCAGGAGGAGGTCCCAGCCACCGAGAGTTACGACAGTCCCTTGGAAGCCCTCTACGATGTG
AACGAGGAGCGCGTTAGTGGCCTGGAGGAGCTGATTGGAAGCTTCCAGAAGGAGCTAAAGAAACTGCACAAGAAGCTGCGCAAGCTCGAGGACTCCTGCAATGCCGTAGATGCCGAGCCA
GTGGCCCAGGTGGTGCAGCTCGCACCAGCACCGGCCCCCGTTGCCCCGAAGCCCAGGCGCAGTCACTGCGTGGATGACAAGGGAACAACGCGGCTGAACAACGAGGTCTGGTCTCCGGAC
GTGTGCACCAAGTGCAACTGCTTCCACGGCCAGGTAAACTGTTTGCGGGAGAAGTGTGGCGAGGTGAGCTGCCCGCCCGGAATCGATCCTCTGACGCCGCCAGAGGCCTGCTGCCCGCAC
TGCCCGATGCTCAAAGGAGAGCTGCCGTAG

Retrieve as FASTA