Entry information : MmulDuOx02
Entry ID 706
Creation 2009-02-02 (Christophe Dunand)
Last sequence changes 2012-09-03 (Bruno Savelli)
Sequence status complete
Reviewer Christophe Dunand
Last annotation changes 2015-12-24 (Christophe Dunand)
Peroxidase information: MmulDuOx02
Name MmulDuOx02
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Cercopithecidae Macaca
Organism Macaca mulatta (rhesus monkey)    [TaxId: 9544 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value MmulDuOx02
start..stop
S start..stop
HsDuOx02 2518 0 1..1288 1..1284
HsDuOx02 181 3.75e-46 1295..1377 1466..1548
PtroDuOx02 2409 0 1..1288 1..1283
PtroDuOx02 181 4.08e-46 1295..1377 1465..1547
CfaDuOx02 2250 0 1..1288 1..1277
CfaDuOx02 160 1.49e-39 1295..1377 1459..1571
SscDuOx02 2201 0 1..1288 1..1281
SscDuOx02 181 4.99e-46 1295..1377 1463..1545
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '706' 'complement(join(23446459..23446581,23446872..23447000,23449233..23449409,23449632..23449759,23450027..23450076,23451106..23451205,23451383..23451613,23451771..23451949,23452477..23452560,23452927..23452996,23453496..23453704,23455612..23455705,23455792..23456017,23457456..23457641,23457933..23458135,23458326..23458439,23458631..23458768,23459144..23459262,23459869..23460044,23460611..23460774,23461645..23461747,23462017..23462107,23462555..23462651,23462780..23462840,23463243..23463409,23463516..23463717,23463917..23464104,23464700..23464864,23465133..23465222,23465493..23465562))' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 23465493..23465562 68 N° 2 23465133..23465222 88 N° 3 23464700..23464864 163 N° 4 23463917..23464104 186
N° 5 23463516..23463717 200 N° 6 23463243..23463409 165 N° 7 23462780..23462840 59 N° 8 23462555..23462651 95
N° 9 23462017..23462107 89 N° 10 23461645..23461747 101 N° 11 23460611..23460774 162 N° 12 23459869..23460044 174
N° 13 23459144..23459262 117 N° 14 23458631..23458768 136 N° 15 23458326..23458439 112 N° 16 23457933..23458135 201
N° 17 23457456..23457641 184 N° 18 23455792..23456017 224 N° 19 23455612..23455705 92 N° 20 23453496..23453704 207
N° 21 23452927..23452996 68 N° 22 23452477..23452560 82 N° 23 23451771..23451949 177 N° 24 23451383..23451613 229
N° 25 23451106..23451205 98 N° 26 23450027..23450076 48 N° 27 23449632..23449759 126 N° 28 23449233..23449409 175
N° 29 23446872..23447000 127 N° 30 23446459..23446581 121  
complement(join(23446459..23446581,23446872..23447000,23449233..23449409,2344963 2..23449759,23450027..23450076,23451106..23451205,23451383..23451613,23451771..2 3451949,23452477..23452560,23452927..23452996,23453496..23453704,23455612..23455 705,23455792..23456017,23457456..23457641,23457933..23458135,23458326..23458439, 23458631..23458768,23459144..23459262,23459869..23460044,23460611..23460774,2346 1645..23461747,23462017..23462107,23462555..23462651,23462780..23462840,23463243 ..23463409,23463516..23463717,23463917..23464104,23464700..23464864,23465133..23 465222,23465493..23465562))


exon

Literature and cross-references MmulDuOx02
DNA ref. GenBank:   NC_007864.1 (23465562..23446462)
mRNA ref. GenBank:   XM_001103398
Protein sequence: MmulDuOx02
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1377 (321)
PWM (Da):   %s   155074.86 (35575.5) Transmb domain:   %s   o600-622i1045-1067o1082-1104i1151-1173o1188-1210i1223-1245o (o575-597i1020-1042o1057-1079i1126-1148o1163-1185i1198-1220o)
PI (pH):   %s   8.53 (4.66) Peptide Signal:   %s   cut: 22 range:22-342
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MLRARPETLMLLGALLTGPLDPAGSQDALSLPWEVQRYDGWFNNLRHHERGAVGCRLQRRVPANYADGVYQALQEPQLPNPRRLSDAATNGTAGLPSLRNRTVLGVFFGYHVLSDVVSVETPGCPAEFLNIRIPPGDLVFDPD
QRGDVVLPFQRSRWDPKTGRSPSNPRD
TNQVTGWLDGSAIYGSSHSWSDALRSFSGGQLASGPDPAFPRDSQKPLPMWAAPDPATGQSGPRGLYAFGAERGNREPFLQALGLLWFRYHNL
CAQRLARQHPDWGDEELFQHARKRVIATY
NIAVYEWLPSFLQKTPPEYTGYRAFLDPSISPEFVVASEQFFSTMVPPGVYMRNASCHFRKVLNKGFQSSQALRVCNNYWIRENPNLNSTQ
EVNELLLGMASQISELEDSIVVEDL
DYWPGPGKFSRTDYVASSIQRGRDMGLPSYSQALLAFGLDIPRNWNDLNPNVDSQVLKATAALYNQDLSQLELLPGGLLESHGDPGPLFSAIVLD
QFVRLRDGDRYWFENTR
GLFSKKEIEEIRNTTLRDVLVAVINVDPSALQPNVFVWHGAPCPQPNQLTTSDLPQCAPLTVLDFFEGSSPGFAITIIALCCLPLVSLLLSGVVAYFRGRERK
KLQKKGKESVKKEAAKDGVP
AMEWPGPKESSSPIIIQLLPDRCLQVLNRRLTVLRVVQLQPLQQVNLILSSNRGCRTLLLKIPKEYDLVLLFSSEEERGDFVQQLRGFCVRWALGLHVAE
MSEKELLRKAVTKQQRERILEIFFRHLFA
QVLDINQADAGTLPLDSSQKVREALTCELSRAEFAESLGLKPQDMFVDSMFSLADKDGNGYLSFREFLDILVVFMGSPEDKSRLMFTMYDL
DENGFLSKDEFFTMM
RSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSELRFTQLCVKGGGGGGECVRGIRDIFKQNISCRVSFITRTPGERSHPQGLGPPASG
APELGGPGLKKRFGK
KAAVPIPRLYTEALQEKTQRGLLAQKLQQYKRFVENYRRHIVCVAIFSAISVGVFADRAYYGFVSPPSGIAQTTLVGIILSRGTAASVSFMFSYILLTMCRNLIT
FLRETFLNRYVPFDAAVDFHRWIAMAAVVLA
ILHSAGHAVNVYIFSVSPLSLLACIFPNVFVNDGSKLPQKFYWWFFQTVGMTGVLLLLVLAIMYVFASHHFRRRSFRGFWLTHHLYILL
YA
LIIHGSYALIQLPTFHIYFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPSGTRPCLTWYICERHFQKVLNRSLFTGLRSITHFGRPPFEPFFNSLQEVHPQVRKIGVFSCGPPGMTK
NVEKACQLVNRQDQAHFVHHYENF*

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 7, 29 introns).
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGACACTGATGCTCCTAGGAGCTCTTCTGACTGGACCCCTGGATCCAGCGGGCAAGTATCAGGCTCCTCTAGCGGCGGGGTGTTCCCCAGGATCCCCTGGG
AGGTGGGGAGGAGAGAGGTGCGGCAAGCGGCTCCCTGAGTGAGACTGGAAGGTCATTTCGCCGCGCAGCTCAGCGGGATGGGAAACTTCTCAGTGCGGCCGGACACTTGGGTCCCGTTAG
GGGCGCTCCGTGAGCTGGGGAAGGACTGGCCAAGGCCTTTGTTGCCAGGGAGGGGTAGCTGGGAGCGTAGTGCTGAGGCGGCCCTTCTCTGTGCCCACAGGCGCAGTCAGGACGCACTCT
CACTGCCCTGGGAAGTGCAGCGCTATGACGGCTGGTTTAACAACCTGAGGCATCACGAGCGTGGCGCTGTTG
GTGCGTTCTGCGGTCTCGGGTGTGCTGGGGCCGTGGCTCGCGAAGGTC
AGGGGCGCGGGAGGCCCTGAGAGGAGAATATGCGGGGAACACGCGCCCAGCAGCTCCGCTGCCGACACAGCGCACTCTTACGCGCTCCTGGGCCCAAGAGACCCTTGCGAGAAGGTTCTG
TCAGTGAAGGGGGATGGGGGTTGAGGGAGGCTTAGGGCGAGGTTTGGGTGATCCTTGGGGATGGAGTGCTTAGACAGAGCCCCACTCCCTGCCTCCGCAGGCGCTGCCGGTTGCAGCGCC
GAGTACCAGCCAATTACGCCGACGGTGTGTATCAGGCTCTGCAGGAGCCGCAGCTGCCCAACCCGCGCCGGCTCAGCGATGCAGCCACGAACGGCACAGCTGGGCTGCCGTCGCTCCGCA
ACCGCACCGTACTGGGGGTCTTCTTTG
GTGAGGGCAAAGGGGGAGACCAGTGGGGTTGATCTGGCGCTCTGCTCAGCCTTGGGGAGGGGCCAGATCCCCTCTGCGAGTCCACAGGAGACC
CATCCAAGACTCCCAACCACTTCCTCTCTCTCTAAGCAGCACTTCAAGACTGCCTTCATCTCGGAGAGATTTGGGATGTTGATACAGAGATATTCACTCCCCTTTCTCTTATGCCTTACT
CCAAACTAGGGGTGTCACTGGACCCCCATTATGGCTCTTGGGAACTGAGCTCCCCAGCCACCGCTCTCCTCACCATGTGTTTGTAACCATCTTACCTCTCCCTAGCCCAGAGGGAGGAGG
ACTGACTTGGGGTACTCCTACATAAATTATATCATTTTGATTCTCACAACAGCTTTATGAATTGGATAGGAAGGGAACTCACTATAGTCTCACTTTGCAGATTAGAAAATTGAGGCTCCC
TGGGGTTAACCTGCAGAGCTGGTGGTGGAGCTGGCACTGGGACCTCTGTCTTTTGGCACCTGAGGACCCTTGGAGGCCGCCCAGGCCGTGGGGAGAGCGCGATTCGGACGGTTTGTCACC
TATTTGCACCCCATGCCCGCAGGCGCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAAACGCCCGGCTGCCCCGCCGAGTTCCTCAACATCCGCATCCCGCCCGGAGACCTCGTGTTCG
ACCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTTCAGAGGAGCCGCTGGGACCCCAAGACCGGACGGAGCCCCAGCAACCCCCGAGACCTG
GTGAGGAGGGGAAGGCGGCAGGAAGGGG
CCGCACCCCAGTCGGTTGGGACCCAGGCTGCGGGCCTGGCAGGGCCTGGCGGGGAGGGGCGCCCACTTCCCAGCCGCGGACTACCGCCAGGCCCCGGCCTTCCCTGGCCTGCCGCCGCCC
ACCCTCTCCTCGCCCCCTTACACCCCATCTCACCCGCCGCGTGCCCCGCAGACACCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCATCGCACTCCTGGAGCGACG
CGCTGCGGAGCTTCTCCGGGGGACAGCTGGCGTCAGGGCCCGACCCCGCCTTCCCCCGAGACTCACAGAAGCCCCTGCCAATGTGGGCGGCGCCCGACCCCGCCACCGGGCAGAGCGGGC
CCAGGGGGCTGTACG
GTGAGGCCGCCAGGGCGGGACGGGACCGGCTGGGGGTCTGCAAGTGTGGGCTCCCCCGATCACGCTACCGCTCATCTCTTCCCCTGCGCCCCCACGTCGGATGCA
GCCCCTTCGGGGCGGAGCGAGGGAACCGGGAGCCCTTCCTGCAGGCGCTGGGTCTGCTCTGGTTCCGCTATCACAACCTGTGCGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGGGG
ACGAGGAGCTGTTCCAGCACGCGCGCAAGAGGGTCATCGCCACCTACCAG
GTCAGCCGTCCGCGCCCCGCGACGTCTCCCCTCCCGCGAGCAAGTCCACGGGAGACTCCGCTGCCCCACG
GAGCTCCCCATCTGTGGACAACCTCCACCCAGAAACCCCTCCCCAGACAGCCGAGGTCTAGGGAAGCCCCAGTAAATGATAGGGAGGCACAGCGCTGTTTAGAGGAGAAATCTGGCTGGT
GATTATTATTTATCACCTCCCAACCCCCTGCTCCCCTAAATGTCCTGGTTCCTTGAGGAGGCAGGCCTCACACCGCTCCTGTTTGAGTTGCTTCTCCCATGACTGACCCTGCCTGGTCCT
CATCTCCACCTGGAAGCCGTCCTTGGGCTCAGACCCTTCCAGGCCCCGCCCCATCCAACTGGAGCCTCCCCTCGCCTCTCTCTGCCCCTCAGAAAACATCGCTGTGTATGAGTGGCTGCC
CAGCTTCCTGCAGAAAACACCCCCGGAGTATACAG
GTGAGGGAGTGGGGGGTGGGAGGGGACCTGTGTTGAGAATCCTGAGGGGACAGGAGACAGGTGGGTGTGATGGGAGGATGTGGAG
GCAAGGAGCCTATCTCCCTATCATCTCCATCTCCTTCCTGCAGGAGATACCGTGCTTTCCTAGACCCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGG
TGCCCCCTGGCGTCTACATGAG
GTGAGGGAGGGGCTGGCAGAGAGGGGGCATCACACCAAGAAAGGTGCAGAATGAACTGCCTTGGAGGCTGGGGCCTTTCACACTCCTTTGCAGTTTCA
CTGGAGAATGGGAAGCAAAAATTTGGGGCCCTGAAACAGAACCCTGGGGTAAGATGTGTGGGCTTAGTAGGGAAATCTCCCCAGCTCTCCTAAGGGCTGAAATTTGGTTGCTGGGTGTAG
GATTTGTCCAGCAGCTGAGTCATTCCCTTCCCTCCTCTCTCCACCCTACCTGGACTAGGAGCGCACACTATCTTCAGTAAACACACATTGCCAAATGTCTGCCATGTTCAAGGAAGTTCC
TGGGCCGCTGCTCAATCCTATGAACCCCCACTGAGTCCCTCAGCCCACTCAATCCCATCCTTGACTCTTCTCTGAATTCTCTCATCACATCCTTTGTTCTCCATTTCAGAAAAATGCCAG
CTGTCATTTCCGGAAGGTCCTGAACAAGGGTTTTCAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
GTCAGACTGGGGTCAGGGTCACGGGAAGATGGGTCAAG
GTCAGTCTTCACACAGGCTGGGAAAAGCAACAATCCCAGTTCTTGGGTGTTGCTACCCCAGGTTCATGGGAGATGAAGGATAGAGGAAATTATCCGGGGACAACAGCTCAAGAGAGGCTG
GAACTGGCCAAGGGCTCTTTGTCCTGGGGTACTAAAGTGGCCCAGGCTGAGAGAGACTGAGTTTTGGGTAGGGGCCCTAGAGCCTATCTTGAGTGCATTGTTTACTTCCAGAAAACCCCA
ATCTGAACAGTACCCAGGAGGTGAATGAGCTGCTGCTGGGAATGGCCTCCCAGATTTCGGAGCTGGAGGACAGTATAGTGGTTGAAGATCTGAGGG
GTGAGCTCAGAGCCAGAAGGGGTG
GATGGCAAGGGGCCAGGAAGCCTGAGGATCCTTCTGGGTTCATCAACAGCAGAACTAGGGCACCCACGATGCAGGAACACAGATACAAACAGAGATTTCAAGGAGTCATAAGAAGAAAAT
GAACTTTGAACTTTTTTTTTTTTGCTTAATTTACAGTTTTGTGTTGCTTTTATTTTAAATTAACTGTGAGTATTGATGGGGGGGTGGTACCACAATCTCTTTGGTGCTTAGTTAGGTCTC
TAAATTTTTTCATTCAGCCCTGTCTTGAAATTAGGGAGGCATAGGACAGGGTATTTTTTTTTTTTTGAGACGGAGTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGCGATCTCGGC
TCACTGCAAGCTCCGCCTCCCGGGTTTACGCCATTCTCCTGCCTCAGCCTCCTAAGTAGCTGGGACTACAGGCGCCCGCCACCTCGCCCGGCTAGTTTTTTGTATTTTTTAGTAGAGACG
GGGTTTCAGTGTGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGTGATCCACCCGTCTCGGCCTCCCAAAGTGCTGGGATTACAGGCTTGAGCCACCGCGCCCGGCCAGGACAGGGTAT
TTACTGCTTGGTGACTAGAACAGTCTTGAGTCTTAGAGGAGGGGTCTTACTGGAAAAACTGGCCCCGATCACATATGGTGACACTGTGTGGTCACCACACAATGGTCAGCATGGGCACAG
ATCTCATGTGGATCGCTGGGGTAGCCAGGAAGGAAGAACCATAGTGCTAATAGTTGGATACAAGGCTGCAGGGCAAAAAAGGGTGGAGAGGACAAAGCCCAAAGTCTGCTCTTCTCTCCT
CTTCAGATATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAGCGTGGCCGAGATATGGGGCTGCCCAGCTATAGCCAGGCCCTGCTGGCCTTTGG
GCTGGACATCCCAAGGAACTGGAATGATCTCAACCCTAATGTGGATTCCCAG
GTCAGGAATACTAATGATAATAATGGCAGCTCAAGCTTTCCTATGGCAATACTGTTCCAAACCTTTAC
ATATGTTGACTCATTTAGTCTACATAATAATATCATAAGGTACATATAATCATTTTTCCCATTTTACTGATGACATAGTTGGTAAATGGTAGAGGCAGGACTTAAATTCAGCCATATGAT
TCCAGAATCTATTTCTAACCACTGCATTGTACCATTCCTGCAGGGTGGCTTCTGGGTTGGGTGCCATTGTCCTGTTGCTGCAGGGTCCCACTCCAAGGGCTGGGTGCCCCTGAAGCTGCT
AATCATTGAGGTCAGGCAGGCTGGTGATGGTCACAGGATACGCTCCTCGGGCACCTGACTGTGGTCTTACCATGGGTAGGGACACACTGATCCTTCCACCAGACTTGTCCTGCCTGAGGG
GGCTTGCCTAGGAAGAGGAAATCAGGCCTGAGCAGCAAGCCAGGCAGTTGCTGGGGTCCTGTGTCCCAGGGATGGGGCAACAGTGGCTGCCCTCCTCAGCAAACATACACTCACCCCTTA
TCTTCTGTGGTGCCCCAGGTGTGCTGAAGGCCACAGCTGCCCTGTACAACCAGGACCTATCCCAGCTAGAGCTGCTCCCTGGGGGGCTCCTGGAGAGCCATGGGGACCCTGGACCCCTGT
TCAGCGCCATTGTCCTCGACCAGTTTGTACGTCTGCGGGATGGTGACCGCTACTGGTTTGAGAACACCAGGAATGG
GTAAGGCTTGCCTGGGTCCCCACCTCAGATTCCTCCTCAGCCTG
GGCCCTAGACCTTCTGTCTGGCCTTAGACAGCCCTTATGAGCCCTTGATTCCTAGTCAGTCCACCACACCCTTCCTAACCCCTCTGGGTTCCTTTTTCTTTTCTTTTCTCTTTCTTTCTT
TTTCTTTCTTTCTTTCTTTCTTTCTTTTTCTTTCTTTTTTTTTTTTTTGAGACAGAGTCTAGCTTTGTCACCCAGGCTGGAGTGCAGTTGCATGATCTTGGCTCAATGCAACATCCACCT
CCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGAATTACAGGCATGCACCAACATGCCCAGCTCCTTTTTTTTTTTTTTTTTTTTTCAGTAGAGACGGGGTTTCACCA
TGCTGGTCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCACTCCACTCACCTTGGCCTCCCAAAGTGTTGGGATTACAGGCATGAGCCACTGCACCTGGCCCCTCTGGGTCTCTTTT
CTCACCTGGCTCCTTGGGTCTGGGGTTGCTGGAGGCCTGCATCCCCTTCCCATCCCAGTGACTTCTACTTCCTCCAACTTAGGCGCTGTTCTCCAAGAAGGAGATTGAAGAAATCCGAAA
TACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACGTTGACCCCAGTGCTCTGCAGCCCAATGTTTTTGTCTGGCATAAAG
GTGAGTGCCCTGGGAGAACACAAGTGAGTGACAGTGG
CCAGAGAAGGATCAAGATTGAGGGTGCGGTGGAATCACTTGGTGCTGTCCAGGGAGCCAGGCACCTTCTGTGTTGGGCTAGGAGGCCTGCATTTGGCTGGCTCCCACACCAGGGACCTCA
ACGAGCACACAAGTTGCACCCTACAGTCAACAAGGGGTGGATGGGGTAGATGCCAAGAAACAAGATGTGGATGGGGACTTTGTGAGGGAGACAGTTTCAGGGAGGTGGGCCTGGGGAAGA
CAGATGATACCTTGATCCTTTATAGGTTAGAGGGGAAAGAGGTCTGGCCATGCAGTGGGATCCTCAGACTTTGAGATCTTCCATGCCCTCTCCCTCAGGTGTGCACCCTGCCCTCAGCCT
AATCAGCTCACAACCAGCGACCTGCCCCAGTGTGCACCCCTGACTGTGCTTGACTTCTTTGAGGGCAGCAGCCCTGGTTTTGCCATCACCATCATTGCTCTCTGCTGCCTTCCCTTAG
GT
GAGCTCTTAGGCAGCCTCTCTGCAGACTGGCCCTGCCCCTTGTTTCCCGCTGGCCTGAGGGGCTGGCTATTTGGTACCGTTTCAGACCAGGCTCAAGGAACCTCTGGAAGGGAGGGGCCA
TAGCCGAAGCCACAGTGAAGCTCTAGGCGAGGAGCTCCCTCCTCACTGGCTCCTTCTGATCCCCTTCAGTGTGAGTCTGCTTCTCTCTGGAGTGGTGGCCTATTTCCGGGGCCGAGAACG
CAAGAAACTACAAAAGAAAGGCAAAGAGAGCGTGAAGAAGGAAGCAGCCAAAGATGGAGTGCCAG
GTGAGAAGGGGCTGGGCAGAGGAGGGAGAAGGGAGCGGGGAGGGGAGAGACAGGA
GTCTGGGAAAAAGAACCAAGTTACAGAGTGAGAGGAAAGCCAGGGCGCCTTTAGGGTCTTCACAGTGGAATTGACCTGAAGGCAGGGACCTGGGGACATCTACTGAACTACCCGGCCCAA
TCCTCCCTTCCCCAGCGCGATGGAGTGGCCAGGCCCCAAGGAGAGCAGCAGTCCCATCATCATCCAGCTGCTGCCAGACAGGTGTCTGCAGGTCCTGAACAGGCGTCTCACTGTGCTCCG
CGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAGCAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
GTATGGCTCGTCCTGCCTCC
CCTGCCTGGGCTGCCCTCACACGACTCCATTATCACAAGCGAGGCCACCTCAGCTATAGACCTCACCTATGACAGCTGCTGCTGGGGAGAGGGGCTCCTTTCAGAGGCCCCCAGACTCAA
CCCGACCCCCTTCGGTCACACACCTGGCCCCAGCCTGGATGGATGGGAAGGAGCTTTTCTCCCCTGCCCTCAACCCAAGATCCGTTGAGGGGAGGCTGAAGCAGAAGGTCCAGCCAGCTC
CCTGCGTCGGTGCCGCCTTCCTCCCACCCAGGTGTGCTGCTGTTTAGCTCTGAAGAGGAACGGGGCGACTTTGTGCAGCAACTACGGGGCTTCTGTGTGCGCTGGGCTCTGGGCCTCCAT
GTGGCTGAGATGAGTGAGAAGGAGCTATTGAGGAAGGCTGTGACAAAGCAGCAGCGGGAACGCATCCTGGAGATCTTCTTCAGACACCTTTTTGCTCAG
GTGCCAGGATCTGTGCCTTTT
GGAGATGGGCCCAGCCCCAGAAATGGAGGAAACTTGGGCTGGATAGAACACCCTGTGGGTGAACTAAGCTTCCACTCTGTGGCCTGGAGAGAAATAGTCTTGCGTGAATCCTGGCATTGC
CACTTTACTTAGCCCTGTGACCTTAGGCAAGTCACATTATCACTGTTCTGTATCTCTGTTTCCTCATCTATAAAACAGTGATGAAAACTGTATCCATCCCACTGCATTGTTGTGAGGATT
CGGTGAGATCATCTACATGCGTGGTCCACAGAGGTTGGGCCCTGGACCCAAGGGCATCAGCCTCACCTGGGAACATGTTAGAAAAGCACCTACCCAGTCAGACTGAACCAGGAACTCTGT
GGGTGGGGCCTGGCAATCTGTGTTTTAACAAGCTCTCCAGATGATTCGGATACACTCTTATGTTTGAAAAACATTGTTTTATGTACAGTGCTTATTGGCCCAAGTGCTGGGTCAGTTGTA
GGCATTTAACAAATGGTTGTGGCCAGGTGCAATGGCTCATGCCTGTAATCCCAGCACTTTGGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNGGAGAATCGCTTGAACCCCGGTGGCAGAGGTTGCAGTGAGCCGAGATCCTGCCGCTGCACTCCAGACTAGGTGACAGAGCAAGACTCCATCTCAAACAAACAAACAAACAAAA
AACAATAAATGGTTGTCACATGTGACTTTTAAACATTTTGCACAATGGACAAATCATAGGCACATGGCAGCCTTATCTGAATTGGCAAGAGAGCACAGCCCCAGCCCCTTCCTGCCTGTC
TACCATCATGTCTCTACACCTTCTGTCCCCAGTGTAGGCTTTCTCACTTTTCATTCCCCTCAACTTTGCACTTCCCCTTCCATACCCCAGCACCATGCTCACTGTATGAAATTTCCGGTT
CTTGGGCCCAGGGAGAAATGGGCAGGCTGCTAGAGATTTGATTCCCCCATCTCTAGGACAACAGAGGCCCCAGTCAGTATATCTAAGGATCAGGAGAACCTTCAGGGTTCAGCCTTTCTG
ATTTGGACTTTGGGGAGATGTAAAGGGTCACTGAACTGCTTCCAGCATAGGCTTCACCTCCTTCTCTTTCCCTCCCTCTGCTGCTGCCCAAGTGCAGGTGTGCTGGACATCAACCAGGCC
GACGCAGGGACCCTACCCCTGGACTCCTCCCAGAAGGTGCGGGAGGCCCTGACCTGTGAGCTGAGCAGGGCCGAGTTTGCCGAGTCCCTGGGCCTCAAACCCCAGGACATGTTTGTGGAT
TCGATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCCTTCCGAGAGTTCCTAGATATCCTGGTGGTCTTCATGAAAG
GTAGGGGGCTGGGAGGTGCCAGGCTATCCAAGAAT
CTAGGGATCTTTCAGCAAGGAGATGACCTGCATCCCCTTTTCTCTTCCCAGGCGCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTC
CAAGGACGAATTCTTCACCATGATGCG
GTATGGGGTGTGCCTTTCTAATCCTGAGACTTCCTGGTGTGTTTCAAACAGGAAAACAGGTTGAGTCAGAGGAGGGCTGGCAAAGAAGCTATG
TGGTCATCTGTGTTGAGAGGTGGCCTAGACACTACATCCTAAACTCTCAGAGCATCCACCTTTAAATGTTTACCTGACTGGCTTCTGCCTCTGGGAGAGTCTCTGTCTGGACTGTCAACA
CCAGCCAGAAAAGTCTCCCTGGTTAAAAAACAAAAACGAAAACCCAACACCAATATGGCCAACAACAAAAATCCACGAATCCCTTTTGGGTGCCCTTGGATCTTTGTGAACTCTTTTACA
GCATGCCCAACACTGTGCTTTACCCAATGAAACAGTGAATGGATGACCTTGGTCGCTGCTCCAATATTCATCACTATGATAGTCAGGAACAGAAACGTGGAAGGCTTCTCCATATTCCAA
CCATTCTTTCTTCCTGCATCTTAAGCCATTTCTGGTTTTGTTGTGCTGGTAAAAAAACAGCTTTGTGCTTCTCATTCCTGAAGACAATGAATGCGTCAGTAACACAGCTCGCCTCCATGC
CATAAGGGCAGGGCTTGTTCCCTGTTGAATCCAGCTTCTCCACACTGTGANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTGGCACAAGGAAGCAGTGGCATTGCAGCTAGGTCCCTCCCTGGCACCCTACTGCCTGGTGCCCCCACTGGACTATGA
AAGGGGGAGCCCAGGGGTGATGTGGGAGGCATCAACGAAAGAGAGTGGACAGAGAGCCTGCCACGAGAGAGGGCCATGCACACCCTGGACACAACCCTGCACTCAGTGGACTATCTTCCC
AGTTGTAGGTGCCCCCTGTTTGAGGGCTGCTTTCTCTAATTGGTCAAGGTCACTTTCAATTCTGTTCTGCCTTTTGAGTCCATGGCCACCCCACCCAGCTTAGTGTCAGCTGCAGGCTGG
ATGAGTGCCTTCTCAGTGCCATTTTCCGGGTCACTGGTAACCACATTAGATAATCCAGGGCCTGCCCCACCCTGAACCTACCCAAGCCTGACCTTGCTGGGTGACAGGCTGCTGTGTCTC
TGGTCCTCCTCCAGGTGTCTTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCCGAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAGCTGA
CATGGGAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAGGTGGAGGTGGAGGTGGTGAGTGTGTGAGGAATG
GTTGGTGTCAGGGAG
GGGGGCATGTCCTCAAAATGAGAGTTCCCGGTGGGCAGGGCCCAGGGCTTATTCTTTTCTGAGTCCTTGGTACCTAGTACAGAACCAAGCACATGTAATTGGGGTGGCACATGTAGGTGT
GCAAGTTGTCTACTGCACAGGGATGCCCACTGAGATCAAGGAGCAGGTTGAAATCTATCCGACACACTGCTCAAAGCCACGGGCATTGGTCTAGGATGTATCTGGCCAGAAGAAGAGGCA
TTGTTTCTTTTACTCAAAGGACCTGCAGGGTTGCATCCACCTAAGAGGACATTCCCTTTCTTGCGCAAAGTTGTTACATTGTCTGCCCTGTGCACAGTGAGTGCTTAGTGTGGGGAAGGA
GGGAAACAGGACTGGAGTCAGAGATCTGGACATGACTTTCCCAGAAGGGAGGAGGGTACAGTCTCCTATCCTACCCCACTGCCCTTGTGGGGAAGCCAGTCCTGCCTCTTGTTCTCTTCT
CTAGGTGTATTAGAGATATCTTTAAACAAAACATCAGCTGTCGAGTCTCATTCATCACTCGGACACCTGGGGAGCGGTGAGCAGGAATGGGGCTCTGGCGGGCTGGCCTGGCTGAGCCCC
CTGCAGAGAAATGAAGGGAGTAGGACTGGCTGATCAGCCTCGGGTAAAATCAGGCATTTGCCCTTTGAAAGTAGCTCTTGGTAGCACAAAAATTCCAGCTGCCTCTCTCACCCTATGCTG
CTCGGATGCTTGGCTCTCTCCCTGCCACTCCAGGCCAGAATCATTCTACAAAACAAATCATGAGATCCTATTAATTCATTTTGCACCTTGCCCCCTGCCTGGTCCCAGGAGCCACTCCCT
ACCTCTACCCCATGCTCTGACCTGGGAGTTCTTCTCCTGGCTGCAAAGACAAGGGGAGAACAGCCCCATTTCTTTTTCTCAGCTCTCCCACCCCCAGGGACTGGGGCCACCTGCCTCAGG
AGCCCCAGAGCTGGGAGGCCCTGGACTGAAGAAGAGGTTTGGCAAAAA
GTGAGTGTTTCCCAAATCCCTGGGCCCAAAGAGACATGGAGAGAAGTCTCAGGGTCCCTAGGCCCCATCTAC
ATATCCTTGACATATAAGGGACCATCCTGAGTCTCATTCCATTCTTGTTCCTGACCTGACTGGGGGGTTATAGCGTTGAATGACTGTTCATCCTCTTCCAACCTCTGCACCCCGTTCTTC
AGGCAAGGGTCCTGGCTCAACAGGATGATAGTAAGGGGTCTCTTGGCTCCTGCCTGCTTTGGGCACAGCCTTGAGGCCTGTGCTGGGATCAGGAAGAAAGAAGGATAGAACAGACAGGAG
GAGGGGGAGTAGCAGGGAGACAGCGAGTGGGTGGATGGAGCAAAGGTGGAAGTCAAGGGTTGGAGGAGTAAGAAGTCCGCAGATTGCTTTTTTCCCTTCATCTGTTGTGGCCCATCCTGA
TGCCTGCCCGATCCCCAGGTCACCTTTCATGGGGTGGCATTAAGGGAAGGCCAGAGGCCCTTACCACACCGCTGCCCCGCCTCCCCTTGCTATAGGGGGCAGCGGTGCCCATTCCCCGGC
TGTACACGGAGGCGCTGCAAGAGAAGACGCAGCGAGGCCTCCTAGCCCAAAAGCTGCAGCAGTACAAGCGCTTCGTGGAGAACTACCGGAGGCACATCGTGTGTGTCGCAATCTTCTCAG
CCATCAGTGTTGGCGTGTTTGCAGATCGTGCTTACT
GTAAGAGTTCCAGGCTGTGGGCAGTGGGCAGGGAGCAGGCCCCGACCCTTGGAGAGGAGTGAAAAGCCCTCTGATCCTAAGAGT
CTGCATAGGAGAGCCCAGGGCTCAGGACCCTGGCCTCTTGTGCCGAGCTGATCTAACCTCACTCCGGCCCCAGACACTATGGCTTTGTCTCGCCACCCTCGGGCATCGCACAGACCACCC
TCGTGGGCATCATCCTGTCGCGGGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCCTATATCCTGCTCACCATGTGCCGCAACCTCATCACCTTCCTGCGAGAGACTTTCCTCAACCGCT
ATGTGCCCTTTGACGCCGCGGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTCCTGGCTA
GTACGTGACTCCCAGGCTTTTTCCTCTTTGCTGCAGCACCCTGGGTTTAGTTGG
GGGAAACAGTGGGGAGATGGAACTCCTTATGCCTCCATCTCTCCTCCCCATGCCTCCTCTCCCTCGGGATCGGAGGCAGAGTCTGTCCTGGTTGGCATCTCTAACAGGGTCTGCCTCTTT
CAGTTTTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATCTTCTCAGTCAGCCCCCTCAGCCTGCTGGCCTGCATATTCCCCAACGTCTTTGTGAATGATGGGTCAGTTCTGGGGAA
GGTTTCTCCTGGGACTCACAGGGCGGGCCTAAGGGTATAATAGAAAAAGAAATAGGCAGAGCACAGTGGCTCACACCTGTAAGCCCAACAGTTTGGAAGGCTGAGGCAGGAGGATCGCTT
GAGGCCAGGAGTTCCAGAGAAGCCTGAACAATAAAGTGAGACTCCATTTGTACAAAAAGTAAAAAGATTAGTGGGGCATTGTGGTGCACATCTGTAGTCCTGGCTACTTAGGAGGCTGAG
GCAGGAGGATTGCTTGAGCCCAGGAGTTTGAGGTTGCAGTGAGCCATGATCAGTACCACTGCATTCCAGCCTGGGCAACAGAGCAAGACCCTGTCTTTAAAAAAAAAAAAAAGAAAAAGA
AAAAGAAAAAAAGAAATCGACAGTCACAGACACTCAGCAAGAAGCTCAGTGCTAAGCTAGTCCCCTGGGGGAAGCTGAAAGGTAAACTTCTTGCCCTCAGGAAGGAAGCTGGCTGTGATT
GGCCAGGAAAGGTGTTTGGGAGATGAAGTCAGAGACTCTTTCTCATATATGCCTGACACACAGCTTGCCATCTCTGCCTTCTATCCATTCACTGGGCAGACATTTATGGAGTGTGTCCCA
TGTGTCCACCCAGATGCCAGAATAGGGGTGTCAAGATACGTGCAGTCATTCAACAACTACTTACCGAGATTTGCTGTGTGCCTTGTGTTGTTCTAAGCCTAGGGATAGAGCAGTGAATGA
AACAAAAATCTCTGCCCTATGGAGCTTACAGAATAATGAACCAGGGACTCAGGGAAGCAGGGGTCAACCACAGTAAGAGAGTTCAGGATAACGCAGAAGGAAAAATTGCTGAGTGCAGAG
GGTGGAACCGTGGACGGCTGGGTGCCAAGTTCCTTGGAGGGATCCCTGACCATCTGGGTGGAGGCCCCATACCCTCAGCAGTCAGGGCCAGCAGGAAAGGAGTCATGCTGTGGTGTGACA
GTGCTGGAGCCCCTCCTGCCCCAGCCAGAGGCTCAAGGTGCCTTTGCCCCCCAGGTGTCCAAGCTTCCCCAGAAGTTCTACTGGTGGTTCTTCCAGACCGTCCCAGGTAGGAAATGTGGG
ATCTGGGGGTTCTGTCTGAGGACTTCTGCTTTTGTCTCCATCTCTCTGTGAATACCCACTTGTCTATAGTGGCCATGGGGTCTGTTTCTAGCCTTCAGGACAAGCCCCAGCTCCATTCCC
TCCAGGCAGTCTTTTCTGGGGGCCCTGGAGGGATGAGAGAGGAAGGAGGGCAGGATAGGGGAATGTGCCATTGTCTTTTCAGCCCAAGCTGAAGTCCTGAGACTACTCACTGGCCCTGTC
CTCCTGCCCCCAGGTGTATGACAGGGGTGCTTCTGCTCCTGGTCCTGGCCATCATGTATGTCTTTGCCTCCCACCACTTCCGCCGCCGCAGCTTCCGGGGCTTCTGGCTGACCCACCACC
TCTACATCCTGCTCTATGCCCTG
GTGAGGGACTTCCCTGGGCCAGCCCATGGAGCAGGGAGCTCAGGATGGGACAGGAAGGTGAAGGAGGGAGACTTGGATCCAAGATCTCAGAATGAGA
CCTTGAGATTTAAGACCCCAGACCTTAGCCCTATCTCCCCGGCACAGGCCTACTGCCTGGGCAAGAGGGGGTGCCGGGAAGGGGCCTGGCTGGGCCTGAGCTGTACTAACTGGCCACGTC
TCCAGCTCTCATCATCCACGGCAGCTATGCTCTGATCCAGCTGCCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGGGGTGACAAGCTGGTGAGCCTGAGCCGGAAGAA
GGTGGAGATCAGTGTGGTGAAGGCGGAGCTGCTGCCCTCAGGTACTAGGCCCTGCCTGACATGG
GTTGGGAGTAACGGAGGCCAAATCTTCAGACGTGGGGAGACAGCTCACCAGGCCTC
CTGACCCCATTACTGCCTCTGACTCTGTGTCCAAAAACAAACAAAAGAAAACCACTCTTGGAGATTCCTGAAGGTTTCCTGATAGAAGTAGACCAGGAAGGGCTGTGCTTAGCTCCCTGG
AGACTTGCAGATGGTGAGAAGTGACCCTGAGAAGAGGCGGCTGATGGTGCATACAGAGGGGTTTGAGGATGGGAAAGGCCCCCACACGTGTGACCTGGGTACAGGGAAACGCAGGGCAGG
AAGCCACATATGCCTGTCCCGTTCCTTCTCTCAGAGACAGACAGATGCCCAGATTGCCAGCCTGTTGTTGAGAGTCAGCGTTGGCCAAAGTCGGGATTGGTATCATCATAGAGGGTGGCA
AAAGATGATTTTCTGTATCCCAGGACTGTCTAACCTTCAGGACTCACATTGTAAAAAAATTAATGTCACTGCAATATTATTTCAATTGAGATTACTTAGGGACAAAGTCTCAGCGTGGAG
TTAGTATGCCTTTCTACTCTCCAGCACTTGCTAATCTCCTTTTTTTAACAGAGAGATCAGGCCTGAGACTCAGCTATGGTCAGATAACAAACAGTATCCAGCTAAAATTTGGTAACAAAA
TATGTTGTTTTCTACATGGAACACATGATACTGGTTTTCCACTTACCTTAGCAATAAAGTTCCCTGCCAAGATAAATTGAATCGGAAAAGGTAGTTGATTTAAAGAAAAGCGTTAGATAG
ATGATAGTACAGGAGGTGTGTAGAAAGGTAACCCTCAAATGGAGGTTTGATAGCCACTGAGATGGTCCTTGGTGAGCCTGAGTCCTTTTACCCAGCAGGCTGGTATTCAAGGCAGTGCTG
TGAGGACTGGGCCTTCCATGGGAAGAGGGAGTAGAGAGGAGGAGAGGGGCTGGAAGAGGGGAGCGGAGAAACCCAGCCCTTGCATTACCTGGAAAACAGAGAAGGGGACCCTCCTACTCC
CAGCCCCAGGAGCCCCACCTGTTTAGACCATGTGACTACTCCCCAGGCCTCAGGCGTGTGGAGAGGACAGGCGCCTGTCAGTCCTCAATACCAGGAGCACGCCTTCTGATCTGTGCTTTT
CCCTGTCTATGACCTCCAGGAGTGACCTACCTGCAATTCCANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTGGTGGGCAGATGGATTGGCAGCATAAGGCAGGATGGCCTGGGCAGGTGGGTGGATGGCCAGGCTGAGCTGGCAGGAGG
CACAGAGCTGGTGGCCCTGATCCTCAGCCTCCAGCTCCTTCCCCTCCCCAGTCTCTGTCTCCTGGGCTATGTGGGCTGGCTCGGGCTGAGTGCTGGCCCTAATTGTCTTTGGTCTGACCT
GCCCCTGTGCCCCCAGTATACATCTGCGAGCGGCACTTCCAGAAGGTGCTGAACCGGAGTCTGTTCACGGGCCTGCGCTCCATCACCCACTTTGGCCGTCCCCCCTTCGAGCCCTTCTTC
AACTCCCTGCAGGAGGTCCACCCACAG
GTCAGTCCCACTCCCTCCCACCCTGGGACTCTGGCCTTCCCCTGCCAGGACATCCTGGCCCGGAAGCACCCTGCCGCTCTGTTCTGAGCAGAG
AACTCCACCCGCTTGCCTGGCCCTAGGATGGGGTCAGGTCTTAAAGGGGGACTTCCACCCCCTCCACCTTAAGCCTCCTCCTCAAGGCCTGGATTTGAAGCCCTGGTCATTCCAGCCAGG
CTCAGGAAGCAGCTTTTTCCAAGGAGAGTGAGCACCTCCAGGCTGCAGGCCCCCCTCTCTCTCCAATCTCCTGACAGGTGTGCGCAAGATTGGGGTGTTCAGCTGTGGCCCTCCAGGAAT
GACCAAGAATGTAGAGAAGGCCTGTCAGCTCGTCAACAGGCAGGACCAAGCCCACTTCGTGCACCACTATGAGAATTTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGACACTGATGCTCCTAGGAGCTCTTCTGACTGGACCCCTGGATCCAGCGGGCAGTCAGGACGCACTCTCACTGCCCTGGGAAGTGCAGCGCTATGACGGC
TGGTTTAACAACCTGAGGCATCACGAGCGTGGCGCTGTTG
GCTGCCGGTTGCAGCGCCGAGTACCAGCCAATTACGCCGACGGTGTGTATCAGGCTCTGCAGGAGCCGCAGCTGCCCAAC
CCGCGCCGGCTCAGCGATGCAGCCACGAACGGCACAGCTGGGCTGCCGTCGCTCCGCAACCGCACCGTACTGGGGGTCTTCTTTG
GCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAA
ACGCCCGGCTGCCCCGCCGAGTTCCTCAACATCCGCATCCCGCCCGGAGACCTCGTGTTCGACCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTTCAGAGGAGCCGCTGGGACCCCAAG
ACCGGACGGAGCCCCAGCAACCCCCGAGACCTG
ACCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCATCGCACTCCTGGAGCGACGCGCTGCGGAGCTTCTCCGGG
GGACAGCTGGCGTCAGGGCCCGACCCCGCCTTCCCCCGAGACTCACAGAAGCCCCTGCCAATGTGGGCGGCGCCCGACCCCGCCACCGGGCAGAGCGGGCCCAGGGGGCTGTACG
CCTTC
GGGGCGGAGCGAGGGAACCGGGAGCCCTTCCTGCAGGCGCTGGGTCTGCTCTGGTTCCGCTATCACAACCTGTGCGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGGGGACGAGGAG
CTGTTCCAGCACGCGCGCAAGAGGGTCATCGCCACCTACCAG
AACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACACCCCCGGAGTATACAGGATACCGTGCTTTCCTA
GACCCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGGTGCCCCCTGGCGTCTACATGAG
AAATGCCAGCTGTCATTTCCGGAAGGTCCTGAACAAGGGT
TTTCAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
AACCCCAATCTGAACAGTACCCAGGAGGTGAATGAGCTGCTGCTGGGAATGGCCTCCCAGATTTCGGAG
CTGGAGGACAGTATAGTGGTTGAAGATCTGAGGG
ATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAGCGTGGCCGAGATATGGGGCTGCCCAGC
TATAGCCAGGCCCTGCTGGCCTTTGGGCTGGACATCCCAAGGAACTGGAATGATCTCAACCCTAATGTGGATTCCCAG
GTGCTGAAGGCCACAGCTGCCCTGTACAACCAGGACCTATCC
CAGCTAGAGCTGCTCCCTGGGGGGCTCCTGGAGAGCCATGGGGACCCTGGACCCCTGTTCAGCGCCATTGTCCTCGACCAGTTTGTACGTCTGCGGGATGGTGACCGCTACTGGTTTGAG
AACACCAGGAATGG
GCTGTTCTCCAAGAAGGAGATTGAAGAAATCCGAAATACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACGTTGACCCCAGTGCTCTGCAGCCCAATGTTTTT
GTCTGGCATAAAG
GTGCACCCTGCCCTCAGCCTAATCAGCTCACAACCAGCGACCTGCCCCAGTGTGCACCCCTGACTGTGCTTGACTTCTTTGAGGGCAGCAGCCCTGGTTTTGCCATC
ACCATCATTGCTCTCTGCTGCCTTCCCTTAG
TGAGTCTGCTTCTCTCTGGAGTGGTGGCCTATTTCCGGGGCCGAGAACGCAAGAAACTACAAAAGAAAGGCAAAGAGAGCGTGAAGAAG
GAAGCAGCCAAAGATGGAGTGCCAG
CGATGGAGTGGCCAGGCCCCAAGGAGAGCAGCAGTCCCATCATCATCCAGCTGCTGCCAGACAGGTGTCTGCAGGTCCTGAACAGGCGTCTCACT
GTGCTCCGCGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAGCAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
GTGCTGCTGTTT
AGCTCTGAAGAGGAACGGGGCGACTTTGTGCAGCAACTACGGGGCTTCTGTGTGCGCTGGGCTCTGGGCCTCCATGTGGCTGAGATGAGTGAGAAGGAGCTATTGAGGAAGGCTGTGACA
AAGCAGCAGCGGGAACGCATCCTGGAGATCTTCTTCAGACACCTTTTTGCTCAG
GTGCTGGACATCAACCAGGCCGACGCAGGGACCCTACCCCTGGACTCCTCCCAGAAGGTGCGGGAG
GCCCTGACCTGTGAGCTGAGCAGGGCCGAGTTTGCCGAGTCCCTGGGCCTCAAACCCCAGGACATGTTTGTGGATTCGATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCC
TTCCGAGAGTTCCTAGATATCCTGGTGGTCTTCATGAAAG
GCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTCCAAGGACGAATTC
TTCACCATGATGCG
GTCTTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCCGAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAGCTGACA
TGGGAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAGGTGGAGGTGGAGGTGGTGAGTGTGTGAGGAATG
GTATTAGAGATATCTTT
AAACAAAACATCAGCTGTCGAGTCTCATTCATCACTCGGACACCTGGGGAGCG
CTCCCACCCCCAGGGACTGGGGCCACCTGCCTCAGGAGCCCCAGAGCTGGGAGGCCCTGGACTGAAG
AAGAGGTTTGGCAAAAA
GGCAGCGGTGCCCATTCCCCGGCTGTACACGGAGGCGCTGCAAGAGAAGACGCAGCGAGGCCTCCTAGCCCAAAAGCTGCAGCAGTACAAGCGCTTCGTGGAG
AACTACCGGAGGCACATCGTGTGTGTCGCAATCTTCTCAGCCATCAGTGTTGGCGTGTTTGCAGATCGTGCTTACT
ACTATGGCTTTGTCTCGCCACCCTCGGGCATCGCACAGACCACC
CTCGTGGGCATCATCCTGTCGCGGGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCCTATATCCTGCTCACCATGTGCCGCAACCTCATCACCTTCCTGCGAGAGACTTTCCTCAACCGC
TATGTGCCCTTTGACGCCGCGGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTCCTGGCTA
TTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATCTTCTCAGTCAGCCCC
CTCAGCCTGCTGGCCTGCATATTCCCCAACGTCTTTGTGAATGATGG
GTCCAAGCTTCCCCAGAAGTTCTACTGGTGGTTCTTCCAGACCGTCCCAGGTATGACAGGGGTGCTTCTGCTC
CTGGTCCTGGCCATCATGTATGTCTTTGCCTCCCACCACTTCCGCCGCCGCAGCTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCTCTATGCCCTG
CTCATCATCCACGGC
AGCTATGCTCTGATCCAGCTGCCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGGGGTGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGTGTGGTGAAG
GCGGAGCTGCTGCCCTCAGGTACTAGGCCCTGCCTGACATGG
TACATCTGCGAGCGGCACTTCCAGAAGGTGCTGAACCGGAGTCTGTTCACGGGCCTGCGCTCCATCACCCACTTTGGC
CGTCCCCCCTTCGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCACAG
GTGCGCAAGATTGGGGTGTTCAGCTGTGGCCCTCCAGGAATGACCAAGAATGTAGAGAAGGCCTGTCAG
CTCGTCAACAGGCAGGACCAAGCCCACTTCGTGCACCACTATGAGAATTTCTGA

Retrieve as FASTA