Entry information : HsDuOx01 (DUOX1 / LNOX1 / THOX1)
Entry ID 3339
Creation 2006-07-26 (Christophe Dunand)
Last sequence changes 2006-07-26 (Christophe Dunand)
Sequence status complete
Reviewer Catherine Mathe
Last annotation changes 2015-12-10 (Catherine Mathe)
Peroxidase information: HsDuOx01 (DUOX1 / LNOX1 / THOX1)
Name (synonym) HsDuOx01 (DUOX1 / LNOX1 / THOX1)
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Homo
Organism Homo sapiens (human)    [TaxId: 9606 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value HsDuOx01
start..stop
S start..stop
PtroDuOx01 3180 0 1..1551 1..1551
MmulDuOx01 3093 0 1..1551 1..1551
MmDuOx01 2898 0 1..1551 1..1551
CfaDuOx01 2896 0 1..1551 1..1551
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '3339' 'join(45424165..45424222,45426062..45426145,45426343..45426507,45427302..45427489,45427672..45427873,45427980..45428146,45428548..45428608,45428727..45428823,45430122..45430212,45431264..45431366,45431625..45431797,45433093..45433268,45433490..45433608,45434173..45434310,45435388..45435501,45436234..45436433,45437093..45437278,45439631..45439856,45440102..45440195,45440470..45440645,45442830..45442899,45443321..45443446,45444072..45444250,45444484..45444714,45445578..45445677,45446149..45446198,45448000..45448127,45453035..45453188,45453936..45454168,45454417..45454575,45455730..45455885,45455988..45456116,45456977..45457099)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 45424165..45424222 56 N° 2 45426062..45426145 82 N° 3 45426343..45426507 163 N° 4 45427302..45427489 186
N° 5 45427672..45427873 200 N° 6 45427980..45428146 165 N° 7 45428548..45428608 59 N° 8 45428727..45428823 95
N° 9 45430122..45430212 89 N° 10 45431264..45431366 101 N° 11 45431625..45431797 171 N° 12 45433093..45433268 174
N° 13 45433490..45433608 117 N° 14 45434173..45434310 136 N° 15 45435388..45435501 112 N° 16 45436234..45436433 198
N° 17 45437093..45437278 184 N° 18 45439631..45439856 224 N° 19 45440102..45440195 92 N° 20 45440470..45440645 174
N° 21 45442830..45442899 68 N° 22 45443321..45443446 124 N° 23 45444072..45444250 177 N° 24 45444484..45444714 229
N° 25 45445578..45445677 98 N° 26 45446149..45446198 48 N° 27 45448000..45448127 126 N° 28 45453035..45453188 152
N° 29 45453936..45454168 231 N° 30 45454417..45454575 157 N° 31 45455730..45455885 154 N° 32 45455988..45456116 127
N° 33 45456977..45457099 121  
join(45424165..45424222,45426062..45426145,45426343..45426507,45427302..45427489 ,45427672..45427873,45427980..45428146,45428548..45428608,45428727..45428823,454 30122..45430212,45431264..45431366,45431625..45431797,45433093..45433268,4543349 0..45433608,45434173..45434310,45435388..45435501,45436234..45436433,45437093..4 5437278,45439631..45439856,45440102..45440195,45440470..45440645,45442830..45442 899,45443321..45443446,45444072..45444250,45444484..45444714,45445578..45445677, 45446149..45446198,45448000..45448127,45453035..45453188,45453936..45454168,4545 4417..45454575,45455730..45455885,45455988..45456116,45456977..45457099)


exon

Literature and cross-references HsDuOx01 (DUOX1 / LNOX1 / THOX1)
Literature De Deken X., Wang D., Many M.-C., Costagliola S., Libert F., Vassart G., Dumont J.E., Miot F. Cloning of two human thyroid cDNAs encoding new members of the NADPH oxidase family. J. Biol. Chem. 275:23227-23233(2000).
Protein ref. UniProtKB:   Q9NRD9
DNA ref. GenBank:   NC_000015.9 (45424165..45457099)
Cluster/Prediction ref. UniGene:   Hs.272813
Protein sequence: HsDuOx01 (DUOX1 / LNOX1 / THOX1)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1551 (1530)
PWM (Da):   %s   176975.66 (174803.5) Transmb domain:   %s   o594-616i1044-1066o1081-1103i1149-1171o1186-1208i1221-1243o (o573-595i1023-1045o1060-1082i1128-1150o1165-1187i1200-1222o)
PI (pH):   %s   7.94 (7.96) Peptide Signal:   %s   cut: 22 range:22-1551
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MGFCLALAWTLLVGAWTPLGAQNPISWEVQRFDGWYNNLMEHRWG
SK
GSRLQRLVPASYADGVYQPLGEPHLPNPRDLSNTISRGPAGLASLRNRTVLGVFFGYHVLSDLVSVETPGCPAEFLNIRIPPGDPMFDPDQRGDVVLPFQR
SRWDPETGRSPSNPRD
ANQVTGWLDGSAIYGSSHSWSDALRSFSRGQLASGPDPAFPRDSQNPLLMWAAPDPATGQNGPRGLYAFGAERGNREPFLQALGLLWFRYHNLWAQRLARQHPD
WEDEELFQHARKRVIATY
NIAVYEWLPSFLQKTLPEYTGYRPFLDPSISSEFVAASEQFLSTMVPPGVYMRNASCHFQGVINRNSSVSRALRVCNSYWSREHPSLQSAEDVDALLLGMAS
QIAEREDHVLVEDV
DFWPGPLKFSRTDHLASCLQRGRDLGLPSYTKARAALGLSPITRWQDINPALSRSNDTVLEATAALYNQDLSWLELLPGGLLESHRDPGPLFSTIVLEQFVRLRDG
DRYWFENTR
GLFSKKEIEEIRNTTLQDVLVAVINIDPSALQPNVFVWHGDPCPQPRQLSTEGLPACAPSVVRDYFEGSGFGFGVTIGTLCCFPLVSLLSAWIVARLRMRNFKRLQGQDRQ
SIVSEKLVGGME
ALEWQGHKEPCRPVLVYLQPGQIRVVDGRLTVLRTIQLQPPQKVNFVLSSNRGRRTLLLKIPKEYDLVLLFNLEEERQALVENLRGALKESGLSIQEWELREQELMRA
AVTREQRRHLLETFFRHLFS
QVLDINQADAGTLPLDSSQKVREALTCELSRAEFAESLGLKPQDMFVESMFSLADKDGNGYLSFREFLDILVVFMGSPEEKSRLMFRMYDFDGNGLISKD
EFIRML
RSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHNSELRFTQLCVGVEVPEVIKDLCRRASYISQDMICPSPRVSARCSRSDIETELTPQRLQCPMDTDPP
QEIRRRFGK
KVTSFQPLLFTEAHREKFQRSCLHQTVQQFKRFIENYRRHIGCVAVFYAIAGGLFLERAYYAFAAHHTGITDTTRVGIILSRGTAASISFMFSYILLTMCRNLITFLRETF
LNRYVPFDAAVDFHRLIASTAIVLT
VLHSVGHVVNVYLFSISPLSVLSCLFPGLFHDDGSELPQKYYWWFFQTVGLTGVVLLLILAIMYVFASHHFRRRSFRGFWLTHHLYILLYVLIIH
GSFALIQLPRFHIFFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPS
GVTHLRFQRPQGFEYKSGQWVRIACLALGTTEYHPFTLTSAPHEDTLSLHIRAAGPWTTRLREIYSAPTGDRC
ARYP
LYLDGPFGEGHQEWHKFEVSVLVGGGIGVTPFASILKDLVFKSSVSCQVFCKKIYFIWVTRTQRQFEWLADIIREVEENDHQDLVSVHIYITQLAEKFDLRTTMLYICERHFQKVL
NRSLFTGLRSITHFGRPPFEPFFNSLQEVHPQ
VRKIGVFSCGPPGMTKNVEKACQLINRQDRTHFSHHYENF*

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 15, )9 cDNA and 102 ESTs.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGGGCTTCTGCCTGGCTCTAGCATGGACACTTCTGGTTGGGGCATGGACCCCTCTGGGGGTGAGTACAGATTGGAGGAGAAGCATGGTTAGGAGCAGAGGAACCCAGCATCCTCTGGGC
TCCCCCAAGGACAGGATCCATTTGATATTCATCTGTATCTTTAGTCTCTAGCTGACACAGTGCCCTGCACATGAGAGTTGCTCATTTCTAGCTGTTGAAGAAATAGATGGATAATGGAAT
GAAAATTATCCCCAAATACACAGAATTAGGGAGCCTTCAGCTGCTGTCAGCACTTTCTCTTTTATTCTTACAGTTTGGGGGCCCTTTCTCTCTCCTCTCTCTTCTCTTGTTTTCCATAAA
GTGAGGGCCTCTGACTCAAAATAGGAAAAGATTGATTAGACACATAAGACAATAAAGGATCTTGGGGTCAGATTGGACCCAAAGCTGAAGGGGAGTCATCAATATGTTTAAACATTATGA
TGAAAAACAAAAGCAATATCTTCCATAAGAGTTTCATGTTGCTGCCTTCAGTAGACAGTTTCTTGCTGGAATCTGAGGCTGCTCTCTGCAGTCCGGATCTCTGTGGAGGGAAGTGGGCCT
ACAGTGCCACAGTTAAAGTTAAAAGATCAAGTCCCTGATCACTGGGGATGGGAGATGCAAAAGCAGAAAGCAGAGAAAACCTGTAGAAGGATCTGTCCTGGCTGGAGGTGACAGGATGGT
GACACCTTCCCTACAACTTCTCCCTTGCCAGAGCTACTCACATCCACGTTCCTTCCAGCCCTCTTCTTTCCCTAGCTTTGGATTTCCCTTTTTTCCCTCAGCTCTCACCTCTTATCCCCA
AGAGGCGCTTTCCCTGTACTGTCCTGCAAAGGCTCTCTCCTTTATGTGCCACAGGTTCATTTTATTTTAGAGATCACATGTGTGGCCCTGCCCAAAGTCTTGAACCTTAGCTGGGTTCTT
TGCTCAAACATCAGGCCCTGAGTCATTTTCCCCCCTGACTCTCTGCTGAGTGAGTGTGAATGTGTGCGTCTGTTCTCTGAAGTGAGCAGAGAAGAAGCCCCAGGCCTATAGACTCCTAGT
TTGGGGGAAAGAATCAGGCCAGCTATCTGCTCTACTGATCAGAAAGATCCCTGCTCCATCCCTGATTCCATTATCCATTCTGGATCTTCCCTCTGTCCCTCATTCCTCAGTTCTTTCCCC
AGCAGCTGAGTGCTCCCTCCTGGACCCTCATTTTGGAGAATACAACAGGATCAAGAGGAGGAATTGTATGGGGGACAAGATGCCCCAGAAAAAGGGCAAAGTCATCCACATCACCAGTCC
AGTCCCTGGTCCTACCCAAGAAACTGGGGAAAAAAGCAGTAGCAACACCACACCCTTGCCCTGGGCTCTGCTCAGCCAGAGATGACTAGTCCCCTCACCACACCGACCCTGCCCAGGGAT
GGGTCACAGAATGACATGTCCTTCTCCGGTATGGAGTGTGGCTGGCTCTTGCTCTGTGTGGTGTCTTAGGGCTGGGGATGGATTCTGTCCTGTTGTGTCCCCTCAGGGAGCTGAAGTTTA
AGGGGAGTAAAGTCCCCTGCATCCTATTCCCCATCACCACTGCTCTCCTCCAGCCCCCTCTCCCTCTCTGAGGCCGTCTGTCAGCACACTCTGCTCCCTGCTCCACTGCTTTTTGGCTCC
CTGCCATGTCTCCCCCACTGAGCAGCTTAAAGATCTGGCTTGAGTTTGGACACTACTTTGCCCTGGTTTTTCCCTCTGCACCCTACCTCTCACATCCACTTCAGGTATCCACTTTCTGCT
GCTCCAAGTGCCAGGCTCACAGCACCCATATCCGGCAGGCCTGCCTGTCCTCTCACGCACTGACTTGCCCAGCTGCCCCTTCCCTCCATTCTCACACAGGAGCTCAGAACCCCATTTCGT
GGGAGGTGCAGCGATTTGATGGGTGGTACAACAACCTCATGGAGCACAGATGGGGCAGCAAAG
AGGTAAGTGAGAGCCAAGTGGGGATAGAACCCCAGGGCCAGGGGGGTACTGAGTGCT
GCGGGGCAAGAGCTGGCAAGTACCAGCAAAGGCCATCCATTTCCAAGGTTTTAGGATCCATGACATGGAGGAAAGCCTGGGAGAGAGGGGGTCAAGAATGCCCCCTGAAGATTCATCCTT
ATCCTTACCCCTCCTACCCCAGGCTCCCGGCTGCAGCGCCTGGTCCCAGCCAGCTATGCAGATGGCGTGTACCAGCCCTTGGGAGAACCCCACCTGCCCAACCCCCGAGACCTTAGCAAC
ACCATCTCAAGGGGCCCTGCAGGGCTGGCCTCCCTGAGAAACCGCACAGTGTTGGGGGTCTTCTTTG
TGGTGAGAACTTCAACCTCTGGGGAAGGAAGCCGGTGGGGTCGGCTGGACACC
TCTGCATGTGAAGAGGGGTCAGAGGATGAGAGAGAAGTGATGGAAGGCCTAAGGGATGAGGGTGAGAGGTGGAGGAGGCAGTGGGGTATGGTAGGGGTAGAAGCAGGTGGAGAGAGGATC
CATGCTGAAGTGAACTCTGGTTGGGGAGGCAGAAATCTGCCCCACACAGAAAGATGAGGGGTGGGGTTAGGGAAAAGGGGCATCCCTCCTCGTTGGGTTGTTGGTGGGGGTAAGGGGCTG
GTAGAGGCAGAAACATGAAAATAATCTGGGAGGGCCTCTCCGTGGGACCCTGTTCTTAGGCATTGCCCCATGGCTTCTAGGACAGGCTCAGGGCAGGGTTCTGATTCTGGCAGAGTGGAC
TGAGGGGAAGTCAAAGTGGGGGTGCTAGGATGGCTTGACCCCAGTGACAGGATTGGTGGCTGGTCCATCAGGAATGCCTGGCTCCTGCTGCGTTTTGGAGCTGGTGCTGCCGGCCCGTTT
GCCTAGGCACACTCACAGGGCACCTCCCAGTTACCAAGGAGAAAGCCAAACTGTCAACGACAACAGCTTGGATCAGGTGTAGGCACAGAGGGACAGAAGAGACAAACAACTGGGAGGGAG
TGCCTAAGGTCAGGGCTTGGACTTGCCCATAATGGCTGCTGAGTCTTTTGAAGCTTCAGGGGAGGGAAGGAAACTGGATACCTAGGGTAAGAAGGAAAGTGGCCCTCTGCTTGCCCTAGC
ACCCCCTCCTGCACTGCCCGCAGGCTATCACGTGCTTTCAGACCTGGTGAGCGTGGAAACTCCCGGCTGCCCCGCCGAGTTCCTCAACATTCGCATCCCGCCCGGAGACCCCATGTTCGA
CCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGAAGCCGCTGGGACCCCGAGACCGGACGGAGTCCCAGCAATCCCCGGGACCCG
CGGTGAGGCGGGGAAGGCGGCGGGAAGGG
ACCGCACCCCAGCCAGGTGGGACCTGGGCTTCGGGCCTGGCAGGGCCTGGAGGGGAGAGGCGCCCACTCCCCAGCCGCGGACACCCGCCGGGCCCCGGCCTTCCCTAGCTCGCCGCCGCC
CATCGACCCGGGCTCACCCGCCGCGTGCCCCGCAGGCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGTTCCTCGCATTCCTGGAGCGACGCGCTGCGGAGCTTCTCCA
GGGGACAGCTGGCGTCGGGGCCCGACCCCGCTTTTCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCGACCCCGCCACCGGGCAGAACGGGCCCCGGGGGCTGTACG
CGG
TGAGGCCACAGGGGCGGGACGGGGCCGGCTGGGGGTCTGCGAGTGTGGGCTCCCCCGATCACGCTACCGCTCGTCTCCTCCCCTGCGCCCCCACGTCGGATGCAGCCTTCGGGGCAGAGA
GAGGGAACCGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAGCTGTTCCAGC
ACGCACGCAAGAGGGTCATCGCCACCTACCAG
AGGTCAGCCGTCCGCGCCCCGCGACGTCCTCCCTTCCGCGTGCAAGCCCACGGGAGACTCCGCTGCCCCATGGAGCTCCCCATCTGTG
GACAACCGCCACCCAGAAACCCCTCCCCAGACAGCCGAGGTCCAGGGAAGCCCCTGTAAATGATAGGGAGGCACGCGCTGTTTATAGGAGAAATCTGGCTGGTGATGATTATTTATCACC
TCCCCACCCCCCACTCCCTCAAATCCCCTGGTTCCTTGTGGGGACAGGCCTCACACTGCTCCTGTTTGAGTTGCTTCTCCCATGACTGACCCTGGCTGGTCCTCATCTCCACACGGAAGC
CGTCCTTGGGCTCAGACCCTTCCAGGTCCCTCCCCATCCAACTCGTGCCTCCCCTCGCCCCTCTCTGCCCCTCAGAACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACA
CTCCCGGAGTATACAG
AGGTGAGGGAGCGGGGAAGGAGGATGGGAGGGCTTGCGTGTGTCTTGCGGGGTGGGGTGGGGACTACGTTGGGGATTTTAGGGCTAAATTCTTCTGTCCTCTCT
TCTCCTATTTCCCCAGGATACCGGCCATTTCTGGACCCCAGCATCTCCTCAGAGTTCGTGGCGGCCTCTGAGCAGTTCCTGTCCACCATGGTGCCCCCTGGCGTCTACATGAGAGGTGAG
GGAGGGGCTCAAAGGTGTGTGTGCTGGGAGGGATGGGGCTGTCAACTGAGGAAAATCTGCCCTCAGGAGCCCTCTGTACAGGATTATCAGTCTGAAGTGTCCCCAAGGGAAAGACCGATA
GAGAGGGGAAGAAAACAATTGTTTTAAAAGATACATCACTGTGGTTTTTAAGTATTTTTACTTCCATTCTCCTGCTGCTGGTGTAGATATTACCCTCCCCCCCACCATTAAATATAGAGT
AATAAAGGTTCAGAGGGTGTCACAAAGTCCACTGGTCTAAACCTCATGTCTTGACACTACACTGGGGACATTTTTAGCTATAATCACAGAGCCCAGGGCCTGCAACCTTTTGGGGGCAGA
GGGTTCTAAAATGTGTGCAATCTGAAGGAAAATATTGGCTTCAAAATAATGAAAAGGGAACTTTAAAAAGACATAGGCCGGGAGCAGTGGCTCACGCCTGTAACCCCAGCACTTTGGGAG
GCAGAGGTGGGTGGTTCACCTGAGGTCGGGAGTTAGAGACCAGCCTGACCAACATGCAGAAACCTCGTCTCTACTAAAAAAAAAAAAAAAAAAAAAAAAATTAGCCAGGCATGGTGGTGC
ATGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGAAGAATCGCTTGAACCCAGAAGGCGGAGGTTGCGGTGAGCCAAGATCGTGTCATTGCACTCCGGCCTGGACAACAAGAGTGAA
ACTCCATCTCAAAAAAAAAAAAAAAAAAAAAAAAGACACATATTTAAATAATGTATATGTATGTATATGTATATGAAACAAAGCATGTCCATTTATATTTTTACAATTGCTTTATAATTT
AAAACAATTCATAGATTAGATTTACTCTGGAATAATCTCTGGGGTATGCCTGATGATGTCTGAGAAATAGACAATTGAGTTATTCATAGACAAGTAGTTTCAGAAGAACAATTTTTGAAT
CCTTTAAATAGGTATATATGTACACATATATTGTACATATAACTGTGTATATATATATTTTATATGTGTGTGTATATACATATATTGAGAGAGAGTCAGAGAGAGAGTGTGTGTGTTAAT
GAGCTATGAGTACATTTTAGATATTTGGTCTGATGTTGTAGATGGGACCTTCCAAAAGTCAGACCCTCCAGGATGTGCTAAGGTCCGAACCACACACCACCTCGGAAAGACGATGGCCTG
GGGTCCCTGGATACCGCCTCAGAGACCAGAACTGATGCCTGACATTATAACCCCATTATCTCAATCACCATCTCCCTGCTCCTTGCATTTCAGAAATGCCAGCTGCCACTTCCAGGGGGT
CATCAATCGGAACTCAAGTGTCTCCAGAGCTCTCCGGGTCTGCAACAGCTACTGGAGCCGTGAG
AGGTCCGAGCTGGGGGCCACATATGTGGTGGATGTGTGTGTGTGCATGCTTATGTG
TGTGTGTGTATGTGTGTGTGTGTGTGTGTGTGTGTGTGAGTGCATGGTGAAAGTGGTCAGTAAGGCTGACGCAGGTAGCTGGGATAGGCTGGGGTGAAGGAGGCAGCCTGGAGGACCTCA
GGCATCTCCCCTACAAAGTCTGAGGTGACCAGGTAGCTGAACATGGCCAAAAGCCAACCCACCACCCCAACCTCAGGGCTGTGGGAAAGCCAGTGGGACCTGTGCAGGTATGGTTCCGCC
TTCGTTGCTCCCTCTCACGTTTTCCCTCTCCTTCGACAGCTTTGGGCGTCCTGTAGGCCTTGGAGTACCCCAACTAGTCTCACTTGCCCCAAGGAAACATCCCATTAGACACACATCCCG
GGCAGTGTGTGAATGCGTATGTGTGAGGGGGGTGGGGAGGAAGCAGGGGTTCCTAGAGATTATTGGGGATTGTCCATCCATCCAAATCAGGCCAGCATGGGTGAAGAACATTGATTGTGA
ATTTAAAGCAAAATGAGATACTTTGTGTAAAGTGTTTACTAAGGGGAAGGGAGTGGGGAGGAAGAAAAAAGAAGCCATAGAATCAGAGCCCTGGAGCTGAGTTGAAAGAGGACTCAGTGT
CCCCCTCATCAGAGTCCCTTTTTATTAAGGGAAGGAAACTGAGTTCAGAGAGGGCCTGTGACTTGCCCAAGGTCATACAGCTGGTTAGTAACAGAGGACTAAGACCTAGATCTTCTGGTT
ACTTTGCCAACCACACTGTCAAGTATCCCAAGGTCAAAGTAAAATTTTAAGATCCCTTGTGCTTGTATAGACCTCTGTCAGGCAGCACTGACAGAGGGGACACCCCTCACCCTGAGTGGG
GAATCAACCTGCAGAAGAGGAATGTCCCACCGGTTTGGGAGCAAACAGCAGGACTGGGTGGGCTCAGGGATAAGGATGATGGTGTGGAGGGAGCCGGCTGTCTAACCTCTGACCCCATTA
ACCACACCCCTTTCCTCCCCACCCCCAACCTATAAAGCACCCAAGCCTACAAAGTGCTGAAGATGTGGATGCACTGCTGCTGGGCATGGCCTCCCAGATCGCAGAGCGAGAGGACCATGT
GTTGGTTGAAGATGTGCGGG
GGGTGAGTCTGAGGCTGTCCCTGCAGGTTGTGAACTCCTGGCCTCTGGGAAGAGGCTCAGCTGGACTTCCAGAACCAGGTATGAGGGGGTGCCTGGGGCT
GGGAGCCTGCACTGCACTCACTGATGAAAGAGCTTCTGACATGCTGACCATGGCTCCTTCTGGCTCAAGTCTCCTGGGGCTGGTTGGAGCTTGGGGCCTGGACTGGACACTGCTGTGGTG
GCCCTGAGCTCCCTCAGACTGTCTGGTATCTTGTCTCCAGATTTCTGGCCTGGGCCACTGAAGTTTTCCCGCACAGACCACCTGGCCAGCTGCCTGCAGCGGGGCCGGGATCTGGGCCTG
CCCTCTTACACCAAGGCCAGGGCAGCACTGGGCTTGTCTCCCATTACCCGCTGGCAGGACATCAACCCTGCACTCTCCCGGAGCAATGACACT
CTGTGAGGAGGGGTCAGGACCCAGAGG
GTAGGGCGGGAGGGACAAGGCACGTGGGCCTGAGACATGGAAGATAGGCAGTGAAACTTGAGCACAAGAGACAAGCACCTAATGGGAGGGGCAGGGCTTACCCAGATCACTTTTCCCAAT
ATTACATATCAGGATGAAGATTACCAGACCAGCAAGATTTGGCTTTGCACTCGGATTGTTCAGATCCAGCAAGCGTGTATTAAGCATCTATTGTTTGCTAGGCACTTAATAGTCACTATC
TTATTTACTGTTCTCTCCACCCTCCTGAGTGGGTATAGAAGGTCCTTGACTTAGGAACAATCTGTGCTTTCTTGCACCGATGTTTAAGCCACCTTTCAGGAGTCAGCTGTTACACCCTGA
GCTACCACCTGGACTGGTGGCCCAGGCCAAGGTAGAGCCTGAGGCGGAGGCCCAGCATGCCTTACCCATTCCTGGTGTTCTCAAACCAGTAGCGGTCACCATCCCACAGCCGCACAAATT
GGTCAAGGATGATGGTGCTGAACAGGGGTCCTGTTCAGCTGCTAGCCAAGAGACCAGATTGGGATACCTTACCATCTTGACACAGCAGTGTTTCTACAGTTGTGTGCCTATTCTATTGCA
TTTTGCCCTTACTATTTTATAAAATTATTGGAAAGTGGGGATTTGAAAGGCCAATAAAGAAGTGATATGAGCATGGGGGCTCGTAAACTTTATAAAATCAATATATAGATAGGAACCACC
AGGCACACCAAAAGTAGCAAATCTTTAGCTTCAAATGGGCCTCAGTGGGATTAAGCTGGTCAAGTGCTATTTACACTGTGGAGAAAAGAAAACTAAAATAAAGGTCAGGTAGAAAATGCT
GGTGATATGTTTTTGAAAACTGTCTCCAAAAGGCAATATATAATCAGGCATTTGATTAAAAAAAATTTTAGTTCTAAAGAATAGAAACCTTTTATAACACATCATTCTGCCATTCAACTT
TTGCACATTCTGATTTTGTCCCTTCAACTTTCATACCTTTTCAAGAACAAGTTATGAATGCGGATTGGCTGGCTGTTTTATTACTTTATAAAATAGGCAGCTGAATGTGTGAGAGGGTTC
CCATAAATGGAGTTCACTTATGTAAAACACTCAGAGAAGTGCCTGGGGCATAATAAGCACTGTATAGGAGTGAGTTACTGTTACTGTCATTTATTATTATCATCCCTGGGGCTAATCCCT
GGGAGCCACCTCTTTGCCTTAGGGTGGGTGCCGTGTCCTTTCTCTTACTTCTGCCCCTTCTTGGTTCCAGGTACTGGAGGCCACAGCTGCCCTGTACAACCAGGACTTATCCTGGCTAGA
GCTGCTCCCTGGGGGACTCCTGGAGAGCCACCGGGACCCTGGACCTCTGTTCAGCACCATCGTCCTTGAACAATTTGTGCGGCTACGGGATGGTGACCGCTACTGGTTTGAGAACACCAG
GAATGG
GGGTAAGGCGTGCTGGGCCTCCGCCTCAGGCTCTACCTCGGCCTGGGCCCCAGACCCTCTTTCTGGCCTTAGACAGCCCCCATGAGCCCTTGATTCCAAGCCAGCCCACCACCC
ACTTCCCAACACCTCTGGGTCTCTTTTCTCACCTGGGTCCTTGGGCCTGGGGTTGCTGGAGGCCTGCATCCCCTTCCCATCCCAGTGACTTCTACTTACTCCAACTTAGGCTGTTCTCCA
AGAAGGAGATTGAAGAAATCCGAAATACCACCCTGCAGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCTCTGCAGCCCAATGTCTTTGTCTGGCATAAAG
AGGTGAGTGGCC
AAGGGGTGGCTGGAGGAGTGGTGGGTCTGGAGCCTCGTCCCTCTTCAGCTCTGGGCTTGGCTCAATGTGGCTGAACGTCAATCTCTGTGCTCCAAGAGGGGAGTGAGCTGTGGTTCTGCC
CTTGGGAACTCCAGGTGCCAGGGCCACCACTTGCAGCTGTGTAGGGTCCCCTCTGCAGAACTCCAGGAAGTGCTGAGTGAGGGCAGAACTGAGACTCAGAGAGAGCTGGTGAGGAGGTTC
ACATTCGGCTGAGATGGAGCTCGGGCTAGGTATAATATTAGGAGGGCTCAGTGCAGTGAGGATATCCCAACCCTACAGCAGTGAGGGAAGCGTGTGTGTGTGTGTGTGTGTGTGTGTGTG
TGTACACTTCTGTGTGTGAGGGAGAGATGGAGGTTGGGATTCATTTTTTACCCCACTTGTTACCCAAGGCAGCCCCTTCCCCTCAGCTACCCAGAGTGCCCCCACCCCCTTTCTGCCACC
CTAACCTCCTCCGTGATCCTGTGCCAGCACCTGTGGCCCAGCACCCAGGACGCTGGCTTCCTCTGCCTTCCCAGGAGACCCCTGTCCGCAGCCGAGACAGCTCAGCACTGAAGGCCTGCC
AGCGTGTGCTCCCTCTGTTGTTCGTGACTATTTTGAGGGCAGTGGATTTGGCTTCGGGGTCACCATCGGGACCCTCTGTTGCTTCCCTTTGG
GGGTAAAATCATGGACAGAGTGGGGTGG
GGTGAGAGATGCAAGCTAGGGGATGCAGTTTGGGTGGTTCCACTAATGACAACAAACCACGCAAACCACACAGAGCATGGTCCCTTTGGGTAGGAGAATGAAACATTAGAGGAGGAAGGG
ACAAGAGAAGTGTTTGACCTGCAGAGAAGTCAGCTCCAGGGAGAATTGCCTCCCTGTTTTCCTGGGACTGCCCCCAGGGCCCAGCATTGGTCAGGAGCCAGGGGGAATCTATAATCCACA
GCTTTTTGCAGGCCTGAGCCTAGAGGACTGTCAGAGGCAAATCCCTGTTTAAGAACAAGGGCTAAGACGGGGCGCGGTGGCTCATGCCTGTAATCCCAGCACTTTGAGAGGCCGAGGCAG
GCGGATCACCTGAGGTCAGGAGTTTGAGACCAGCCTGATCAACATGGAGAAACCATGTCTCTCCTAAAAATACAAAATTAGCCGGGCATGGTGGCGCATGCCTGTAATCCCAGCTACTCG
GGAGGCTGAGGCAGGAGAATCAATTGAATCTGGGAGGCAGAGGTTGCAGTGAGCTTAGATCGCGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCTGTCAAAAAAAGAAAGAAA
GAGAGAGAGAGAGAATGGAAGGAAGGAAAGAAGGAAGGAAGGAAGGAAGGGAGGGAGGAAGGGAGGGAGGAAGAGCAGGCTGGGGCAGACATAGTCGCTGGAGTGAGGGCTTGGGGCTTA
GTCTTTGGCAAGGCTGTGTGTGGGTCAGCTAGCAGCCTGGTGGGGTGAGTCTAGTCTCTGACACAAGGACTCTGGACTTAGTCTCTGGCCAGGAAGGGACGTGAGTGGTGGAATCTGAGG
AGGAAGGGTGGGTATCTTAGATGCTACCCAAAGCTCCCCATGGGATGCAGAGCAGCTTCCCCCAGGGACCTTCCCAATCTGAAACAATTGAGCAAGCTGACCTAGGAGGTGGGGACAATA
GGTGGTATTGCCAGGTAAGGAGCTGAGAAAGGAGCTGCTTCCATCCCCTAGACCCCCACGTCTCCCTGAACCTCTGCCTCTGCCCTCCCAGTGAGCCTGCTCAGTGCCTGGATTGTTGCC
CGGCTCCGGATGAGAAATTTCAAGAGGCTCCAGGGCCAGGACCGCCAGAGCATCGTGTCTGAGAAGCTCGTGGGAGGCATGGAAG
AGGTAGGTCTAGGGCTGGCCAGGGTGGTGGTAGGG
AGGACATGGCTCAGCGCTACAGCTACCCACCTCCACCCCAGCAGCCTAAGGAAAAGGCCTCCCTTTTCTAGGACTGTAGGCAAGGCCACAGTGGCATTAAGCAGAAGTTGGACATGGAGT
CCTGCAGCCTCCAGAGTTTCTCATGTGCAGCAGTCATTGGACCTTGCCTTACAGGGAGTAAGTGTAAGACCCTGTAGTGATCTTACAGGGTCACTATGACTTACGAATCCCTGGGAACTG
CTGATTTGTAGCATCTCACGCTGGAACTCCAGGCAACAGAGAAGTGGTTAATGAAGGTCCAGAGTCAGACTGGGATTGGAATCTTGGTTCCACCACCACTATGGGCTATTGGGCCTTAGG
CAGATTTTTGAATCTCTCTAAACCATTTTTTTCTTATCCATAAAATAGGAATAATAATAGCACCTACCTCACAGGTTTATAATTAAATGAAATAATTCATGCAAAGCACTTGGTATAGTA
TTGAGCACATAGAAAACATTCAGGAAATGATAGTTACTATTCTTTTCATGGGGCTAGAGGCACAGCCAGGCTTTCAGGGAGGGAATGACTCTCAGTACCCCAGAAGGGGACAATGAACTG
TGGAGGCCTTGATCTAATTTCAGCCCTGAGCTGGCCCTCTTCCTCCCAATGTACCTCTGATGGGTCCCAGCTGACGAAGCCCTGTCTCTCTCCCCCTAGCTTTGGAATGGCAAGGCCACA
AGGAGCCCTGCCGGCCCGTGCTTGTGTACCTGCAGCCCGGGCAGATCCGTGTGGTAGATGGCAGGCTCACCGTGCTCCGCACCATCCAGCTGCAGCCTCCACAGAAGGTCAACTTCGTCC
TGTCCAGCAACCGTGGACGCCGCACTCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
TGGTATGGCTCAGCTGGCATCTGGCTCCTTGTCCACAGCCAAGGCCAGGGAGGCAGCCAGG
TGGAGGGGAAGAAGCATGGGGTCAGGAGGCAGGAAATGATAGTTACTGTGCAGGAGTCTGGCTTCTTGTTCCCAGGTGGAGTACCTGATAGAGAGATAAACTTGGACACGTTTATCCTGC
TCCGGGTCTGCTGGATGACATAATCTCCATGGTTTCCTCCTAACTCTTGCAACCACAGTCTGCCTCCTCCTTTTCTAAGGACAACCTGTCAGGTCCCAGCTGGGTGTCCTTGGAAGAAGC
AAAGCCCTCTTCCCTGTCCTCTTATCTCAGGATGTAGTGAATTTTTAGAGATTTTGGAGGAGAGCTAAATCAGAGGCCTGGCCCATGATCCCCTCCATCCTCTGGCCTTCCTGCCCTCCT
CTGCCTTCTCCAGTATGGAATAGAGGGAAGAATGTCTTTGTACATGTGCCTTTAAATCCAAACTCTGCACCTATGAGCAAGTCACTTAACCCTACTGAGCCCCTTTCCTCTTCTGTAAAA
GGAGAAATAATATCCCTTTTTCAAGGCCACCCCAGTGGCCCCTCTGACATAATCCACATAAACTGCCTAACCTCAAGAGGCCTTGGCAAATGGTTGCTTTTGCTCGGGCTGCCCCCTTAG

GTGCTGCTGTTTAACTTGGAGGAAGAGCGGCAGGCGCTGGTGGAAAATCTCCGGGGAGCTCTGAAGGAGAGCGGGTTGAGCATCCAGGAGTGGGAGCTGCGGGAGCAGGAGCTGATGAGA
GCAGCTGTGACACGGGAGCAGCGGAGGCACCTCCTGGAGACCTTTTTCAGGCACCTTTTCTCCCAG
AGGTGTGTACATGGGACCAGATCAATCCTTATGCTGTGGTGGTGTCCTTACCTG
CATGAGGCCATGGGGTGACTCGAGGGGAAGTCAAAGCCCAGAGTTCTCAGCTAATAATCAAGTCTATTGCAATTTTGGATGAATCACCAGACTTTATCATAAGGAGTTGCCCCTGCCCCC
ATCTGAACCCCACCTCCAACTACGAGGCTCCAGTGGGGGGATGCCCAGGAATGGGCTTCTCTGCATGTGTTGCTTATCGCTCTCCTCAGGGCTGCCCAGACTTTCCATCCTCACTCATCT
CAGCCTGCCTCAAAGGCTGCAGAGACCCAGAGCAGCGTGGGCACCATAAGGGATATTGTTCACCCCTTCTTGGGGGAGTGCAGCTGCCAGAGCCCTCCATGGGGCATTGATGTCCACCTT
GCAGACTTCTGAGAGAGACTGGGAGGATATAGGGAAACAAGACAAAAATGCAGAGACTTGAGTCCACAGCTGGCACAAAAATAGAGGCAGCCTGGGAGCAGGGTAGTGGAGTGGAGAAAC
ATGGTAGATAAAGATAAATATGGTCAGGTGCGGTGGCTCACACCTGTAATCCTAGCACTTTGGGAGGCGGGGCAGGTGGATCACTTGAGGTCAGGGGTTTGAGACCAGCCTGGCCAACAT
GGTGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGTGTGGTGGCGGGTGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGGAGGCAGAGGT
TGCAGTGAGCCGAGATTGCACCACTGCACTCCAGCCTGGGCAACAGAGTAAGACTCCATCTCAGTTAAAAAAAAAAAAAGATAAATACAGCATGGTTCTCCCACTGAAGAAGCTTATCCT
TCCACTGGGGAGATGGAACATGCACAGAAAGGAAGATAACTGGCAAAGAATGAAGGCTACAAAAGTGATTCAGGAAATCAGCAATCCGGGAGCTCAGAGAAGACAGTGAGTCTTCAGCCT
GCTGTGAGATCATGATGGAGGTGATATTTGGCAAGGACTTAAAGTTGGACAAGTTTTGGCTCAGAGGAAAGGAACAGGAGGGACTAAAAAGAGGGCTCCATGGAAGAGCAAAGAAATTAA
TGTTTGAGGAGAAAAAGCAGGATCATGTGGTTGGGGAGAGGATTCATGAGAGAAGCATCCATTGAACAGACATTTACTCTGTGCCAGGAACATGATTTAAAGTCATTTGAGGACAAACAA
GTTGAACTTGAATATGTACAAGAACAGAGGGACCCTCAGCTGGGTGACTCCTTCCCAGAGCCCAGGCAGGACTGGGCATCTCCACTCATTCAGGATCCTTCGGGGCTATGGGAACTCCCT
GTGCTGAGACACGAAGGGTGCTGATATATCCTGGGGGTGGCTGTCAGTGTCTCATATGATTTCAGCTTTCAGAACATTTTCAACACAACCACTTTTGTTGATCTCGCAATAAACCTGTGA
GATAAGTAGGGCAGATGAGGAAATGAATTCCAGGCAACCGAAGAGACTTGCATGAGGTCATTCAGCAAGCAGGTGACAGAGTTGGGACATGAACCCAGGTCTCTTGCCTCCAGCCCTAGG
CTCTTCCCACTACAGCACCTTGTTTGAAGCTGTAAGGTTTACCCCATGAAACCATCACCAGACTTGGAGTAGGCACCTCACTTGACAGGCAGGAAAGACAGGTACTGTGGCCTGTGGCCT
GCCTGCTTCCTCACTGTCTGTTCTCTCATACTGGGGAAGCCTCTGGGTGGGCTGGGGATGGCCACAATGAGAATTCCCCTTTCTCAAGCCCAGCTTTTCCTCAATCCAGCTGAAGCCTTG
TCTTAGTTAAAACTTTTTTTACACATAAATAAGAGAAACCCCAATAAGCTAGCATTGACAAAGGGTCACTTGACTCTGAGAATACAGAGATGTCTCAGGAATTCCAAAGGGAGGACATCA
ACCTGGCCTGAAAGGACCTAGAAAAGGAAAGCTGTCAGCCAATGAGGCCACTCCTCCCCTAGGCTGCCTTTGCCTTACTTGATCTCTAAGTGGCTGAGCACTTGGAGGAAGAGAGTGGTC
CACACTTTCTGAGCTTTCTTGTGCTCAAGTCAAGACAGACCTCACACTTCTTAATTCCAAGTGAGTTCCCTGTCCAGTCAGCTAAGGCAGAGGGTCAGGGTTCTGTAGAACAAACATGGC
CTCAGGCACCACCATAAGGCCTCCTAGCAGAGGGGGTCCTGGGCAGACACCTCAAAAGGTGGCTCCTACAGGGCTCAGCTGAACTTCAAGTCAAGGAAGCCTTGTCTCCTCTCCCTCCCT
CTGCTTCTGCCCAAGTGCAGGTGCTGGACATCAACCAGGCCGACGCAGGGACCCTGCCCCTGGACTCCTCCCAGAAGGTGCGGGAGGCCCTGACCTGTGAGCTGAGCAGGGCCGAGTTTG
CCGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCCTTCCGAGAGTTCCTGGACATCCTGGTGGTCTTCA
TGAAAG
AGGTGAGGGAGGAGGGAATGATAGGAGAGGCTGGACAGGGGCTGATCTGTTGGAGATGGGGAAGCCCTGGGACCAGGTCTCCAGCCATGGCAGATGCCCAGAAGTGCCCAGGCC
GAGGTCAGGAAGCAGAGCGACCCTTGCTGTGTCCAGAAGTGGGTCATCACACTGGTGTGAGGCCCCTTTTGGCCAGCCTGGGGGTTCAGGCAGGCAGGCGGGGGCTCTCCTTATGGAGTC
CTCCCTCTCCCAGGCTCTCCTGAGGAAAAGTCTCGCCTTATGTTCCGCATGTACGACTTTGATGGGAATGGCCTCATTTCCAAGGATGAGTTCATCAGGATGCTGAGAGGTTTGTTCTCT
GGGACAGCCAGGAGAATGGGCCAGGGCAGGGATGCCAGGGCAAATAGATGGGACCTGAAGGAGAGAGCAGAAGGGCCAAGGAAGGAAGCCTCCTCCTTACCCATGTAACCAGAGGCTGTT
CAAACTAGCAGGGGGCTTGGGATGTGGCCAGTCGGGGCCCCTCCACATGGGCACAGAGAACTTGGTGCGATGGAGTCAGTGTGTCCTGTGTCTGGGTCGCAGGCCCCAGTCAGGGCCGGA
TGGTTCCTCTCCCCCAACCCCAGATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCTGAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCAGGACAAGGAG
GAACTGACATGGGAAGATTTTCACTTCATGCTGCGGGACCACAATAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAG
AGGTGGGGCAGCCTGGTAGGCAGCACTGACTCATTGGTTAG
GCATAGTAGGCACAATGCCCACATACTTTTAGGAGTCCACAGAAATGTTTTAATTTCCATTAAAATCAGAAGAAAAAAATGAATGTAGTAATATGTAATAATGTATCCAATCTGGATTGT
AGTTTTCTTTATATCAACATAATTATACAATATAATTTTAAAATATTATTTTTTATTTTTATGGAGGAAAGGACCCAAAAAGGTGAAAGTGCCTAGGGCCCAGGAAAGTCATAATGCAGC
TCTGCTGGTGGGGTGGGTAGGGTCAGGATAGGGTAGGAAATGGTGATTGGAACTCTGCTTCCCCTGGCTCCCTGCCAAAGCCCCCTGTGCGTGGTGCCCAGGGCAGGGGGAGACTGTCTC
CCTGTTACAGCCCTACCCAGGCCATACTTCCTTCAGCTGGGATGTCTGGTTGTGGGGGTGGGGTGGTATCTCTTGGGGTTTTTTTTTGCCACTATATCTCCTGGCTGAGTGCTCAGTAAA
TGTTTGCTGAATACATGAAGACCCAGTTCTGTTGTCCTTCCCCCTACTTCCTGCCCCACCAGTGTCTGATTCAAGTGGGTGGGGGCTAGAGTTAGAGTGAGGAATGGGCAAGCAGCAGGC
AGGAGGGTCTGTCTGTGGCCTGCTCTGAGCTGTTATGGTATCTTTGCATCTTAGGACTCTCACTGGGGCATGTTGAGACCAAACCCAGCCCCTCCCAGAATTATGAGAAGGGGTAGGCTG
AGCAGCCTCCACAGGGAAGAGGGGCAAGTTAGATAAGACACAGATGAGCCTCTGGTCAGGGCTTCCCCCTGGGGGTAAGGAGAGCTGGAAGAGAAGGAACCCAGCACTCCTGATTCTGCA
GCTTTTTACCTTTCCCAGGCAGCCAGGGCAGCAGGCACCTGAGCCTAAGGGCAGGGGCAACGAAGGTGCATGGAAGGGAGCCCACACCACAGAGTCCCAGGGACATCCAACACCTGGGGT
GGAGGGTAAAACCAGGCTTGGGGATTGAGGAAGATTAGCACTGAGTATTGTTTTCTTTTAATTGTTATTGACTCCAGATTTTTTTTATAAAGATTTTAAAACTAAAGAAAAGGAACAAAA
GGCCAGGTGCAATAGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCAAAGTGGGAGGATCACTTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATAGAGAAACCCCATCTCTATT
TAAAAAAGTACAAAAATTACCCAGGTGTGGTGGCACATGCCTGTAGTCCCAAGTACTCCAGAGGCTGAGGCACAAGAATCGCTTGAACCAGGAGGCGGAGGCTGTAGTGAGCCAAGATTG
TGCCACTGCACTCCAACCTGGGCGACAGAGACTCTGTCTCAAAAAAAGAGATGTTCCTTCACTTTTCACCTAGATATGCTATTTGTTACCATTTTACCACATTTGCTGTTTGTGGAACAT
TTGAGAGTACAGGCATTAATACTTTGTTTTAAACATTCTGATTTAATTTTAATATAGTTCAGCTTATCTTCTTTTGACCTCAACTATAGACCAGGGTCATCTTCAGGCTTGGTTCAGCAC
TTGGAAGAAACCCAGAGATTCCCAGGACTCCTGTAAGAATTTGGCTTTGCAGCTTAGTCCTGCCCTGCCTTCAGTGCTCAGCCTAGACAGGAACAATGGGGTGGCTCCTGGGAGAGCTTT
TCTCCTGCATATTTTATTATGAACATTTTCAAACACAGAAAAGTAAAAAGAACTGTAGCGTGATCTCCCATATACCTACCACCTAGATTCTACATTTAGCTCTTGGAGGTTTTAAGCTAG
AGGATCTTAAGCCAAATTTTTATGGCAGAGCAGGCTCAGAAGGCAGGTTATATACAGTTCCCGAAATCTACCCCATAGTCCCAGAAGTGGGACTGATGATGGACACACTTTAGGGAAGCC
TGCCTGGCCAACTTTCTTTCAGGCCTTTTCCCTGCCCACCTATGGCTGGGTCCAGCTCCCATTGGGGACAAGGGCTGAGGTTGGAGCACCCGAAAGCAGGGCCTCCATTGGTCTGGGACT
GTCTACTGTGCCTGAGCATGGGATGGTAGGAGTGCTGTGCGTTTGAGCCAGGTCTTCCTGGGCTCTGGACCCTGAGCTGCTCCCTAGCCTGGCTCTGCTTTGCAGGGGTGGAGGTGCCTG
AAGTCATCAAGGACCTCTGCCGGCGAGCCTCCTACATCAGCCAGGATATGATCTG
TGGTGAGCACCCATCTGGGAATGTCGGGGGGAGGAGTTGGGGAGTTGCCATTTCTCTCCCCTGAA
TGGCTGGGATCAGGGCCACCGCTAGCCCATGCAGCACCTTCAAACAAATTAGAAAAGGACACCCCTTTCTCTAGGCAGACACAGCCCTGTGCCAGGGCAAGCAGAAAGCCTGCTGGATTT
CCGCTCTCACTTACGGCCTGGCCCAGATGCCCTTGTGAAGGGTAAAGGCATATGCAGCAGCCTTAGCGAGGACCCCCAAGATCAGACTCTGTCTATAGGTGACTGTGGGAATCCTGCTGT
CCCCTTGCTGACAGCTCTGATCCTTCCTCAGCAGAATGGGTTTGGAGGCAGACCAGGATAGCAGAGGAACGAGTGGTTGAGATGGCCAGCATCCTATCTCTTACCATTCTTGTCTTAGTC
CCTCTCCCAGAGTGAGTGCCCGCTGTTCCCGCAGCGACATTGAGACTGAGTTGACACCTCAGAGACTGCAGTGCCCCATGGACACAGACCCTCCCCAGGAGATTCGGCGGAGGTTTGGCA
AGAA
AAGTATGTCTGCTCTTCCCCTTAAGCCCAGGCAGTTCATCCATTCCTTCAGCTTATAAACATCTTTTCCTTGGTGCCAGGCACTGTGCTAAACATTGTGGGTACAGGCAGGGTGAA
TAGTCCTTGCCCAAGAACATCACAATCTAAGAAGAGAATCTGGTACACTGATAATTTCAATGTCCCACTGATGACTGTTCTAATAGTGGTAAGAGCGAAGGGCTTTGGAACTCAAGGAAG
CACTCACCTCTGCCAGGGAGGTCAGAGATGCCTTCATGGAAGAGGTGTCCTTTGAGTCTCTAGTAAAGGCTGAATATGGGCGCCGGGTGGGAGTTGGGAGCTACAGACTGGAAGGAGAGG
GACTGAGTGTGAATGAACACAGGTAGATGGAGTACATCCCCTGGGAAGAAATGACCACTCCAAGCCATCCCTCCACTGCCAGCTTGCTTATGCAGTGCAATATAGCCTGATGCGGTGAGG
TCTGAACCCTTCTTCCACAAGGGTAACTAGGTTTCTTTCTCGGAAGCAGTGGGCTTCCCTCACTTCTGGGCGGCTCACCTCCGTGAAGTGGGGCCCCACTAGCGTTGGGTCCCATGGTGG
GTGCCAAAGGCTAAGGCTTCCTGTCTCCCAGGGTAACGTCATTCCAGCCCTTGCTGTTCACTGAGGCGCACCGAGAGAAGTTCCAACGCAGCTGTCTCCACCAGACGGTGCAACAGTTCA
AGCGCTTCATTGAGAACTACCGGCGCCACATCGGCTGCGTGGCCGTGTTCTACGCCATCGCTGGGGGGCTTTTCCTGGAGAGGGCCTACT
CTGTGAGTGACTTTACTTACCACGAGCCCT
GTCCCTAGGCTTGCAATGAGTGATCGCCCTGGGGGTGGGGCCTGCGATAAGTGCCAGCCCTGGGTAGGGTAAGTGGAGCCTGTCTGAGTGAAGACTGTGCGGGGGAGGCCTGTTGTGAGT
GGCAGCCGGCCAGGGCCTACCGCCCCTAACCAGCTCTCTGTCCTCTGCACTGACCCTCGCTTGCCTGCCTGGGCCCCCTCCACAGACTACGCCTTTGCCGCACATCACACGGGCATCACA
GACACCACCCGCGTGGGAATCATCCTGTCGCGGGGCACAGCAGCCAGCATCTCTTTCATGTTCTCCTACATCTTGCTCACCATGTGCCGCAACCTCATCACCTTCCTGCGAGAAACCTTC
CTCAACCGCTACGTGCCCTTCGACGCCGCCGTGGACTTCCATCGCCTCATTGCCTCCACCGCCATCGTCCTCACAG
AGGCAGGGCCTGGGTGTCCCTGGGAGGCTCTCCAGGGCCTCCCG
CCCCCGCTGACTTCCCCTCGTATGAGAGCCCCCCTCTCTGCTGGCACTTACCTTTAATGTCCTTCTCCATCAGGATGGAGTTGGCCTGGGCCAGGGTGTGAAGTAAGCCCGGGAGCCTGT
CGCTGGTCACTTCCAAGTGCCTCTTCCCGCACTTTCCAGGGCGCCTCACCAGCCTCAGCTGACAAGTTACTAACACCCCAGAATGAGTGTGCATAATGTGTCAATTCTCCCAATTTTTAT
GTTTTAAAGCATGACTCTGAGAGAAAGCAAAGGAGTGAACTTCTAATGCTGTAATTTCAGACTCACATGGTTGCGCACATGGATGGTGTGTCTGTGGGGTGTTGGGTAGGGCCAGGTGAT
TGTTTAGAGGTCAGAAGTCCAGGCCGGCGCGGCGGCTTATGCCCCTAATCCCAGCACTTTGGGAAGCTGAGGCGGGCGGATCACTTGAGGTCAGGAGTTCGAGACCAGCCTGGCCAACAT
GGTGAAACCCCGTCTCTACTAAAAATACAAAAATCAGCTGGGTGTGGTGGCACATGCCGGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAGGCAGAGGT
TGCAGTGAGCCGAGATCGCACCACTGCCCTCCAGCCTGGGTGACAGAGTGAGACTCCATCTAAAAAAAAAAAAAAAAAAAAAAAAAAGAGGTCAGAAGTCGAGACTCCTAAGGTACTTCT
CTGGGACCCCCACTCTGGCCAGGGTCCTCGATCTTGGGCTGAATGAGTGAGCACCCACCCTGGGCTGCCCCAAGCTCACCACTTGGTCTGCTCTTCCTTAGTCTTACACAGTGTGGGCCA
TGTGGTGAATGTGTACCTGTTCTCCATCAGCCCCCTCAGCGTCCTCTCTTGCCTCTTTCCTGGCCTCTTCCATGATGATGG
GGGTGAGTAAGTGCGAATGTGTGTGTGTGTGTGTGTGTG
TGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTATAATGGGGAGGATTCCTTTTGGGTGGAAAGAAATTATCTGGCTCCAGAGGAGACTGTCACCTTCTAGGTTACAGGACAGACAGTGACC
AGCCTGAGCCCCTAATGCCAAGTCAGCAGGAGAGACTGGTGTCTGAGTTGGGGTGCCTCCCCTGAAGGGTCCCATCTGGAATTCCCAAAGATCTCCTCTCATTAGCTGGGCATGGTGGTG
CGTGCCTGTAATCCCAGCTACTGGGGAGGCTGAGGCAGGAGAATGGCTTGAAACCAGGAGGTGGAGGTTGCAGTGAGCCAAGATCATGCCACTGCACTCCAGCCTGGGTGACAGAGCAAG
ACTCTGCCTCAAAAAAAAAAAAAAGAGAGAGAGATCTCCTCTCAAGGTGTCTCTTTGCTGTCCCTTCCACACAGGTCTGAGCTCCCCCAGAAGTATTACTGGTGGTTCTTCCAGACCGTA
CCAG
AGGTGAGAACCCTCCTTGATCCATGAATTTCTGGACCTGACTGTGAGTTCAAGGCTCTGGGTTCTCTGCACCCCAGAGCAACCCACGTTCACTCACTCAGCTCTTCCGGTGACCCA
GGCTCTGTCCTCTGGCCTGAGGACACTACTGGGTGGGCAGGAGACTTAGACTACTCTGCATTCCAGCCCTCCTCGCAGGAGCTCAACTGGGTTCCTGCCCCTACTTTGGGCTAGTTCCTT
CTCTAGTAGGGTTAGGAGGAGAAATATCTCCTACTATGGACTAGTTCCAGTGGAATAGGAGTGGCTGGCTACCTCTTCCCCCAATACACACACATACCCTAACAGTAGCTTTGAGGAGTG
CTGTGCCCCAGCTGCATGGGGGAAGGAGCAGGCTCTTTGTCAAGCCAGACAGAGGCGCCTACCCAGTATGCCTTGAAAGGAGCGTTTGGGGGATATTCCTAACTCCCTATAATCACCCAT
CTTAGAGCTATTAGCTGTGAATCTATTCAAATTCTCTTGAACCTATTTATATTTTCAGTCTGTTCCTTCCTTGGAGTAACATAATTTCTATACTGGCTCCTCTCTGTGAAACACAGCATG
GTTTTTTTTTTTTTTTTAATCCTAAAATGATCTGCTTTGAACTTCAGAGGGTGCTTGCTAATTCCTGCACACTGAGATTTAGTGGATAAGGCTGTGTTTATGCTCTCCTCTCCTTTTAGG
ACTTTACAGGCTTTGGTTAGATCCCTTCTTAACCTTTACTTTTTTATACTTCAGGGCTCTAATCTTCTTAACTTCTGCCTCTCTTCCTTGATCACTTGAGGGAGCATTCTCTGTCCTCTC
TGCCCTGCCCTGGCTCCAGTATTATCCCCCTGTATACCCATGGCCTGGATCCCCTGTGAGAGGAGGGGCCTCCCACCAACTCTGGGTTGTACTTGGGGACCCTAGTGATGAGCAGGGACA
GTGTGGTCCTGGCCAGTCACTAACAGTGCCAGTGCTCATGCTGTGTGCTGTGGCAGCCAACAAAGAGTGGTGCAGGGACAAGGACCAAACCTTGACTCGATGAGAGTACAGCCTGTCATT
TCCATTGTTTGGCCCAGAGGGCCCAGAGTGGACACCCTCTGCATCCATTCATTCAGCACACATTTATATAGGGCCCACGGTGTGCCAGGCCCTTGCACTAGGCACTAGGATTCAGTGGCA
AACTAGCTTAGCCTGTTCCTCATTCTTGAGAACCCGAGAGTCAGGAAAGTGACAGACAAAAACCAAATAATGACATTGGTGAAAAATTGGCTGGGTGTGGTGGCTCATGCCTGTATTCCC
AGCACTTTGGGAGGCCAAGGTGGGCGGATCACCTGAGGCCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAGAACCCTGTCTCTACTAAAAATAAAAAAAATTAGCTGGGCATGATGGT
GCATGCCTTTAGTTCCAGCTACTCAGGAGGCCGAGGCAGAAGAATCATTTGAACCTGGGAGGCAGAGGTTGCAGTGAGCCGAGATTGCACCAATGCACTCCAGGGAGCCTGGGTAACAGA
GCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAATCATGGTGACAAGGGCTATGAAAAGTGAACAATGTCTGCTGAAATGAAAGGTACCTGTGATCAGTGGTGTTGGGGGAGAGGACCA
GAGGAGAATGCTGGGACATTCTTACTCCAACTTGGGCAGTGGAGTGGAGAGGGGACCTTGGAAGCTCCAGAACAGTTCCCTTTCAAGGCACTGATCTTCTGCCTTCCACCCTATATTCAT
TTTGCAGGCCTCACGGGGGTTGTGCTGCTCCTGATCCTGGCCATCATGTATGTCTTTGCCTCCCACCACTTCCGCCGCCGCAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATC
CTGCTCTATGTCCTG
TGGTGAGGGCTTTTGGCTGTGAGCCAGGCCAGGAGGGTATGGGCAGGACATTTCCAGGGAGGCAAGGAGAGCTGGGACATTCATTCAATTTCTAGCTATTTGAAG
CATCCTCTTCACTTCTCGACGTCCCTCTTTGAAGGTGGAGATGATAGTACAGGGCTCTCCATTAGGATGACCCCAATATGCAGCACTTGGAAGCTCGGCTCCCTGGCATGGGTTTTGCAG
TAGCAGCCCTGCCAGACTGATCTCCATTCCTCGCAGACACCTCCACCCCACTTCTGCCCAGCTAGTCTTTCCCCACTTTGGGTGGCTGGTTTTGGCTCCTGGGTGGACTACCTGCACCCA
AGAGAGCATCACCCTTTTTGAAGAAACTTATCTTAAATGGGTTCTGCCACTTTTGTTCTGATCCTATTTTTGGGGGCATTAGTCACACCTGCTAAGTGCCACCTGTACAGTATATAAGCT
TGTTCAAAATGACTTCAGTTGGATTGTTGATCTAATGAATTTTTATTTTTTATTTTTTGAGACAGAGTCTCGCCCTGTCGCCCAGGCTGGAGTGCAATGGCATGATCTCAACTCACAGCA
ACCACCGCCTTCCGGGTTCAAGCAATTCTCCTGTCTCAGCCTCCTGAGTAGCTGGGACTACAGGCACGCACCACCATGCCTGGCTAATTTTTGTATCTTTAGTAGAGACAGGGTTTCACC
ATGTTGGCCAGGCTGGTCTCGAATTCCTGACCTCGTGATCCACCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCAGGCCAATCAAATAGATTTTTAAGACT
CTACCTATCCCCTTTCCTTCTCCTTAACCTTGGCTCCTGCCTCCCTCTCTCCATGTTGCCTCCCTTCTCATGTTCCCTTCACCCTCCCCTGGGTTTCTCATCGGCTGCCTAGCTGGTTAG
TTCCCGAGCTGATTCTGTGTCCCTGTGTTGTGGCTCTGCTTGTCTGCCTTCCTCATGAGCAGAGGCAATTCCCTAAAGGGTACTCTGAGGGCCTCACTTAGAAGCTAAACCAAATCCCAA
GGCTCCAGCTGTTCCCTGCAGCAGAACTGCATACAGGGCTCTCCTTTGTGCCGAGACAAAGAGAAACTGCACCATGGCCTCGGTCAGGTTGGCTCCTGCCATCTGACCTGCCTTCTCTCA
GGTGCAGTTCTGTCCTCACTGACAGATAGGAGGGTCTGTTTCATCTTGCCTGTTGCTCAGCATTTCACAGTTCTGAAGATGACTACTTAGCCTTGGCTTAGGATGTTTCCTAGAACTGAT
GATATGCGTTTCCTAAGAAACATCTCTTTCTTTGGAAGCTATGACTAATTTACCCTCTCTCCTCCTCTTCCCAATAATGAGTGGAGCCCTAGCATTCTCGGCTACTGCTTCTGTTGGGGG
TGGAAATGACGTTCTTGTTCCTGGTGATAAAGCAGCTACTTTGAACTAGATGAGAGATCCAGAAGAGTGGCCAGGCAGGCTCCCTAAGACCCAGAGCCCTTTCTGATTTGTCCTGGGGTG
TAGGCATCACTAGGATTCCAAGCCACCCTTCCTGCCAGCAGGAAAGTCAGGGGTTCGAAAGGTGTGGCCGGGCACAGTGGCTCACATCTGTAATCCAGCACTTTGAGATGCCGAGGTGGG
TGGATCACCTGAGGCCAGGAGCTCAAGACCAGCCTGGTCAACATGGCGAAACCTCGTCTCTACTAAAAATACAAAAATTAGCTGAGTGTGGTGACGCACACCTATAGTCCTAGCTACTCA
GGAGGCTGAGGCATGAGAATCGCTTGAACCCAGGAGGTGAAGGTTGCAGTGAGCCGAGATCATGCCATTACACTCTACCCTGGGCAACAGATGGACAGCCTGTTTAAAAAAAAAAAAAGA
AATAAAAAAGGAAAAAAGAAAGGTGGAGGCCTCTATGCTAGGCTATCCTCACCAGCAGTTTGGGACAATCTCATCTCAGGGAGATCAGGGCAAGAGCACCCTCTTTGTTCCCACTTGTCC
CAGGCGTGACCACTTGACTTCAAGCAGAAAGATAAATTACAAGTAAAGGACCCCAGTATAGACTAGACCCTGGGGAAGGATGGCCAGGCTTCTGATCCAGGCAGAAGAGGGCAGGGCTAG
CACATCAGAGGCTGAGCTGGCTGAGCCACCACCCCATCCCACCAGCCAACAAGGAATCTGCAGCAGCCTTGTGGTTTAAAAGGAAAGCCAGCACAAAATAGGGTGTAGAAACAGAGCAGC
ATGACGTCACCCTGCCATTGTACTCAGCCTGTTTGGATGCTCAGAAAGCCTTAGGGGTGGGTGTGGGCTCCACAATGAAACAGTGAACTTGGGGAAGAAAGAGCCAGTTCAAAGAAGAGC
TCAGAGAGGTTTAAGTTAGTCTTGTAGGGAAAGGTTCAAGAATCTAGGAAGACTTAGCTGGGGGAAGGGAACGCCTGAATGGGGAACCAGAGTTCAGTTTAAAAGCTATCTGTTGGCCGA
GTGCAGTGGCTCGCACCTGTAATCCCAGCATTTTGGGAGGCTGAGGCGGGTGGATTACCTGAGATCAGGAGTTGGAGACCAGTCTGGCCAACATGGTGAAACCCCGTCTCTACAAAAAAA
TTAGCCGGGAGTGGTGGCATGCGCCTCTAATCCCAGCTACTCGGGAGGCTGAGGCAGGGGAATTTTTTGAACCAGGGAGGTGGAGGTTGCATTGAGCCAAGATCACACCACTGCACTCCA
GCCTGGGTGACAGCGAGACTTCCATCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAGCTATCTGTGTATCTGTGGAGGGTCTTTTATGTACTAGCTCTCTGTAGGGCCCAGCAGCA
GAAGGGGAGACTCTAATCTCTTCCTTAGGAACTTAGGATTCGCTGTGACAAAAAGTGGCACATGCATGAAATGATCAAAATGAACACTTGATATAGGATTAATTGCTGAGCGGAGTGTTC
CAAACAGTAAGTTATAGGAGTTCACTGAAGGGAGTTTCACTTGGGCTAAATTGGCTAGGGAAGACTTTGAGGAGAAGACTGGACTTGGGCCAAGGGTCTGGGGGAGGTAAGCTAAGCTTT
GACAGGGCAGAGTGGAAACTGAAGAGCAATCCAGGTAAAAGGAAGGGCATGAGCACAGGCAGAGGTGGGAATGGTGATAGGATATTCAGGAGGCAGAGGGGACAGGGCTGGCTGCAGATG
GCCATAGGGCATTAGGTGGTGGCAGGACATGCTGCTGGTGATCATGTGGCAGCTGCAGTTGAAACTGAGGGGCTGTGGCTCAGCTGAGAGGCCATGCCACATGGCGGAGCATAGAAAGAG
ATGAGCAGAGGGTGACAAAACCAGGCTTGGGCAACACCCATGACTGGGGGTGGCAGAAGGAATTAAGCCAGTGGAGGAGATGGAAAGCGAGTGGCCCAAAGTGGGAGAACCAGTAGAGCA
TTGTTGCAGAAGTGGCGAGAAGGAGGTGGTCACTAGTGCCTCAGGCTTCTAAGAGTAGGAGCCAACATGGCCTGAAAAGATATATTTTACCCATTTGTGGGGCTGCTGGGCCCTCAGTGG
GAAGGTCCTAAGGAGAGGTGGACGCAGAAGTCAAACTGAGCCAGACTCAGGAGCAAGCAAGTGGGAAGAATGTGACCAAGTAGTGTCTGACCATGTTAGACAAACTTATTAGGGAAAGGA
CCAGAGCTGGATGATACTTGTCAGTTATTACATGGAAAAGGGAGACTAGTTAGGTATTCTTAGCCTGGAAAAGGAGAGGATAAACTGGGGAAGCGCCTTTCCCCAACAGTTACTGTGTGC
CTATCATATGCCAGGCAGAAACTGCTAGGAGCTTATAATGGCAAACAGTCCCTACCCCTTCAGTCCAGTGGGGAAAACGACAGTAAACATGTAAATGAATGAGGAAGCAAAAGTACTGTT
GGCCGCGCGCGGTGGCTCACACCTGTAATCTCAGCACTTTGGGAGGCCGAGGTGGGCAAATCATTCGAGGTCAGTTCGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAA
AATACAAAAATTATCTGGGTTTGGTGGCAGGTGCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCTCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCTAAGATCATGCCA
TTGCACTCCAGCCTGGGTGACAGAATGAGACTCTGTCCCCCCAACCCCCAAAAAACAAAAGTACTGTGGATGACAACGCATGCACTAAGAGAGTTAACAGGCAGATGTGGTAGAGAGGAC
TGAGGGGAGGCCTGAGGAGGTGGTGTTTCTACTGAGACCTGAAAAATGAGATGGAGCTAGCCATGGGAAGAACCAGGAGAAGGGCATTCCAGGCAGGAGATGCAAAGATCCTTAAGCAGA
AAGGACTGGCCGTGTTTGAGGAACAGGGAGGGGAGAACGGCTGGCAGGCAGAAGTCAAGAGTACTTGTGAGGCCTTGTAGTGCAATGGGAAGGCCACCAAAGCTTTTAAAGCTGGGGATG
AACAGGATAAAATGAACTTTAAAAAGTCACACTGGCTGCCTTGAAGAGACAGGAATTACAGTGAGACAAGACCAGTAGAAGACCTACTTCCACTGGAAGGCTCCTGCATGAGTCCAGGTG
ACAGGTGACATCACTTGGTCTAGGGTGGTGTCAGGGATCTAGAAAATACTCTAATTCCTCATGGAATGCTATGGAGGATGGAGTTGGGAATGGTGAGGTTTTGCAACCTAGAAGGAGCAT
GGTACGTGGTGAGGTGGGGGCTGGATATACCAGCGGGGAAGTATGGAGGGTCTAAGGCCTGAGCTGGCCCTGTATTCTGCTATGGGCATCGCTAGGTCTGAGCAGAGCTCTTTCCTCCAT
CTAGCTCATCATCCATGGTAGCTTTGCCCTGATCCAGCTGCCCCGTTTCCACATCTTCTTCCTGGTCCCAGCAATCATCTATGGGGGCGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGT
GGAGATCAGCGTGGTGAAGGCGGAGCTGCTGCCCTCAG
AGGTACCAGCCTGGCAGGAGATCAGCTTGGTGACACTGAGGGAGCTGACCGGGCAGAGGCAGAGTCTAGACCGCACAGTCTC
CTGGTCGGGCCCAGTGGAGCTGGCAGGTGCCTTGGGTGGCAGGGACAAAGGAGCTGGTAGGGGCAAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCAAGTGG
ATCATGAGGTCAAGAGATCGAGACCATCCTGGCCAACATGGTGAAACCCTGTCTCTACCACAAAAAAAAAATACAAAAATTAGCTGGGTGTGGTGGCATGCACCTGTAGTCCCAGCTACT
CAGGAGGCTGAGGCAGGAGAATCGGTTGAACCTGGTAGGCAGAGGTTGCAGTGAGCTGAGATCGTGCCACTGCACTCCAGCCTGGCAACAGAGTGAGACTGTGTCAAAAAAAAAAAAAAA
AAAAAAAAAAGGAAGGAGGGAGGGAGGGAGGGAGGAAGGAAGGAAGGAAAGGAAAGGAGGGCTGTTTGCCAGGGGACCTGAGCCTCTCTCATCGCAGAGAGCAGAGAAGCTCAGGCCAGG
CTGCCCCTTGCCTGGCTGACCCTCAGCCCCAGTGCTGCCTGGGCCCCCACAGGGTGAGCTTCTGATGGGGGAGGCCTCCTTTGTCATATTCCAGAAGCAGAGCTGAGGCTGTGGGTGGGG
AGCTCTCTGGAGGCCACATTGTTGCTGGGTTCAGGGCAGCAGCTTCTCACCCACCATCCCTCCCCAGGAGTGACCCACCTGCGGTTCCAGCGGCCCCAGGGCTTTGAGTACAAGTCAGGG
CAGTGGGTGCGGATCGCTTGCCTGGCTCTGGGGACCACCGAGTACCACCCCTTCACACTGACCTCTGCGCCCCATGAGGACACGCTTAGCCTGCACATCCGGGCAGCAGGGCCCTGGACC
ACTCGCCTCAGGGAGATCTACTCAGCCCCGACGGGTGACAGATGTGCCAGATACCCAAAG
AGGTACCAGACCCTGGCCAGACATGCCACATGCGCCCATATCCCCTTTGAAGGCTTTGGC
CCGGCAGCACTAGACTCCCCGCTCTGTGCCTCCCCTCTCTGCATCTAGAGACTGGTTGTTCCAGATAATGCCACAAATCCTCCTCTTCCCCTCACTCCATATACGTATCTACCTTTCCTT
TTCTCTCATTTCTGGCTTCCGATCTATGGTGGTGGCCAAGCTTAACTGAAACCCACGCCCCTCCCTACAGCTGTACCTTGATGGACCATTTGGAGAGGGCCACCAGGAGTGGCATAAGTT
TGAGGTGTCAGTGTTAGTGGGAGGGGGCATTGGGGTCACCCCTTTTGCCTCCATCCTCAAAGACCTGGTCTTCAAGTCATCCGTCAGCTGCCAAGTGTTCTGTAAGAAG
AGGTGAGTACT
GCCCCCACTTCCCACCAACCCATGGCCCCTTCTTCCTTGTCATGGTACCCATCTGTGCCTTTCTCCTCTGCCTTTGTTCTTGGCTTCCCTGAACTGGGCAGGGGCAAGTTGGTTTGAAAC
CTGGGGTTGGGTGGTGAGTGGGGTGGTATCAGGATATCCCTAGACCCTTTAGGACAGAACTGGTTTGCGGATGCCTCCAGAATCCTGCCTGGTTTACAAAGACTCTTTCCCTCAGTTAGG
GGTGCTCCACATAGGATTTGCTTCTGGGAGTTCCTCCAGGCAAGGTTGGAGGGGGTAAAGAAACCCATTCCTTCTTACAAAGGAGCCCAGCACTTGGAGCCTTATTGGGCTGGGAGAGGG
AAGGGCATGTTGGGTGGAGCACAAGTGAAACTGGCTCAGCTCTCAAGAGGAACAGGAGGAGTGGTGGCAAGGCAGGGCTGGTGGGCCAGGTTGGCCCCAGGAACACAGCTCAAACCCTCA
GGCCCTCACACGGGTCCTTTCACACCAGCCTGGACCTGGTGGCAGGCCCCTGCTCGTTTCTCTGGGCCTGGCAGGTGAGTCAGGGGTGCTGGAATGCAGTGGGCAGAGGCCCAGCCCATG
TGCTGTCAGCATTCAGAGGGTGACAGCTCCCCTCCTGTCCTCTGCCCTCTGCTTCTTAACTATACAGCCCCGCTTTCCTCCCTCGGAAGCTGCAGAGGGCTCCCCGCAGGGCAGCTTCCA
CCCGCTGGTATCTCTTTGAGCCCATTCCCATCTGCTCAGGCAGCTGCTGGGTCTCCCTGAGGCTGCTCTGTGAAATGCCCTTTAGCCACTAGGTGCCTGGCAGCCTGACCAGGGACCACT
GGAGGGAGCTGTAGGACAGGCTGAGCATGAAAGCTGCTTCCCACCCCACAGACCATCAGCTCCCAATCACCAAGGGATCTTTCCCATGGCCCTGGGGTCTCTCACTTCCACCTCTCTATG
CCCTGGGCCAACTCCATCTCTGGCCTCAGAGCCTGGCCCCGTGGCCCATACACTCCATCTCCCCAGGCTGTCTAGGGAAGGGCACAAGCTAGCTGTGGTCAGAAGAGTTCTATCAGTATG
GACACCCCTGGGGTTTAGGGAGACTGAGCTGAGATGGGTCCTGAACTCCAGCCCTGTGTCCCCAGATCTACTTCATCTGGGTGACGCGGACCCAGCGTCAGTTTGAGTGGCTGGCTGACA
TCATCCGAGAGGTGGAGGAGAATGACCACCAGGACCTGGTGTCTGTGCACATCTACATCACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACTATGCTG
TGGTATGTCAGGGCCCACC
AGGAGGGTATGCGGGCCACTGTCTGAGCTAGGAATTGACCCTAGCTGTGCCTGGCTGAACTTTGTTCCACCCTTCCCTACCATAGTACATCTGTGAGCGGCACTTCCAGAAGGTTCTGAA
CCGGAGTCTATTCACAGGCCTGCGCTCCATCACCCACTTTGGCCGTCCCCCCTTTGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCCCAG
AGGTCAGTCCAACCCATAACCAGGTT
CTCTTCCTCTTTATCATTTGGGGTCTGAGCAAAGCTCCCAAACCTTCCCCATCGAGGAAGAAATCGACTGCTGATGAGAACCCATCCCCTGGGAGATTGGTGGGGTAATGGAATGGAAAG
GATAGGTGGGTGTCACCCTTAGGTGCTAAAGGAGGAGGCAGGCAATAGGGACTTGCCATCTCTGAAGCCAAGATGTATTGTCCAGAAGGAAGAGCTCAATCATTGAGCTAGTCCTCGCCA
AAAACTCGGGGCTACCCTCACCACTACCCTCACCAATACTGAGTCAGTCCTCACCCTGGACACTGAGGGAACCTTTACCATGAGCACTGGGCTAGTCACTACTCACAAATACTGGTCTAG
CCCTCCATGAGACCCTGAGCTGTTCTTCAACTTGAACACTGATTAACCCAACCTCTATTAGAAATGTTGATCTAGCACTTAGCAAAATTCCACTAGACCTATAAACACTGAGCCAGCCCT
CACCATGCATAGTACAAAGCAAACCATCAGGAGCACTGCTGGCTTTTTTTTTTTTTTTAATTTAAAGAAAGGGTATTGCTCTGTTGCCCAGGCTGTAATGCAGTGATGCAATACCAGTTT
ACTGTAACCTCCAACTCCTGGGCACAAGCAATCCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGAGTGCACCACTATGCCTGGCTAATTAAAAATAAAAAAATTAGAGTCAGGG
ATGTTGCTATGTTGCCCAGGCTGGTCTTGAACTAGGGAGCTACCCCTTCCTCTGAACACTGCGTTATCCCTGCCTCTGAGCAAAGAGTTAGCCTCCACTTATTCCTCCTGCAACAGGTCC
GGAAGATCGGGGTGTTTAGCTGTGGCCCCCCTGGCATGACCAAGAATGTGGAAAAGGCCTGTCAGCTCATCAACAGGCAGGACCGGACTCACTTCTCCCACCATTATGAGAACTTCTAG

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGGGCTTCTGCCTGGCTCTAGCATGGACACTTCTGGTTGGGGCATGGACCCCTCTGGGAGCTCAGAACCCCATTTCGTGGGAGGTGCAGCGATTTGATGGGTGGTACAACAACCTCATG
GAGCACAGATGGGGCAGCAAAG
GCTCCCGGCTGCAGCGCCTGGTCCCAGCCAGCTATGCAGATGGCGTGTACCAGCCCTTGGGAGAACCCCACCTGCCCAACCCCCGAGACCTTAGCAAC
ACCATCTCAAGGGGCCCTGCAGGGCTGGCCTCCCTGAGAAACCGCACAGTGTTGGGGGTCTTCTTTG
GCTATCACGTGCTTTCAGACCTGGTGAGCGTGGAAACTCCCGGCTGCCCCGCC
GAGTTCCTCAACATTCGCATCCCGCCCGGAGACCCCATGTTCGACCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGAAGCCGCTGGGACCCCGAGACCGGACGGAGTCCCAGC
AATCCCCGGGACCCG
GCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGTTCCTCGCATTCCTGGAGCGACGCGCTGCGGAGCTTCTCCAGGGGACAGCTGGCGTCGGGG
CCCGACCCCGCTTTTCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCGACCCCGCCACCGGGCAGAACGGGCCCCGGGGGCTGTACG
CCTTCGGGGCAGAGAGAGGGAAC
CGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAGCTGTTCCAGCACGCACGC
AAGAGGGTCATCGCCACCTACCAG
AACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACACTCCCGGAGTATACAGGATACCGGCCATTTCTGGACCCCAGCATCTCCTCA
GAGTTCGTGGCGGCCTCTGAGCAGTTCCTGTCCACCATGGTGCCCCCTGGCGTCTACATGAG
AAATGCCAGCTGCCACTTCCAGGGGGTCATCAATCGGAACTCAAGTGTCTCCAGAGCT
CTCCGGGTCTGCAACAGCTACTGGAGCCGTGAG
CACCCAAGCCTACAAAGTGCTGAAGATGTGGATGCACTGCTGCTGGGCATGGCCTCCCAGATCGCAGAGCGAGAGGACCATGTGTTG
GTTGAAGATGTGCGGG
ATTTCTGGCCTGGGCCACTGAAGTTTTCCCGCACAGACCACCTGGCCAGCTGCCTGCAGCGGGGCCGGGATCTGGGCCTGCCCTCTTACACCAAGGCCAGGGCA
GCACTGGGCTTGTCTCCCATTACCCGCTGGCAGGACATCAACCCTGCACTCTCCCGGAGCAATGACACT
GTACTGGAGGCCACAGCTGCCCTGTACAACCAGGACTTATCCTGGCTAGAG
CTGCTCCCTGGGGGACTCCTGGAGAGCCACCGGGACCCTGGACCTCTGTTCAGCACCATCGTCCTTGAACAATTTGTGCGGCTACGGGATGGTGACCGCTACTGGTTTGAGAACACCAGG
AATGG
GCTGTTCTCCAAGAAGGAGATTGAAGAAATCCGAAATACCACCCTGCAGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCTCTGCAGCCCAATGTCTTTGTCTGGCAT
AAAG
GAGACCCCTGTCCGCAGCCGAGACAGCTCAGCACTGAAGGCCTGCCAGCGTGTGCTCCCTCTGTTGTTCGTGACTATTTTGAGGGCAGTGGATTTGGCTTCGGGGTCACCATCGGG
ACCCTCTGTTGCTTCCCTTTGG
TGAGCCTGCTCAGTGCCTGGATTGTTGCCCGGCTCCGGATGAGAAATTTCAAGAGGCTCCAGGGCCAGGACCGCCAGAGCATCGTGTCTGAGAAGCTC
GTGGGAGGCATGGAAG
CTTTGGAATGGCAAGGCCACAAGGAGCCCTGCCGGCCCGTGCTTGTGTACCTGCAGCCCGGGCAGATCCGTGTGGTAGATGGCAGGCTCACCGTGCTCCGCACC
ATCCAGCTGCAGCCTCCACAGAAGGTCAACTTCGTCCTGTCCAGCAACCGTGGACGCCGCACTCTGCTGCTCAAGATCCCCAAGGAGTATGACCTG
GTGCTGCTGTTTAACTTGGAGGAA
GAGCGGCAGGCGCTGGTGGAAAATCTCCGGGGAGCTCTGAAGGAGAGCGGGTTGAGCATCCAGGAGTGGGAGCTGCGGGAGCAGGAGCTGATGAGAGCAGCTGTGACACGGGAGCAGCGG
AGGCACCTCCTGGAGACCTTTTTCAGGCACCTTTTCTCCCAG
GTGCTGGACATCAACCAGGCCGACGCAGGGACCCTGCCCCTGGACTCCTCCCAGAAGGTGCGGGAGGCCCTGACCTGT
GAGCTGAGCAGGGCCGAGTTTGCCGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCCTTCCGAGAGTTC
CTGGACATCCTGGTGGTCTTCATGAAAG
GCTCTCCTGAGGAAAAGTCTCGCCTTATGTTCCGCATGTACGACTTTGATGGGAATGGCCTCATTTCCAAGGATGAGTTCATCAGGATGCTG
AG
ATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCTGAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAACTGACATGGGAAGATTTT
CACTTCATGCTGCGGGACCACAATAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAG
GGGTGGAGGTGCCTGAAGTCATCAAGGACCTCTGCCGGCGAGCCTCCTACATCAGCCAGGAT
ATGATCTG
TCCCTCTCCCAGAGTGAGTGCCCGCTGTTCCCGCAGCGACATTGAGACTGAGTTGACACCTCAGAGACTGCAGTGCCCCATGGACACAGACCCTCCCCAGGAGATTCGGCGG
AGGTTTGGCAAGAA
GGTAACGTCATTCCAGCCCTTGCTGTTCACTGAGGCGCACCGAGAGAAGTTCCAACGCAGCTGTCTCCACCAGACGGTGCAACAGTTCAAGCGCTTCATTGAGAAC
TACCGGCGCCACATCGGCTGCGTGGCCGTGTTCTACGCCATCGCTGGGGGGCTTTTCCTGGAGAGGGCCTACT
ACTACGCCTTTGCCGCACATCACACGGGCATCACAGACACCACCCGC
GTGGGAATCATCCTGTCGCGGGGCACAGCAGCCAGCATCTCTTTCATGTTCTCCTACATCTTGCTCACCATGTGCCGCAACCTCATCACCTTCCTGCGAGAAACCTTCCTCAACCGCTAC
GTGCCCTTCGACGCCGCCGTGGACTTCCATCGCCTCATTGCCTCCACCGCCATCGTCCTCACAG
TCTTACACAGTGTGGGCCATGTGGTGAATGTGTACCTGTTCTCCATCAGCCCCCTC
AGCGTCCTCTCTTGCCTCTTTCCTGGCCTCTTCCATGATGATGG
GTCTGAGCTCCCCCAGAAGTATTACTGGTGGTTCTTCCAGACCGTACCAGGCCTCACGGGGGTTGTGCTGCTCCTG
ATCCTGGCCATCATGTATGTCTTTGCCTCCCACCACTTCCGCCGCCGCAGTTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCTCTATGTCCTG
CTCATCATCCATGGTAGC
TTTGCCCTGATCCAGCTGCCCCGTTTCCACATCTTCTTCCTGGTCCCAGCAATCATCTATGGGGGCGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGCGTGGTGAAGGCG
GAGCTGCTGCCCTCAG
GAGTGACCCACCTGCGGTTCCAGCGGCCCCAGGGCTTTGAGTACAAGTCAGGGCAGTGGGTGCGGATCGCTTGCCTGGCTCTGGGGACCACCGAGTACCACCCC
TTCACACTGACCTCTGCGCCCCATGAGGACACGCTTAGCCTGCACATCCGGGCAGCAGGGCCCTGGACCACTCGCCTCAGGGAGATCTACTCAGCCCCGACGGGTGACAGATGTGCCAGA
TACCCAAAG
CTGTACCTTGATGGACCATTTGGAGAGGGCCACCAGGAGTGGCATAAGTTTGAGGTGTCAGTGTTAGTGGGAGGGGGCATTGGGGTCACCCCTTTTGCCTCCATCCTCAAA
GACCTGGTCTTCAAGTCATCCGTCAGCTGCCAAGTGTTCTGTAAGAAG
ATCTACTTCATCTGGGTGACGCGGACCCAGCGTCAGTTTGAGTGGCTGGCTGACATCATCCGAGAGGTGGAG
GAGAATGACCACCAGGACCTGGTGTCTGTGCACATCTACATCACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACTATGCTG
TACATCTGTGAGCGGCACTTCCAGAAGGTTCTGAAC
CGGAGTCTATTCACAGGCCTGCGCTCCATCACCCACTTTGGCCGTCCCCCCTTTGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCCCAG
GTCCGGAAGATCGGGGTGTTTAGCTGT
GGCCCCCCTGGCATGACCAAGAATGTGGAAAAGGCCTGTCAGCTCATCAACAGGCAGGACCGGACTCACTTCTCCCACCATTATGAGAACTTCTAG

Retrieve as FASTA