Entry information : PtroLPO
Entry ID 4294
Creation 2007-01-02 (Christophe Dunand)
Last sequence changes 2011-01-31 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Last annotation changes 2011-01-31 (Christophe Dunand)
Peroxidase information: PtroLPO
Name PtroLPO
Class Lactoperoxidase    [Orthogroup: LPO001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Pan
Organism Pan troglodytes (chimpanzee)    [TaxId: 9598 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value PtroLPO
start..stop
S start..stop
HsLPO 1471 0 1..712 1..712
PpyLPO 1447 0 1..712 1..712
MmulLPO-B 1442 0 1..712 1..712
PabeLPO 1348 0 1..712 1..682
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '4294' 'join(57414845..57414920,57415859..57415946,57419323..57419483,57420903..57421020,57421409..57421538,57422258..57422464,57424027..57424351,57428704..57428864,57438611..57438863,57440036..57440209,57441234..57441471,57441672..57441879)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 57414845..57414920 74 N° 2 57415859..57415946 86 N° 3 57419323..57419483 159 N° 4 57420903..57421020 116
N° 5 57421409..57421538 128 N° 6 57422258..57422464 205 N° 7 57424027..57424351 323 N° 8 57428704..57428864 159
N° 9 57438611..57438863 251 N° 10 57440036..57440209 172 N° 11 57441234..57441471 236 N° 12 57441672..57441879 206
join(57414845..57414920,57415859..57415946,57419323..57419483,57420903..57421020 ,57421409..57421538,57422258..57422464,57424027..57424351,57428704..57428864,574 38611..57438863,57440036..57440209,57441234..57441471,57441672..57441879)


exon

Literature and cross-references PtroLPO
DNA ref. GenBank:   NC_006484.2 (57414845..57441878)
mRNA ref. GenBank:   XR_024789
Cluster/Prediction ref. Genebank:   468421
Protein sequence: PtroLPO
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   712 (686)
PWM (Da):   %s   80019.88 (77295.3)  
PI (pH):   %s   8.84 (8.67) Peptide Signal:   %s   cut: 27 range:27-712
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MRVLLHLPALLASLILLQAAASTTRAQTTRTSAISDTVSxAKVQVNKAFLDSRTRLKTAMSSETPTSRQLSEYLKHAKGRTRTAIRNGQVWEESLKRLRQKASLTNVTDPSLDLTSLSLE
VGCGAPAPVVRCDPCSPYRTITGDCNNRRKPALGAANRALARWLPAEYEDGLSLPFGWTPGKTRNGFPLPLAREVSNKIVGYLNEEGVLDQNRSLLFMQWGQIVDHDLDFAPDTELGSSE
YSKAQCDEYCIQGDNCFPIMFPPNDPKAGTQGKCMPFFRAGFVCPTPPYKSLAREQINALTSFLDASFVYSSEPSLASRLRNLSSPLGLMAVNQEVSDHGLPYLPYDSKKPSPCEFINTT
ARVPCFLAGDSRASEHILLATSHTLFLREHNRLARELKRLNPQWDGEKLYQEARKILGAFVQIITFRDYLPILLGDHKQKWIPPYQGYSESVDPRISNVFTFAFRFGHLEVPSSMFRLDE
NYQPWGPEPELPLHTLFFNTWRMVKDGGIDPLVRGLLAKKSKLMKQNKMMTGELRNKLFQPTHRIHGFDLAAINTQRCRDHGQPGYNSWRAFCDLSQPQTLEELNTVLKSKMLAKKLLGL
YGTPDNIDIWIGAIAEPLVERGRVGPLLACLLGKQFQQIRDGDRFWWENPGVFTNEQKDSLRKMSFSRLVCDNTRITKVPRDPFWANSYPYDFVDCSAIDKLDLSPWASVKN

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 17, 11 introns). No EST. Sequence shift in genomic probably due to a sequencing error (missing "a").
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGGGTCCTTCTCCATCTCCCAGCCCTCCTGGCTTCCCTCATCTTGCTTCAGGCTGCAGCATCTACCACAAGAGAGGTGAGTGTCTCCCTTTACGGGTTTTCTGGGGCTGCTGTCACA
AAGCACACCAACTGGATGGCCTAAGACAACAGAAATGTATTCTCTCACAGTACGGGAGGATAGAAGTCCAAAATCAAGGTGTTGGCAGGGCTGTGCTCCCTCTGAAGGCTCTAGGGAAGA
ATGCTTCCTTTCTTCTTCCAGCTTCTGGTGGCCCTGGTATCCTTGGTGTACCCTGGCTTGCAGCTGCAACACTCCAATCTCTGCCTCAGTCTTTGCATGGCCTTTTTCTCTGTGTGTCCT
CTCCTCTTCTCATAAAACCACCATTTATTGGATTTAGGGCTGATGCTAAGGCAGTATGACCTCATTTTAATTCATAACATCTACAAAGACCCTACTTCCAAATAAGGTCACATTCTGAGG
TTCCATGCAGATGTGAATTTTGGGAGGGCCATTATTTAACCCACTACACTCTCCTTGTTGAGAAAACAGATCCTGTCCTCCATCCTCCAGACCCCAATACAAGAGGGAACAGGGAATCCC
ACCATTACTCTCATGCCATCCCCTTCTGTACGTCAGGGTCCTGGTGATCAGTGGGTCCTCTTTGATGAATTTGAGATAGTCCACAAGACCATCTCACCAGGAACCAGAGAAATCCCCACA
GTCCAGTAGCCCAAGCCACCCTGAGTGGCCACACTCCCTCTTCATCCCATCTCTGGGCATTCTAGTGAGATCTCAAAGAGCAATTTTGACATCTTGTTGAAGATCGAAGAGTATTCTTCC
CATTCTGTGCTTTTTCCCCTTAAGGACTCTTATGAAACTTTCCTAATCTTGATCTCCTGCCTACCCAGATTTGAGGGGTGGGGGTAATGGCTGAGAAAGAGGGAGACAAAAGCAGAAGAC
ATCGCAAAGGAGTGGTGTCTGACACCCTACTTCCTGCTCCCCACTCCAACCCCAAGCGCAGACTACCAGAACCTCTGCCATCTCCGATACTGTGAGTCANGCCAAGGTCCAAGTCAACAA
GGCCTTCCTGGACTCCCGAACCAG
AGGTACGTGAGACACACACACACACACACACACACACTTCCCTTCACAGGCTTCCTGTTCATGACAAGCACATCTAGGCCAGCCCGGATGAGCCAA
TGAGGCGAACAATGATAAAACTTACCCTTCCATCCTCCTAGCCCTCCAAGTACATGACAAGTAGGAGCAATAGCCCACAATTCCTAAAGTCCTCAGCCTGTACATCACCTGAAAGAGGAC
ATCTAATCTAACCCTAGAAGGACTTTCTTCCACGTGGCCCAATGTTTCCTTGGGCCATGCATTCCAGAGTCCCAGAAAGCCTCTCAGTAAAAGTTCTTCATTCATCCAGCTGATATTGAC
TGATTCAGCCAGGTCCACCAGAAGCCGAATCCGAGCCCTGCATTCAGGTGCAGGTGACATATTAGGGGAATGCTCACAAGGAAGCTGTGATGGGGTGAAGAAGCAGAAGTGAAGGGGGAG
GTTAGACAAATAAAGTGGCTTCAAGTCTATCCTCAGCCTGACCCCACAGGGAGACCTGGAGCAGAAGTTGCTCCTCAGAGCCTGTCCCATCTTGGAGCGAGGGAGCAGAGCCCTCATCTG
CCCACATCAGTCAGACATGGCTGCAGGCTTGCAGGTGGCTGAGAGGTGTTGTGACACACCTGTGGAATTCCCGGGCACCCCCAGCCAGGCCACTCCAGTGGCTCAGAAGCAGGCGCCCAA
CAAGGTTACTGGAGTGAGCCTGTAGCAGATGCACCCATGGCAGCAAGGGGAAGAGCCCGGGGCCCACAGGGCACCAGTAGAGTCATCTGCCCTCCGATCCCGTACAGGGGCAACGTGGGC
GCCTTCCCTCAGGAAACGCACACTCTAGGGAGAGGGCAGACATTGAGAAAAAAAGTAACCTTATTCTTTCCTCTCGCTGTCCCTTTACACTGGACCTGGGCCTCCACTCACTCCTCCTCA
GCCCTTGAATGAACTTTCAAGCCGTCAGATAACTTTTTTGGTGACTTAGATGATAAAACACGCTCTGGACGCTTCTAGTCCCCAGACCAGGATTATCTCAGGAATTAAAGTGTCAGCATA
AGAAGATAAAGCCAGACCTGCTCTATTCCTTCAGCTCAGAACTGCTTGAGGCCTCTGGCGGGAGAAGAGAGTGCAGTCTCCTTCTCCCTCTTTCTGGGGCTCCTCCTTCCCAGCTTGGGT
TTCTGCTCCTTGGGGGTTAAGCTGAAGGAGGAAGGAAAAAGGGGTGGTGAGTCTACATCCCCTGAACAGTTCAGTCCTCAAGGGGCCGGAAGCTCTCTCTCTCTCTCCCCTTGGGTGCTT
TCCTGGTTTCAAAGAGAGATTCTGCTGCCAACTCTGTGATCTGGAAAATTCTCTGGCCAGTGGCTCCCAGCTCTAACCCTCTCTGCCTGGGAGCATCGGGCCCACTCCTTCACCCCTCCC
CACCCTGTCTTGGGGCAGGTCCCTTTTAAGCAGCCTCCAGAAGACTAGCAGCTTCGCCCTACTGTTGAGGCGGTGCAGATTCCAGATTCCGCCCCTCTGTCCGTGCTCTCAGGACACTGC
TCTTCAGGCAGGTGTCAAACCCCAGCCATGACTCCAGGACACACAGTTAAGCCCTCCTCCTAAGACCTTTCTCCCTAGAATTAAACTAGCAAAGGAGGAGCCCCATGGTGTTGGGAGGGA
CGTGCCCAGTATTTCTGCAAATGTCTCCAAAGGAAAATTCTCTCTCGCTCCTCAAATTCAATCCCTTTATAGCCTCAAGATGGGGGACAGCATAAAAGCCACAAAACCGGTGTTCATACA
CCTCCTTTGCAAGTCCTGCACAGGCCAACCAGCGCCCTGCTTTGAAATGTGGAGGAGCCCTTCCCTGCTGGTTAGTCCTTGGACTCAGGGGCCTCACAGAGACAGAACGAGTCACATTTA
AACATTCTGTTACCCAAGTACATACACAGTGCAATGACTCACAATTTATGATAAGGAAAATGAAGGAAATAAACAGGGCAAGGGATGCCACTCTAGGCACAGGGATCAGGGAAGCCTGTT
GGAGAATGGGAAGGAGCTACCCAGGCAGAGGGCATGGGGGTGGCATTCCAGGCACTGCAACAGCATAGGGTCTGAGGCAGGAAAGACCGTGGGTGCTCAGGAAAAGGAAGGCCAGTGGAG
GCGGGAGGCAAGGAGGATCACGCATAAGGTGAGAGCAGAGAGGTGTGTCGCCCCAGGGTAAGCCATTAGAACATTTGAATCAGCGGTGGCATGGTCTGATTTTTATGCTAAAAAGATCAC
TCCAGCAGCCCTGTGGAGAACGAATGGGAAGAGGACAGCAGTGGAGGTAAGAGCTGTCTAGATGGGAGAGGATGATGGCTAGGACCGGGGTGGAGACTGTGGAGATGGCAAAGAGGTCCT
TCCAAGACCCTGCCTGCCCCCAACAGAGCCCAGCCCCAGGGCGGAGCAGGAGGTGAGTCCCTGGTGAATCTGTGTTCAGGCAGAGGAAGCATGCAGAGCCCAGGGCCGGGCGGGAAAGGA
AGGAATATCTGGAACCTTGGATGGGACCTTACAGGCTAAGCCCAGGGAGGCCCAGCGGCCTGCCTGCCTTTGTTCATGGTGTGAGGTGGGGACTCTAAAATACCTTGCAGCAGCCCCTCT
GGGACCCAGCTGATGGAGACAGTGATGCTCACACATGTGAGTGTGTGCTCTGTGCAGATGCAGCTCTGGGGGCCCTACATTCATTCCTCCCCAAAATCCTAGGAAGTACTGAATCGAACT
GTAGAAAACAGCATTTTCAAGGTCAAAAATGATGAAACAACATGGTTCAACCTAAGGTTACTATCATTATTTTCATTATACCAATAAGGAAACTGAGTACAAAGAGCTTAAGTAATTTGC
CTCAAGGGCACACAACTAGAAAGTGGCAGTCAGGAGCCAATCCCAGTCAGTCTATCCCAAAGTCCATACAGAAGTCTCTGATGGGAGGAAATTGTGATTCCCAGTTCATAAATGAGGGGA
AGTCACTCAGAGGTCATATAATTAAACCAATCCTGGTGGAACCCAGATCCACCCCTGTGTCTGTCCAGCTCTAAGCCCACTTTAGGCTAGCAGCATCCAAGATAACTTTCTAGGATGATG
AAATGTTCTATATCTGCACTGTCCAACACAGTCATCACTAACCACACCTGGCTATTGAGCACTTGAGGTGTGCCTAGTGCTGCTGAGGAACTGAATTTTTAATTTTAATTTTAATAAGTT
TCAATTTAAATAGCCACATGTGGTTAGTGGCTACTGTATTGGACAGTGCAGCAGCCTGGATTATGAGGCCATGTTGTACACACACCCCTCCCACCCCTCCAGCGGCCCCCAGCCCCCTTC
CTACAGGGTCCAGGCCATGATCCCCATCTCCCTCCACTGTAGGCTGAAGACCGCCATGAGCTCTGAGACTCCCACCAGCCGACAGCTCTCAGAATACCTCAAGCATGCCAAAGGCCGGAC
GCGCACAGCCATCCGCAATGGACAGGTGTGGGAGGAGTCTTTAAAGAGACTGAGGCAGAAGGCATCCTTGACCAATGTCACAG
AGGTACAGAAAACCCACCAGCCCTCTGTGGCAGGAAG
GGGTCATCTCCCCACAGGAGAAAGCAGAGGCCACCCAAAGCTGCACTTCCTGTTGAGAGCTATCTACTCAGAGGAACACTCCTGTCTCTCCCTTCTTAGACCCGTAGACGAGACTGGCAG
AGCAAGGAAGGTCTAAGAGGACAGCTAGGGCAATGCTGTTTTTGCAGAGGAAGAAACTGAGACCTAGCAAGCTCACATTGCCGACTCCAAGTGAGGGGACATTGGTGCTCTTTGTATAAC
ACCGTCTGTAAGGCCTCTGCCCAAAGGAAGTGCTAAAGTCAATAAACATGGGTTCCTTGTCCTAAACCACAGTGTCACCACTTCCTCTAGTAGAGGAGACATTTAAGCCACATGTGTCAG
GGGGATTTACTGGGATAATGTCACAGGAGCTAAGATTTCCATCGTGTTCTGGGTAAGATGGAGCTATCAGTAAATGAACTTAACTGATGGCTTGGTCATGGCCTCCTCCATCCCCTCCAC
TCTCACCCCCAACCAAATGCCACAGCCCACAGAGGGTCATGAGGAAGGGGTTTGGTCCTGACAGGAGGAAAAGGCTGCAGGTTCTCCCTGGGAAAACATCAAGTGAGGGGGGATACCTGG
CCCAGGAGTTGGGCAAATTCCAAAGCTCCAAAAGAAGGAAACTGGGACAAGGTAAGAAGTAGGAAGCACTTTGACCTCTGAAATACAAAAGGTTCTGAGAGCTGAAATCCTGCCACCTTT
CAGAGATGGCCCCCAGAAGAAGCCCCCATGAGCCAGATTCCCTCACCTCCCTGCCCTTGCAGACGCCAGGGTGGATGAGGGTATGCGTTCAGAAGTGGGCATCTCTCTCCACTCTGTGCC
CACACTGAACACTCTGATGCCTAAAGCCCCCCTACTCTGTCACTCCTCCCACAACCCTTGCCCCTCACTGAAAAATTACCTGTCCCTTCCCAACCTCTATCCTAGCTATGTCCCTGCTTG
CTTGGTACTTCAGATGTGAACGAAGCTTTAGAAGAAAGCAGTCTTATCACACTAACTCATACTAACATATGCTTAACCTCCCGGGGTATTTCCATTCTAAAAGAAAGATCTGTCAATTCA
AATGAATCTGGACCTGGTATACCAGGAATTACTAGCAATTCTGGGGACATTCCCAGTTACTCTGTGTGGCCTTGGGCAAGTCTCTTGAACATTGTGTTCTCATTTTCTTCCCTGTAAAAT
GGGAATAAAATGTATTACCTTAATGTGCTGTCAGATTAATTGAGAAACCACTTGAGAAACGCTTACTTAGAATGGAATGCATTCCAAAACATGCAGGTTCACCTCTGGTCTCCATTTCCT
GCCTCCCACGGGTGCCTGTGAATGGAGAAAGAACTCAGTCCCTTTGGGGTCCCTTCTGTTTCAGATCCCAGCCTGGACTTGACTTCACTGTCTCTAGAGGTGGGCTGTGGTGCTCCTGCT
CCCGTGGTGAGATGTGACCCGTGCAGCCCTTACCGCACCATTACGGGAGACTGCAATAACAG
AGGTGGCGGGGCTTGGGGTGTGGGGGCCGACCATTCCAGCCACTCCGCCCCGCCCCGC
CAAGGCCTTTGTCCCTTGGGCACTCTAGGCAGATCTGCCACTGCCTTGCCCACCTGGGCTGGCGCTCCCACCTTCCCCACCTTCTCAAGATCGCGCGTCTCCAGCCCTCTCCCTCCAGCC
CACTGTGTGTCTCTGGAAGCGGCACCTTTCCCCGGGGCGGGGGAGCCCCGCGCCTTCAGGGGGTGGGCGCAGTTCAGAGACCCCACATTTGAGAGGCCCCCTTCTCTGCGTCCCCTCCGC
CTCGAGCAGAGGCTCCTTCCCCCATGGGGTCCTGGGCTTTGGAGCCCCGGAGCCGGCCTGCCGAAGCTCAGCGAGGATCGTGCTGGTTTCAGGAGGAAGCCTGCGCTGGGCGCCGCCAAC
AGGGCTCTGGCGCGCTGGCTGCCCGCGGAGTACGAGGACGGGCTCTCCCTGCCCTTCGGCTGGACGCCGGGGAAGACGCGCAACGGCTTCCCTCTCCCTCTG
TGGTGAGGGCAGGCCGGG
CGGGGGTGAAGGATGGGAGCCAGAGGGGACGGAATGCACTACCCAAACACGCAGTGAAAGGAGGAGGAGGGATCGGTCGGGGCAGCAGGGGTGCGCGATGGAGATCTGCACAGACCCCCA
CCTTCCGCTAGAGGGCAGCAGAGCTCGGAGCCCGGAGCCCGGCCACGTGGTGGTCACGTCCTGGCCGCCGCAGGCCAGATACCCTGGGGTCTCCCGGGCAGATGGCCTCCCTGCCCCATC
CCGCTCGCACTTACCCCAGCTCCCGCCCCTGGAGTGCATCAGCTGTTGTTTGCCTGGAGGTCCTATTTGTTGGCCTTATCGAGCTGGCAGCCCCTACCCAGAGCCCCTCTCCTATTCCTG
GACACCTGGCACCTCCGTAGCTCGCATTTCAGAGGCAGCCTGTCTTTGTTTTCCCTCTCTTAGTGGAAAGGGCACTGGCCTGGCAAATCATCAAGTCATACGGATCTGGGTTCAATCCCA
TCCCTGTCCCTTTGTAGCGGCAGGACCTTGGAGAACTGACTTCACCTGTCCAGGATTCAATCTCCCCCTCTACAGAATAGGCTAATGCCACCCACCTCACAAACTCCTTGTAGGGATTAA
GTGTACTTGTAACATGGTAGGTAGTTAATAAGCTGTAGCTGCATCTGAATCCTCAAAGCACTTTTTCCTCTAATCTGCCCTCCCCTTCTCTCTCCATCTGTAGGCCCGGGAGGTATCTAA
CAAGATTGTTGGCTATCTGAATGAGGAGGGTGTTCTGGACCAAAACAGGTCCCTGCTCTTCATGCAGTGGGGTCAGATTGTGGATCACGACCTGGACTTTGCCCCTGACACCGAGCTGGG
GAGTAGCGAGTACTCCAAAGCCCAGTGTGATGAGTACTGTATCCAGGGAGACAACTGCTTCCCCATCATG
TGGTACGGCCCTGCAGCTGGGCATCTCTGCCTAGCCCCTTGCCCACCCTG
ATGTAGCAGACATTCCCAGCTCATCAGCTAGGATCTCTGGATAGATCTTGGATACTGATTTGGTAGTTTCCCATGCCTCACGAGTCCATTCGTTCACTCATCCGTTTATTCAGTAAATAG
ACATTGACCACCCCATACCAGGCAGTGCACTAAGCGCTGAGGATTCCAAAGTGAACAGGACACTGCTCCTGCTCTCAAGAACAAAGGGCCAGTGTGGGAATTGGACAAGTAAACTACTGA
TAGCAGAATGGGAACACTGGAGAAGGTTGTCTACATTCCCGGCCCAGCAGCATTGGTAAAACCTGCTTATTAGAAATGTAAAATCTCAGTTGGGCGAGGCGGCTCATGCCTATAATCCCA
GCACTTGGGGAGGCTGAGGTGGACAGATCACCTGAGGTCAAGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCTGGCGAGGTGGCAC
ATGCATGTGGTCCCAGCTACTCAGGAGGTTGAGGCAGGAGAACCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCCGAGATCACGCCACTGCACTCCAGCCTGGGCGACAGAGCGAGA
CTCCAACTCAAAAAAGAAAAAAGAAAAAGAAAAAAAAGAAATGCAAAATCTCAAGTCTCACCCAAGATGTTAGGAAATAGATTCTACATTTTTACCAAGATCACCAGAGATTCATTTTAT
ACATGATAAAGTTTATACTGCTGTACATCATCGTGGAGGATGACTTCAATTCCACTCAGAAGGACTTGAAACCTGGATGAATCCGAGTGAGCAGCAGGGATTTGTAGGGATTTCCATCCG
GAAGGGAGAGGAGCTTGCAGCAATGCATCTTTTCTTATTATTCATGGGTATTATAGTTGGAGAGTCATTTCTTCACTCAAAGAATAAATCGAGGCTATGTGCTGTGGTATAAAGAGCATA
AACTGTGGAATGAAGCCAATGATCTGAATCCTACATCTTTCCCTTCCAGTGTATAAGTTCTTAGGCAACTTACTTAACTTTTCTGAGCCCCAGTTTTCTCATCTATAAAATGGAACTGAA
TTGTCCACCTTACAGGGTTTTTTTGAGGACTGAGATGAGTTAAAGTATGCATAATTCCTGGCACATGGTAAGTCCAAAATAAATACATGGCAGCTGTTGTTAGTAGTAATTATTAACAAA
CCTGTTCCCACCTCGTGTCCTGCTTTCCTGTCCAGGGAGCTCAATACTCCCTCCTCACCAGAAATTTGGAGCCCTCACTGTGAGGACTGAATGATTACCCTCAGTTCTTTCACACAACAG
GATGTAATGAACTTCAGGATATTATGAGCCCAGGGTGTGTAATGTCAGCAAGAAGTGTCTGGGGTCAGTGTCTGGTACCAGGGTCCTAGGAGGGTGCCATCAGCATCTTTGCATCCAGAC
TTGCCCTGGGGAGAGGTTGCCAGGACCCAGCTATTCCACCCCTGCCCAGTCACTTATGCCCACTCTCTCTGCAGTTCCCACCCAATGACCCCAAGGCGGGGACTCAAGGGAAATGCATGC
CTTTCTTCCGAGCTGGGTTCGTCTGCCCCACTCCACCCTACAAGTCCCTGGCCCGAGAGCAGATCAACGCTCTGACCTCCTTCCTGGATGCCAGCTTTGTGTACAGCTCCGAGCCAAGCC
TGGCCAGCCGCCTCCGCAACCTCAGCAGCCCCCTGGGCCTCATGGCTGTCAACCAGGAGGTCTCAGACCATGGACTACCCTACCTGCCCTATGACAGCAAGAAGCCAAGCCCCTGTGAGT
TCATCAACACCACTGCCCGTGTGCCCTGCTTCCTGGCAG
AGGTGAGTCTAGCCTGGGAACAGAAGGGCCCAAGAACAAGGGGATTATCCAGGGAAGCTTTACCACTGCCCCCTTCTTGTC
ATCTTTCTGGAAAAATGCACACTGAGCCATTGCTGTCCCTAGCAGTGGTCCAGCCAGGACTAGCAGAAGGGCCCAGATGAATAAAGAATTCAAGAACATATGTTCACTGTAAAACAAAAT
CAGGAAGTCCAGAAAAGTAACAAAGAAAATAAAAAGAAAGTGCCAGGCTTGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCAGATCATGAGGTCAGGAGATCGA
GACCATCCTGGCTAACATGGTGAAACCCCGTCTCTTCTAAAAACACAAAAAATTAGCCGGGCGTGGTGGCACACCCCTGTAGTCCCAGCTACTCTGGAGGCTGAGGTAGAAGAATCACCT
GAACCCAGGAGGCGGAGGTTGCAGTGAGCCGAGATCACGCCACTGCACTCCAGCCTGGGCAACAAAGCAAGACTCCATCTCAAAAAAAAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAG
AAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAAGAAAAGAAAAAGAAAATTCACACAATCCCACTACCCACCAATAATCATCGTTAACATTTTCATGCTTATTTTCCAGTATTTTTTCTAT
CACAGATACAGTTCTACATTGTTTTACATAATCGGGTCTTACTATAAATAATGCTCCAAAACCTGATGTCTTCACTTAATCTACTCCATGCCATTAACTGTTCTTCTATGATGTGATTTT
TAATGGCTGCCTGGTATTTCATGACACATATTACCATCATTTATTTTCTCAATCCTACTTGGGGGAATTTAGTTATTTCAAGTTTTCCTGTGTTTTGTTGCTGTTGTTTAGGTAATGAAC
ATCTTCATTGATAAATCTTTGTTCTCATTTTTAATACATTTATTAAGATAATTTCCCAGAAGAGGAAGTTCTAGATGAAGAGGAATGTATATTTTTAAAGCTTTTTATAACCTGGGGACA
AAATCCCCATTCCATTCAGCCTGTGCACCACCAAAGCCCTGCTAATTCTGTTCATTCCAGGTGCCCTCCTGGTCCTCTCTGTCTCCGCTTCTAGAGGCCATCCCACCCTCAAACATCCTC
TCTCTTTGGACAAACCCCCTGGCCACCCTGACATTGCAGAAGCCAGCTCTTAGCCAAGATACCTTTATAGATCTTTTGGGATTTTGTGTTCACATATATCCCACATCAATCCAGACACAA
ANNNNNNNNNNCCTACCTGCCTAATGACAGCAAGAAGCCAAGCCCCTGTGAGTTCATCAACACCACTGCCCGTGTGCCNTGCTTCCTGGCAGGTGAGTCTAGCCTGGGAACAGAAGGGCC
CAAGAACAAGGGGATTATCCAGGGAAGCTTTACCACTGCCCCCTTCTTGTCATCTTTCTGGAAAAATGCACACTGAGCCATTGCTGTCCCTAGCAGTGGTCCAGCCAGGACTAGCAGAAG
GGCCCAGATGAATAAAGAATTCAAGAACATATGTTCACTGTAAAACAAAATCAGGAAGTCCAGAAAAGTAACAAAGAAAATAAAAAGAAAGTGCCAGGCTTGGTGGCTCACGCCTGTAAT
CCCAGCACTTTGGGAGGCCGAGGCAGGCAGATCATGAGGTCAGGAGATCGAGACCATCCTGGCTAACATGGTGAAACCCCGTCTCTTCTAAAAACACAAAAAATTAGCCGGGCGTGGTGG
CACACCCCTGTAGTCCCAGCTACTCTGGAGGCTGAGGTAGAAGAATCACCTGAACCCAGGAGGTGGAGGTTGCAGTGAGCCGAGATCACGCCACTGCACTCCAGCCTGGGCAACAAAGCA
AGACTCCATCTCAAAAAAAAAAGAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAGAAAAGAAAAGAAAAAGAAAATTCACACAATCCCACTACCCACCAAT
AATCATCGTTAACATTTTCATGCTTATTTTCCAGTATTTTTTCTATCACAGATACAGTTCTACATTGTTTTACATAATCGGGTCTTACTATAAATAATGCTCCAAAACCTGATGTCTTCA
CTTAATCTACTCCATGCCATTAACTGTTCTTCTATGATGTGATTTTTAATGGCTGCCTGGTATTTCATGACACATATTACCATCATTTATTTTCTCAATCCTACTTGGGGGAATTTAGTT
ATTTCAAGTTTTCCTGTGTTTTGTTGCTGTTGTTTAGGTAATGAACATCTTCATTGATAAATCTTTGTTCTCATTTTTAATACATTTATTAAGATAATTTCCCAGAAGAGGAAGTTCTAG
ATGAAGAGGAATGTATATTTTTAAAGCTTTTTATAACCTGGGGACAAAATCCCCATTCCATTCAGCCTGTGCACCACCAAAGCCCTGCTAATTCTGTTCATTCCAGGTGCCCTCCTGGTC
CTCTCTGTCTCCGGCTTCTAGAGGCCATCCCACCCTCAAACATCCTCTCTCTTTGGACAAACCCCCTGGCCACCCTGACATTGCAGAAGGCCAGCTCTTAGGCCAAGATACCTTTATAGA
TCTTTTGGATTTTGTGTTCACATATATCCCACATCAATCCAGACACAAAACCGAGTTCTTTCTTGTTTTTCATAAAAATATGCACTCACACCAAGAATTTACAATTTTAAATAGATTTCC
TGGCCTGGTGCAGTGGCTCACACCTGTAATCCCAACACTTTGGGAGGCTTAGGTGGGAGGATTGCTTGAGGCCACAAGTTCGAGACCAGCCTGGGCAACATAGCAAGACCCCTGTCTCTA
CAAAAAATTTAAAAACTAGCAACACATGGTGGCCCATGCCTGCAGTCCCAGCTACTCTGTAGGCTGAAGCAGGAGGATCACTTGAGCCCACGAGGTTGAGGCTTCAGTTAGCCAAGATAA
TGCCATTGCACTCCAGCCTGGGCAACAGAGCAAGACTGTCCAGATAGATAGATGATAGATATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATAGATGATAGATAGATAGA
CTCCCCTAAAGATTAGAGGGCACAGGGAGAAGTCCTTTATCCCCTTATGAGACACCTGGTCTTTGGCCATTTAACTGTTCCCCCATCGTTACTCCTCTCTCCTCCCACTACACACCCCAC
TCCAGATTATTCTCCCTTTATTACCAGAGGACTCAGCCTACCAAGCTCTCACTCAAACCCGCATTCTAAACTTCAATTTGCTCTGTGAGTAAAAGCTCTCATCATCGAGCTCCCTTCTAG
AGCGTGGTAATAGCAATGTACTGGTAGTTGTCCACAGCTTGTCATGCTGAAGCTATCCATTCAGTAGAACGAGCCAAGTAGTACTGCAGCAGCTAAANNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGGTC
TTTGGCCATTTAACTGTTCCCCCATCGTTACCATGATGCGCCTCCCACTACACACCCCCACTCCAGATTATTCTCCCTTTATTACCCAGAGGAACTCAGCCCTACCCAAGCTCCTCACCT
CAAACCCGCATTCCTAAAACCTTCAAATTTGCTCTGTGAAGTAAAAAGCTCCTCATTCATTCGAAGCTCCCCTTCCTAGAAGGCGGTTGGGTTATAATTAGCAAAATGTTACTGTGTAGT
TTGTCACCACCAGCCTTTGTCACAGTGCTGTGATAGGCTTTATTCTCATTTTCATAGGTAAGGAGACCGAGGCTCAGAAGTGTTAAGTAACTTGTCCAAGGTCACACAGCTAAAAAGTGT
GTATATAGAACTGAGAGTCAAACCTATGATTCAGTACCCCCTCCCCATCCCTATTCACCTGACGGGAAGTAGGGTTTGTGGACGGGGCGGGGGGGGGGGGGCGGTCCTGTGGGGCACCAT
CATTTCTTTGTGCAGGTTACTGGGTCCTGGGGCTGTTAGGGAGAATCTACCTTCCTGCCTTCTGGGCTTTCAGGAGATTCTCGAGCCTCAGAGCATATTCTGCTGGCCACATCCCACACC
CTCTTTCTCCGCGAGCATAACCGGCTGGCCAGAGAACTAAAGAGACTCAACCCTCAGTGGGATGGAGAGAAGCTCTACCAGGAAGCCCGGAAAATCCTGGGAGCCTTCGTGCAG
AGGTAG
GGAGTCCCAGGAGCACTGTCACCTGGTCCCACCTGGCTCCCACTCCTGGCTCTGCCCTCTGCTGGCTGCTGTGCCTCAGGCTCTCTCTTCTGTTCCCACACCTCTGTTTCCCCCGGCCCC
TCTGCTGCTTCTCTCTTGTTGTCTCATTCTTTGAAATCCCCTTCCTTTCATTTCTTCAGCTTCTCTGTCTCAAAATGAATCTCTGGATTTGGAGTGTTGGAAAGTCACTTGTGCGTTGGT
CCCTTCAGTGTGCCTGCCTGGGTGCCATTACTGGGACTGTAGGCTCCTTCACTCCTGGGACTGCTACATCGCAGTGATTTGGGTTATTTTGATTTGATTCTCTTCCTTTTTAATTCTCTA
CATCTACGCCCATCTCATGGGCATACTCAGTGGATACTGTTGATTGGATGATCAACTAACTAGAAGAGATTTACAAAGGTTGATGTGTTGATATTAACCATGCACATATTTTTTAAACAT
TTTATTTTGAAATAATTATAGATTCTAATATATTGCAATAGGAGGGGGGTTTCAAAGGATGAGACAACAGGAGAGAAGCAGCAGAGAGGCCGGGGGAAGCAGAGGTTTGGGAACAGAGAA
GGTAGCTGGAGGCACAGCAGCGAGCAGAGGGCAGAGCCAGGAGTGGGAGCCAGGGGGGCCCCGGTGGCAGGGCTCTTGGGCCTCCCTCCCTGCAGATTATTAGATTCTAAGAATTGTACA
GGAAATTTTAAAAAATGTACAAGGAGGTCCCATGCACCCCTCACCCAGTCTCCTCCTTACATTAGCATGTAAGTGCTAATGTCTTACATAACCAAGATACACTGGCAAAACCAGGAAGTT
AACAATGATATAAACCATAGAGCTTACTCAGATCTCACCAGTTATAAATGCAATCAACTGTGTGTGTGTGTATGTGTCTGCATAGCTCTATGCAGTTTTATCACATATGTAGCTTTAAGT
ACTACCACAATAAAGACTGTTAACCATACTGTCCCCTAAAACACCCTCATGTTATCTCTTCATAGCCACATCCACCCTCACCCCTATATCCCTAACCCCTGACAAGCCCTGATCTGTTCT
CCATCTCTATGTTATTTCACAGATGTTATATACATGAAATTATGCAACATGTATCCATCTGAGATGGGCTTTTTTCATTAAGCATAATATCTTTGAGGCTCATCCATGCTGTTGTGTATA
TAAATAGTTCATTCCTTTTTGTTGCTGAGTAGTATTCCATGGTGTGAATGAACTACAATTTGTTTAATCATTCAACTTTTGAAGGGCATTTGGATAGTTTCCAGTTTTAGATTATTATGA
ATAAAGCTACTACAATCATTCAAACATAAGTTTCTGCATGAAAAGAAATGTTTATTTCTTTGAGATACATGCCTAAGAGTGCAATTGCTAGGTCACATGGTAAGTCCATTTTTAGTTTTA
AAAAAAATTTTTTTTAATTTTTTAATTTTGTGGGTACATAGTAGGCGCCTACATGTATGGGGTACATTAGATGGCTTGATAAGGGCATACAATGTGAAATAAGCACATCGTGAAGAATAG
GGTATCCTGCTGGGCACGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGCAGATCATAAGGTCAAGAGATCGAGACCATCCCGGCCAACATGGTGAAACCTTGTCT
CTACTAAAAATACAAAAATTAGCTGGGTGCGGTGGCATGCGTCTGTAGTCCCAGCTACTCGGGAGGATGAGGCAGGAGAATCATTTGAACCCAGGAGGTGGAGGTAGGAGGCGGAGGTTG
CAGTGAGCCAGGATCACACCACTGCACTCCAGCCTGGGCGACAGAGTGAGACTCCCTTTCAAAAAAAGAAAAAAAAAAGAATGAGGTATCCATCCTCTCAAGCATTTATACATTGAGTTG
CAAACAATCCAGTTACACTCTTAAGTTATTTTAAAATGCACAATTAAGTTAAGTTATTATTAACTATAGTCACCCTGTTGTGCTATCAAATAGTAGGTCTTACTCTTTTTTTTTTGAGAT
AGAATCTTGCTCTGTTGCCCAGGCTTCCTCAGCCTCCTGAGTAGCTGGGGTTTCAGGCACCTGCCACCACGTCCTGCTGATTTTTGTATTTTTAGTAGAGATAGGGTTTCACCGTGTTGG
CCAGGCTGGTCTCAAACTCCTGACCTTAAGCGATCCACCCACCTTGGCCTCCCAAAATGCTGAGATTACAGACATGAGCCACCGCGCCTGGCCATCATTCTATTTTTTGGTACCCACCAA
CCATCTCCACCTCCGCCCCCCCACAGCCCCTCATTACCCTTCCAGGGATCTGGTAACCATCTTTCTACTTTCTATGTCCATGAGTTCAATTGTTTTGATATTTAAATCCCACAAATAAGT
GAGAACATGCCATGTTTGTCTTTCTGTGCCTGGCATATTTCACGTAACATAATGATTTCCACTTCCATCCATGTTGTTACAATTGACTAGATTTCATTCTTTTTTATGGCTGAATAGTAC
ACCATTGTGTATATGTATCACATTTTCTTTATCCATTCATCTGTTGATGGACACAAGTTGCTTTCAAATCTTAGCTATTGTAAACAGTGCTGTAACAAATGTAAGAGTGCGGATATCTCT
TTGATATACTGATTTTATTTCTTTTGGGTATACACCCAGTAGTGAGATTGCTGGATCATATGGCAGCTCAATTTTTAGTTTTCTGAGGAATCTCCAAACTGTTCTCCATAGTGGTTGTAC
TAATTTACATTCCTGCCAACAGTGCCAAATGTGTTCTCCACATTCTTGCCAGCATTTGTTATTGCCTGTCTTTTTTATATAAGTTATTTTAACTGGGGTGACATAATATCTCATTGTAGT
TTTGATTTGCATTTCTCTGATGATCAATGATGTTGAGCACCTTTTCACATGCCTGTTTGCCATTTGTACGTCTTCTTTTGAGAAATGTCTATTCAAATCTTTTGCCTATCTTTTGATTGG
ATTATTAGATTTTTTTCCTATAGAGTTGTTTGAGCTTCATATATATTCTGTTTGTTAATCCCTTGTCAGAAAGGTAGTTTGCAAATATTAATATTTTCTCCCATTCCGTGGGTTATCTCT
TCACTTTGTTGATTGTATCCTTTGTTGTGCAGAAGCTTTTTAATTTGATGTGATCCCATTTGTCCATTTTTGCTTTGATGGCCTATGCTTGTGGGATAGTGCTCAAGACATTTTTGCCCA
GACCAATGTCCTGGAGATTTTCCCCAATGTTTTATTGCAGTAGTTTCACAGTTTGAGGTCTTAGATTTGTCTTTAATCCATTTTGATTTGATTTTTATATATGGGTAAAAGATAGGGGTC
TAGTTTTATTCTCTGGCATATGGATATCCAGTTTTCCCAGCACCATTTATTGAAGAGACTGTCTTTTCCCCAGTATATGTTCTTGGCATCTTTGTTGAAAATGAGTTAACTGTAGATGTG
TGGATTTGTTTCTGGGTTCTCTATTCTGTTACATTGGTCTTTGTGTCTGTTTCTATGCCAGTACCATGCTGTTTGGTTGCTATAGCTCTGTAGTATAACTTGAAGTCAGGTAATGTGATT
CCTCTGGTTTTGTTCTTTTTGCTTAGGATAACTTTGGCTACTCTGAGTCTTCTACTTTTAGTTTTAAAAGGAACTGCCAAACTATTTTTTAGAATGGCTGTACCATTTACATTCCTATCA
GCAATGTATGCATGATCCAATTTCTTAGCATCCTGACGGTGTCACCACCATTTTTTGTTTTATCCTTTCTGATAGGTGTGTAGTAATAGCTCATTGTTTTAATTTGCATTTCTCTAATGG
CTGGTAATGTTGAACATCTTTTCCTGTGCTCATTTGCCACTTGTGCATCCTCTTCAATAAAATACTTGTTCATATATTTTGCCTATTTTCAAATTGGACTATTTGACTTGTAATGTTGAG
TTTTGAGAGTTCTTTACGTATTCTAGACACAAGTCCTTTGTCAGATATGTGATTTGCAGTGATTTTCTCCCAGTCTATAATTTGGTGGGGGTTTTTTTTCATCCTCTTATGAAGGTCTTT
TGTAGAGCAAAAGCTTTTAATTTTAATGAAGTTCTTTTAATCAAATTTTCATTTCCTGGATTGTGCATGTGATGTCAAGTCTCAGAACTTTTTTCCTAGTCTTAGGTCCCAAAGACTTTC
TCCTGCTTTTTCCTTCTAGGATGTTTATAGTTTTATGTTTTACATTTAAGTCTATGATCCATTTTGGGTAAATTTTTGCATAAAGTGTGAGGTTTAGGTCAAGATGCTTTTTTAAATTTT
TTCTGGCTATGGATGTCAAATTGCTCCAGCACCACTTATTGAAAGGCTATTGTTTCTCCATTGAATTGCTTTTGTTTCTTTGGTAAAAAATATTTGGGCATATTTACATGGATCTATTTC
TGGGTTGTCTATTCTCTTCCATTGGTTTGTATGTATGTCTATTCCTCTCATACCACAGTGCTCAATTACTATAGCTATACAGTAAGCCTTAAAACACTTTATTCTTCTTTTTCAAAATTG
TTTTGGATATTCTAGGATCTTTCCCTTTCCATACATATTATTAAATAAGTTTGTGTATGTCTATCAAAAAACCTTATGGAAATTTTGATAGAAATTGCATTAAAATTATATATACATTTA
GGGAGAGTTGACATCTTTACAATGTTGAGTTTTCCAATCCATAAACTAAGTATGTCTCCCCAATCTTTAGATTTTTTATTTCTTTAATCAACATTTTGTAGTTTTATCACACAGATTCTG
TACATGTTTTGTTAGATTTATACCTCAGTATTTAATTTTCATTTTGTTTTGTTTTGTTTTGAGACGGAGTCTTGCTCTGTCACCAGGCTGGAGTGCAGTGGCACGATCTTGGCTCACTGC
AACCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGTCTCCTGAGTAGCTGGGATTACAGGCGCCCGCCACCATGCCCAGCTAATTTCTGTATTTTAGTAGAGACAGAGTTTCACCATGTTG
GCCAGAATGGTCTCAATCTCTTGACCTCGTGATCCGCCCACCTCAGCCTCCTGAAGTGCTGGGATTACAGGCGTGAGCCACGATGCCCAGCCCAGTATTTCATTTTCTTTGGAGGAATTA
TAAATGGTATTACCGCTTTATTGCAGTTTCGACTTATTTTATCAGTGTATGAAAATATAATTGATTTTTTTATGCGTTGATTTTGTATCCTGTGACCTTGCTAAATTCAATTAGTTCTAG
GCATTCTGCTGTAGATTACTTGGATTTTCTAAACAGGTAATCATGTAATATGCCAATAAAGACAGTTGTGTTTCTTCTTTTCTAATCCATATGATTTTTCTTTCTTTTTCTTGCCTTATT
ACGTGGCAATTCCTGGAACTTCCTAGTGGCTCAAACTTCCAATACTATATTAAATGTGCATGATAAGAAAGCAGGCATCCTTGTCTTCCCAATCGAGAAAGTATTCAATCTTTCACCATT
AAGTATAAGATTAGTTATAAATGTTTTGTAGTTGCCCTTTATGAGGTTGAGGAAGTTCTGCTTCATTCCTAGTGTAATGAGAGTTTTTATCATGAACAGGTGTAGAACTTTGTCAAACGG
TTTTTCTGCATCAATGGATAGGATCATATGATTTTTATCCTTTAAACTATTCTGCTTGCTTTGGATTTATTTTGCTCTTATTTTTCAAGTTTCTTGAAGTAGAAACTTAGATGACTGATT
TGCAGTCTTTTCTATTTTCTAATGTATTTAGTGCTATAAATTTCCCTCTTAGCACTGAGTTGACTGCCTCCCACAAATTTTGATATATTGTGTTTTTATTTTCATTCAGTAGGTATTTTT
TAAGTTTCCTTTCAGACTTCATCTTTAACCTATGTATAATTTAGAAGAGTGTTATTTAATTTCCAAGCGTTTGTAGATTTTCCTGTTGCCTTCCTGATTTCTAGTTTGATTCCATCATGA
TCAGAAACTATACTCTGTATGATTTCAATTTTCAATTTGTTAAGATTTGTTTTATAACCCTGCTGTGGACTATCTTGTTGAATATTCTGTGAGTTCCTAAAAAGAATGTGTATTCTGCTG
TTGTTGAATGCAATGTTCTGTAAATGTCAGTTGGATCTGTTGGTTGATGGTGCTATTCAGTTCTGTACTCCAGCTAATTTTCTGTCTAGTAATTCTCCCACTTACTGAGAGTCAGGTGTT
GAAATCCCTAGCTATAAATGAGAATTTGCCTATTTCTTCTTCCAGTCCTGTCAGGTTTTGCTTCATGTATTTTAAAGCTTTAAAGTTCTGTTGTATGATCCATATCATTTAGGATTGCTT
CATCTTCTTGATGGATAGATCCTTTTATCATTATGTAATTTCCACTTTTGTCCTCTGAAGTCAACTTTATCTGATATTAATATAGTCACCCCTGCTTTTAAAAATTTATGTCTGCATGAT
CTTTCTTATTCCATCAATTTCAACCTACCTATGTCATTATGTTTGAAGCCAGTTTCTTTTAGACAGCATATAGTTGAGTCACTTAAAAAAAAAAGGAAAAGTAATATACTCTGCCAATCT
CTGACCTTTAGTAATTATTACTTTTAAATTATCTTTAAACTAATTATTAACATGTTAGGGATTAAATCTCCCATATCTCTCATTTTATTATTTGTTTTCTGTTTGTTCCTTCTGTTTCTT
CTTCCTCTAATATTGTTTTTCTTGCCTTCCTGTGGGTTAATTGAACATTTTTTAGAGGTCTATTTGATCACATATTGCTTTGTATAATTTTCTTAGTGGTTGTTCTCAGGATAGATAGGT
AGATAGATATAGAAATAAAGATAGAGATAGAGATATGGATTTACGACGTATCACAACCTATTGGTGTCAGTGCTTTATCATTTCAAGTGACATGTAGAAAACTCATTTCCATTTGGTCCC
TTCACCTTCCCCACTTTTGAAATATAATTGTCTGAAGCATTTCTTCTATATACGTTTTATAGCACATCAGATGGTGCTTCAACCATAAAATATAATGAAAAAATTCATGAGGTGGAAAAG
AATTGATGTTTACTTCTATTTTTACCCATTACATTGTTCTTCTTTCCTTTCTGAAGGTCCTGTCCTTCTTCTGTTACCATTCTTGATGTTTACAGAGTTTCCTTTAGCCATTGTTTAAGG
ATAGGTCTGTGTGCAGCAAATTCTCTGAGTCTTCCTTTGTGTGAGAATGTCTTTATTTCTCCTTTATTCCTGAAGAATCGTTTCACCAGCTATTGGATTCAGGGTTGGCAGTTCTTGTCT
TTCGGCACATGAAAAATGTGGCGACACTTCCCCTGGCTTCTGTGGTTTCAGATGAGAGCTTCACCGATATTTTGTTTGTTTGTTTATTTCTTTGTTTCTTTGAGAAGGAGTCTTGCTCTG
TTGCCCAGGCTGGAGTGCAGTGGCAACATCTCGACTCACCGCAACCTCCGCCTCCCGGGCTCAAGCGATTCTCTTGCCTCAGCCTCCCGAGTAGCTGGGATTACCGATGCTCACCACCAC
ACCTGGATAATTCTTTTTGTATTTTTAGTAGAGACAGGATTTCGCCATGTTGGCCAGGCTAGTCTCAAACTCCTGACCTCAGGTGATCCACCCGCCTCAGCCTCCCAAAGTGCTGGGATT
GCAGGCGTGAGCCACCACACCCAGCCCTTTACCATTACTTGAATTGGTGTTTCTTTATAGGTAATGTGTCATTTCTCTTTGACTATTTTAATACTTTATGTTTAGTTTTCAGAAATTTAA
TTATGATATGCCTTGGAGTGAATTTCGCTGTTTATCCTGTTTGGGATTCATTCATCTTCTTGAATCTGTAAATGTGTGCCTCTCACCAATTTTGAGAAATTTTCAGGCATTATTTACTCA
AATCCTTTTTCAGTTTCACCCCCTTTCTCCTCTGTTTCTGGAACTCCATTGATATGAATGCCAGATCTTTTGTTATGGTCTCACAGAACCCAGAGGCTCTGGTTTGTTTGAGCAATTTTT
AAAGTCTATTTTCTTTCTGTTGTTCAGATTGAGTAAATTCTATTGATCTGTCTCCAATTCTATCCTCTGTCACTCCCACTCTGTTATTGAGTCTATTCAAGCTTTTTTAAAATTTCAGCT
AGTGTATTTTCAGTTCTGTAATTTTGATTTGTTTCTTTTTTATAATTTCTACTTCTTTGCTATGATATTGTGTTTGTTTCAGGAGAATCTATAATTGATTATTGAAGCATTTTTATAATG
GCTGCTTTTAAAATTTTTATCAGATAATTATAGCATCTGATTCATGTCATCCTTGGCATCAGTTGATTGTCTGTTTACTCAAAACGTGGCATTTCTGGCTCTTGGTATTATGAGTGATTT
TCAGTTGTATCTTAGACATTTTGGTTGCTATATTAGGAGACCCTTGATCCTATTTAAATTTTCTATTTTAGGCCGGGCACAGTGGCTCACGCCTGTAATTCCAGCAATTTGGGAGGCCGA
GGCGGGTGGATCACCTGAGGTCGGGAGTTCAAGACCAGCTTGGCTAACATGGTGAAACCCCGTCTCTACTAAAAATAGAAAAACTAGCCGGGCTTGGTGGCATGCACCTGTGGTCTCAGC
TACTTGGGGGTGCTGAGGCAAGAGGATCACTTAAATCAGGGAGGTCAAGGCTACAGTGAGCCAAGATGGCATCACTGCATTCCAGCCTGGGTGACAAAGTGAGACCCTGTCAAAAAAAAA
AAAAAAAAAGCACTTCATTTTAGCACATAGTTACCTTGTTTAGGTTTAGCATGTAGGTTCTGGACTACTTTTGTGGGCTTTAGGTCCACCAAAGCTTAGTTTCCTGAGCCTTGCAATGCT
ATTCTGGTCTACTTCACTCTTCTGGTGTTTGGGGGCTTCCCACTCAATCCCTGCTGGTGCTGTCTGCAAGGCAGGAAAGTACTTCCCTGGCTGCCTGCTGTTGCTGAGCAACCTTCTAGG
GGAACAGGAACCACTGCACCTGAGCTCCTTATGGCACTGGGTTAAGAGCAAAGAGACACAGAGCTTCACTGTGAAAGCTAGATAGAAATGTGGAAAATGTGAAGCAAAAAAGCAGAATGC
ACAGGTATGCATTCTGTAGGATTATAACTATGTAAAAGTCGGTAACATATAGACAAAGACTACAAGAATAAAAATTAATATAATTATTAGGTTATGTGGAGGCATTTTTCTTTTCTGAAA
ATTTTTTATTTGTGTTAGTTCATTGTGTTTTTATGTAAAAGAAAACCAGAAACCAGTCTTTCCATTCTTCTGTATTTCTTTCAGAACTGGAGATTGGATGAACTTTCCTCAGTTGCCCCT
GAATCATTCCTGTGGTTTATCATTCCTCTTGCCCTGACATTGTTACATTAAAGTGACACACAAAGACTGTCACTTTAGCATCAAGGTCATAAACTTAGCACAAAGATGACAAAGATAGTA
CGTTTTATATGCCAAATTCAAATGACTGATTAGTAGAGCCTATGTTGAGACAAATCGTAAGAGATCAGCAGAAAAAAATGTCATGATTGATTAGCAATGTCTGCTATGCTCATAGGAATG
GAAACTGGGCACATATGCTATCCATTTGCCATCCTTGGTTTTCAACAGCTCATCACTCTTGGCTGCAGCTCTTTCACAGCCTCCCTCTGCAGAGGTTTAAACTTTCTTACACCCTTTCCT
CTTGATTTTATTCTCCCTCCAGATTATCACCTTTAGGGACTACCTACCCATTTTGCTAGGTGACCACAAGCAGAAGTGGATACCCCCATATCAAGGCTACAGTGAATCTGTGGATCCCAG
AATTTCCAATGTCTTCACCTTCGCCTTCCGCTTTGGCCACTTGGAGGTCCCCTCTAGTATGTTCCGCCTGGATGAGAATTATCAGCCATGGGGGCCAGAACCAGAACTCCCCCTCCACAC
CCTCTTCTTCAACACTTGGAGGATGGTCAAAGATG
TGGTAGGCCCTTTCAGGGAAGTGCTGTCACCTGGGTCTCCCACTCCGCAGCCTATTGTAGGGAAACTTGGGTTGGAAGTCAGATT
CCAAGCACTTTTACATGACCTTGGGCAAATGAATTCACCTCACTGAGCCTCAGTTTCCTCATTTATAAAATGGGGACAATAATAGTACCCATCTCACGAGGCTATCCAAAACATTAAATG
AGATATACATGTGGAGCACCCTACAATGGTTCCTGACACACAGGAGGTGATCAATAAATATAAGCTATTCTTTCTTTCCTCTTGGCTTCTAAGATGACTGGGATTTCCCTACCTCCTTTC
CCTGCCCTAGGCTTCTCTGACTTCTACAGGGTTTGGTAAATAAATTGCTCTTTTCTGATCTCTTCTAGAGTTTTTCACTTCATAGAGGTGCCAGAGGTTGGACCCAACTTTCCTTTATCC
CTAGGCTATGTTGTGTTGAAATTAACTTTACTTAAAGCCTAGCAAGTCAAGGACATTTCCTTGAGATTTATGCAACCTCTTCCCATTATGCAACGTTCCTCAGCAGAACATATGGAAGAA
AAAGCTGACTCCATGAACTAGAGCTTTAAAAATACCTTTTTTTTTTTTTTTTTTTTGAGACAGGGTCTTACTCTGTCACCCAGGCTGGAGTGCAATGGCGCAATCTCAGCTCACTGCGGC
CTTGACCTCCTGGGCTCACATGATCCTCCTACCTCAGCCTCCCTTGTAGCTGGGACTACAAGGCACACACCCCCATACCTGGCTAATTTTTTTGCCTTTTTTTCAGAGGCGGGGTTTTGC
CATGTTGTCCAGGCTGGTCCCAAACTCCTGGGCTCAAGCAGTCTTCCCACCTTGGCCTCCCAAAGTGTTAGGATTACAGGCATGAGCCACCATGCATGGCCAAAAAAAATCTCAATGTTT
ATATTCATGCCATCATCCCATGAGTGCCCCTTTCTGCCCACCCCCAGAGCAGACAATCTGGATGCCCCTGTCTCCCTGAGCACACCCTTCAGGCTGCACTGTGCTCTTCTCTCCCTCCTT
CCAGCCTGGGATGTCCAGGTGACCCACATTCAACTAAGAGAAAATGTCTGGTGGAGGCAGAGGCAAGCCATGAATGATGTTCCCACTCTTCCCCCAACCTAAGCATGGCTTCTGGGTCTC
CCTTGGCAGGTGGAATTGATCCTCTGGTGCGGGGCCTGCTGGCCAAGAAATCCAAGCTGATGAAACAGAATAAAATGATGACTGGAGAGCTGCGCAACAAGCTTTTCCAGCCAACTCACA
GGATCCATGGCTTTGACCTGGCTGCCATCAACACACAGCGTTGCCGGGACCATGGGCAACCTG
TGGTGAGTGTCTGAAGTCTGGCCTGCACTGGGGAAATTTTAGCTAACTTTCTAGGGC
TCCTGGAGACTCTCTTCTATAACCCTGAGGATCTGATCATCTGATCAATTCTGAAACAAAAGTCAGGCCAGTTTTTTTTGTTTGAGGGTTTTTTTTTTGAGACAGGGTCTTGCTCTGTTG
CCCAGGCTAGAGTACAGTGGTGCGATCACAGCTCACAGCAGCCTCAAACTCCTAGCTTCAAGTAATCCTCCTGCTTAAGCCTACCAAGTAGGTGGGACTACAGGTGTGCCACATGCCCAG
CTTTTTTTTTTTTTTTAAATATAGACATAGTCTTGCTATGTTGCCCAGGCTGGTCTCGAACTCCTGGGCTCAAGCAATCTCCCACCTTGGCCTCCCAAAGCACTGGCATTACAGGTGCAA
GCCACCATGTCCAGCCCCAGGCCATTCTTAAAACCCACTTTGTTCATCTATGAAACAGAATGTCCCATTTCCCCTAATGCAGAGAAAAGAGCATATATCTGGTACTATCTTGGCATTCCT
TAAATGAAAAAGAGGTCACCAACTAACTTTAAAAACAGTTTAAAAGCCCTGAAAACATTAAGAAAACAATAGAGAGAAACCAATGACTGTTGGACCTTTCTTTTGTCCCCATGAAAGGCA
ATTTAATTAAGAACCAGACTGCCTGAGTTCAAATCCCGCCTATGCCATTTACTAGTTATAAGACTTTGGGCAAATTACTTAACATTTCTATTCTTCATTTACCTATTTGCACAGTGCTTA
GAATAGTGCCTGGACTCATAGTACATACCACATAACTTTCATGATTGCTTAGTGGTTACGTAGGACTTAACCAACTAGCCCCCTTAAGATTGAAGTGCAGATAAAAAACAATGGATTCCC
TCCTTCCCCATCCCAAAGGCTGTAAGCCCAGACTTTCTAGGGCTGATAGAGCTGGAGTGGACATGGCCCCAAAGGGGGCTGTGAACAGGAAGTCCCCGGGATGAGACAGCCTCAGTCTCT
CCACCCTAGGGTACAATTCCTGGAGAGCCTTCTGTGACCTCTCACAGCCGCAGACACTAGAGGAGTTGAACACAGTGCTGAAGAGCAAGATGCTGGCCAAGAAGTTACTGGGTCTCTACG
GGACCCCTGACAACATCGACATCTGGATAGGGGCCATTGCTGAGCCGCTGGTGGAAAGGGGTCGGGTGGGGCCTCTCCTGGCCTGCCTCTTGGGCAAGCAGTTCCAGCAGATCCGTGATG
GAGACAG
AGGTAAGTGCGTCCTCAGCCAGGGAGGGAAGGGCAGGGCCCTTCTCCAAGAGGGGTGTCCCAAGGTCCTGCATGAGCGCTGACTCCGGATCTCAGTGTGAGTAGGGCTTTTCA
TGGTCCCTGTGACCCTTTCCTTCCCCTGTGACAGAGTGGGGAGGGGCCAGCGGCCAGACTGTGGAGTGACTCTACTTCTGTCTCTGCAGGTTCTGGTGGGAAAACCCTGGGGTCTTCACG
AACGAGCAGAAGGACTCTCTACGGAAAATGTCCTTCTCACGCCTTGTCTGTGACAACACCCGCATCACCAAGGTCCCACGGGACCCATTCTGGGCCAACAGCTACCCCTATGACTTCGTG
GATTGCTCAGCCATCGACAAGCTGGACCTGTCACCCTGGGCCTCAGTGAAGAATTAG

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAGGGTCCTTCTCCATCTCCCAGCCCTCCTGGCTTCCCTCATCTTGCTTCAGGCTGCAGCATCTACCACAAGAGCGCAGACTACCAGAACCTCTGCCATCTCCGATACTGTGAGTCAN
GCCAAGGTCCAAGTCAACAAGGCCTTCCTGGACTCCCGAACCAG
GCTGAAGACCGCCATGAGCTCTGAGACTCCCACCAGCCGACAGCTCTCAGAATACCTCAAGCATGCCAAAGGCCGG
ACGCGCACAGCCATCCGCAATGGACAGGTGTGGGAGGAGTCTTTAAAGAGACTGAGGCAGAAGGCATCCTTGACCAATGTCACAG
ATCCCAGCCTGGACTTGACTTCACTGTCTCTAGAG
GTGGGCTGTGGTGCTCCTGCTCCCGTGGTGAGATGTGACCCGTGCAGCCCTTACCGCACCATTACGGGAGACTGCAATAACAG
GAGGAAGCCTGCGCTGGGCGCCGCCAACAGGGCTCTG
GCGCGCTGGCTGCCCGCGGAGTACGAGGACGGGCTCTCCCTGCCCTTCGGCTGGACGCCGGGGAAGACGCGCAACGGCTTCCCTCTCCCTCTG
GCCCGGGAGGTATCTAACAAGATTGTT
GGCTATCTGAATGAGGAGGGTGTTCTGGACCAAAACAGGTCCCTGCTCTTCATGCAGTGGGGTCAGATTGTGGATCACGACCTGGACTTTGCCCCTGACACCGAGCTGGGGAGTAGCGAG
TACTCCAAAGCCCAGTGTGATGAGTACTGTATCCAGGGAGACAACTGCTTCCCCATCATG
TTCCCACCCAATGACCCCAAGGCGGGGACTCAAGGGAAATGCATGCCTTTCTTCCGAGCT
GGGTTCGTCTGCCCCACTCCACCCTACAAGTCCCTGGCCCGAGAGCAGATCAACGCTCTGACCTCCTTCCTGGATGCCAGCTTTGTGTACAGCTCCGAGCCAAGCCTGGCCAGCCGCCTC
CGCAACCTCAGCAGCCCCCTGGGCCTCATGGCTGTCAACCAGGAGGTCTCAGACCATGGACTACCCTACCTGCCCTATGACAGCAAGAAGCCAAGCCCCTGTGAGTTCATCAACACCACT
GCCCGTGTGCCCTGCTTCCTGGCAG
GAGATTCTCGAGCCTCAGAGCATATTCTGCTGGCCACATCCCACACCCTCTTTCTCCGCGAGCATAACCGGCTGGCCAGAGAACTAAAGAGACTC
AACCCTCAGTGGGATGGAGAGAAGCTCTACCAGGAAGCCCGGAAAATCCTGGGAGCCTTCGTGCAG
ATTATCACCTTTAGGGACTACCTACCCATTTTGCTAGGTGACCACAAGCAGAAG
TGGATACCCCCATATCAAGGCTACAGTGAATCTGTGGATCCCAGAATTTCCAATGTCTTCACCTTCGCCTTCCGCTTTGGCCACTTGGAGGTCCCCTCTAGTATGTTCCGCCTGGATGAG
AATTATCAGCCATGGGGGCCAGAACCAGAACTCCCCCTCCACACCCTCTTCTTCAACACTTGGAGGATGGTCAAAGATG
GTGGAATTGATCCTCTGGTGCGGGGCCTGCTGGCCAAGAAA
TCCAAGCTGATGAAACAGAATAAAATGATGACTGGAGAGCTGCGCAACAAGCTTTTCCAGCCAACTCACAGGATCCATGGCTTTGACCTGGCTGCCATCAACACACAGCGTTGCCGGGAC
CATGGGCAACCTG
GGTACAATTCCTGGAGAGCCTTCTGTGACCTCTCACAGCCGCAGACACTAGAGGAGTTGAACACAGTGCTGAAGAGCAAGATGCTGGCCAAGAAGTTACTGGGTCTC
TACGGGACCCCTGACAACATCGACATCTGGATAGGGGCCATTGCTGAGCCGCTGGTGGAAAGGGGTCGGGTGGGGCCTCTCCTGGCCTGCCTCTTGGGCAAGCAGTTCCAGCAGATCCGT
GATGGAGACAG
GTTCTGGTGGGAAAACCCTGGGGTCTTCACGAACGAGCAGAAGGACTCTCTACGGAAAATGTCCTTCTCACGCCTTGTCTGTGACAACACCCGCATCACCAAGGTCCCA
CGGGACCCATTCTGGGCCAACAGCTACCCCTATGACTTCGTGGATTGCTCAGCCATCGACAAGCTGGACCTGTCACCCTGGGCCTCAGTGAAGAATTAG

Retrieve as FASTA