Entry information : EsilPxd04 (Esi_0209_0042)
Entry ID 16973
Creation 2021-02-04 (Christophe Dunand)
Last sequence changes 2021-02-04 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Peroxidase information: EsilPxd04 (Esi_0209_0042)
Name (synonym) EsilPxd04 (Esi_0209_0042)
Class Peroxidasin    [Orthogroup: Pxd003]
Taxonomy Eukaryota Phaeophyceae Ectocarpaceae Ectocarpus
Organism Ectocarpus siliculosus    [TaxId: 2880 ]
Cellular localisation N/D
Tissue type
Inducer
Repressor
Best BLASTp hits
Perox score E-value EsilPxd04
start..stop
S start..stop
EsilPxd02 865 0 1..811 17..891
EsilPxd02 256 1.11e-69 849..1095 1007..1268
EsilPxd03 842 0 1..811 17..890
EsilPxd03 269 7.65e-74 849..1095 1005..1262
EsilPxd01 606 0 249..811 356..989
EsilPxd01 339 1.89e-96 812..1206 1073..1455
EsilPxd01 219 5.66e-58 1..178 89..302
EsilPxd01 50 0.00000706 1233..1264 1596..1627
EsilPxd05 431 2.36e-135 1..646 94..763
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '16973' 'complement(join(180287..180382,183361..183463,183851..184026,184743..184839,185374..185410,186168..186282,186631..186831,187586..187642,188136..188324,188861..188965,189457..189642,191290..191449,192574..192674,192931..193065,194596..194695,194948..195165,195642..195848,196050..196154,196422..196511,196890..196998,197632..197849,198226..198466,199285..199419,199871..200052,201017..201049,201853..202039,202436..202647))' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 202436..202647 210 N° 2 201853..202039 185 N° 3 201017..201049 31 N° 4 199871..200052 180
N° 5 199285..199419 133 N° 6 198226..198466 239 N° 7 197632..197849 216 N° 8 196890..196998 107
N° 9 196422..196511 88 N° 10 196050..196154 103 N° 11 195642..195848 205 N° 12 194948..195165 216
N° 13 194596..194695 98 N° 14 192931..193065 133 N° 15 192574..192674 99 N° 16 191290..191449 158
N° 17 189457..189642 184 N° 18 188861..188965 103 N° 19 188136..188324 187 N° 20 187586..187642 55
N° 21 186631..186831 199 N° 22 186168..186282 113 N° 23 185374..185410 35 N° 24 184743..184839 95
N° 25 183851..184026 174 N° 26 183361..183463 101 N° 27 180287..180382 94  
complement(join(180287..180382,183361..183463,183851..184026,184743..184839,1853 74..185410,186168..186282,186631..186831,187586..187642,188136..188324,188861..1 88965,189457..189642,191290..191449,192574..192674,192931..193065,194596..194695 ,194948..195165,195642..195848,196050..196154,196422..196511,196890..196998,1976 32..197849,198226..198466,199285..199419,199871..200052,201017..201049,201853..2 02039,202436..202647))


exon

Literature and cross-references EsilPxd04 (Esi_0209_0042)
Protein ref. GenBank:   CBJ30678.1
Protein sequence: EsilPxd04 (Esi_0209_0042)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1264
PWM (Da):   %s   136702.27 Transmb domain:   %s   o623-645i728-750o
PI (pH):   %s   4.4
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MTDVFLTSTAGLSAKSSLFIAWGQLLTYDLALTVANGTESLDVPCNDADQNGGIDVWCPLGAASEDIPFSRSDAAEGTDGVRSPINYASSYIDLDFVYGRSAEEAELLRTMEDGFMNVTD
SGVPFQNEDGTW
VAPDMGFEGDEDIFQACRGWTIAIFQHVTQNDFLIRLLGITLTDLGLAMYLGSDDDFTSTYDSEGAHRRSRRGRRLYVSSNDYNTTINAAADAFTVTAGGAAFESALP
GTVRIVSD
GYVSTDDDNVELNVASADMAGIFARNNVADVLRGAVLSPALTVDAYYSPVVSNLSPLFKLPVDGVQRGRDHGLPSYNGARAFGLDPATTFEDVSDDADLASRLSDAYGGDIN
GLDAFTGALAEGTHSSTGGVLGDLLVAAWSDQLTRSIAGD
RFYHLHARYMENVANTTLMDVIGRVTNATDLPLSVFAPSITVCDGGCAGDGDGIAVLSDNFELEWEELEDDQMAITFRCK
DLGTSGWMGVGWGGLTMELAQ
DFIICEITDESTASCTDRAYTTEREAPPLDSAGETSLNFTDLSMEDGWTSVTFLRDRGAFDDQDYDLGSDIDNAADTLMIYAYREGEGIGQHPNGNRGA
ATVNFATGNVEAECDDDDFVLLHGALMLVAWMVLAPVGIYYV
RYRKGERVKWAGFEWFEMHQEIMIVASEAVLPLFNKHFHIWAGRFAYLAGVVQCYRGLELVSSDDNLVLSAGDGLDLEIGSFGVFRDVGFPIWFALVGLGFLVLETRKQYRRYFRKGAANLCGCVELINEEYTGEKGDDGEKVEDRLVPRTEALPLYTVEEFNDEEEEGDEEEEEEEKDKSVRKAGAAGEETQAAAGG
KEPNRAFRIAGKAVLMQARGTAGRTFDL
ETKAAKANELAVVPDGPSAPSAGAGVTTFGVPVVSPGAGPAALKRTWSSKKLLQRFHVCPLLFREKMGTDSPIGRGLLFTKRPTYRYIFSCP
GQAQAL
VETIHGVCHFHMPGQVPGKGVIQRAYNAYAVRVQGFVDGKDGSGKGATPPRVVPAQETSEGVLCIEMRIRLYHDGAMSQLLEKLSKDTDNPAIQLQGPFIITKLVPPPAHRNVV
MIAAGTGVNP
MVQQIRDYLALPRDQAHSTRSRLCLIWQSMSEAELYGSEEITEMQAKSKGLLEVIVLVSGDQRRRNVPGAAFRRGKKMMSKAMAMVSPVSSSPSSSVIAMPARPPKVYDM
SPNRPDDEERSSGGIAGHKRGRRRSD
QVVVSGPSVFVFYVETILAEMGVPSEAIVFLD

Retrieve as FASTA  
Remarks Missing part due to N in the sequence.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGACGGACGTCTTCTTGACGTCGACGGCCGGGCTGTCGGCGAAGAGCTCCCTCTTCATCGCGTGGGGCCAGCTGCTGACCTACGACCTGGCCCTCACGGTGGCCAACGGGACCGAGTCC
CTCGACGTGCCTTGCAACGACGCCGACCAAAACGGGGGGATAGACGTGTGGTGTCCCCTAGGTGCCGCCTCGGAGGATATTCCCTTCTCCAG
GTCAGTTGTCTCGTCGCGTGTCTAGTTT
AATGCGTGGCGGGGGCCGGCGGGAGCTCCCCCCCCCCCCCTCCCTTCCCCCGCCAACGGAGTTCTGAAGAGCTAAACGCCCCGCCTGCACGTCAGTAGAAGTATGTTTGTTGTCAAAGAG
GCGAGCATGCGGTGTCCACACGTACACACGTAATTGTTTTTTGTTTGATTTCGTGCTGTGCTCAGCAGGAAGTTTCTTGTAATCATTTTTTTATTGCTTCCCAGGAGTAGATGATGAAGC
ACCATGTTGACTCTTACAGCTATTTAGTCGCGCATCTGCTTCGTTCGATGGACTGGGGTTCCTTTTTATTTTTATTTTTTCCAATTTTCGTTTCGTGTATCTCCGCGCGATACATTGCAC
ACGCACAGATATCGGACGCCGCGGAGGGAACGGACGGCGTGCGCAGCCCCATCAACTACGCGTCCTCGTACATCGACCTGGACTTCGTGTACGGGAGAAGCGCGGAGGAGGCAGAGCTCC
TGAGGACCATGGAAGACGGTTTCATGAACGTCACGGACAGTGGAGTGCCCTTCCAGAACGAGGACGGAACGTGGCTG
GTGAGTGTGGGAGAAAGTCGGGCAGAGAGGCAAAGAGAGACAC
TTGTGTGCTTGTGTGGCCGGTCGGTCGGTCGGTCGGTCGGTCCTGAGTTTTTTGTTTTTGCGGCTGACGGGGGCATGTATAGCGTTTTTTGTTTTTCTACGTTTGAGTAGAGTGATCTGA
ATAGAACGTTGGTCGGGGAGGGGGGGTTATCGTGCGTGACGTGATCTCCAGACGAAAGACCCAGGTATTGCACCCCGTTCGAAGTTGGGGAGGAGGGGGAGTTGACCCATGTGTGTAAAT
TTTATTGAGAGGCGATACGGAGGGGGGGGGGGGGGGGGGGCCAATCCTGCCCCAAGGGGGGGTCCGTCCCCGAAAAAAAATAGGGGGAAAGGGGGAAAGACCACCCCGACCGACCAACCC
AAACCCAGACCCGGGGGGGGCCAAAAAACCGANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCTTTTTTTTTGCCCCCCTTTTAGGAGCCCTTTTTGCTTTTTTTTTTTCTTTCCCCTGGCCGCTGCCC
ATTTTGGGGGGGGGACTTTTCCTGGGGGGGTTCGGGGGGGGGGGGGGGGGTCGGTCGGCACCGGCACGGGCAGATGGCGGACCAACGCCCCGCACAGTACCCTGTCACGTTCGCGCTCCA
TATCATGCTGCTGCTGGAGCACAACCGCTGCTGCGTGGAGGTGTGGCGCCGGACATGGGCTTCGAGGGAGACGAGGTGACCGAAGGAAGTGTTGATTTTTATTTCGTCGAATTTTTTTTG
CAGGGGAACGATTGGCCATCGGTGATCGGTCGGTCCTGTAGGAACAAGTACCGTTCTCTCGACGACGCGTGGTCACTGGTCTATAACAACGAAAACATGCAAAACATGAGTGCGCCCCCG
TTCTTTTGACGAGAGTATTTTTCATGTGTCTCGTGTGTCGAGCCTGCCGGGTTAAACGCTCCTGCTGTGCGCCACCGCTGTAGCAGCAGCACCGCTGTCGGGGGGTGAAGTGCGAACGAA
CTTCAAGTAGACCGTGGTGCTTGATTTCTGACGTGGGTTTCCTTCTCTTGTTGTACATGGCCCAGACCACGGCTGAGCACGCCGCCGCGCCGCTTGTCCTTGAGTTGGTGGGTGGCTTTG
GCTTGCAGGGCGGTACTGTAGCCTTCCCGGGGATATCAAGTTTGGTTTTTGTAGGGGTACGGGGTGGGGGTAGGATAAGGGATAGGGCCACGAACCCCCCCTCTCTATTTCATAGAAGAA
TAATCAAGTCCTCGCGAGGGCGGTCCCTTCGAGGCCCGCCGGCGGACTGTGTAGAGTGTTTTGTGCGTACGCCCGATGGATTTTTTTCTGTCCATGCACAATCTGTTTTTTCTTCTTTAG
GAAGGATATTTGGTTTCACATGCTTCCACTGTTGTATTCGTTTCCTTGTCTTGCAGGAAACCGTCTGCATAAAACAATTAATTCTTCAGTAGAAAGAACAGTTGTGTACAAGACGACAGC
CCTAATACGTGTTGGTGTATACTTGAACATACTTTTGTTTATTCTTGACGAAAACCAAATGCAATCAAATGCACAGGCAACACCCCCGTTAACTGTTTTTTTTGTTTCAAGGTGAGAACC
CAAACGAAAACCATTGATGAAATAGGAACAGATGATCGTTGGTCACATTGACCCACTTTTTTTTTTCTTTTTTTTCCAGGAGACATTTTCCAGGCCTGCCGGGGATGGACGATCGCCATC
TTTCAGCACGTGACGCAGAACGACTTCCTCATCAGGCTTCTCGGCATCACTCTGACGGACCTGGGCCTCGCCATGTACCTTGGCAGCGACGACGACTTCACGTCCACATACGACTCGGAG
GGCGCGCACAGGAGATCGCGTAG
GTATGAAGATGTATGTGCACTAAAATTACAAAGTGATCCATTTTTTTCCGGAGTGGGCGCATGAGAGCACAGCAGCAGTGTACATCAAGGGGTTATG
CAAGATAAGCAGGTGTTTATTTTTGCAGCAAGGACCAACGCACAGCTGGCTTTCCACCTGCTGTATGACGGTTCCCAAGCTCACACAGCTGAGGACGTGTTTTTATTTCGCAACAGCCCA
GGGAGCCTCAACGGACCAAGAAAGGGAACTCAAGGCGCTTGCCATGGATCAGAACGCACAACTCTCGGTTTTACGCTGAACGCGCTCGCTTGTTCATGGACGTACTGTGTGTATGTTGGT
GCGCACGCGAATTATCGCTTTTTCAGATGCATGCATGCGTTGTTGTTGGCGTGATGCCTTTTTTTTTTTTTTTGCGTTCGTTCGTTCGCTCCCGTGTTGTTATTGGCGCTTCAGAGAGGG
AGGAGGCTCTATGTTTCTTCGAACGACTACAACACGACGATCAACGCAGCCGCCGACGCGTTCACCGTCACCGCCGGAGGCGCGGCGTTCGAGTCGGCCCTGCCGGGCACCGTGCGCATT
GTGTCGGACGG
GTGAGAGAGCGTTTCATTGTGTCGATGAGATATCCTTGCATCTATTTCTCCCTAGCCCGCCGGGTCGGATGACTGGCCCGTGTATGTCGTGCTAACAGCAGCAGTGCAT
AGGGACAAGGAGCACGCAAGAATAACAAACCCTGGCGTGATTGACCTAGGCGCCAACCAACCCTTTCCCCGACGAAGCCCCCGTCCTGGGCGGGCGAAAACCCGGCGTTTCGCTCCGTCG
TAGGGATGCGCTTTCATGACCCCCCCCCCCCCCCCCCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNCCCCCCCGGCCCCCTGGCCCTTTTTTTGGGAGGGGGGAGAGAGGTTTTCTTGGGGGGGAGGGAATTCCCTGGCTTTTTTTTCCCCCACCCCCCCGGGGTGGAA
AGACGGCCCCGTTTATTTCGTCCTACCCCCCCCAGTCCTTGGGGCCAGGGGCCCCCCAAAAATACCAACCCCGGGGGGGTTGGCCTTGGGCCCCACCCACCCCTTTCCCGGAGAAACCCC
CCTCCCGGGGGGGGGAAAACCCCGGGTTTTCCCCCCTTTGAGGGGGGGGGTTTTCAGGCCCCCCCCCCCCCCCCCCCCAGGTACCGTTGGCAGAGTAGGGAGGAACGATTGGCCATCAGT
GATCGGTCCTGTGGGAACAACGCTCCGTCCTCTCGAGGACGCGTGGTTAATGGTCTAACCGCTAAGGCATGCAAAACACAAGAGTACGCCCCCCCGCCCCTCTCCCCAGGTGTATGTCTC
AACCGACGACGATAACGTCGAGCTGAACGTCGCGAGCGCCGACATGGCCGGGATCTTCGCGAGAAACAATGTCGCCGACGTCCTTCGCGGCGCCGTGCTCTCGCCGGCCCTGACGGTGGA
CGCCTACTACTCCCCCGTGGTGTCCAACCTGTCCCCGCTGTTCAAGCTCCCGGTCGACGGCGTCCAGAGGGGGCGGGACCACGGGCTGCCCTCGTACAACGGTGCCCGAGAG
GTGAGGAT
GAGATAGACACGTGGTCAGGCGGGTTTAGGTGTCAGTCGTATTTAGGAGATAGGTTCCCGTGCTGCCAGTAATGGTTGAGTTCAGGAGACTTGCCATGCTGCCGGTTGTAGTCGTCAGTT
GGGGCCACGAGACTCTTGCGCCTTCCAGCAGTACTGTAGCCAGTCGGGGGTCCAGAAGACTCTCGTTCTACTTGTAGTCAAGTCAGGTTCACTCAGACAATGAACCGGAACACTCGGGGT
GTACATGGAGTTACCTATTTACCCCCCCCCGTCCGTAGGCGTTCTTTCCACGGCTGCTGCTGCTGCTGCTGCTGCTGCTGAACGTCGTCGGTCGTTGTTTGGTGGCGTTTTTCGTTTTTC
GCGCGAAGGCGCGTTCGGCCTGGACCCGGCGACGACGTTCGAGGACGTCTCCGACGATGCCGACCTGGCCTCGCGCCTGTCCGACGCGTACGGCGGGGACATCAACGGCCTCGACGCCTT
TACGGGGGCGCTGGCGGAGGGGACGCACTCGAGCACGGGGGGCGTGCTGGGGGACCTCCTCGTCGCCGCGTGGTCGGACCAGCTCACCCGCTCCATCGCGGGGGACAG
GTTGGTTGGTTG
GGTTTTGCGGTTGTTTGCGGTGTGTTGGTGGCTGTTGTTTAGTTGCAGCAGAGGCACTGGGAGCGGCTGCGGTTAAAAAACGGAAGTATATTAGTTGTATTCGTTTGAGAGGGGAGTGTG
GGTGTTTGTAGAGTCAAGAGGTGGCATGTCCCGCCCACTAGGAGTACTCGTTGGATAGTACTCCGTATAATACGTTTGGAGAGAAGACGAGATTTATTTTCGTGGCCTTGTCCCGACCCT
CTCCCGAACCTGTGCCGGGAGATGGGCGACTCTACCATGCGTCGCACCATCTCCGTCTGTAACAAGGCGCGAATGCCGCACGGGAGCCTCTGTGTGATCCCGACCCTTCTCCTGTGTTCA
GTGTGAACTCTCCACGGCACTGGTGCATGTTGCATACCGTGACTGTAAAATCAATACCACGGTAATACAGCAGTCTCCCTAAACGGTATTGCACTCCTCCCGAGACCTTCGGCCGTTTCA
CCGCTGTTCTTAACGCCTCTTTTTTTTTCTTCTTTCTCTCTTTTTCGCAAATCTGGCTGTTCTCGTCCGCAACCCCGCCGGGCCTGTCGTTGCTATGTATGTGAACGCCGCGATTTTTTT
TACGCACGTCCCGAAACCCAGGTGTTCTATCACCTACACGCCCGTTACATGGAGAACGTGGCCAACACGACGCTCATGGATGTCATAGGCCGCGTCACCAACGCCACGGACCTACCGCTC
TCCGTATTTCAG
GTGTGCATACAAGTACAAGTAGTCAAATTCGTTCGGCCACTTGTGCGCATCTATTTGTGCCCATCCCATCAAATCGTTGTTAAAGTCCCACCAATCGTTGATGAAGTC
CCATCAATCGTAAGAGTAGGCAGCCCCCGTGCCTGAGTGTCGAAAGTACGGTAGATGCTGGGGGCTCATTCGTCGGCCGTCATTCGCACTTGAATGTTGCAGGCCCATTTCCTTCCGAGC
TTCCTCCTGTTATGTTGTGCTAGAATGTATGACGCATGGAAGAGCTCATCCTGTTTTGTAGACGACTGACTTTGTTCAACTCTGTTTTCTTTTTCTTCTTGTTCTTGTTCTTGTTCTTCT
TCCTTCTCATGCACCTCGTCATTTCCAAAGGCGCGCCCAGCATCACCGTCTGCGACGGAGGGTGCGCCGGGGACGGAGACGGCATCGCCGTATTGTCGGACAACTTCGAGCTAGAGTGGG
AG
GTCAGTCGTCATAGACTACGAGATCAGGCATGGAGAGTATTTTGCAAGGCCTCGTTGTATGTTTGGTGCGGGTATTCTATTTTCTATCTAGCGCATCGTCCCCAAGAAAATTGTTTTT
TTTTTGCTCCCCAGACAAGAGGGTGTGCGCTTGTGCACCGCTTGTGCGTTTCATCTTGGCGCGCGCGAAGTCAAGGTTCGACACGCTTATTTTGAGGTGTTTCTTTTTTGGCCTTGCTGT
GATGTTTATTCAACGTGGACCGTCGCCAGGAGAACTGGAGGACGACCAGATGGCCATCACGTTCCGGTGCAAGGATCTCGGGACGTCGGGATGGATGGGGGTAGGGTGGGGAGGGCTGAC
GATGGAGCTGGCCCAG
GTAAAGAGGAAGGGTTGGACAAGGTAGACATCTCCACCCTGCCTGTTGCCGGTGTTCCTAGTAAAATGCTGTCCCCGCAAGCTATATCTGCGAAAACAAGTCGA
TTAACGGTAGAGACCCCACAGGGCCGACCCTGACCACACGTAAAACGAACTGACCAAGCTTCCCCCCCGCCCTTCCTCCCACGCCCCTGCCTCGAAGGAGATTTTATCATCTGCGAGATC
ACCGACGAGAGCACTGCTTCCTGCACAGATCGTGCTTACACCACCGAGCGCGAGGCACCCCCCCTGGACTCCGCGGGCGAGACCTCCCTTAACTTCACCGACCTCTCCATGGAGGACGGG
TGGACGTCCGTCACCTTCCTGCGAGACCGGGGGGCGTTCGATGACCAGGACTACGACTTGGGTTCG
GTGCGTTGTTGGTGGTGATAGTGATGGTGGTGTTGTCTTACCGTTGTCATCGCT
TCCTTACAAGGTAGTGGAGGGCCGTTGTTCGCCTCATTCTCGTACCCCCCCTGGCGCGGCGTCCTGCTCCAGCCGGTAAGGTCGGCTAGGCGCCCCTTCTCGGAACGGTTTCGTCTTTTA
AGTCAATTCCCACACCCTTTCAAGGCCGTGTGCCCCCTCTCGCAACGGTTTCGTCTTTTGAGTCAATTCCCAAACCCTTTTTAAGGCCGTGTGCCCCCTCTCGCAACGGTTTCGTCTTTT
GAGTCAATTCCCAAACCCTTTTAAGGCCGTGTGCCCCCTCTCGCAACGGTTTCGTCTTTTTAAGTCAATTCCCACACACTCGAAAAGCCTGTGCGCCTTCTCGCACCGCTTTCGTCCTTG
AGTCAATGCCTGCCAACTTAACGAAACAAGCAAGAACACCCCGCCTCCAATCCAACAAACAGGAGACATCGACAACGCCGCGGACACTCTGATGATATACGCCTACCGAGAGGGGGAGGG
TATCGGGCAGCACCCGAACGGAAACCGCGGGGCCGCCACGGTCAACTTCGCCACCGGCAACGTGGAGGCGGAGTGCGACGACGACGACTTCGTGCTCCTGCACGGGGCCCTGATGCTTGT
CGCGTGGATGGTGCTGGCACCCGTGGGCATCTACTACGTCAG
GTGTTTGTTTTTGTTCCCGTGGTGTGTAATAATTCAGTCGCTGGTGATGTTGTTTTGCGTTTTTTGTGTAAACAATGC
CTAAATTATGACCACTCGTTGGTCTAGACATTTGTGTCAATATTTCTTCCAGCTCGTTGTTTGTGTTGAACGAAAGGCCAACTGTCTTTCGTTCGCGCTTTGTCTCTTTGTACGTCGTCT
GACTGCTGATATGAAGCCCGTTGTTTTGTATTTCGAACATCGCGGTGGCGCCAGGTGTACCGCAAGGGGGAGAGGGTGAAGTGGGCCGGTTTCGAGTGGTTCGAGATGCACCAGGAGATC
ATGATCGTCGCTTCCGAGGCCGTGCTCCCTCTCGGG
GTACGCTTCATTCGCCAATCGTCAACCACCAATCACCAATCACCAATCACCAATCACCAATCGACAACCACCAACCACCACTTA
CCGTTCACCAATCACCAGTGACCAGTCATCAACCATCATCACCAAATACCACTTATCGTCCGCCAATCACCAATCGCCAATCATGAACCATGAGTCACCGCGCTTTCGTTCGTTTTTGGC
CTTTTTCTTTATATATTTTCGTGCGTGTGTTATCGGGTCGTCCCTGATTTTTCGCGTGGCTACGTAAATTGCTCATTTTTTATTTCATGGTGAAATGCTGGTGTTGCCAGCCCGTCACCA
CGAGTTGTTTTGTTGACGTTGAGGATTGTTGCCCGCTTGGTGTTGTCGCTGTGGTCGCACTTGTGAGGTCATGTTTTTTTGTTTTTGTTGTCGGATTCGCAGATAGATCCCCACCGTTCT
CGCCTCCGGCCGCCAGCAAGAGCACGGCCTGCACGCACAACGGAGATATTCATTAACCGCCTTTAGCCGCACTCGTCACATATCCCCAGTTTGGCCGTGATTGAGAGAGAGGGGGGGGGG
GGGGGGGCAAATTTTTTTCCCTTTTTCAAAAAATTTTCCCCCCCCCCCCTTTTTTTCCCCCCCACCCCTTTTTTTTTTTTTCCCCCCAAAAAAAATTTTCCCCAAAAAACCCCCCCGCCA
AAAAAAAAAATGGCAAAAAAATTTAAACAAAATTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCCCCGGTGTGTG
AGAAAAAGACCCCAAAGGCCCCCCCAAAAAGGGTCCCAAGGGAGGTTTGGCCCCCGGACAGAAAAACCAAAAAATTTTTTTTTTTTTTTTTCCCCCCCCCCCCCCCCCCCCCCCCCCCCC
CGGCAGTTTTCAACAAGCACTTCCACATCTGGGCCGGGCGCTTCGCCTACCTGGCCGGCGTGGTGCAGTGCTACCGGGGGCTGGAGCTGGTGTCGAGCGACGACAACCTCGTGCTGTCCG
CGGGAGACGGCTTGGACCTCGAG
GCGAGATAGCCCCTGTGGGATGCCTTGTTTCGCCGTGTGTTTTTTGTAATGTTCCGTCATGTGGATATGGCCCCAGTGGGATGCGTCCGCGTGCTGC
TGTTCGTGTGTTTTGTACGTGCGGAGTGTGTGGACATGGCCGCTGGGGATTTCGTTTTTGCCTTGTCTTCGCAAGCTCGACATCACCTCAACAACCTGCTGCTGCTGACACGGAACTGGC
GTTACGTACGACGCGCTTTGTTGTTGGCGTTCGGAGCAGATATCGGGAGCTTCGGCGTGTTCAGGGATGTCGGCTTCCCGATATGGTTCGCGTTAGTCGGCTTGGGCTTCTTGGTCTTGG
AGACTCGGAAGCAGTATCGCAG
GTACGGTACGAGCTGTTGCTGCTATTCGTGGTGTAGGTGTTGTTGTTGCTGCTGTCGGTGGTGGTGGTGGTGATGTTGCTGTTGTTGTTGTTGTTGGT
GGTGGTGGTGGTGATGTTGCTGCTGCTGTTGTTGTTGGCGTCTTGGTATTGGTGTGATGTTGGCGCCGCCGTTCTCGTCATGGCCGTTGTCCTTGTCGTAGTCGTAGTCGTAGTTGCAAC
CGTAGTCGTTCTTGTTACCGGCGTTAGCTTGACTGCGTCTTAGCGCAAAACCCGCGTTTTGGCAGTGACGCGATCTATACATGGACACGCTTCTTGTGAGGCGGCGGATTTGCCTCCGTC
GTCGTCGCCCATTATGCTGCTCGCCGAAACAACCAAGGTGCTAGATACTGGTATGTGAGTACTGTATGTAGGGATTTTCGAAAGCGAAAGCCGAAGCTTACATCTATGAAGCTTTGGAAG
AGCCCCCACGAAGGGGAGAGGGGGGGGCTCCGGGACAGGCGACGTTCTTTCTGGCCTTGGCTCGACCAGAGCAGCGACCAAGGAGCTCCGGAAGAGAGGGAAAACGCGTAGTATCGTAAG
GTTGCGAGTCAGCGCACAGGGGTGCGATCGATTTTCAAAGTTTACTTTCTCCACAGCCAGAAGGGTGCGAGCGCCAACTGATACTTCTCAGGAAAGCTCTCGATGAGATATTTCCAAAAC
CACCATTTTCGACAGTGGTAACCTCCTGGTTGTGGAGAAATTACGCTTAGAAAATCGCTCGCACCCCTGTGCGCTGACTCGCAACGTTGCGGTAGGCTAGCGTAGCACGATTGCTGCTTT
TATCTCCCCTCAAGAAAGGAAGGAATACATTTAAAAAAAAATGCTAACCCTCGTCGAAATCACCGCTGCCGTATTTTGCGCTGCAACGTTTGGAGAGCGAGCTCCGTTTGTTTGCTTGTG
TCTTGTGAAGCTCGTGTACGCTTCGTTTTTGTTTCCCTTTATTTGTTTTTTGGTAACAATCTCTACCACTACATACTCCTGAGTACTGTTTTTTTTTTTTTGCGTATTGTTCCTTTCCAT
CCTTTATTCATTGTCCTGTTCTTCCTTTTTTTTTTTCTCTATCATCCTTCTTTATGGTGATATCAGGTGTACTTCCGCAAGGGCGCGGCCAACCTGTGCGGGTGCGTCGAGCTCATCAAC
GAGGAGTACACGGGAGAGAAGGGGGACGACGGCGAAAAAGTGGAGGACCGGCTCGTGCCTAGGACGGAGGCGCTACCCTTGTACACCGTAGAGGAGTTCAACGACAAG
GTGCGGTGATGC
TGCTGTTGCTCGATTTATTCGGGGGATACATATTTTGGTCGCGGTGTTGTAAGGGGCTCAGGCGTGTTGGGGGGGAGGGGGGGGGGGGGGGGGGCTCCTTTTTNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTCGTGGTTGGTTACATCGAGGAGAAGAGGAGGAAGGAGATGAGGAGGAGGAGGAGGAGGAAAAAG
ACAAGAGCGTGAGAAAGGCGGGAGCAGCAGGGGAGGAGACGCAAGCCGCCGCCGGCGGCAAGGAACCTAATCGGGCTTTCCGCATCGCCGGGAAGGCGGTATTGATGCAGGCGAGGGGGA
CTGCTGGCCGCACGTTCGACCTG
GTGAGAGTTTTCGTGTATTTTGTTGTTGTTGTTGTTGTTGTTGTCGTTGTTTGCTTCTGGCCTGTGGTATTATGCATGTTGCAAAACAACCGAGGAA
AGACGCGTGTCTCTGTGTGTGTCTGCTATGTACTGTAGCGTTGTTTTATTCGTTGCTGCGAAAGTACTTTGCGTATCCCGGAGTCGGTTGCGGCAACATGTTGCAATGAAAAAAATGTAT
TACGTACTTACTTGCAAAACCTTGTTACTCGTTGTGGCGAAAGTGCTTTGCGTACGGGAGGCGGTTGCGGCTGCATCTCGACAAGGTGGTACTGAAAATGGTGGTTACACGTACAGCAGA
AGCCTCGCTTGGTCTCAAGCCGCAAGCCGCAAACAACGCTGTCGAGAGACCCTGAACCATCATCAACATTGTAGCCCCCATCCCGCGAACAACGCTTGCATGCTTGCTTGCCGTCCTAAC
CTTTGCCGCCCCGTTCCTCCCCTCCCCTCCCCAGGAGAAACCAAGGCGGCCAAGGCCAACGAACTCGCTGTCGTTCCCGACGGCCCTTCGGCGCCCTCAGCCGGAGCGGGAGTCACGACG
TTCGGTGTGCCCGTGGTCTCG
GTAAGCGAGCGGTGCCGTTTGTTGTGGCTAAGTACCATGGCTTTTTTTGGAGATACACCCTTAAGGAGGGTACAATCTCCCCCTACCTGTAACGTCTGT
CAAAGACGATTCCCTGTTTTCACGCTGGTGTTGTTGTTGTGGTGGTGGTTGTTGTTGTTGTGGTGGTGGTGGTGGTGTTGTTGTTGTTGTGGTGGTGGTGTTGTTGTTGTGGTGGTGGTG
TTGTTGTTGTTGTTGTGGTGGTTGTGGTGTTGTTGTTGTGGTGGTGGTGTTGTTGTTGTGGTGGTGGTGGTTGGTGTCGTCTGTCCAGGAGACAAAAGGCAGGCAGTCCAGCTCCATCCC
CAGAGACAAGCCGTTTTTCTTCTTCCGCGCAGGAAGCCAAAGCTACGCATCGTGCTGTCGTTGCTCACCCGTATACCCCCCCCGACGGGTGAGGTCAATGGTCTAACGACTACGGCATGC
ACGACTATGAGTACCCCCCACCCCCCACCCCCCACCCCCGTATCTCCTCCGTGTTTCCCCCCCACACTTTCTTGCAGCCCCTGGTGCTGGTCCCGCGGCGTTGAAGCGGACGTGGTCCAG
CAAGAAGCTGCTGCAACGGTTCCACGTCTGCCCCCTTCTGTTCCGCGAGAAGATGGGCACGGACAGCCCGATTGGCCGCGGCCTGCTCTTCACCAAGCGACCGACGTACCGCTACATCTT
CTCGTGCCCCGGGCAGGCTCAGGCCCTG
GTAAGCCAACCTATAATCTGCATCTGTCTATTTGGACATACATATTTTCATGTGTGTTGTTCTCTTCTTTTTTTTTCGTGTAGACGTGGTCC
CGGTGTAGCGGTCCATCGTGTCAAATGTTGTTACCGCTATTGAGGGCATCGCCGCTGAACCCGATTGGCAGTAGGAGGTGCAGCCACGGTAGTTTGAAACGATATTGACGGCATCCCGCT
GAACAGGGGACCGCTAAACCGGGACCACGACTCTACCTCGGAAGTGTAACACAGCAATCGTCCCCATCCACGGATGAGTCGTTTTGTCCTTTTTTTTTTTTTCGTTGCAATTGGCGTTCC
GAATCGGCGAACAAAAGTATGTAGCGGTCAATCCCCCTCCTTCCTCTACCCTTCCTGCATTTTTCTCCTGTTTACCTTATTGCCCCCTCCTACTCGCCTTGCGTTCTACTTCCCCCCCCC
CCCCCCCCCTTGGGGCAACGTCGGCTTTGCCGCCCAAACAGGTGTCGAAACAATCCACGGGGTTTGCCACTTCCACATGCCCGGTCAGGTTCCAGGAAAGGTGCGACGTTTTTGGTTGTT
CTGTACTGCAGTCCTCCTCGGTTGTTGACACCTTTTTTTCACAGCTATGGATAAAAAAGCAAAATTCACCACGCACCCTAAAAGGAAGGTGGGGCACGAAAACAAAATAAAAAGAAGAAA
AAAGCCCCAGTGTTTACCCCCCTCTTTTGTGACCCTGGATGGTCGTAGTTGTACGAAAGAAATTCACTGAAAAGTGTAGTACTCTACATACACTGAAGAGTGTAGTACACGTTTCACCAC
GAAAAACGGGGGGACTTTCGCCATACCGGTAGTAAATAGGACGAAAACACACACAAAACACATGGTTTCTCATAGAGTGTACAATATAACATACTTACTTAGCCCAAGGGATCTGTGCGG
CCAGAACGAGGGTCGTCTCAACACCATTTCAAGACACGGGCATGTTTCCGATGATTGTCGCGTTTGCTGCCTGCTGCTGCTGCGGCGGCGGAGGCGCTTTGCCCTGACTGGTGCTGTGCA
ATATCTACACAACGATAAAGAAAGACACTCCCTGCGTTCATCGAGATGCTACAGTATCCCAATCGAAGGGATGCGTGTTGAGTGAAGTATTTACAGCAAATCAGTTAGAGAGTCGAGGGG
ACACCCGCTGCTGGATGATACAACTGCTGTCCCATGATCGCTCTTCGGTGACGGTGCTCCCTCCCTTTCTGACCCTTTCCCCGTTGCGTCCGGCTGGGTGGCCACCCGCCCGGCTCTGCC
TGCCATAAACGCAGGGGGCGTGATCCAGAGGGCCTACAACGCGTACGCCGTGCGGGTTCAGGGATTTGTCGACGGCAAGGACGGGAGCGGCAAGGGGGCGACTCCTCCCAGAGTCGTGCC
CGCGCAGGAGACCTCGGAGGGGGTGCTGTGCATTGAGATGAGGATCCGGCTTTACCACGACGGGGCCATGAGCCAACTTCTGGAGAAACTGTCGAAG
GTAAGGTTTGTATTTGTGTTGTC
TCGTCTGTTCTGTCTATCTCCTTGGTGTATTTTTTTGTTTTTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGTCTGCCTGTCTGCCGGTGTAT
AGCACTCCCCGTAATCCTGGCATTGGTGAAAAATCAACCAATATCTGTGTTTCTCGTCGCTCTGCCTTGTTGCGACTTTTTTTCCCCGCGTTCTTCCGTGGTGATTTCCTCGTGGACCCT
CTCGTGTTGCTTCTACGTTCGTTTCCTCTCTCCTTTCTCCTCTTCTCGTGACGGTGGCTGCATACCATACCACACGTGCACACAGGAGACACGGACAATCCGGCGATCCAGCTGCAGGGC
CCCTTCATCATCACCAAGCTCGTGCCGCCGCCCGCCCACCGCAACGTCGTCATGATCGCCGCCGGCACCGGCGTCAATCCGA
GTACGTTGCGTAGCAGTTTTTTTGTGCCTTGGCAATGT
TTTTGGTGGCTGTTTTTTTCCTGGGTAAGGGCTGCCTCGTGTAAAAAAAAAAATGCGAAACAAGCCAGGGAAAATTGTCTGACACATGCTGGAGAAAGGGGCCTTGAATGCCCTCTTGTT
TTACCTGGGGCATGGTTCTTAATTTCTTGTGTGAAGGTAGAGGTATCCCAAGATGAACTATTCGTACGACAAAGTAGGGTTTAGTGCTGGCCTCTTGGCTGAGCACCTTTTCTTGTAGTC
AAGTTTCTTTGGCCATGGGTATTTTTTTTAGTCTTGAGGTTCTTAACCCCCTTGTTGAGGTCGTGGCGCGCTCTGTGCTATGCAACACCCCGAAAATTACCGGCACCATCTTTGTCTCCT
ACGACCTTCACGGAGCCCCCCCCCCCCCCCGCGCGCGCCGTGTGACCCTATCCTTTGTCGCTCGGCTTGTGTTTCCTTGTGTGTCGGGTGACACTCAAACAACACCGTACTTGTCTTCCG
CCAAATGTTTCCGGGACGACTTCCGTGGGTGACCATCGCGAACACGGGTCAACCACTCCTCACCTCGACAACCTGGCCATGGTATGGAATTCGAACATGGGAATCTCTCCTCGTGGACAC
GTACCCTCGACTGCAACACGTGCGCCGGGCTTGAAAATTCGTGTGATCATCATCACCGATGCTTCCAATCAACATCGACCTCAACACACACGCGCACACGATGGTCGACTTGAATGCAGT
GTGGTGCAACAAATTCGAGACTACCTCGCCCTTCCCAGGTACAGCAGTCTACCGTGCGTGTCCTTCCGTTGCGGAGAAGATTTTTGCTTACCGGGGTGCCGTGTTTTTCTGTAGGAGTAG
TAGTGGCTGCGAATGGTTCTTCTCTCCTATGCATGCCTTTTGGTTTGCTGATGGGCGCAGATATGTGTGTAATCGTTTGCCTTGGGTGTTGACCGATTATTTGATTTCTAGAAAGGCGTT
CCCTTTTGGGTAGTGGTTGGCCTGCTGTGGCCACGGAAGGTGTCCGCGCCAGCTGGACCATTCCGGGCATTAGAAGGTACCTCCAAGCACAGACCGAACGATATTTTATGTGAAAAATAC
AACCACAACAACCAATACTGAAACAACAAAAACGTTCATCCTCTTGTCGGTTCTTGTTGCACAATGCTTGCTGTGCTTGCTGTGCTTGCTATGCTTGCTATGCTTGCTATGCTTGCTATG
CTTGCTATGCTTGCTATACTTGCTATACTAAACGAACTCAATCATGAATCAATCGTTGTGCTTATCAATCAATCGTCGTGCTTACGAAACAGGGGGACCAGGCTCACTCTACAAGGTCCC
GGCTGTGCCTGATCTGGCAGAGCATGAGTGAGGCGGAGCTATACGGGTCAGAAGAGATTACGGAGATGCAG
GCGAGCTTTCCTTTTGTTTCGTTCGTTCTGTTGTGCCCCGCAGGCTGTG
GCATTTGGCCCCGTAAGCTTCCACGCGGCGAGCGTTGCGCGGGATCGACAGTTTTGGGAACTTGGTGTTTTCAACGGTTTACGCTGTGCTCGGCTCCGGCGCAAAGCAGCAACAAAACTA
CTGCCCATGATTGCGAAGCTCGCCCCTTTCTTCTCCACGGGCATAGGTTAGGTTATGAACGACAGTGACATGCGCGCGAATGTGCCTCATCCACGCGTCCTTCGAAGTACGATCATCTCG
CAAGAACTCGGGAGACCCTTTTGTTTCGTGAGAACGTTTTTTTTCATTCGTGTTTGATGTTCATGCCCCTTTCTGAAATGGGTCAGAAGGCCGGACTGCACCAGCTGCACGTTTTGTTCT
GTTTTTGTACAGGTATTGATATGTGTCGTGCAGCAAACCACCAGGTGCAGCAGGGAGGTAGACACAGAACAGCAGCAACAGCGCCAAGAACGACGGTCAAATCTCTCGACATAGACCTGC
TTAATTTCAAACCCTGTTACTTGGTGTCCAGGACACACGAGGAGCGTCTCTACTGCCAACACTCACGGCTCACGGCTATTGCAGGCGCTACACAACGAACGTTCGCCGCGGCATCCACGC
CACCTTGGGTCGTTTTTTTTTTTTAGTCACGGGTGCCCCCCCGCGCCGCCACTTCTTTTTTTTTCAGGCGCCAAGAGCAAGGGCCTTCTGGAAGTGATTGTTCTCGTGAGCGGGGACCAG
AGGCGTCGTAACGTTCCCGGGGCAGCTTTCCGCAGGGGCAAGAAGATGATGAGCAAGGCAATGGCGATGGTCTCCCCCGTTAGCAGCTCTCCGAGCAGCTCGGTCATCGCCATGCCAGCC
GGGAG
GTGGGTGGGTGGGGAGAAGATGCTGGCGTTTGGTTTGGTAGTATCGTTTTTCGTTGTCGTTGTCGTTGTCGTTGTCGTTGTCGTTGTCGTTGTCGTTGTCGTTGTCGTTGTCGTT
GTCGTTGTTGCTGTTGTTGTCGTTGTTGTCGTCGTCGTCGTTGGTGGTGGTGGTGTTGTCGTTGTTGTTGAGGTCGGCGGTCTGCACGACAAGTTTCGTTCGGTTCGGATCAAGTAGTGT
GTCGAAATCACAACGCAGGATTTAATTTTTCGAGCGACCCTCCCCGCCCAAAGGCACCAGCCACAAGAACAAGCCTGAAGAACAATCGTGGGTCCGGTTCTTGCTTTTCCCGTGCACGTG
CCCGTCCCATCAACCATCCCGCTACCGGTCAGGCGCCTCCCAAGGTCTACGATATGAGCCCGAACCGTCCGGATGATGAAGAACGTTCCTCGGGAGGCATTGCTGGGCACAAACGTGGCA
GGCGGAGATCTGATCAG
GCGAGCAAAAAAAGTTTTGTCTTACCTTTGTTGTTTTTCCTTCCTTTTTTTTTTTATCTTCTCTACGCCGGCCTGGTGGTCTCATTCTCCTTGGTTACGCTGT
TGTCTGGAAGATCCAGCGGTTGGTTGTAGGGCAAGGAGGCCTCGTGTGCATGCGTCCCCCTCCCCCCCCCTCCCCCCGCAGTAGGGGTAGGAACGGTGCTCCGTCTCGAAAGGGACGCGT
GGTCAATGGTCAAGTGACTAAGGCAAGCAACAAATGAGTACGCCCCCCCACCCCCCAAGTCGGGGAANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTAGAGCCTGGTCCGGTGA
GAACCCGGCGTGCTCCGTCCCGTCTTTGTAGGGATGCGAGGTCAATGGTCTAACGACTACGGCATGCACGACTATGGAGTCACACCCCCCCCCCCCCCCCCTTTTTGTGGGGGGCTGACG
ATGAAATGCTATATGTTTCCTTGCTTGTAAGAGGGCTGGCCGTTCTTTCGACGAAGTTACATGTACGCATTGAAGGCAGTCTACATATACCATGACAAACGTGAGGCGGTATCCCCTCGT
CGAAAATAGCGGAGAGGTTGAATCGGCAGCGCGTCGATTTCGAGTACAAGCGGGAAATAGAGGCAATCCTTCTCCTCGGAAGGGGTGCCAGGTCAGGTCAACGGCCTACGTGATCCAGCA
GCTGAGCGCCCTGTTGAACCCTCGAGCCGTTTCCATTTCCTGGGCACTTTTTTCATCGCGGCGTTGCCCCGCTCGGTTGTTGGTGTCCTCTCAAACGGGGGGAGGAAGGTACCGATTCAA
CCTCACCGCCACCGCTGCCTGTAAGAGCTGGCTTGCAAGGCTCTTTACTCACGACTATTTTATTTTATTTTCTCGACAAATCCTTTCAGCAACTTTTTGGTTTCCGATAACTACATTTAC
ATTGTAAGACGACCAGATTTTTTCTTGTTTTTTTTTTTTGCGTGGGGTAACAGCAAGTAGTATCTCACTTTCTTTCTTTCTGTGTGTGTGTTCACGTGTATTTTTTCATGGAAGCAGAAG
CCACTGGCGGCAAATCCGTGGGAAAAGTTCCAAGACGTCAAGTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCAACGGGGTCGCTTGATCGAGTGGGATGCTGGTCCATGATG
CCCCCCCCCCCCCCCCGCTCTTACAACGCGAAGATTTGCGCCAAACCCAAGCATATCCAATCGTATCGTCCTACTTCGTTGTGTCTATATACAAAGTAGCGTCAAATCTGATCAAATCAC
ACCTTTTTTTTTCTTTGTCCAGTCTGGTCGTGGCCCCCCCAAGTCTCAATGCACCACGTCTGCATTCATTCAATCTCGTGTCGTGTTCGAATCCGATGCGGTCATTCCATTCCAGGTGTG
GTGGTTTCCGGTCCGAGCGTGTTCGTGTTCTACGTGGAGACTATCCTGGCAGAGATGGGAGTGCCTTCCGAAGCAATCGTCTTCCTCGACTAA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGACGGACGTCTTCTTGACGTCGACGGCCGGGCTGTCGGCGAAGAGCTCCCTCTTCATCGCGTGGGGCCAGCTGCTGACCTACGACCTGGCCCTCACGGTGGCCAACGGGACCGAGTCC
CTCGACGTGCCTTGCAACGACGCCGACCAAAACGGGGGGATAGACGTGTGGTGTCCCCTAGGTGCCGCCTCGGAGGATATTCCCTTCTCCAG
ATCGGACGCCGCGGAGGGAACGGACGGC
GTGCGCAGCCCCATCAACTACGCGTCCTCGTACATCGACCTGGACTTCGTGTACGGGAGAAGCGCGGAGGAGGCAGAGCTCCTGAGGACCATGGAAGACGGTTTCATGAACGTCACGGAC
AGTGGAGTGCCCTTCCAGAACGAGGACGGAACGTGGCTG
GTGGCGCCGGACATGGGCTTCGAGGGAGACGAGGACATTTTCCAGGCCTGCCGGGGATGGACGATCGCCATCTTTCAGCAC
GTGACGCAGAACGACTTCCTCATCAGGCTTCTCGGCATCACTCTGACGGACCTGGGCCTCGCCATGTACCTTGGCAGCGACGACGACTTCACGTCCACATACGACTCGGAGGGCGCGCAC
AGGAGATCGCGTAG
AGGGAGGAGGCTCTATGTTTCTTCGAACGACTACAACACGACGATCAACGCAGCCGCCGACGCGTTCACCGTCACCGCCGGAGGCGCGGCGTTCGAGTCGGCCCTG
CCGGGCACCGTGCGCATTGTGTCGGACGG
GTATGTCTCAACCGACGACGATAACGTCGAGCTGAACGTCGCGAGCGCCGACATGGCCGGGATCTTCGCGAGAAACAATGTCGCCGACGTC
CTTCGCGGCGCCGTGCTCTCGCCGGCCCTGACGGTGGACGCCTACTACTCCCCCGTGGTGTCCAACCTGTCCCCGCTGTTCAAGCTCCCGGTCGACGGCGTCCAGAGGGGGCGGGACCAC
GGGCTGCCCTCGTACAACGGTGCCCGAGAG
GCGTTCGGCCTGGACCCGGCGACGACGTTCGAGGACGTCTCCGACGATGCCGACCTGGCCTCGCGCCTGTCCGACGCGTACGGCGGGGAC
ATCAACGGCCTCGACGCCTTTACGGGGGCGCTGGCGGAGGGGACGCACTCGAGCACGGGGGGCGTGCTGGGGGACCTCCTCGTCGCCGCGTGGTCGGACCAGCTCACCCGCTCCATCGCG
GGGGACAG
GTTCTATCACCTACACGCCCGTTACATGGAGAACGTGGCCAACACGACGCTCATGGATGTCATAGGCCGCGTCACCAACGCCACGGACCTACCGCTCTCCGTATTTCAGGCG
CCCAGCATCACCGTCTGCGACGGAGGGTGCGCCGGGGACGGAGACGGCATCGCCGTATTGTCGGACAACTTCGAGCTAGAGTGGGAG
GAACTGGAGGACGACCAGATGGCCATCACGTTC
CGGTGCAAGGATCTCGGGACGTCGGGATGGATGGGGGTAGGGTGGGGAGGGCTGACGATGGAGCTGGCCCAG
GATTTTATCATCTGCGAGATCACCGACGAGAGCACTGCTTCCTGCACA
GATCGTGCTTACACCACCGAGCGCGAGGCACCCCCCCTGGACTCCGCGGGCGAGACCTCCCTTAACTTCACCGACCTCTCCATGGAGGACGGGTGGACGTCCGTCACCTTCCTGCGAGAC
CGGGGGGCGTTCGATGACCAGGACTACGACTTGGGTTCG
GACATCGACAACGCCGCGGACACTCTGATGATATACGCCTACCGAGAGGGGGAGGGTATCGGGCAGCACCCGAACGGAAAC
CGCGGGGCCGCCACGGTCAACTTCGCCACCGGCAACGTGGAGGCGGAGTGCGACGACGACGACTTCGTGCTCCTGCACGGGGCCCTGATGCTTGTCGCGTGGATGGTGCTGGCACCCGTG
GGCATCTACTACGTCAG
GTACCGCAAGGGGGAGAGGGTGAAGTGGGCCGGTTTCGAGTGGTTCGAGATGCACCAGGAGATCATGATCGTCGCTTCCGAGGCCGTGCTCCCTCTCGGGTTC
AACAAGCACTTCCACATCTGGGCCGGGCGCTTCGCCTACCTGGCCGGCGTGGTGCAGTGCTACCGGGGGCTGGAGCTGGTGTCGAGCGACGACAACCTCGTGCTGTCCGCGGGAGACGGC
TTGGACCTCGAG
ATCGGGAGCTTCGGCGTGTTCAGGGATGTCGGCTTCCCGATATGGTTCGCGTTAGTCGGCTTGGGCTTCTTGGTCTTGGAGACTCGGAAGCAGTATCGCAGGTACTTC
CGCAAGGGCGCGGCCAACCTGTGCGGGTGCGTCGAGCTCATCAACGAGGAGTACACGGGAGAGAAGGGGGACGACGGCGAAAAAGTGGAGGACCGGCTCGTGCCTAGGACGGAGGCGCTA
CCCTTGTACACCGTAGAGGAGTTCAACGACAAG
GAAGAGGAGGAAGGAGATGAGGAGGAGGAGGAGGAGGAAAAAGACAAGAGCGTGAGAAAGGCGGGAGCAGCAGGGGAGGAGACGCAA
GCCGCCGCCGGCGGCAAGGAACCTAATCGGGCTTTCCGCATCGCCGGGAAGGCGGTATTGATGCAGGCGAGGGGGACTGCTGGCCGCACGTTCGACCTG
GAAACCAAGGCGGCCAAGGCC
AACGAACTCGCTGTCGTTCCCGACGGCCCTTCGGCGCCCTCAGCCGGAGCGGGAGTCACGACGTTCGGTGTGCCCGTGGTCTCG
CCTGGTGCTGGTCCCGCGGCGTTGAAGCGGACGTGG
TCCAGCAAGAAGCTGCTGCAACGGTTCCACGTCTGCCCCCTTCTGTTCCGCGAGAAGATGGGCACGGACAGCCCGATTGGCCGCGGCCTGCTCTTCACCAAGCGACCGACGTACCGCTAC
ATCTTCTCGTGCCCCGGGCAGGCTCAGGCCCTG
GTCGAAACAATCCACGGGGTTTGCCACTTCCACATGCCCGGTCAGGTTCCAGGAAAGGGCGTGATCCAGAGGGCCTACAACGCGTAC
GCCGTGCGGGTTCAGGGATTTGTCGACGGCAAGGACGGGAGCGGCAAGGGGGCGACTCCTCCCAGAGTCGTGCCCGCGCAGGAGACCTCGGAGGGGGTGCTGTGCATTGAGATGAGGATC
CGGCTTTACCACGACGGGGCCATGAGCCAACTTCTGGAGAAACTGTCGAAG
GACACGGACAATCCGGCGATCCAGCTGCAGGGCCCCTTCATCATCACCAAGCTCGTGCCGCCGCCCGCC
CACCGCAACGTCGTCATGATCGCCGCCGGCACCGGCGTCAATCCGA
TGGTGCAACAAATTCGAGACTACCTCGCCCTTCCCAGGGACCAGGCTCACTCTACAAGGTCCCGGCTGTGCCTG
ATCTGGCAGAGCATGAGTGAGGCGGAGCTATACGGGTCAGAAGAGATTACGGAGATGCAG
GCCAAGAGCAAGGGCCTTCTGGAAGTGATTGTTCTCGTGAGCGGGGACCAGAGGCGTCGT
AACGTTCCCGGGGCAGCTTTCCGCAGGGGCAAGAAGATGATGAGCAAGGCAATGGCGATGGTCTCCCCCGTTAGCAGCTCTCCGAGCAGCTCGGTCATCGCCATGCCAGCCGGGAG
GCCT
CCCAAGGTCTACGATATGAGCCCGAACCGTCCGGATGATGAAGAACGTTCCTCGGGAGGCATTGCTGGGCACAAACGTGGCAGGCGGAGATCTGATCAG
GTGGTGGTTTCCGGTCCGAGC
GTGTTCGTGTTCTACGTGGAGACTATCCTGGCAGAGATGGGAGTGCCTTCCGAAGCAATCGTCTTCCTCGACTAA

Retrieve as FASTA