Entry information : EsilGOX01 (Esi_0012_0104)
Entry ID 16940
Creation 2021-01-30 (Christophe Dunand)
Last sequence changes 2021-01-30 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Peroxidase information: EsilGOX01 (Esi_0012_0104)
Name (synonym) EsilGOX01 (Esi_0012_0104)
Class Glycolate Oxidase    [Orthogroup: GOX001]
Taxonomy Eukaryota Phaeophyceae Ectocarpaceae Ectocarpus
Organism Ectocarpus siliculosus    [TaxId: 2880 ]
Cellular localisation N/D
Tissue type
Inducer
Repressor
Best BLASTp hits
Perox score E-value EsilGOX01
start..stop
S start..stop
EsilGOX03 431 1.53e-150 11..394 23..404
PpaGOX03 425 8.09e-149 25..394 5..368
PpaGOX02 422 8.18e-148 25..394 5..368
MpGOX01 421 3.92e-147 28..394 10..369
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '16940' 'complement(join(565654..565794,568668..568808,569898..570032,570559..570648,571064..571270,571839..572005,573347..573465,574452..574569,579752..579818))' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 579752..579818 65 N° 2 574452..574569 116 N° 3 573347..573465 117 N° 4 571839..572005 165
N° 5 571064..571270 205 N° 6 570559..570648 88 N° 7 569898..570032 133 N° 8 568668..568808 139
N° 9 565654..565794 139  
complement(join(565654..565794,568668..568808,569898..570032,570559..570648,5710 64..571270,571839..572005,573347..573465,574452..574569,579752..579818))


exon

Literature and cross-references EsilGOX01 (Esi_0012_0104)
Protein ref. GenBank:   CBN74053.1
DNA ref. GenBank:   FN647877.1 (579818..565654)
Protein sequence: EsilGOX01 (Esi_0012_0104)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   394
PWM (Da):   %s   42122.39  
PI (pH):   %s   8.36
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MNDPMTAPVGPAQDTSKAPEAAGWEPVNVREFERHAQLMLSKNAFDYYASGANDMVTLRENRAAFNRLRLRPRILRDVSMVDTSTSVLGQKISSPICIAPAMQRMAHDSGECATAGAAAK
AGALMTLSSWSTTSLEDVAKAGGPGGARWFQLYVY
DRKITEQLVKRALAAGYTALAVTVDTPVLGRREADMRNRFKLPEHLTMGNFVSAGGAHASGTKDGGNDSGLAAYVASLIDRTLDW
NDIKWLRTICGSMK
IVVKGVMTAEDAAESVRQGVDGIWVSNHGARQLDTTPATIEVLPEVVAAVSGRCEIYLDGGICRGTDVFKALALGAKAVFIGRPVLWGLAHSGEEGVSKVLKLLHD
ELVMALQLTGCTRVSSASRSMVTHQTSYYSKL*

Retrieve as FASTA  
Remarks
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAACGATCCCATGACCGCGCCGGTGGGCCCCGCCCAGGATACCTCCAAGGCTCCGGAAGCCGCAGGTCAGTGAGGAGGCACGTCCGGCTCGCTCTCGCGCAGCATGATCCAAGAGCCG
TCATACATAACATACTGCTGTAGGTGGAACACATACATATTTCGGCCGAGCAAGTATGATATGATGACCCTAGGGCCCTGCCATTTTGGTCGGGGCGTTGTGCCTGTAGGGACGTGGGGT
TTTATGAGAGGACCTCCCCCTCGGCTTGAGATGCCTTATCCTATTGTACGCCAGGCATCGATAGAGGTTTGGGCATGACTTACGCACGCAAATTCCTAATGAAAATTGGGCCGCTTTTTT
CCACAGCAGTGTTCAAGGAGACGCCGGCACACGCACCAACCCCTGTTGTTATCATACGTCATACGCGGGGTCAGACAGATGTGCTAGCCCCCTGTGCTGTTCTGTTAGACACCTGGAAGT
ATTTTGATGTTCGTAGTGTGGAGCTAGGTTTGGTCACAGCGTAGGACGGAGCAGCGAGTAGAGCGGCCTATTCGATCACCCCGCTCTTGCTGTATACCTGTGTCCATGTGTGGGTGCTGC
AACGTGTGTATGTGTGTCACTCGCGGTACCTAAATAGGAGGCTATCGCCATTTGCCCCAGATGCATGACCGTGGTTGTACTGCAGAATGAAGAGCAAAGCATCTCATTCACATTCCCAAC
AGCGTTATGTGCCAATTGCGACGTTGCGTTTCCAAACGGTAGTCTACCCCCCGTAGCGGTAGGTTGATCTTCTTCATGAAAGGTCCTGGGTTTTGTTCGGCATGCATACGCATATGCATC
ACGGCCCGTGGAGTCATATGGATTTGATGGACAACACCCCACACGTCATTCTCTCTCATATTGTATAAGCAGCATGTGCAGTTTGACTTCTGTGGGCTAGATGAACCAAAAAAAAAAATC
TGTACATTCTCCACACGCGGGGCCTTGGCGTCGGTAGACATGACTGTCGAAAGCCTTGATGGTGGCACGAACCGTTGATCATTCACATTAAATGATTGCCGTCCGTTGTGAAGGGTACAC
GCTGATTTATCGTTCCCGGGGGGTGTCCTACAACGAAGGTCGCTTTCGACCAAGGAACACGAACATTTTGTCGAATGTATGCTACGGCTGCACATATCCAAATCTATCGTTACGGAAGAT
CGTTGTACGTTGTACGGCTTTTGAACACTGCAATAGAAGAGTTCACCGACGCTGAACCATTCAACCATTCACAGCAGTTACTGCCCCAAGGTTCCTCACAGGCCACCACAGCAGTTACAA
TAAACGGTTATTTTGTCGAATGAACATATGTTCATACCTGCACTGCACAAACCTATCTTCGCCCCTCAGAATACCGCTTTCAGCAAGAAATACGACCAACCGCGTTCAGCAATCCATGAC
GAACGAACGGACGTCGTTTTCGACCAACGAGGAAGAAATTTCTGTCGGATAGTGCTGTACCCCTGCACAAGTTTTGCTTCTCTTCCACCTAAGAAATATCGTCTTCAACAGCCGCCTTCG
GCAAGCGCGCGGTTCCGGAACAAGGGCAAAGAAAGGTCCCCGGCGGGGGTTCCGTGGGGAGAAACAAGCACCTATCACACACCCCGGGTGCAAGAAGCGCGCCTGGAGGAGGGCACCTGG
TCGAGACCTGCTTGCAGAAGAAGCAGCAGTAGTAGCCTGCTGTGCATAAGATGCGGCAGCATCGGGGTAGAGGGAGACAACCGGCAAGCACCGCGCGGCCATACTTTGCACCGCGCGGCC
TACTTTGCAACACCCCGTAGCTACAACTCCAGGCCCATACTGCTGTAGCGTGATGTGTCAAGCGATCGCGAAATGAAGGGTGTACAATCACCCAGGCGACCGGGCGACCTCACCCCCGCG
CACGTGACAAACAACACGGCAAGCAGAGTATGAGCGACTTCTGACTTGCCCCCACGTCATAGCAATTGGAGTTTACGTTAGAACAACAAAGGCGGGTGTACCGACAGCTACTGACCTGGA
GAGTTTTCTCTTCACGCCTTTCCTGCCCAATTGCAGGAGACATCCCAACTACCGCAGGCGCCACGCTAGCTGGGCAACCCTGCTCATCGCTATCATCGCTAGGGTCCTTGTGGCGGGGAA
TACGGTCTTCGGCGCACGAGCGCTTCGCCATCGGACGACTAATCTCCCGGAGAACACTCCCGGAGAACAGTCGTCCGATGGCGAAGCCCTCGCGCCGGCGACCGTACTCCTCGCATCGAG
GACCCTAGCGATGATAGCGACGAAGTAGGGTTGCCTAGCTAACGTCACGCCTGCGGTAGTTGGGTTGTCTCCTGCACTTGGGAGGAAAGGCGAGAGAGAAAACTCTCCACGTCAGCAGCT
GTCGGTACACCTGCTTTTGTTGTCCCCCGCGAGAGTTTTATCATCGGGTGCGCTGCGACATATGGCCACAAGGTTTGGTTCGGACGGCACAGGGTTGCCGTTGACCTCTCTAAACATTCG
AGACAGGGTGTCGGGGACAACGTTGAGGTTTTCCGGCAAGTGTTTGACAGTGAAATCGCAGTTCTGGAGTGCGATAGCCCAGCGCGTTAACATCGTGGACGTGTCTTGCATGTGGTACAG
ATACGTGAGGGCTTGGTGATCGGTGCCACACGTAAAGCGATGGCCCCAAAGGTATGGGCCCCAGTGGGTGAGGGCCCAGACGACCGCAGTGCGTCCTTTCATGGTAGCGCTGTACTGGCG
TTGACCTCGAGAAAAACATTTGCTATGAAACGCGACGATATCGAGTTCTTCACCGTCGGCGCTGTCGCGGGATGGCTGAGCGAGGAAAGCGTCTACGCCTTGTTCAGAGACGTCGGTGTG
AAGGAAGGGCACATGTTCCCTTTCAGACGATGTGTGGTACAGCTCACGGCTGGCGCACAAACAAACCCAATCTTTCTTTCCTGTCACTTGTGTCCACGACGCGTCTAAATTGACGGTTCC
GTCGAAGACTAACCCGTGAGTAACACCACAGTGGACACAGCTACTGGTGTCTCTGACGGGTCGCTTCCATGGTTGCAATCCCGTCCGACCTTGCCAAATGCGCCGCTCAAATCTATCCCT
CCGGCCGCCGTATCCCTTCGAGCAGCGAACGGGTCGCCGATACAATAGAGGTCATGGTTTTCATTGTTTCCATGAAAACTTTGCGGTATCACCCGTACCGTCACCGCCCTTGTCGTCCCT
TCCCTAGGACCCGATTCCATACTGAACAGTGCCATGTCGGATTTTGGTGCGATCTTTGATTGGGAACAACAAACTTCTTGTCAATGGAGCCGCGGACAGAGTGGGCGACGGGGGTGTAGA
TGGCCGAGGAAGGGTCGTTGGGGGGGGGGGGGTACCCTTGGATTTGGGGGTGCCTCCGGACGGGGGATCGCGGATTCGGCGATGACGGGGGGATCGAGGCTCTAGGCGTTACTGCCGGGG
GACAGTTGCCCGTCGTTGTTGAGGTTTGAGAGGGGGGTTGTCCTGCTCGGAAGACATCGTCAGAGATAGGACACAGGTTATAACCCTGCCGCTGACGCCAGATATAAGGTTTAGCTATCT
CAGACCGAAAAGAAGGGGTTGAACCTTCTCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNGGAAACTTCCGTTCGGGGGGGGGGGGGGGGCGAGTGTACAGACAGACTTGCGGGACAGGAGTAGTTATCTAGGAAGAACCGACGGTGGTCTCTAGGAAGAGCCACAGGCG
GTCTCTAGGAAGAGCCAACGGTGGTCTCTGGGAAGAGAGGGGAATAAGAAGAAGAGGAGGGACAGAGTAGGGGCGAAGAGGGGGGCGCACAGCACACCCCTCCTCGGCGCTGACGACGCA
CAACCAACCTGACGAGGCGGTTACACAGATGCAGCAATCTGCGCTGGCGTGCTGACGCTGCCGGGGAATCTCCCCGGGCGTGAGAGGACAAACAACCAGCCGGACGGTTGCAGTATGCGC
GGTAGACGGGCATGGGCATATGTACAGTGACGTAACAGACGCGCGCGGACCGCACGGAAAGATCCGCAGAACCGGCGGAGGAACAGTTGGACGAAGAGCCTCGAGTGGAGGGTGTATGGC
AGCAAGATCAGAGACTCCCATTTAGCAACTTCGATCGTGCATAGACGAGGGGGGCGTACGCGGGGTGTATAGCAACACGGCTAGCAGAGTATAGCGACTTCGGTACGCCTACGTACCTCC
ATTTTTCGTGCCGAATTGGGATTCGTCTGTAGCCCATTTATTCTGAATTCGTCGACAGCCCATTTATACTGTTACACTGCTGTTGTATTTTTGCTCTTAGCCTAGGATCATTTTAGTTAT
AGTGTTGGATTTTGGATGGTGCCTCCAGCTCCGGCCTGTGGTCCCACCCATGATTTCCTGGCCGTGTCTTCTGTTCGGGAGCGTAGCGGCCCGCCTGGGCGGGGTGATGGCGCCCGTTCC
ATCTTGCGCACAAGCTTGATCTCTTTCAGGGATCCTATTATAGCAGGGTGTCAGACCACACGCCTTCACAGTTTGGTAGTTTGCGTGTGAAAGCAAAGCGGAGGCTAACGGGGTCCCCTT
AAGCGGCGAGACACGGAGACCCGGAGACCCGGGCGGGACAGGCCCCCGGAAGCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCAGAGACTTGGAGAAGGGGTCGTGACAAGCTACGAGTCTCGGAGACCCGGAGACTCGAAGCTTGTCACGACCCCGTTAGCGTCCTCT
TCGTTTTCACACGCAAGCTACCAAACTGTCAAGGGGTGTGGTCTGACACCATGCTAGGAGCCGTGAAAGAGATCGGGTTTGTGTACAAGATGAACGGGCGCCATCACCCCGCCCAGGCGC
CCAGGCGGGCCGTTCCCGTTTCTTCTGGAAGTAGGTCTCGACCAGAGGTGCCCCCCCGGGCGCGCTTCTTGCACACCGGGGTATGCATGACATGAGCACCGAGGCCATTCGTTTCCCCGA
CGGAACCCCCATCAAAACATCGTCGCATTCAACCATCGAGTACATACATACGCACCCGATGTCACTTGTGCTTGTTACGTACGTCACAGGTGTTGGGAGCCTGTCAACGTGCGCGAGTTC
GAGCGCCATGCCCAGCTCATGCTGTCCAAGAATGCCTTCGATTACTACGCCAGCGGGGCCAACGACATGGTCACCTTGAGGGAAAACAG
GTGAGCGGGGGGGGGGGGCGTAGATGCCCGC
GCTGTGGAGGTAGTATTCATAGCCGTGCATGTCGTACTATGGGGGGGAGGGGGAAGAAACTGATGCCCACATCACGGTGTTGAGCAGAACGGCAGTCTTGTGTGCTGTTGGAGCGCGTTG
GATGCCAATCCTCGGAACACGCTTGGCCGATGCCGGTTCTTGAGCAAACGCGCCCCGGCGGCCGCAAGCCGGCGTCTGGCTGAGCTTTGGTCGGCACGTCCGAAGGGTTTTTCCGTGCGT
GGATGGCGTTGTCGAGGACGAAACTGTTCTCGCTACAACATGTAACTGTCAAACCATGTCGCAACCCTCGACCATATTTTACCATCATGTGGTACGTGTGTCTCCTGAAACAACACGCCT
CCGAGTATGTATGTCTGTGACGCCCAACGTCAGCAGGATAATCCATCCGACTGGACTCATCGAAGTCCTCTCTGTCCGAAGGGGAGGACTATCACCCCCAATGTTTTCTTATCCGCACAC
CTACCCATGACGCCTAGGCAACCAATCCATACCAATGGTATGGGTGTCGCCATAACCGTGGCCATTTGGCTTCGCTACATGGTTTGACCTGATGTCGTTTGCGGCAACCCTGTTTTTTTT
TTTTTTCTTACGAAAACAAAAAAATAGGATCTCTATCCACAGATATTTGAGTGTCAAGGAAGGATAAATCAGCAATGAATTAAAAGTGTTTGTGTAAACAGCAGTAGACTTGGCAGAGTT
TTTTCCATCAAGTAATCATGTTGGGTAGATGCAACAATCGGGGAGGTTGGGTTCGGGATAGGGAATAGGGTCTGCGTACCCCCTCCCTGGTCTATTTCCTCAGATCCTTAATTTCTCGCG
AAGTCGGCCCTTGTGTACAACAACCTGCCTGTGGTCCTACCGCTGTTATCCGAGCAACATACTATCTACAACCAGCGACTTCGGCTATTATTATTCACAAAAAATGAATCCACAGGGGGC
GGCGTTCAACCGGCTCCGGCTTCGGCCCCGCATTCTCCGAGACGTGAGCATGGTGGACACGTCTACCTCGGTCCTGGGGCAGAAGATCTCTTCTCCTATCTGCATCGCCCCCACCG
GTGA
GAGCTACTGTTGTAGTTGTCTGGAATAAAACACTGTGTTGGATGTTGGGTAGCGGGTTCTTCAGGCAAAGGTTGAAGGTTGACCCCAAGACCTTGACCTTGGTCATGAACGAGTGCAGAA
GGGGGGTCGATCGCCCTGCTATTAACACATATGTACCAACATGTAGTGTGTGAAGACAACAAACAACATGATACACGCCGCCTAGATCTCCGCATGTCCTCGATCCCTGCTATAAGCACA
TGCCAGCATTTGTTGTGAAGACAGCAACAGCATGATACACGCCGCTTAGATCTCCGCGTGTCCGTATCTTGTTGCTGTGTACACCTTACAGGAACAGCCCCCCTGCTTGGTACCAACTAG
ATACGAACTTTTGATTCCTCTCGTCTTCTCGCACAACATGGCCGTAGTAACCTCAAGCCCCAAATCTGGATCTTCCATCGTTCCAATTCAAACAAATCTCTCAAGGGGAAGTGGTCCCGT
CGAAGTCGTCGATCTCCCAATGGAACTCCATCTTGAATGATTTCAGACTATGGACTTCTTTTGGGAGATGGATGAATTCGAAGGGACAATATTTCTTTGGTAGATTGGTTTCACGTTGGA
ACGATCGATCATCCAGGTTGTGATTTGAGGTTCTCACTTTTGCGATAAGATGAGAAGGAGCACGACGAAGATCGCACCTAGTAACGAGGAGGGGGTCTGTTCGTTCATGTAGTTGTGCAC
TCCACCAAGATACGGGCCTGCGGAGATCTTACGGGGGTTTCGTGTATCATGTTGGTCTGTGTTTCTGTTATAGAAGGGCTATCGATCCCCCCTTCTACCGTGCGTGGGCCAGCGTGCCCC
CTCCACTGGTTCGCAACCAAGGTCCTGAATCAACCTACACCTGACATTGGACACACGTATATTTTTGTTTTATTGTTGTCTCTATGGTTACGTCGCGGCCAGTCCGATGTAGAATCTGTG
AGAACGGTTGTGACACTGTAGGCGAGCGACACTCTTGCACGAGAATGGAGCGGTGGAATAACCAGACTGGAGCGGCGGAATAACCAGACCATCAGCCCCAACGAAGAGCCCCATCGAACG
CCCGCTCCACAATTGTAACTGCAGCAATTTTGATGCTCCAGTTTACGCCAACAACAGAAAAACGCAGCTGTTTCTTTCATGTGTTCCTTCTCAACATGGGTAGATTTCGGACTTTTTCGT
CGCGTCCGCAATAAACCGAAAAGCACGTGTTTTTATTCGTGTTCTCTCTGTAAACTCAAAACACACGCCTCTGTCTGCGCAGCAGCACTAACAACAAAAATGGTACGTGATGAATTCGAT
AAAATGCACCACCGCAGCGCGATGCAGCGCATGGCCCACGACTCGGGCGAGTGTGCCACCGCCGGCGCCGCAGCCAAAGCGGGCGCCCTAATGACCCTTAGCTCTTGGTCCACCACCTCT
CTCGAGGACGTCGCCAAGGCCGGGGGGCCCGGGGGTGCTCGCTGGTTCCAGCTCTACGTCTACAAG
GTGTGCGCATGTGTTTTTTTTTTTTTTTAGCATCGACCGCGCTAACCAGCTGAT
TATTGAAAAGACGTCAGTCTTTGTGCGGTTGAAAACACAAAACAATTCTGAATGCCATTAACTAGGCGTGTGTGTGTTGTTCTATTTCTTGACATGCCATCTGAATATTAAATTGTGTAC
TTGCCCCGGTTTGCTCGTGTGACTGACGCTCAAGGCCCTGTTTCCTTTTTTGTTTTGTGCCGAAAGAGTAGCGCGCGCTCACGATGTGCACCGTTGATGTGAAACGAACAGCAGCCATCG
CTGCTCAAATCTGCCTTGTTATCGTTTGCTGCTTCTAACGCCCTCACTTGATCGCCGTCGGCGAACCACAAATTTACAATATTCGATCAAAAACATGGATAAAGGAGGAAAAAGCAACGC
TGATTATTTATGTTTTTCATATCCTCTCCACGGCAACGTTTCCTAAACAAGTAAACGCCCCATATGATGCCTCGCCCTCCCCAGGACGCGTGATCAGTGCTCTAAACGCACGCACAACAT
CCCCTTCTGTTACATAATCACCGGCGTCGAACAGGAGACCGCAAGATCACGGAGCAGCTGGTCAAGCGCGCCCTCGCGGCCGGATACACCGCCCTCGCCGTCACCGTCGACACGCCCGTC
CTCGGCAGGCGGGAGGCCGACATGCGCAACCGCTTCAAGCTGCCGGAGCACCTGACGATGGGCAACTTCGTGTCTGCTGGCGGCGCCCACGCCTCGGGGACTAAGGACGGGGGGAACGAC
TCG
GTGAGATTTTTTTTTCTTTTCTGTTCTTACACGATCACTTTTTTTTTTTTCTTCTCTTACAGAAACACTTTTTTTTTCTTGAAATTCACTTCTTTTTTTTTCTTCTCACAAACACAA
CAAATACACCTGTTTGTGTCAGGCCTCTGGATACGGCGGTTGAAGCTTTTCGGGAGTCTTTTTTTTTTTTCGTTGACATGATATATCCATCTCTATCCATAGATACGGGTCTAGGATGAA
TTTGAACGTGAAATCATAGCGTTAGTGGAGAGGCACTGTGGTTTAGTACCTGTGATTATTTTCAGTATCCTTGCTGTTGTGCTTGACGATATTTTTGTCGTTATGGTTTCTGCCCTTGCT
GTTGTGCTTGGTGCCCGTCGTTCCCCCCGGCACCTACCGTACCTATGGCATACCTCAGGGGGCCTCGCAGCGTACGTGGCAAGCCTTATCGACAGGACGTTGGACTGGAACGACATCAAG
TGGTTGCGCACGATCTGCGGCAGCATGAAG
GTGAGAATCGATACTGAGACAGGTGCACGCAGGCACGGTGTAAAAGTACCGTGGATGGTGTTCGTGGTTGGAAGACATGACAGATTTAAC
CCTGGCCCTGTGCGGGGACAACCCTCCATCCTCTGTAGGACGCGTGGTCAATGGCCTAACCGCTAAGGCATGCGAAACTTGAGTACACCCTCCGTACCTCCTCCCTCATGCCAGCCAGAT
CGTGGCGAAGGGCGTTATGACCACGGAAGACGTCATGTCTGTGGCCGAAAGACGTCACATGTGGTCGTAAAACGTGTACCACCCCACCCCACCCCCCCTCCACCCNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNACCCCCCCCCCCCCCCCCCCCCCCATCCCCTCCTACCGCGTATCCTCCCATGCCAACCAGATATCGTGGTGAAGGGCGTTATGACCGCGGAAGACGCTGCGGAA
TCTGTTCGCCAGGGCGTGGACGGCATCTGGGTCTCCAACCACGGTGCTCGGCAGCTGGACACCACGCCGGCTACCATCGAGGTTCTCCCAGAG
GTGTGTGTGCGCGCGATCTGACCGAAC
CCCCTTAACGGGAAATTGTGTTTTTTTTAGGAAGTACTGTAGAACATGGAGGGGCTACGTAGACCCTATTCCCTTAGCCCTAACCCTAACCCTGACCGAACTTCCCCTATTGCATGCTTT
ATCCAACATGTTTACTTCATACAAAAACCCTGAGAAGGCTACTGATCTGACCAGTAGTGCTTGCCCCTCCGTTGAAAAATAAGGTTCCCGGAATATTCTCGTTGTTGTTCTGGTTCTGTT
GCACAAACCTTCAGTAGTGCGGCCCTAGCCTTCCACCGTATAGCAACCTAGCGCTACAATAACTGGCAGAGACGGTTGAAATGTATTTGAAAATGTATGCAGTATTTTTTTAATATTACA
TGTACACTTAAAGAGAAGTTGGTTTTTTTGTTCGAGAATACTGCTGTACACAAACAATTGCAGAAACACCACCTGTTCGAGATATGACCACGCTCGAGGGGGAAGTCTTGTCATTTTTTT
CTTTGGGTTTTGGCGGTACGGTATTTTGGTACCGTTTCAGCTGTAGCAGAAGATGGAAGGCAGTGCACTGCATGCTCCAAGAGATTGCGGCACGAAGAAGAGAGGAGGAGATGCAGACAG
CTGCGCTCTTCATACTCGCTGCCACACAGACGTTCCCTACAATTCATAGCGGTGACAAGTATGCCAGCGTGTTTTGGTTTCGGCCGTTCGCGGTACGGCAGCCGTAGGACCTCCCTCTAC
TTGGCTCGATGTTGGAACACACTATAATAATAGTAGTAGTTCCTTGAAAATATAAAAAAAAAAACTGCCTGTCGGTATGGGGAACCCGCCCCCCCCCTCTCTTTTAACCTACAGTACAGT
GCCGGCCTTTGCAAAAAAACGATTTTTTAGATTGACCAGCCATCAATTTTGACTGCGGTCATGGATTTCGACGTGTACGTGTTGCCCATGATAGCGTTGGCAAGACCCAGTACTATCAGT
CAACGCCGCAGTTTTGTCAATCGTCGTGGTGGCGTCTTGCCCCGTACCGGCCCGCTTCAATTCGTGTCTTATTTTCTATGATACCTCTTTTCTTTGTCGCAGGTGTGGTCGCGGCGGTGA
GCGGTCGGTGCGAGATTTACCTGGACGGAGGCATCTGCCGGGGCACGGACGTGTTCAAGGCGCTCGCTCTCGGCGCCAAGGCTGTCTTCATCGGACGACCGGTTCTTTGGGGTCTCGCGC
ACAGC
GTAAGTTAAAAGACTAGTTGTGCTGCTGTCATCATTGTATTTATTCAACACAACGTGGACACACACGTGCTCTGGGGTCTCGCGCACAGCGTAAGTAAAAGACTAATAGTGCTGC
TGTCATCATTATATTTGTTCAACACAACAGCGTAAGTAAAAGACTAATAGTGCTGCTGTCATCATTATATTTGTTCAACACAACGTGGACACACACACACCTGCCCGTTGTTTCGGCTTG
TCCACACAACCTGTTGCTTTTGAGCACATGGCTGCTTGGTGTCACACATGTATATAGGGAAGTAGATAAAAGTCGGTATGCCCACCCCTTTCTCGGACAAAAACGAGTACTCGAAAAACC
AGCTCGCGTGGACGCCGTAGGCGACCGCCAAACGCCTGAAAGCCCCATTCTGGCACCTGAACAGCATAGCCAGGGACCATACACAGCACGGGCATCAATTTTTCCGACTTTCAGGCACAT
CTTTGGCCCAAATACCGCAGCAACGCACAAGGCCTCAACGCAGTGCCTGCCCACTTTTGTCCCCACTTCTGTCCATTTTTCTACTCATTTTTTATCCGAATTGTACCCACGTTTGTCCAC
TCCCGGACATCGAAAATCCACCTCCACGTCACCGCACGGCAGAACATCGAATAGCCTCGGACCTTCCCAACACCGCACCCGTGGCATATTTCGTATCCATGACATCGTTCGCAAACAAAA
TACCGACTGCGGACCGCGGCGGTGGAACATGTCAGAACATGTCAGACCCCTTCGGTTTCCAACGGCAACCACATGGCACGGAAAAAAATACCCGGTAAATACCGCACTCACGGCACGCCA
CAACAAATTTGCGTGACACACAGCGGACAACGAGCCAACGGATCGCGCATACAGATCAAGTACAACCGGACAAACCCACACACTACACACACACACACATGACACACACCACCCACGATT
GCACCGACTCGTGAGGGCAGCATGTCAAGGTAGTTTTGCCCTGCAGGGCGCCCGCCCCTCTGGGGCAAGCACGGGCACACCTCCGTCACTTGGGTGTTGGCCCCATGGTTGTGGGTGCGA
CTTTTGCTTCATCTCAAGGACTTCAAAATGGGCTTCGAACGCGGATTACATGAACGCGGATTCTAGGCAGCTAGAAGGCGACCGAGGTGGAACTACCGTAGAGCATGTGGCCTGCACCGG
GCAGGCCGAAAAGAAGAAGAAGCACATCAACCGCAGGTGTGTGGAGGAGGAAGCGGTGGCGATTGTGAACTCCAACAACAGCATTAGCGCGCCGGAACTCGCGAAGACGGTTCGGTTGGC
TNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGAAGACGGGGCGGTT
GGCTTAAGCGTCTTCGCGAGTGCCGGCGGCGGTTGGCTTAAGCGTCTTCGCGAGTTCCGGCGCGCTAATGCTGTTGTTGGAGTTCACAATCGCCACCGCTTCCTCCTCCACACACCTGCG
GTTGATGTGCTTCTTCTTCTTTTCGGCCTGCCCGGTGCAGGCCACATGCTCTAATTCCACCTCGGTCACCTTCCAGCTGCCGTCCCGTACCTTGCACGCCTGGGCACGTACCGGGCAGCC
GGATGTCGCCTTCCCGGTGATGATGACCCCGGTGCAGCGGTAGAGCTTGCCCCTACCGCCATTGACCCTGGGGTCTAAGCGGCACTGCCTAGAATCCGCGATCGTGCGTCGTTCGAAGCC
CATTTTTGAAGTCCTTGGACGAAGCAAAAGTTGCACCCACAACCACGGGGCCAACACCAATTGACGGAGGTGTGCCCGCGCTTGGCCCTGCACGTGGGTGGTGTGTGTGTGATGTGCGGG
TTGTTCTCTGTTCCGGTACTTGATCTGTATGCGCGACCCGTTGTCTCGTTGCACGCGTTTGTGGCGTGCCGTGAGTGCGCTACTTGCCGGGTATCTTGCCGTGCCGCGAGGTTGTGTGGT
TGCCGTTGGATACCGAAGGGATCTGACATATTCTGACATGTTTCACCGCCGCGGTATTTTTGTTTGCGAAAGATGCCATGAGTACGGTGTTGAGCAGGGGTTAAGGTTATGGTCCAAGGC
TATTCTCGATGTTCTGCCGTGCGGTGACGTGGAGGTGGATTTTCAAAGCGGACAAAAGTTGGTACAATTCGGACGAATGGGTGGGTACACAAATGGACAGAAGGTGGGCACATGAAAAAA
TACAACTGTACTTCTAGAAGCCGGTGGGTATACCCACTTTTGTCTATTTCCGTTTTATGTACATTGTCTTGACTTGATTGGTTGTTGTTGTTTAGTCGGCTGGGATCTCGCGCACATCGT
AAGTTAAAGCAGCGGTACAGCCTACCATTGGTTTCTTTTCAACTTTGTTTTTGGCTTGTATTGTTGTTGTCATTGCTGCTGTTGTTTTTGTTCTTTCTACCGGTTCTCTGGGGTCTTGCG
CACAGAGCAACTTGACGGGGCAGACTGCCAATAGACTTCATCAACGTTGTTGTTCTTGATGTTGTTGTTTGAACCAGTCGACCGGTACCTCGCGCACAGCGGGAGCAAACACAGCGGAAG
CAGAAACTATTTAGCCCCCTGTCCCCCCATCCCCAACCCTCCCTCCATTCACGTCCAAACCCCAACCATTGCTTCTCGGTCCTTGCAAAGACTGTGTTTCTGTAAAATTGACCCGTATAG
TCGGAAGACGGTACATGTAGTGACAATGTCAATGTCAACGAAACGCCAACGACACGATCGTCAAAACCCTCCCTGTGCTTTCCTTTCTGTCGATCAACGATCTGGTTGGGTGTACTAGGG

GGCGAGGAAGGAGTGTCGAAGGTCCTCAAGCTCTTGCACGACGAGCTTGTCATGGCCCTTCAGCTCACGGGCTGCACACGGGTCAGCTCGGCCTCCCGCTCCATGGTCACCCACCAGACG
TCGTACTACTCCAAGCTCTAA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGAACGATCCCATGACCGCGCCGGTGGGCCCCGCCCAGGATACCTCCAAGGCTCCGGAAGCCGCAGGTTGGGAGCCTGTCAACGTGCGCGAGTTCGAGCGCCATGCCCAGCTCATGCTG
TCCAAGAATGCCTTCGATTACTACGCCAGCGGGGCCAACGACATGGTCACCTTGAGGGAAAACAG
GGCGGCGTTCAACCGGCTCCGGCTTCGGCCCCGCATTCTCCGAGACGTGAGCATG
GTGGACACGTCTACCTCGGTCCTGGGGCAGAAGATCTCTTCTCCTATCTGCATCGCCCCCACCG
CGATGCAGCGCATGGCCCACGACTCGGGCGAGTGTGCCACCGCCGGCGCCGCAGCC
AAAGCGGGCGCCCTAATGACCCTTAGCTCTTGGTCCACCACCTCTCTCGAGGACGTCGCCAAGGCCGGGGGGCCCGGGGGTGCTCGCTGGTTCCAGCTCTACGTCTACAAG
GACCGCAAG
ATCACGGAGCAGCTGGTCAAGCGCGCCCTCGCGGCCGGATACACCGCCCTCGCCGTCACCGTCGACACGCCCGTCCTCGGCAGGCGGGAGGCCGACATGCGCAACCGCTTCAAGCTGCCG
GAGCACCTGACGATGGGCAACTTCGTGTCTGCTGGCGGCGCCCACGCCTCGGGGACTAAGGACGGGGGGAACGACTCG
GGCCTCGCAGCGTACGTGGCAAGCCTTATCGACAGGACGTTG
GACTGGAACGACATCAAGTGGTTGCGCACGATCTGCGGCAGCATGAAG
ATCGTGGTGAAGGGCGTTATGACCGCGGAAGACGCTGCGGAATCTGTTCGCCAGGGCGTGGACGGCATCTGG
GTCTCCAACCACGGTGCTCGGCAGCTGGACACCACGCCGGCTACCATCGAGGTTCTCCCAGAG
GTGGTCGCGGCGGTGAGCGGTCGGTGCGAGATTTACCTGGACGGAGGCATCTGCCGG
GGCACGGACGTGTTCAAGGCGCTCGCTCTCGGCGCCAAGGCTGTCTTCATCGGACGACCGGTTCTTTGGGGTCTCGCGCACAGC
GGCGAGGAAGGAGTGTCGAAGGTCCTCAAGCTCTTG
CACGACGAGCTTGTCATGGCCCTTCAGCTCACGGGCTGCACACGGGTCAGCTCGGCCTCCCGCTCCATGGTCACCCACCAGACGTCGTACTACTCCAAGCTCTAA

Retrieve as FASTA