Entry information : EhuxCP01
Entry ID 8393
Creation 2011-05-17 (Christophe Dunand)
Last sequence changes 2011-05-17 (Christophe Dunand)
Sequence status complete
Reviewer Not yet reviewed
Last annotation changes 2015-08-04 (Christophe Dunand)
Peroxidase information: EhuxCP01
Name EhuxCP01
Class Catalase peroxidase    [Orthogroup: CP002]
Taxonomy Eukaryota Noelaerhabdaceae Emiliania
Organism Emiliania huxleyi    [TaxId: 2903 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value EhuxCP01
start..stop
S start..stop
BnaCP 520 2.03e-176 55..655 66..766
PcapCP02 497 2.91e-168 46..647 44..683
PsojCP03 494 1.97e-167 38..647 39..683
PsojCP 491 4.47e-166 56..646 54..682
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '8393' 'join(191662..191825,192555..193071,193229..193521,193652..193787,193867..194550,194654..194890)' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 191662..191825 162 N° 2 192555..193071 515 N° 3 193229..193521 291 N° 4 193652..193787 134
N° 5 193867..194550 682 N° 6 194654..194890 235  
join(191662..191825,192555..193071,193229..193521,193652..193787,193867..194550, 194654..194890)


exon

Literature and cross-references EhuxCP01
DNA ref. JGI genome:   scaffold_41 (191662..194890)
EST ref. GenBank:   GE184550.1 [3' end]
Protein sequence: EhuxCP01
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   676
PWM (Da):   %s   70919.35  
PI (pH):   %s   6.82
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MQAAASTAPAILAAARRHAPARKRGCRQRSRAACVVIRAAPGPLFDVTAAPGAGRVDWHAVKSDLLDLFSRSQSVWPADYGTYAPFFVRLAWHNSGSYRVADGRGGAEGGRQRFDPERSW
QDNTNLDKARNLLAPLKLKHGPALSWGDLFILSGTVAIEAMGGPVLGFCGGRIDDQDGSDSLALGPSPEQRAFHPCPVNGNCTPPLGAVAVGETKSLIYVDPEGHM
VPDPVRSAADVRWT
FAGMAMNDTETVALIGGGHAFGKTHGACPAGAGALPRDDPANPWPGLCGSGRAADAFTSGIEGPWTTRPTRWDNEYFHNVLRA
GLAAGEEPLLAPSAFPEQTSAPTQRVVMMTSDLALAY
DPSFRAAA
AFASSPAALDSAFAAAWYKLVTRDMGPHARCVGPWVPPPQPFQFPLPKPPPEPVDYAEVWAALHKGADPLLPAADAAALAYGCAASYRYTDNQGGCDGARIRFSPEVDFPAN
AGALEAIGKLRGVKKAFGARLSWADLITLAGHAAVAPHSGPMPFCGGRTDATDGAGSRWLQPLSAPSSAAYFKEHPTGLTAREVLAVWAAASRAVAATRRPPADFFRWVALATPTE
VAEM
PADLRVVHDDAELREAALDFASDPAAHKALLAKAWPRLMSLGRFDGPASSPCDDPAATLMLPPTDAPAPRASSH*

Retrieve as FASTA  
Remarks Complete sequence from genomic (5 introns) and 3 ESTs. Incorrect prediction from JGI. Also detected in scafold_829
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCAAGCTGCCGCCTCCACCGCACCAGCGATACTGGCCGCGGCTCGCCGCCACGCGCCGGCTCGGAAGAGGGGCTGTCGGCAGCGCAGCAGAGCGGCATGCGTCGTCATTCGAGCAGCA
CCAGGTCCCCTCTTCGACGTGACGGCCGCTCCCGGAGCCGGCCG
CGGTGCGCGCCCGACCCACGCCCCCTCTCGCACCCCCCCGATCCCCCTGCGCGGCGCCCCCTCCACTGCGCGCCGG
CTCGAGCGCCTATTGCCAAATCGCGACTTTCGAGCGCAGTGCGTGCGCCCCACACGTCACGCGAGCGCAGGAGAGTTTGCGGCAGAGCAGTACCGTGTACTAGCCGGAGGCATCGCGCCG
AAAACCCCGCGCATTCCGCCCCAACAGAGGCTGCAGGAGGCAGATGGCCAATAGCGACGATCGTTACGAAATGTTGCGGGACCGGGAGCTCTCCAAGCTGTCTTGGGTGGTCAAAGAATT
TTTCCCCGTATCGGACCGAAGGTCCTCTTCAGTGTGGAAGAAGCGACAAGGGAAAGAACGACGAGAAGGGGGCGACCGCGAGGTGGCATGAAGGTCGCAGCGAGAATCGGCGAGAGACTG
CGAAGGCGTATAAGGACAACCCGGACCCAAGACGCACAAGCTTTGTTATTATTGTTAGTTAGTTAGGAAGGTGAAGAACACAGCGGACTGGGGGTGGACGCGAGAAGCTTGAGAGGCCTT
CCCGAGAAATCGCGCGCGCCGCGAAGGCACTCGGAACACTGAACGACCACCCCCATTCATGCTCCCTCTACTCTCGCCCGTCGCCGCCTCGCTTTTGTATCGCCTCCCTCGACGGCACGC
CTCTCCCGCTCAACCACCCACCGCTGCGCTCTCGTCCGCCGGCGGACGCGGCGAGTGTGGACTGGCACGCCGTCAAGAGCGACCTGCTCGACCTCTTCTCGCGCTCTCAGTCCGTCTGGC
CGGCGGACTACGGCACCTACGCTCCCTTTTTTGTGCGCCTCGCTTGGCACAATAGTGGCAGCTACCGCGTCGCCGATGGGCGCGGCGGCGCCGAGGGCGGCCGGCAGCGCTTCGACCCTG
AGCGCTCTTGGCAGGACAACACCAATTTGGATAAGGCGAGGAACCTGCTCGCCCCGCTCAAGCTCAAGCACGGCCCCGCGCTGTCCTGGGGCGACTTGTTCATCCTCTCTGGCACCGTCG
CGATCGAGGCGATGGGCGGGCCCGTCCTCGGCTTTTGCGGCGGCAGGATCGATGACCAGGACGGCTCCGACTCCCTCGCGCTCGGCCCTTCGCCCGAGCAGCGAGCCTTCCACCCGTGCC
CCGTCAACGGCAACTGCACGCCGCCGCTCGGCGCGGTTGCCGTGGGCGAAACTAAGAGCCTGATCTACGTCGACCCGGAGGGCCACATGGGG
GGGTGAGCTCTTGCCATGCTAGCTCTTG
CCATGCTAGCTGTGTCACCTTGGAGCCGTCATGGACACGCTGCCCCTTCGCACTCGCTGCCCCTCCCGACTCGCTGCTCCTCCGACACCGCCTCAAATCCCGCCCGACACCGCCTCGAAT
CCCGCCTCCAGGTGCCGGACCCGGTCCGGTCGGCGGCCGACGTGCGATGGACCTTTGCGGGCATGGCCATGAACGACACAGAGACGGTTGCGCTCATCGGCGGCGGGCACGCGTTTGGCA
AGACGCACGGAGCGTGCCCTGCCGGCGCGGGCGCTCTCCCGCGCGACGACCCGGCCAACCCGTGGCCGGGCCTCTGCGGCTCTGGCCGCGCCGCCGACGCCTTCACCTCGGGCATCGAGG
GCCCGTGGACCACGCGGCCAACGCGGTGGGACAACGAGTACTTCCACAACGTGCTGCGTGCTGG
GGGTGGGAGAAGCACCGCGGGCCGGGAGGCGCGTGGCAGTGGAGGAGCGCGCCGCC
TGGCGAGGACCCTCCCCGCTCGCCTGGCCCTCGGCTCCTCTCGGCTCACCTCCACGCGGTTGTGCCATCTCGGCAGGCTCGCGGCAGGAGAGGAGCCGCTGCTTGCCCCGTCCGCCTTCC
CGGAGCAGACCAGCGCGCCGACCCAGCGCGTGGTGATGATGACGTCGGATTTGGCCCTGGCGTATGACCCGTCCTTCCGCGCCGCCGCCGAG
AGGCGTCTCGCCGCGAGCCCGAGGGTCC
GAGATCACACACTAATGGGTACGCTGCACACCCGAGAGATCGCTCGAGACTAGGCGTTCGCCTCGAGCCCCGCGGCTCTCGACAGCGCCTTTGCGGCGGCCTGGTACAAGCTGGTGACGC
GGGACATGGGGCCGCACGCGCGCTGTGTCGGGCCATGGGTGCCACCCCCGCAGCCTTTCCAGTTCCCGCTGCCCAAGCCGCCGCCGGAGCCGGTCGATTATGCGGAGGTCTGGGCAGCAT
TGCACAAAGGCGCGGACCCGCTCCTGCCGGCTGCCGACGCGGCCGCGCTCGCGTACGGTTGCGCGGCCTCGTATCGGTACACAGACAACCAGGGCGGCTGCGACGGCGCTCGCATCCGCT
TCTCCCCCGAGGTCGACTTTCCTGCGAACGCTGGCGCACTGGAGGCGATTGGCAAGCTTCGCGGCGTCAAGAAGGCGTTCGGTGCGCGCCTCTCGTGGGCCGATCTCATCACCCTCGCGG
GACACGCCGCCGTCGCGCCTCACTCGGGTCCGATGCCGTTTTGCGGCGGCCGCACCGACGCAACCGACGGCGCGGGCTCGCGCTGGCTTCAGCCGCTGTCGGCGCCGTCGTCGGCCGCCT
ACTTCAAGGAGCACCCGACGGGCCTCACCGCCAGGGAGGTGCTCGCTGTGTGGGCGGCAGCAAGCCGCGCGGTGGCGGCGACGCGCCGGCCGCCGGCCGATTTCTTCCGGTGGGTGGCCC
TCGCTACGCCCACCGAG
AGGTGCGCGCGGCCTGGTGGCCCCCGTCACGCACCCCCCGTCACCGCCCGTTTTTCCCTCGCCCGCCCCCCCCCCCCCCTTCACCGCCCACTCCTCCCTCGCC
AGGTGGCAGAGATGCCCGCCGATCTGCGCGTGGTCCACGACGACGCGGAGCTGCGCGAGGCGGCCCTCGACTTCGCCTCCGACCCGGCCGCGCACAAGGCTCTGCTCGCGAAGGCGTGGC
CGCGACTGATGTCTCTTGGCCGATTCGACGGCCCCGCCTCCAGCCCGTGCGACGACCCTGCGGCGACTCTGATGCTTCCGCCGACCGACGCGCCCGCGCCGCGCGCCTCATCCCACTAA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCAAGCTGCCGCCTCCACCGCACCAGCGATACTGGCCGCGGCTCGCCGCCACGCGCCGGCTCGGAAGAGGGGCTGTCGGCAGCGCAGCAGAGCGGCATGCGTCGTCATTCGAGCAGCA
CCAGGTCCCCTCTTCGACGTGACGGCCGCTCCCGGAGCCGGCCG
TGTGGACTGGCACGCCGTCAAGAGCGACCTGCTCGACCTCTTCTCGCGCTCTCAGTCCGTCTGGCCGGCGGACTAC
GGCACCTACGCTCCCTTTTTTGTGCGCCTCGCTTGGCACAATAGTGGCAGCTACCGCGTCGCCGATGGGCGCGGCGGCGCCGAGGGCGGCCGGCAGCGCTTCGACCCTGAGCGCTCTTGG
CAGGACAACACCAATTTGGATAAGGCGAGGAACCTGCTCGCCCCGCTCAAGCTCAAGCACGGCCCCGCGCTGTCCTGGGGCGACTTGTTCATCCTCTCTGGCACCGTCGCGATCGAGGCG
ATGGGCGGGCCCGTCCTCGGCTTTTGCGGCGGCAGGATCGATGACCAGGACGGCTCCGACTCCCTCGCGCTCGGCCCTTCGCCCGAGCAGCGAGCCTTCCACCCGTGCCCCGTCAACGGC
AACTGCACGCCGCCGCTCGGCGCGGTTGCCGTGGGCGAAACTAAGAGCCTGATCTACGTCGACCCGGAGGGCCACATGGGG
GTGCCGGACCCGGTCCGGTCGGCGGCCGACGTGCGATGG
ACCTTTGCGGGCATGGCCATGAACGACACAGAGACGGTTGCGCTCATCGGCGGCGGGCACGCGTTTGGCAAGACGCACGGAGCGTGCCCTGCCGGCGCGGGCGCTCTCCCGCGCGACGAC
CCGGCCAACCCGTGGCCGGGCCTCTGCGGCTCTGGCCGCGCCGCCGACGCCTTCACCTCGGGCATCGAGGGCCCGTGGACCACGCGGCCAACGCGGTGGGACAACGAGTACTTCCACAAC
GTGCTGCGTGCTGG
GCTCGCGGCAGGAGAGGAGCCGCTGCTTGCCCCGTCCGCCTTCCCGGAGCAGACCAGCGCGCCGACCCAGCGCGTGGTGATGATGACGTCGGATTTGGCCCTGGCG
TATGACCCGTCCTTCCGCGCCGCCGCCGAG
GCGTTCGCCTCGAGCCCCGCGGCTCTCGACAGCGCCTTTGCGGCGGCCTGGTACAAGCTGGTGACGCGGGACATGGGGCCGCACGCGCGC
TGTGTCGGGCCATGGGTGCCACCCCCGCAGCCTTTCCAGTTCCCGCTGCCCAAGCCGCCGCCGGAGCCGGTCGATTATGCGGAGGTCTGGGCAGCATTGCACAAAGGCGCGGACCCGCTC
CTGCCGGCTGCCGACGCGGCCGCGCTCGCGTACGGTTGCGCGGCCTCGTATCGGTACACAGACAACCAGGGCGGCTGCGACGGCGCTCGCATCCGCTTCTCCCCCGAGGTCGACTTTCCT
GCGAACGCTGGCGCACTGGAGGCGATTGGCAAGCTTCGCGGCGTCAAGAAGGCGTTCGGTGCGCGCCTCTCGTGGGCCGATCTCATCACCCTCGCGGGACACGCCGCCGTCGCGCCTCAC
TCGGGTCCGATGCCGTTTTGCGGCGGCCGCACCGACGCAACCGACGGCGCGGGCTCGCGCTGGCTTCAGCCGCTGTCGGCGCCGTCGTCGGCCGCCTACTTCAAGGAGCACCCGACGGGC
CTCACCGCCAGGGAGGTGCTCGCTGTGTGGGCGGCAGCAAGCCGCGCGGTGGCGGCGACGCGCCGGCCGCCGGCCGATTTCTTCCGGTGGGTGGCCCTCGCTACGCCCACCGAG
GTGGCA
GAGATGCCCGCCGATCTGCGCGTGGTCCACGACGACGCGGAGCTGCGCGAGGCGGCCCTCGACTTCGCCTCCGACCCGGCCGCGCACAAGGCTCTGCTCGCGAAGGCGTGGCCGCGACTG
ATGTCTCTTGGCCGATTCGACGGCCCCGCCTCCAGCCCGTGCGACGACCCTGCGGCGACTCTGATGCTTCCGCCGACCGACGCGCCCGCGCCGCGCGCCTCATCCCACTAA

Retrieve as FASTA