Entry information : MpPrx35(Mapoly0486s0001 / Mp3g18420)
Entry ID 16990
Creation 2021-03-03 (Christophe Dunand)
Last sequence changes 2021-03-03 (Christophe Dunand)
Sequence status partial
Reviewer Not yet reviewed
Last annotation changes 2021-04-20 (Christophe Dunand)
Peroxidase information: MpPrx35(Mapoly0486s0001 / Mp3g18420)
Name MpPrx35(Mapoly0486s0001 / Mp3g18420)
Class Class III peroxidase     [Orthogroup: Prx311]*
Taxonomy Eukaryota Viridiplantae Streptophyta Marchantiaceae Marchantia
Organism Marchantia polymorpha    [TaxId: 3197 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value MpPrx35
start..stop
S start..stop
MpPrx[P]34 511 0 1..263 1..263
MpalPrx38 474 4.08e-171 22..263 23..264
MpPrx38 457 2.87e-164 1..262 1..263
MpalPrx05 313 4.79e-107 26..260 47..281
Literature and cross-references MpPrx35(Mapoly0486s0001 / Mp3g18420)
Protein sequence: MpPrx35(Mapoly0486s0001 / Mp3g18420)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   274 (250)
PWM (Da):   %s   29121.76 (26747.5)  
PI (pH):   %s   7.34 (7.34) Peptide Signal:   %s   cut: 25 range:25-274
Sequence
Send to BLAST
Send to Peroxiscan
*.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
MALTTAMLSS VVLALFSLVA VGQAQLSTTF YRNSCPTALT IVRQQVDSIL ASNPNLAGGL QRLHFHDCFV RGCDASVLLF SNSSTNQEKD AQPNQNSLRG FQQIDQVKSA LEAACPGVVS  CADILAIVAR DATVKAGGQT WPVLLGRRDG TVSLASEALA ALPSPLLNLG QLIQNFAAVG LNASDMIVLS GAHTFGRARC GPVLNRLYNF NGVNGQTDPS MDSTLAANLK KQCPPNDVTT 
IITMDSSPNV FDRTYFKQVI NREDFSPQML HSPP 

Retrieve as FASTA  
Remarks Partial sequence from genomic. End of the last exon is missing. Incorrect prediction from Phytozome. The partial sequence is repeated two times. Specific peptide found in cell wall proteome.
DNA
Send to BLAST
.........1 .........2 .........3 .........4 .........5 .........6 .........7 .........8 .........9 .........0 .........1 .........2
ATGGCGTTGA CAACAGCAAT GCTCTCGTCC GTCGTCTTGG CCCTCTTCTC GTTGGTAGCT GTCGGGCAGG CGCAGCTGTC CACAACCTTC TACAGAAATT CCTGCCCTAC AGCATTGACC  ATAGTACGAC AACAGGTTGA CAGCATCCTT GCATCAAATC CAAATCTTGC CGGGGGCCTG CAGCGCCTTC ATTTCCATGA CTGCTTCGTC CGAGTAAGTT TACCTGATAT TTCTAGCCTC  ACATCAAGGA TTGACATTCT AATTATGATA AATTTTCTAC ATCATCTGAT ATGTTGACTC CATGAAGCAT ATATCCATTG AAAATTTTGT AGCGGAATTC CTGATACAGC TAACGGGCTT  CAGAATCCCA GCAGAATGTC TTCTATAGTG TTCACTTGTC ATGAGATCTT GTCAGTCAGC CTCATGGCTT GAAAACACAT TTCTTTCTAC ATTATTTCAA CGGCTTGCCT CCAGGGGAGC  ATTGTTCTAG AAGCTAGAAA AGATCTTTCT GATGCCAAAC ACCAGCTGAC ATGAGGGCGA TTTTATCTTT TACCATTCAG GGCTGTGATG CATCCGTACT TTTGTTCTCG AACTCCTCAA  CGAATCAAGA GAAGGACGCC CAACCAAATC AAAACTCCTT GAGAGGATTC CAGCAAATTG ATCAAGTTAA ATCCGCCCTT GAGGCCGCAT GCCCGGGTGT TGTCTCTTGC GCAGATATAC  TGGCCATCGT AGCACGTGAT GCCACTGTCA AGGTATGTCT AACATAGGAA AAGTTCCTAT TTCACGATGA ATTTCATCGA TTCTTTGCTC ACTGAAGGCT CCATTGAGCA CGAATAGAAT  CAGACTTAGT GATGGGATCT AATCAGCTGA TTTGCGTTCT TCAAATCGAT CCTGGGATTA AGAACTACCC CTTGTAATGA CACAACAGGC AGGAGGTCAA ACCTGGCCCG TCCTTCTCGG  ACGCAGGGAT GGGACGGTGT CATTAGCGTC GGAGGCCCTG GCGGCGCTCC CATCCCCACT CCTGAACCTC GGTCAGCTGA TTCAAAATTT TGCTGCTGTT GGACTCAACG CATCGGACAT  GATCGTCCTC TCAGGTAATC AACTTGTCTC CAGCTCTTCA GTCCAGTCCA GTTTACATTC AGCGTCATAG TGATCTGAGG GAGAACGACA GCACTCAGAA TGGACTTGTA TATTGCGCCG  GAGTGGCGTG AAGAGGCAGA TCTCTTTGTT TTCTCTTCGT GGCGGTTTCT GTAATCTGGG ACAGCATGCA GAGTAGAAAC CGTGTGCAGT GCAGAGTGGT CATCAAAAGC ATGAGACTCA  ATCTCAGACG TGTTTCACTT GTCCGTTCGA TCAGGAGCGC ACACCTTCGG CAGAGCACGC TGCGGTCCCG TTCTCAACCG ATTGTACAAC TTCAATGGAG TAAATGGACA AACAGACCCG  AGCATGGACT CGACTTTGGC AGCCAACTTG AAGAAGCAGT GCCCTCCCAA CGATGTCACG ACGATTATCA CCATGGATTC TTCTCCAAAC GTGTTCGACC GCACGTACTT CAAGCAGGTC 
ATAAACAGGG AGGACTTTTC ACCTCAGATG CTGCACTCGC CACCG 

Retrieve as FASTA