Entry information : PtroDuOx02 ( DUOX2)
Entry ID 5853
Creation 2007-10-10 (Marcel Zamocky)
Last sequence changes 2010-11-23 (Myriam Duval (Scipio))
Sequence status complete
Reviewer Christophe Dunand
Last annotation changes 2016-02-11 (Christophe Dunand)
Peroxidase information: PtroDuOx02 ( DUOX2)
Name (synonym) PtroDuOx02 ( DUOX2)
Class Dual oxidase    [Orthogroup: DuOx001]
Taxonomy Eukaryota Metazoa Chordata Mammalia Hominidae Pan
Organism Pan troglodytes (chimpanzee)    [TaxId: 9598 ]
Cellular localisation N/D
Tissue type N/D
Inducer N/D
Repressor N/D
Best BLASTp hits
Perox score E-value PtroDuOx02
start..stop
S start..stop
HsDuOx02 3152 0 1..1547 1..1548
CfaDuOx02 2756 0 1..1547 1..1571
BtDuOx02 2714 0 1..1547 1..1545
SscDuOx02 2702 0 1..1547 1..1545
Gene structure Fichierperl './assets/cgi-bin/draw_exon.pl' '5853' 'complement(join(1..123,414..542,787..942,1288..1446,1679..1911,3886..4039,4262..4389,4657..4706,5736..5835,6029..6259,6417..6595,7122..7205,7572..7641,8160..8356,10322..10415,10502..10727,12005..12190,12487..12689,12890..13003,13194..13331,13707..13825,14419..14594,15161..15324,15893..15995,16259..16349,16798..16894,17020..17080,17482..17648,18244..18442,18625..18812,19411..19575,19844..19929,20196..20269))' Exons
ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize ExonStart..EndSize
N° 1 20196..20269 72 N° 2 19844..19929 84 N° 3 19411..19575 163 N° 4 18625..18812 186
N° 5 18244..18442 197 N° 6 17482..17648 165 N° 7 17020..17080 59 N° 8 16798..16894 95
N° 9 16259..16349 89 N° 10 15893..15995 101 N° 11 15161..15324 162 N° 12 14419..14594 174
N° 13 13707..13825 117 N° 14 13194..13331 136 N° 15 12890..13003 112 N° 16 12487..12689 201
N° 17 12005..12190 184 N° 18 10502..10727 224 N° 19 10322..10415 92 N° 20 8160..8356 195
N° 21 7572..7641 68 N° 22 7122..7205 82 N° 23 6417..6595 177 N° 24 6029..6259 229
N° 25 5736..5835 98 N° 26 4657..4706 48 N° 27 4262..4389 126 N° 28 3886..4039 152
N° 29 1679..1911 231 N° 30 1288..1446 157 N° 31 787..942 154 N° 32 414..542 127
N° 33 1..123 121  
complement(join(1..123,414..542,787..942,1288..1446,1679..1911,3886..4039,4262.. 4389,4657..4706,5736..5835,6029..6259,6417..6595,7122..7205,7572..7641,8160..835 6,10322..10415,10502..10727,12005..12190,12487..12689,12890..13003,13194..13331, 13707..13825,14419..14594,15161..15324,15893..15995,16259..16349,16798..16894,17 020..17080,17482..17648,18244..18442,18625..18812,19411..19575,19844..19929,2019 6..20269))


exon

Literature and cross-references PtroDuOx02 ( DUOX2)
Literature unpublished
Protein ref. GenBank:   XP_510367.2
DNA ref. GenBank:   NC_006482.2 (42285634..42265366)
mRNA ref. GenBank:   XM_510367
Protein sequence: PtroDuOx02 ( DUOX2)
Sequence Properties
first value : protein
second value (mature protein)
Length (aa):   %s   1547 (1525)
PWM (Da):   %s   175116.97 (172797.7) Transmb domain:   %s   o599-621i1040-1062o1077-1099i1146-1168o1183-1205i1218-1240o (o577-599i1018-1040o1055-1077i1124-1146o1161-1183i1196-1218o)
PI (pH):   %s   8.2 (8.20) Peptide Signal:   %s   cut: 23 range:23-1547
Sequence
Send to BLAST
Send to Peroxiscan
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
MLRARPEALMLLGALLTGSLDPSGNQDALSLPWEVQRYDGWFNNLRHHERGAGCRLQRRVPANYADGVYQALEEPQLPNPRRLSNAATRGIAGLPSLHNRTVLGVFFGYHVLSDVVSVETPGCPAEFLNIRIPPGDPVFDPDQR
GDVVLPFQRSRWDPETGRSPSNPRD
ANQVTGWLDGSAIYGSSHSWSDALRSFSGGQLASGPDPAFPRDSQNPLLMWAAPRPPPPGQNGPRGPFGAERGNREPFLQALGLLWFRYHNLWAQ
RLARQHPDWEDEELFQHARKRVIATY
NIAVYEWLPSFLQKTLPEYTGYRPFLDPSISPEFVVASEQFFSTMVPPGVYMRNASCHFRKVLNKGFQSSQALRVCNNYWIRENPNLNSTQEVN
ELLLGMASQISELEDNIVVEDL
DYWPGPGKFSRTDYVASSIQRGRDMGLPSYSQALLAFGLDIPRNWSDLNPNVDPQVLEATAALYNQDLSQLELLLGGLLESHGDPGPLFSAIVLDQFV
RLRDGDRYWFENTR
GLFSKKEIEDIRNTTLRDVLVAVINIDPSALQPNVFVWHGAPCPQPKQLTTDGLPQCAPLTVLDFFEGSSPGFAITIIALCCLPLVSLLLSGVVAYFRGRERKKLQ
KKVKESVKKEAAKDGVP
AMEWPGPKERSSPIIIQLLSDRCLQVLNRRLTVLRVVQLQPLQQVNLILSNNRGCRTLLLKIPKEYDLVLLFSSEEERGAFVQQLRDFCMRWALGLHVAEMSE
KELFRKAVTKQQRERILEIFFRHLFA
QVLDINQADAGTLPLDSCQKVREALTCELSRAEFAESLGLKPQDMFVESMFSLADKDGNGYLSFREFLDILVVFMGSPEDKSRLMFTMYDLDEN
GFLSKDEFFTMM
RSFIEISNNCLSKAQLAEVVESMFRESGFQDKEELTWEDFHFMLRDHDSELRFTQLCVKGGGGGGGIRDIFKQNISCRVSFITRTPGERSHPQGLGPPAPEAPELGGP
GLKKRFGK
KAAVPTPRLYTEALQEKMQRGFLAQKLQQYKRFVENYRRHIVCVAIFSAICVGVFADRAYYGFASPPSDIAQTTLVGIILSRGTAASVSFMFSYILLTMCRNLITFLRETFL
NRYVPFDAAVDFHRWIAMAAVVLA
ILHSAGHAVNVYIFSVSPLSLLACVFPNVFVNDGSKLPQKFYWWFFQTVGMTGVLLLLVLAIMYVFASHHFRRRSFRGFWLTHHLYILLYALIIHG
SYALIQLPTFHIYFLVPAIIYGGDKLVSLSRKKVEISVVKAELLPS
GVTYLQFQRPQGFEYKSGQWVRIACLALGTTEYHPFTLTSAPHEDTLSLHIRAVGPWTTRLREIYSSPKGNACA
GYP
LYLDGPFGEGHQEWHKFEVSVLVGGGIGVTPFASILKDLVFKSSLGSQMLCKKIYFIWVTRTQRQFEWLADIIREVEENDHQDLVSVHIYVTQLAEKFDLRTTMLYICERHFQKVLN
RSLFTGLRSITHFGRPPFEPFFNSLQEVHPQ
VRKIGVFSCGPPGMTKNVEKACQLVNRQDRAHFMHHYENF*

Retrieve as FASTA  
Remarks Complete sequence from genomic (chromo 15, 30 introns). No EST. Isolate="Yerkes chimp pedigree #C0471 (Clint)". Isoform 2.
DNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGGCACTGATGCTCCTGGGAGCTCTTCTGACTGGATCCCTGGATCCATCGGGCAAGTATCAGGCTCCTCTAGCGGCGGGGTGTTCCCCAGGATCCTCTGGG
AGGTGGGGCGGGGAGAGGTGCGGCAAGCGGCTCCCTGAGACTGGAAGGTCATTTCGCCGTGCAGCTCAGCGGGATGGGAAACTTCCCATTGCGGCCCGACACTTGGGTCCGGTTAGGGGC
GCTCCGCGAGCTGGGGAAGGACTGGCCAAGGCCTTCGTTGCTCGGGAGGGGTAGCTGGGAGCGTAGTGCTGAGGAGGCCCTTCTCTGTGCCCACAGGCAGTCTCAGGACGCACTCTCACT
GCCCTGGGAAGTGCAGCGCTATGACGGCTGGTTTAACAACCTGAGGCACCACGAGCGTGGTGCTGTTG
GTGCGTTCTGGGGGCCCGGGCGTGCTGGGGCCGTGGCTCGCGAAGGGCCGGG
GCGCGAAAGGCCCTGAGCGGGGAATCTGCGGGGAACACGCGCCCAGCAGCTCCGCTGCCTACACAGCTCAATCTTATGCGCTCCCGGGGCCAAGAGACCCTTGAGGGAAGGTTCTGTCAG
TGAAGTGGGATGGGGGTTGAGGGAGGCTTAGGGCGAGGTTTGGGGGATCCTAGGGGATGGAGTGCTTAGACAGAGCCCCGCTCCCTGCCTCCGCAGGCGCTGCCGGTTGCAGCGCCGCGT
ACCAGCCAATTACGCCGACGGTGTGTATCAGGCTCTGGAGGAGCCGCAGCTGCCCAACCCGCGCCGGCTCAGCAACGCAGCCACGCGGGGCATAGCCGGCCTGCCGTCGCTCCACAACCG
CACCGTACTGGGGGTCTTCTTTG
GTGAGGGCAAAGGGGGAGACCAGTGGGGTTGATGTGGCGCTCTGCTCAGCCTGGGGGAGGGGCCAGATCCCGTCTGCGAGTCCACAGGAGACTCATC
CGACTCCCAACCACTTCCTCTCTAAGCAGCACTTCGAGACTGCCTTCATCTCGGAGAGATTTTGGGATGTTGATACAGAGATATTTGCTCTGTATCTAACCTTTCTCTTACGCCTTACTC
CAAACTAGGGGTGTCACTGGACCCCCATTATAGCTCTTGCGAACTGAGCTCCCCAGCCACCGCTCTCCTCACCGTGTGTTTGTAACCATTTTACCTCCCCCTAGCCCAGAGGGAGGAGGA
CTGACTTGGGGTACCCCTACCTAAATTATATCATTTTGATTCTCACAACAGCTTTATGAATTGAGTAGGAAGGGAACTCACTATAGTCTTACTTTGCAGATTAGAAAATTGAGGCTCCCT
GGGGCTAACGTGCAGAGCTGGTGGCGGAGCTGGCACTGGGACCTCTGTCTTTTGGCTCCTGAGGACACTTGGAGGCCGCCCTGGCCGTGGGGAGGGCGCAATACGGACGGTTTGTCACCT
ATTTGCGCCCCATGCCCGCAGGCGCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAAACGCCCGGCTGCCCCGCCGAGTTCCTCAACATCCGCATCCCACCTGGAGACCCCGTGTTCGA
CCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGGAGCCGCTGGGACCCCGAGACCGGACGGAGTCCCAGCAACCCCCGGGACCTG
GTGAGGCGGGGAAGGCGGCGGGAAGGGGC
CGCACCCCAGCCAGGTGGGGCCTGGGCTTCGGGCCTGGCAGGGCCTGGAGGGGAGAGGCGCCCACTCCCCAGCCGCGGACACCCGCCGGGCCCCGGCCTTCCCTGGCCCGCCGCCGCCCA
TCGGCCCGGGCTCACCCGCCGCGTGCCCCGCAGGCGCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCCTCGCACTCCTGGAGCGACGCGCTGCGGAGCTTCTCGG
GGGGACAGCTGGCGTCGGGGCCCGACCCCGCTTTCCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCCGACCCCCGCCACCGGGGCAGAACGGGCCCCGGGGGC
GTACGG
GAGGCCACAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTGTA
CGGTGAGCCCCCAGGGACGGGACGGGGCCGGCTGGGGGTCTGCGAGTGTGGACTCCCCCGATCACGCTACCGCTCATCTCCTCCCCCGCGCCCCCCACGTCGGATGCAGCCCCTTCGGGG
CAGAGAGAGGGAACCGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAGCTGT
TCCAGCACGCACGCAAGAGGGTCATCGCCACCTACCAG
GTCAGCCGTCCGCGCCCCGCGACGTCCTCCCTTCCGCGTGCAAGCCCACGGGAGACTCCGCTGCCCCACGGAGCTCCCCATC
TGTGGACAACCGCCACCCAGAAACCCCTCCCCAGACAGCCGAGGTCTAGGGAAGCCCCTGTAAATGATAGGGAGGCACGCGCTGTTTATAGGAGAAATCTGGCTGGTGATGACTATTTAT
CACCTCCCCACCCCCCACTCCCTCAAATCCCCTGGTTCCTTATGGGGACAGGCCTCACACTGCTCCTGTCTGAGTTGCTTCTCCCATGATTGACCCTTCCTGGTCCTCATCTCCACACGG
AAGCCGTCCTTGGGCTCAGACCCTTCCAGGCCCCACGCCATCCAACTCGTGCCTCCCCTCGCCCCTCTCTGCCCCTCAGAAAACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAG
AAAACACTCCCGGAGTATACAG
GTGAGGGAGCGGGGAAGGAGGACACCTGTGCGGAGAATCCTGCGGGGAAGGAGACAGGTGCCTGTGATGGGAGGATGTGGAGGCAAGGAGCCTGTCTC
CCCATCATCACCGTCTCCTTCCTGCAGGAGATACCGTCCTTTCCTAGACCCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGGTGCCCCCTGGTGTCTA
CATGAG
GTGAGGGAGGGGTTGGCAGAGAGGGGGCACCACACTAAGAAAGGTGCAGAATGAGCTGCCTTGGGGGCTGGGGCCTTTCACACTCCTTCGCAGTTTCACTAGAGAAGGGGAAGC
AAAAATTTGGGGCCCTGAAACAGAACCCTGGGGTAAGATGTGTAGGCTTAGTAGGGAAATCTCCCCAGCTCTCCTAAGGGCTGAAATTTGGTGGCTGGGTGTAGGATTTGTCTAGCAGCT
GGGTCATTCCCTTCCCTCCTCTCCCCACCCTACCTGGACTAGGAGCGCACTCTATCTTCAGTAAACGCACATCGCCAAATCTCTGCCGTGTTCAAGGAAGTTCCTGGGCCACTGCTCAAT
CCTAGTGAACCCCCACTGAGTCCCTCAGCCCACTCAACCCCATCTTTGATTCTTCTCCAAATTCCCTCACCACATCCTTTGTTCTCAATTTCAGAAAAATGCCAGCTGTCATTTCCGGAA
GGTCCTGAACAAGGGTTTTCAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
GTCAGACTGGGGTCAGGGTCAGGGGAAGATGGGTCAAGGTCAGTCTCTTCACA
CAGGCTGGGAAAAGCAACAATTCCAGTTCTTGAGTGTTGCTGCCCCAGGTTCATGGGAGATGAAGGGTAGAGGAAATTATCCGGGGACAACAGCTCAAGAGAGTCTGGGACTGGCCAAGG
GCCCTTTGTCCTGGGGTACTAAAGTGGTCCAGGCTGAGAGAGACCGAGTTTTGGGTAGAGGCCTATCTTGAGTGCATTGTTTACTTCCAGAAAATCCCAATCTGAACAGTACCCAGGAGG
TGAATGAGCTGCTGCTGGGAATGGCCTCCCAGATTTCGGAGCTGGAGGACAACATCGTGGTTGAAGATCTGAGGG
GTGAGCTCAGAGCCAGAAGGGGTGGATGGTAAGGGACCAGGAAGC
CTGAGGATCCCTCTGGGTTCATCAGTAGCAGACCTAGGGCACTCACGATGCAGGAATACAGATACAAACACAGACTTCAAGGAGCCATAAGAAGAAAATGAACTTTGAACTTTTTTTTTT
TTGCTTAATTTACACTTTTGTGTTGCTTTTATTTTAAATTAACTATGTCTGAGTATTGGGGAGGGGTGGTACCACAATCTCTTTGGTGCTTAGTTAGGTCTCTAAAGGTTTTCATTCAGC
CCTGTCTTGAAACTAGGGAGGCATAGGACAGGGAATTTACTGCTTGGTGACTAGAACAGTCTTGAGTCTTAGAGGAAGGGTCTTACTGGAAAAACTGGCTCTGATCACACATGGTGACAT
TGGCCTTGCAGCAAGGCAAGGTCAGCATGGGCACAGATCTCATGTGGATCACTGGGGGTAGCCAGGAGGGAAGAACCGTAGTGCCAATAGTCAGATACAAGGCTGCAGGGCAAAAAAGGA
TGGAGAGGACAAAGCCCATAGTCTAGTCTTCTCTCCTCTTCAGATATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAACGTGGCCGAGATATGG
GGCTGCCCAGCTATAGCCAGGCCCTGCTGGCCTTTGGGCTGGACATCCCAAGGAACTGGAGTGATCTCAACCCTAATGTGGACCCCCAG
GTCAGGAATAGTAATGATAATAATGGCAGCT
AAAGCTTTCCTATGGCAATACTGTTCCAAACCCTTTACATATGTTGACTCATTTAATCTACATAATAATATTGTAAGGTATGTATAATAATTTTCCCCATTTTACTGATGACATAGTTGG
TAAATCATAGAGGAGGGACTTAAATTCAGCCATCTGATTCCAGAATATATTCCTAACCACTGCATTGTACCATTCCTGCAGGGTGGCTTCTGGGTTGGGTGCCATTGTCCTGTTGCTGCA
GGGTCCCACCCCAAGGGCTGTGTGCCCCTGAAGCTGCTAATCATTGAGGTCAGGCAGGCCGGTGATGGTCACAGGATATGGTCCTAGGGCACCTGACCGTGGTCTTACCGTGGGTAGGGA
CACACTGATCCTTCCACCAGACTTGTCCTGCCTGAGGGGGCTTGCCTAAGAAGAGGAAATCAGGCCTGAGCAGCAAGCCAGGCAGCGGCTGGGGTCCTGTGTCTGAGGGATGGGGCAACA
GTGGCTGCCCTCCGCAGCAAACATACGCTCACCCCTTACTCCTGTGGTGCCCCAGGTGTGCTGGAGGCCACAGCTGCCCTGTACAACCAGGACCTATCCCAGCTAGAGCTGCTCCTTGGG
GGGCTCCTGGAGAGCCATGGGGACCCTGGACCCCTGTTCAGTGCCATTGTCCTCGACCAGTTTGTACGGCTGCGGGATGGTGACCGCTACTGGTTTGAGAACACCAGGAATGG
GTAAGGC
TTGCCTGGGCCCCCACCTCAGACTGCTCCTCAGCCTGAGCCCCAGACCCTCTGTCTGGCCTTAGACAGCCCCTATGAGCCCTTGATTCCCAGTCAGCCCACCACACCCTTCCCAACCCCT
CTGGGTCTCTCTTTTTTTCTTCTCTTTTCTTTTCTTTTTCTTTTTCTTTTTTCTTTTCTTTTTTTTTTTTTTTTTTTTTGAGGCAGAGTCTAGCTTTGTCACCCAGGCTGGAGTGCAGTG
GCGTGATCTTGGCTCAATGCACCCTCCACCTCCCAGGTTCAAGTGATTCTCCTGCATCAGCCTCCCGAGTAGCTAGGATTAGAGGCATGCACCAACATGCCCAGCTAATTTTTTTTAAAA
ATATTTTTAGTAGAGATGGGGTTTCACCATGCTGGTCAGGCTGGTCTCGAACTCCTGACCTCAAGTGATCCACTCGCCTTGGCCTCCCAAAGTGTTGGGATTACAGGCATGAGCCACTGC
ACCCAGCCCCTCTGGGTCTCTTTTCTCACCTGGGTCCTTGGGCCTGGGGTTGCTGGAGGCCTGCATCCCCTTCCCATCCCAGTGACTTCTACTTCCTCCAACTTAGGCGCTGTTCTCCAA
GAAGGAGATTGAAGACATCCGAAATACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCCCTGCAGCCCAATGTCTTTGTCTGGCATAAAG
GTGAGTGCCCTGG
GAGAACACAAGTGAGTGACAGTGGCCAGAGAAGGATCAAGATTGAGGGTGCGGGGGAATCACTTGGTGCTGTCCAGGGAGGCAGGCACCTTCTGTGTTGGGCTAGGAGGCCTGCATTTGG
CTGGCTCCCACAGCAGGGACCTCAACTAGCACACAAGCTACACCCTACAGTCAAGAAGGGGTGGATGGGGTAGATGCCAAGAGACAGGAAATGAATGGGGACTTTTTGAGGGAGACAGTT
TCAGGGAGGTGGGCCTGGGGAAGACAGATGATATCTTGGTCCTTTATAGGATAGAGGGGAAAGAGGTCTGGCCACATAGCGGGATCCTCAGACTTTGAGGTCTTCCCTGCCCTCTCCCTC
AGGTGTGCACCCTGCCCTCAACCTAAGCAGCTCACAACTGACGGCCTGCCCCAGTGCGCACCCCTGACTGTGCTTGACTTCTTTGAAGGCAGCAGCCCTGGTTTTGCCATCACCATCATT
GCTCTCTGCTGCCTTCCCTTAG
GTGAGCTCTTAGGCAGCCTCTCTGCAGACTGGCCCTGCCCCTCATTTCCTGCTGGCCTGAGGGGCTGGCTATTTGGTACCGTTTGAGACCAGGCTCAA
GGAACCTCTGGAAGGGAGGGGCCATAGCCTAAGCCACAGTGAAGCTCTAGGCGAGGGGCTCCCTCCTCACTGTTCCTTCTGATCCGCTTCAGTGTGAGTCTGCTTCTCTCTGGAGTGGTG
GCCTATTTCCGGGGCCGAGAACGCAAGAAGCTACAAAAGAAAGTCAAAGAGAGCGTGAAGAAGGAAGCAGCCAAAGATGGAGTGCCAG
GTGAGAAGGGGCTGGGCAGAGGAGGGAGGAGG
GACGGAGGAGGGGAGAGACAGGAGTCTGGGAAAAAGAACCAAGTTACAGAGTGAGAGGAAAGCCAAGGCACCTTTAGGGCGCCTGCTCAGACTCACAGAGGAATTGACCTGAAGGCGGGG
ACCTGGGGACATCTGCTGAACTACCCGGCCCAATTATCCCTTCCCCAGCGCGATGGAGTGGCCAGGCCCCAAGGAGAGGAGCAGTCCCATCATCATCCAGCTGCTGTCAGACAGGTGTCT
GCAGGTCCTGAACAGGCGTCTCACTGTGCTCCGTGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAACAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCTAA
GGAGTACGACCTG
GTATGGCTCGTCCTGCCTCCCCAGCCTGGGCTGCCCTCACACGACTCCATTATCACAAGCGAGGCCACCCTATCCTCAGCTACAGAGCTCACCTATGACAGCTGATG
CTGGGGAGAGGGGCTCCTTTCAGAGGCCCCCAGACACAACCTGACCCCCTTCGTCCACACACCTGGCCCCAGCCTGGATGGATGGGGAGGAGTTTTCTCTCCTCCCCTCAACCCAAGATC
CATTGAGGGGAGGCTGAAGCAGAAGGTCCAGCGAGCTCCCTGCATCAGTGCCGCCTTCCTCCCACCCAGGTGTGCTGCTGTTTAGTTCTGAAGAGGAACGGGGCGCCTTTGTGCAGCAGC
TACGGGACTTCTGCATGCGCTGGGCTCTGGGCCTCCATGTGGCTGAGATGAGTGAGAAGGAGCTATTTAGGAAGGCTGTGACAAAGCAGCAGCGGGAACGCATCCTGGAGATCTTCTTCA
GACACCTTTTTGCTCAG
GTGCCATGATCTGTGCCTTTTGGAGATGGGTCCAGCCCCAGAAATGGAGGAAACCTGGGCTGCATAGAACGCCCCTGTGGGTGAACTAAGCTTCCGCTCTATG
GCCTGGAGAGAAATAGCCTTGTTTGAATCCTGGCATTGCCACTTTACTTAGCTCTGTGACCTTAGGCAAGTCACATTATCGCTGTTCTGTATCTCTGTTTCCTCATCTATAAAACAGTGA
TGAAAACTGTATCCATCCCATTGCATTGTTGTGAGGATTCGGTGAGATCGTCTACATGAGTGGTACACAGAGGTTGGCCTCTGGACCCGAGAGCATCAGCCTCACCTGGGAACATGTTAG
AAATGCACCTACCCAGTTAGACTGAACCAGGAACTCTGTGGGTGGGGCCTGGCAATCTGTGTTTTAACAAGCTCCCCAGATGATTCGGATACACTCTAATGTTTGAAAAACATTGTTTTA
TGTACAGTGCTTATTGGCCCAAGTGCCAGGTATGTTGCAGACATTTAACAAACGGTTGTGGCCAGGCGCAGTGGCTCATGCCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGAT
CACCTAAGGTCAGGAGTTCGAGACTAGCATGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCTGGACATGGTGGCTCACACCTGTAATCCCAGCTACTTGGGAGG
CTAAGGCAGGAGAATCGCTTGAACCCGGGTGGCAGAGGTTGCAGTGAGCCAAGATCATGCCACTGCACTCCAGCCTGGGTGACAGAGCAAGACTCCATCTCAAAAAAACAAACAAAAAAC
AATAAATGGTTGTTACATGTGACTTTTAAACTTTTTGTGCAATGGGCAAATCATAGGCACATGGCAGCCTTATCTGAATTGGCAAGAGAGCACAGCCCCAGCACCTTCCTGCCTGTCTAC
CACCATGTCTCTACATCTTCTGTCCCCAGTATAGGCTCTCTCACTTTCCATTCCCCTTAACTTTGCCCTTCCCCTTCCCTACCCCAGCACCATGCCCACTGCATGAAGTTCCCGGTTCTT
GGGCCCAGGGAGAAATGGGCAGGCTGCTAGAGATTTGATTCCCCCGTCTATAGGACAACAGAGGCCCCAGTCAGTATATCTAAGGATCAGGAGAACCATCAGAGTTTAGCCTTTCTGATT
TGGACTTTGGGGAGATATGAAGGGTCACTGAACTGCTTCCAGCATAGGCTTCACCTCCTTCTCTTTCCCTCCCTCTGCTGCTGCCCGAGTGCAGGTGTGCTGGACATCAACCAGGCTGAT
GCAGGGACCCTGCCCCTGGACTCCTGCCAGAAGGTGCGGGAGGCCCTGACCTGCGAGCTGAGCAGGGCCGAGTTTGCTGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCC
ATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCCTTCCGAGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GTAGGGGGCTGGGAGGTGGCAGGCTATCCAAGAATCCA
GGGGTCTTTCAGCAAGGAGATGACCTGCATTCCCTTTTTTCTTCCCAGGCGCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTCCAA
GGACGAATTCTTCACCATGATGAG
GTATGGGGTGTGCCTTTCTAATCCTGAGATTTCCTGGTGTGTTTCAAACAGGAAAACAGGTCCAGTCAGAGGAGGGCTGGCAAACAGGCTATGCGG
TCATCTGTGCTGAGAGGTGGCCTAAACACTACATCCTAAACTCTCAGAGCATCCACCTTCAAATATTTACCTGACTGGCTCCTGCCTCTGGGAGAGTCTCTGTCTGGACTGTCAACACCA
GCCAGAAAAGCCTCCCTAGTTAAAAAACGAAAAAAAAAAAAACCCAACACCAATATGGCCAACGACAAAAATCCACTAATCCCTTTTGGATGCCCTTGGATCTTTGTGAACTATTTTACG
GCATGCCCAACACCGTGCTCTACCCAGTGAAACAGTGAATGGATGACCTTGGTTGCTGCTCCGATATTCATCACCATGATAGTCAGGAAGAGAAACGTGGAAGGCTTCTCCATATTCCAA
CCATTCTTTCTTCCTGCATCTTAAGCCCTTTCTGGTTTTGTTGTGCCGGTAAAAAAAACAGCTTTGTGCTTCTCATTCCTGAAGACAATGAATGCGTCAGTAACACAGCTCCCCTCCATG
CCATAAGGGCAGGGCTTGTTCCCTGTTGAATCCAGCTTCTCTACACTGTGATTGGCACAGGGCAGGCATTTCATACATAACTAACTGAGTAAGACAAAATGAAATAAGTGAGCAAATGAA
TACAAAGTATAGATGTAACAGCCCACATTATTTCAATTTTTCTATCCTGTTAAGCCTTAACATTGCTTTAAGCATTCCCCTTAACTGCTACATTCCTCATATGGTCCAGATACCCCAACT
GGACAAGGGCTTCTGAAAGGGCAAAGCTATTGTAGTCTGTACCTCACTGGGTATGTCATTGCAGGCCCAGCCCGAGGTGAGGCTCAAGGGAATCTAGGAGAGGGTCTCTGCCTCCAAGGG
CTGAGTTCCCACCTTCTTTTTTTGGTTTGTTTGTTTTGAGACGGAGTCACTCTGTCACCCAAGCTGGAGTGCAGTGCCACGATCTCGGCTCACTGCAACCTCCACCGCCCAGGTTCAAGT
GATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGAATTACAGGCATGTGCCACCACGTCCAGCTAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTAGTCTCAAA
CTCCTGACCTCAGGGGATCAGCCCGCCTCAGCCTCCTAAAGTGCTAGGGTTACAGGCGTGAGCCACCACACCTGGTGAATTCCCACCTTCTTGTCCTGTAACTCAGCGTGTATCTTGCTT
ACTGTCGGTGGGACGATGTTTTTAATGTGATGGCTGTGCTTGCTGTGTTTTATGGGCCCAGCATGGCACAGCATTGCTGCTGGACCTACACAAACTTGCATAGTCTGTCTTTTCTGTCCT
GAGGCACAACCTATGAATAGAGCTTGGCTACTGCAGGTGCCACTGTGGGTGCTATCAGGTTGGGCATGGAGACGCTCCCGCCTGTGCCCCGGGGTGTTGGCACAAGGAAGCAGCAGCATT
GCAGCTAGTTCCCCTCCCTGGCACCTGGCTGCCTGGTGCCCCCACTGGACTATGAAAGGGGGAACCCAGGGGTGATATGGGAGGCATCAACAGAAGAGAGTGGACAGAGAGCCTGCCACG
AGAGAGGGCCATGCACACCCTGGACACACCCCTGCACTCAGTGGACTATCTTCTCAGTTGTAGATGCCCCCTGTTTGAGGGCTGCTTTCTCTGATTGGTCAAGGTCACTTTCAATTCTGT
TCTGCCTTTTGAGTCCATGGCTACCCCACCCAGCTTAGTGTCAGCTGCAGGCTGGATGAGCACCTTCTCAGTGCCATTTCCCGGGTCACTGGTAACCACATTAGATAATCTGGGGCCTGC
CCCACCCTGCACCTACCCAAGCCTGACCTTGCTGGGTGACAGGCTGCTGTGTCTCTGGTCCTCCTCCAGATATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCC
GAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAGCTGACATGGGAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGT
GTCAAAGGTGGAGGTGGAGGTGGAAGTG
GTGAGTGTGTGAGGAATGGTTGGTGTCAGGGAGGGGGGGCGTGTCCTCAAAATGAGAGTTCCTGGTGGGCAGGACCCAGGTCTTACTCTTTT
CTGAGTCCTTGGTACCTAGTATAGAACCAAGCACGTGTAATTGGGATGGCACATGTAGGTGTGCAAGTTGTCTAATGCACAATGATACCCACTGAGGTCAAGGAGCAGGTTGAAATCTAT
CCTACACACTGCTCAAAGCCACCGGCATTGGTCTAAGATATATCTGGCCAGAGGAAGAGGCATTGTTTCTTTTACACAAAGGACCTGCAGGGTTGCATCCACCTAAGAGGACGTCCCCTT
TCTTGTGCAAAGTTGCCACATTGTCTGCCCTGTGTACAGTGAGTGCTTAGCCTAGGGAATGAGGGAAACAGGACTCGAGTCAGAGATCTGGACATGACTTTCCCAGAAGGGAGGAGGGCA
CAGTCTCCCATCCTACCCCACTGCCCTTGTGAGGAAGCCAGTCCTGCCTCTTGTTCTCTTCTCTAGGTGTATTAGAGATATCTTTAAACAAAACATCAGCTGTCGAGTCTCGTTCATCAC
TCGGACACCTGGGGAGCG
GTGAGCAGGAATGGGGCTCTGGCAGGTTGGCCTGGCTGAGCCCCCTGCAGAGAAATGAAGGGAGTAGGACTGGCTGATCAGCCCCTGGTAAAATCAGGCATT
TGCCCTTTGAAAGTAGCTCTTGGTAGCACAAACATTCCAGCTGCCTCTCTCACCCTATGCTGCTCGGATGCTTGGCTCTCTCCCTGCTGCTCCAGGCCAGAATCATTCTACAAAACAAAT
CATGAGATCCTATTAATTCATTTTGCGCCTTGCCCCCTGCCTGGTACCAGGAGCCACTCCCTACCTCTACCCCATGCTCTGCCCAGGGAGTTGTTCTCCTGGCTGCAAAGACAAGGGGAG
AACAGCCCCATTTCTTTTTCTCAGCTCTCCCACCCCCAGGGACTGGGGCCCCCTGCCCCAGAAGCCCCAGAGCTGGGAGGCCCTGGGCTGAAGAAGAGGTTTGGCAAAAAGTGAGTGTCT
CCCAAATCCCTGGGCCCAAAGAGACATGGAGAGAAGTCTTAGGGTCCCTAGGCCCCACCCGCATATCCTTGACATATAAGCGACCATCCCGAGTCTCATTCCATTTGTTCCTGACCTGAC
TGAGAGGTTATAGTGTTGAATGACTTTTCATCCTCTTCCAACCTCTGCACCCCATTCTTCAGGCAAGGGTCCTGGCTCAACAGGATGATAGTAAGGAGTCTCCTGGCTCCTGCCTGCTTT
GGGCACAGCCTTGAGGCCTGTGCTGGGATCAGGAAGAAAGAAGGTTAAAACAGACAGGAGGAGGGGGAGTAGCAGGGAGACAGTGAGTGGGTGGATGGAGCAAAGACAGAAGTAAAGGGT
TGGAGGAGGAAGAAGCCCCCAGATTGCTTTTTTCCCTTCATCTGTTGTGGCCCATCCTGATGCCTGCCAGATCCCCAGGTCACCTTTCATGGAGTGGCATTAAGGGAAGGCCAGAGGGCC
CTTACCACACTGCTGCCCTGCCTCCCCTTGCTATAGGGGGCAGCAGTGCCCACTCCCCGGCTGTACACAGAGGCGCTGCAAGAGAAGATGCAGCGAGGCTTCCTAGCCCAAAAGCTGCAG
CAGTACAAGCGCTTCGTGGAGAACTACCGGAGGCACATCGTGTGTGTGGCAATCTTCTCGGCCATCTGTGTTGGCGTGTTTGCAGATCGTGCTTACT
GTAAGAGTTCCAGGCTGTGGGCA
GTGGGTAGGGAGCAGGCTCTGACCCTTGGAGAGGAGTGGAAAGCCCTCTGATCCTAAGAGTCTGCATGGGAGAGCCCAGGGCTCGGGACCTTGGCCACCTGTGCCAAGCTGATGTAACCT
CACTCCGGCCCCAGACACTATGGCTTTGCCTCGCCACCCTCGGACATTGCACAGACCACCCTCGTGGGCATCATCCTGTCACGAGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCTTAT
ATCTTGCTCACCATGTGCCGCAACCTCATAACCTTCCTGCGAGAGACTTTCCTCAACCGCTATGTGCCTTTTGATGCCGCAGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTC
CTGGCCA
GTACGTGACTCCCAGGCTTCTTCCTCTTTGCTTCTTCCTCTTTGCTGCAGCACCCTGGGTCTAGTTGGGGGAAACAGTGGGGAGATGGAACTCCTTATACCTCCATCTCTCCT
CCCTATGCCTCCTCTCTCCCTCAGGATCGGAGGTAGAGTCTGTCCTGGTTGGCATCTCTAACAGGGTCTGTCTCTTCCAGTTTTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATC
TTCTCAGTCAGCCCACTCAGCCTGCTGGCCTGTGTATTCCCCAACGTCTTTGTGAATGATGG
GTCAGTTCTGGGGAAGGTTTCTCCTGGGACTCATAGGGTGGGCCCAAGGGTATAATAG
AAAAAGAAATAGGCAGGGCACAATGGCTCACACGTGTAAGCCCAACACTTTGGAAGTCTGAGGCAGGAGGATTGCTTGAGGCCAGGAGTTCCAGACAAGCCTGAATAACAAAGTGAGACT
CCATCTGTACAAAAAGTAAAAAGATTAGCGGGGTATTGTGGTACACATCTGTAGTCCCAGCTATTCAGGAGGCTGAGGCAGGAGGATTGCTTGAGCCCAGGAGTTTTAGGTTGCAGTGAG
CCATGATCAGTACCACTGCATTCCAGCCTGGGTGACAGAGCAAGACTCTGTCTTGAAAGAAAAAAAAAAAGAAAAGAAAAAGAAAAAAGAAATAGACAGTCCCAGACACTCAGCAAGAAG
CTCAGTGCTAAGCTAGTCCCCTGGGGGAAGCTGAAAGGTAAATTTCTTGCCCTCAAGAAGGAAGCTGGCTGTGATTGGCCAGGAAAGGTGTTTGGGAGATGAAGTCAGAGACTCTTTCTC
ATAGATACCTGACACACAGCTTGCCATCTCTGCCTTCTATCCATTCACTGGACAGACATTTATGCAGCATGTCCCATGTGTCTACCCAGATGCCAGAGTAGGGATGTCAAGATACATGCA
GTCATTCAACAACTACTTACCGAGATTTGCTGTGTGCCTAGTATTGTTCTAAGCCTAGAGATAGAGCAGTGAATGAAACAAAAATCCCTGCCCTCATGGAGCTTACAGAATAATGAACCA
GGGACTCAAGGAAGCAGGGGTCAACCACAGTAAGAGAGTTCAGGATAACACAGAAGGAAAAATTGCTGAGTACAGAGGGTGGAACCATGGAAGCCTGGGTGCCGAGTTCCTTGGAGGGAT
CCCTGGCCAACTGGGTGGAGGCCCCATACCCTCAGCAGTCAGGGCCAGCAGGAAAGGAGTCATGCTGTGTTGTGACAGTGCTGGAGCCCCTCCTGCCCCAGCCAGAGGCTCAAGGTGTCT
TTGCCCCCCAGGTGTCCAAGCTTCCCCAGAAGTTCTATTGGTGGTTCTTCCAGACCGTCCCAGGTAGGAAACGTGGGACCTGGGGGTTCTGTCTGAGGACTTCTGCTTTTGTCTCCATCT
CTCTGTGAATACTCACTTGTCTATACTGGCCATGGGGTCTGTTTCTAGCCTTCAGGACAAGCCCCAGCTCCAATCCCTCCAGGCAGGCTTTTCTGGGGGCCCTGGAGGGATGAGAGAGGA
AGGAGGGAAGCATAGGGGAATGTGCTGTTGTCTTTTCAGCCCAAGCTGAAGTCCTGAGACTACTCACTGGCCCTGTCCTCCTGCCCCCAGGTGTATGACAGGTGTGCTTCTGCTCCTGGT
CCTGGCCATCATGTATGTCTTCGCCTCCCACCACTTCCGCCGCCGCAGCTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCTCTATGCCCTG
GTGAGGGACTTCCCTGGGCC
AGCCCATGGAACAGGGAGCTCAGGATGGGACAGGAAGGTGAAAGAGGGAGAATTGGATCCAAGATCTCAGAATGAGACTTTGAGATTTAAGACCCCAGACCTCAGCCCTATCTCCCCAGC
ACAGGCCTAGTGCCTGGGCAAGAGGGGATGCCGGGCAGGGGCCTGGCTGGGCCTGAGTTGTACTAACTGGCCGTGTCTCCAGCTCTCATCATCCACGGCAGCTATGCTCTGATCCAGCTG
CCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGAGGTGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGCGTGGTGAAGGCGGAGCTGCTGCCCTCAG
GT
ATCAGGCCCAGCCTGATCTGGGTCGGGAGCGACAGAGGCCAAATCTTCAGACATGAGGAGACAGTTCACCAGGCCTCCTGACCCCATCACTGCCTCTGACTCTGTCTCCAAAAACAACAA
GAAAAAAAAACCACTCTCGGGGGTTCCTGAAGGTTTCCTGATAGAAGTAGACCCAGAAAGGGCTGTGCTTAGCTCCCAGGAGACTTGCAGATGGTGAGAAGTGACCCTGAGAAGAGGTGG
CTGAACGTGCATACAGAGGGGTTTGAGGATGGGAAAGGGCCCCACATGTGTGGCTTGGGTGCAGGGGAAGTGCAGGGCAGGAAGCCACATATGCCTGTCCCATTCCTTCTCTCAGAGACA
GGCAAATGCCCAGATTGCCAGTCTGTTGTTGAGAGTCAGTGTTGGCCAAAGTCGGGATTGGTATCATCATAGAGGGTGGCAAAAGATGATTTTATGTATCGCAGGACTGTCTAACCTTCA
GGACTCATGTTGTAAAAAAATTAATCTCATTGCAATGTTATTTCAATTGAGATTACTTAAGGACAAAATCTCAGAGTGGTGTTAGTATGCCTTTCTACTCTCCAGCACTTGCTGATCTCC
TTTTTCAATGAAGAGATCAGGCCTGAGGCTCAGCTATGGTACAAACAGTATCCAGCTAAAATTTGGTAACAAAATATGTTGTCTTCTATGTAGGACACATGATACTGGTTTTCCACTTAC
CTTAGCAATAAAGTTCCCTGCCAAGATAAATTGAATTGGAAAAGTTAGTCAATTTAAAGAAAAGTGTTAGATAAATGATAGTGCAGGAGGTGTGTAGAAAGGTAACCCTCAAATGGAGGT
CTGGTAGCCACTGAGATGGTCCCTGGTGAGCCTGAGTCCCTTTACCCAGCAGGTTGGTATTCAAGGCAGTGGTATGAGGCCTGCGCCTTCCATGGGAAGAGGGAGTAGAGAGGAGGAGAG
GGGCTGGAACAGGGGAGCAGAGACACCCAGCTCTTGCATTACCTGGGAAACAGAGAAGGCGACCCTCCTACTCCCAGCCCCAGGAGCCCCGCTTGCTCAGAACATGTGACTNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNAGCCCCAGGAGCCCCGCTTGCTCAGACCATGTGACTACTCCCCAGGCCTCAGGGGTGTGAAGAGGACAGGTGCCTGTCAGTCCTCAG
GTACCAGGAGCAGGCCTTCTCATCTGTGCTTTTCCCTGTCTATGACCTCCAGGAGAGTGACCTACCTGCAATTCCAGAGGCCCCAAGGCTTTGAGTACAAGTCAGGACAGTGGGTGCGGA
TCGCCTGCCTGGCTCTGGGGACCACCGAGTACCACCCCTTCACACTGACCTCCGCGCCCCATGAGGACACACTCAGCCTGCACATCCGGGCAGTGGGGCCCTGGACCACTCGCCTCAGGG
AGATCTACTCATCCCCAAAAGGCAATGCCTGTGCTGGATACCCAAAG
GTGCCCGTCACTGGGAACCCTGCTTCCGGGCCTCTGGCACTGGCAGAGGATCTCTGCCCTTCCCTATCCTGAG
ACTAGAAGCTCCAGCCGTCCCAAAGCCAGCCTGGGAGAGGACCGGGGTGCCTCAGAAAAGACTAGGATGTTCTGTATCCTCCCTCTGCCTGTGTCTCCGTTTCTGGTCTCAGAGCTGGGG
CAGGGTCAGGCTCATTTCATCTCCCCCCTCTCTTGGCAGCTCTGTACCTTGATGGACCGTTTGGAGAGGGCCATCAGGAGTGGCATAAATTTGAGGTGTCAGTGTTGGTGGGAGGGGGCA
TTGGGGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTCTTCAAGTCATCCTTGGGCAGCCAAATGCTGTGTAAGAAG
GTGAGCATCCCTTCCTCATTCATCAAATGGGGCATAGGTG
GCCGAATTGTGACCCGCATCAAGTGGTGGATCATGAGAGAAAGCTCCTGGCTCCAGGAACTGAGTCTGAAGGGGTCATTCTTACCCAGTGGTTGAGATGCCAAACTTGGAGGGAAGTTGG
TGGTATAGCCAGAAGGGCCTCTGCTGGGACCTGTCAGTTGGAAGCCTGGGATCAGGCTGGTGGGTCCTGCCACAGCTTTGGTGTCTGCAGGTGGTCTGGGGCTTCCCAGCCTCTCAGGTG
AAGGCACCTGGGATCTAGGGAGGCTGAACTGAGCTGGGTCCTGATCTCCAGCCCTGTGTCCCCAGATATCTACTTCATCTGGGTGACACGGACCCAGCGTCAGTTTGAGTGGTTGGCTGA
CATCATCCGAGAGGTGGAGGAGAACGACCACCAGGACCTGGTGTCTGTGCACATTTATGTCACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTA
GTATGTCAGGGCCCGCC
AGGCAGGGCAACTTGGTGGGCAGATGGATTGGCAGCGTAAGGCAGCATAGCCAGGGCAGGTGGGTGGACGGCCAGGCTGAGCTGGCAGGAGGCACAGAGCTGATGGCCTGATCCTCAGCC
TCCAGCTCCCTCCCCTCCCCATTCTCTGTCTCTTGGGCTATGTGGGCTGGCTCGGGCTGAGTGCTGGCCCTGACTGTCTTTGGTCTGACCTGCCCCTGTGCCCCCAGTATACATCTGCGA
GCGGCACTTCCAGAAAGTGCTGAACCGGAGTCTGTTCACGGGCCTGCGCTCCATCACCCACTTTGGCCGTCCCCCCTTCGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCACAG
GT
CAGTCCCACTCCCTCCCACCCTGGGACTCTGGCCTTCTCCTGCCAGGACATCCTGGCCCTGAAGCACCCTGCCGCTCTTTTCTGAGCAGAGAACTCCACCCGCTTGCCTGGCCCCAGGAT
GAGGTCAGCTGTTAAAGGGGGACTTCCACCCCCTCCACGTTAAGCCTCTTCCTCAAGGCCTGGGCTTGAAGCCCTAGTCATTCCAGCCAGGCTCAGGAAGCAGCTTTTCCCAAGGAGAGT
GAGCACCTTTAGGCTGCAGGCCCCTCTCTCTCTCTAATCTCCTGACAGGTGTGCGCAAGATCGGGGTGTTCAGCTGCGGCCCTCCAGGAATGACCAAGAATGTAGAGAAGGCCTGTCAGC
TCGTCAACAGGCAGGACCGAGCCCACTTCATGCACCACTATGAGAACTTCTGA

Retrieve as FASTA  
CDS
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
ATGCTCCGTGCAAGACCAGAGGCACTGATGCTCCTGGGAGCTCTTCTGACTGGATCCCTGGATCCATCGGGCAATCAGGACGCACTCTCACTGCCCTGGGAAGTGCAGCGCTATGACGGC
TGGTTTAACAACCTGAGGCACCACGAGCGTGGTGCTGTTG
GCTGCCGGTTGCAGCGCCGCGTACCAGCCAATTACGCCGACGGTGTGTATCAGGCTCTGGAGGAGCCGCAGCTGCCCAAC
CCGCGCCGGCTCAGCAACGCAGCCACGCGGGGCATAGCCGGCCTGCCGTCGCTCCACAACCGCACCGTACTGGGGGTCTTCTTTG
GCTACCATGTTCTTTCCGACGTGGTGAGCGTGGAA
ACGCCCGGCTGCCCCGCCGAGTTCCTCAACATCCGCATCCCACCTGGAGACCCCGTGTTCGACCCCGACCAGCGCGGGGACGTGGTGCTGCCCTTCCAGAGGAGCCGCTGGGACCCCGAG
ACCGGACGGAGTCCCAGCAACCCCCGGGACCTG
GCCAACCAGGTGACGGGCTGGCTGGACGGCAGCGCCATCTATGGCTCCTCGCACTCCTGGAGCGACGCGCTGCGGAGCTTCTCGGGG
GGACAGCTGGCGTCGGGGCCCGACCCCGCTTTCCCCCGAGACTCGCAGAACCCCCTGCTCATGTGGGCGGCGCCCCGACCCCCGCCACCGGGGCAGAACGGGCCCCGGGGGC
CCTTCGGG
GCAGAGAGAGGGAACCGGGAACCCTTCCTGCAGGCGCTGGGCCTGCTCTGGTTCCGCTACCACAACCTGTGGGCGCAGAGGCTGGCCCGCCAGCACCCAGACTGGGAGGACGAGGAGCTG
TTCCAGCACGCACGCAAGAGGGTCATCGCCACCTACCAG
AACATCGCTGTGTATGAGTGGCTGCCCAGCTTCCTGCAGAAAACACTCCCGGAGTATACAGGATACCGTCCTTTCCTAGAC
CCCAGCATCTCCCCGGAATTTGTGGTGGCCTCTGAGCAGTTCTTCTCTACCATGGTGCCCCCTGGTGTCTACATGAG
AAATGCCAGCTGTCATTTCCGGAAGGTCCTGAACAAGGGTTTT
CAAAGCTCCCAAGCTCTCAGGGTCTGCAACAACTACTGGATTCGGGAG
AATCCCAATCTGAACAGTACCCAGGAGGTGAATGAGCTGCTGCTGGGAATGGCCTCCCAGATTTCGGAGCTG
GAGGACAACATCGTGGTTGAAGATCTGAGGG
ATTACTGGCCTGGCCCTGGCAAATTCTCCCGTACAGACTATGTGGCCAGCAGCATCCAACGTGGCCGAGATATGGGGCTGCCCAGCTAT
AGCCAGGCCCTGCTGGCCTTTGGGCTGGACATCCCAAGGAACTGGAGTGATCTCAACCCTAATGTGGACCCCCAG
GTGCTGGAGGCCACAGCTGCCCTGTACAACCAGGACCTATCCCAG
CTAGAGCTGCTCCTTGGGGGGCTCCTGGAGAGCCATGGGGACCCTGGACCCCTGTTCAGTGCCATTGTCCTCGACCAGTTTGTACGGCTGCGGGATGGTGACCGCTACTGGTTTGAGAAC
ACCAGGAATGG
GCTGTTCTCCAAGAAGGAGATTGAAGACATCCGAAATACCACCCTGCGGGACGTGCTGGTCGCTGTTATCAACATTGACCCCAGTGCCCTGCAGCCCAATGTCTTTGTC
TGGCATAAAG
GTGCACCCTGCCCTCAACCTAAGCAGCTCACAACTGACGGCCTGCCCCAGTGCGCACCCCTGACTGTGCTTGACTTCTTTGAAGGCAGCAGCCCTGGTTTTGCCATCACC
ATCATTGCTCTCTGCTGCCTTCCCTTAG
TGAGTCTGCTTCTCTCTGGAGTGGTGGCCTATTTCCGGGGCCGAGAACGCAAGAAGCTACAAAAGAAAGTCAAAGAGAGCGTGAAGAAGGAA
GCAGCCAAAGATGGAGTGCCAG
CGATGGAGTGGCCAGGCCCCAAGGAGAGGAGCAGTCCCATCATCATCCAGCTGCTGTCAGACAGGTGTCTGCAGGTCCTGAACAGGCGTCTCACTGTG
CTCCGTGTGGTCCAGCTGCAGCCTCTGCAGCAGGTCAACCTCATCCTGTCCAACAACCGAGGATGCCGCACCCTGCTGCTCAAGATCCCTAAGGAGTACGACCTG
GTGCTGCTGTTTAGT
TCTGAAGAGGAACGGGGCGCCTTTGTGCAGCAGCTACGGGACTTCTGCATGCGCTGGGCTCTGGGCCTCCATGTGGCTGAGATGAGTGAGAAGGAGCTATTTAGGAAGGCTGTGACAAAG
CAGCAGCGGGAACGCATCCTGGAGATCTTCTTCAGACACCTTTTTGCTCAG
GTGCTGGACATCAACCAGGCTGATGCAGGGACCCTGCCCCTGGACTCCTGCCAGAAGGTGCGGGAGGCC
CTGACCTGCGAGCTGAGCAGGGCCGAGTTTGCTGAGTCCCTGGGCCTCAAGCCCCAGGACATGTTTGTGGAGTCCATGTTCTCTCTGGCTGACAAGGATGGCAATGGCTACCTGTCCTTC
CGAGAGTTCCTGGACATCCTGGTGGTCTTCATGAAAG
GCTCCCCAGAGGATAAGTCCCGTCTAATGTTTACCATGTATGACCTGGATGAGAATGGCTTCCTCTCCAAGGACGAATTCTTC
ACCATGATGAG
ATCCTTCATCGAGATCTCCAACAACTGCCTGTCCAAGGCCCAGCTGGCCGAGGTGGTGGAGTCCATGTTCCGGGAGTCGGGATTCCAGGACAAGGAGGAGCTGACATGG
GAGGATTTTCACTTCATGCTGCGGGACCATGACAGCGAGCTCCGCTTCACGCAGCTCTGTGTCAAAGGTGGAGGTGGAGGTGGAAGTG
GTATTAGAGATATCTTTAAACAAAACATCAGC
TGTCGAGTCTCGTTCATCACTCGGACACCTGGGGAGCG
CTCCCACCCCCAGGGACTGGGGCCCCCTGCCCCAGAAGCCCCAGAGCTGGGAGGCCCTGGGCTGAAGAAGAGGTTTGGCAAA
AA
GGCAGCAGTGCCCACTCCCCGGCTGTACACAGAGGCGCTGCAAGAGAAGATGCAGCGAGGCTTCCTAGCCCAAAAGCTGCAGCAGTACAAGCGCTTCGTGGAGAACTACCGGAGGCAC
ATCGTGTGTGTGGCAATCTTCTCGGCCATCTGTGTTGGCGTGTTTGCAGATCGTGCTTACT
ACTATGGCTTTGCCTCGCCACCCTCGGACATTGCACAGACCACCCTCGTGGGCATCATC
CTGTCACGAGGCACGGCGGCCAGCGTCTCCTTCATGTTCTCTTATATCTTGCTCACCATGTGCCGCAACCTCATAACCTTCCTGCGAGAGACTTTCCTCAACCGCTATGTGCCTTTTGAT
GCCGCAGTGGACTTCCACCGCTGGATCGCCATGGCTGCTGTTGTCCTGGCCA
TTTTGCACAGTGCTGGCCACGCAGTCAATGTCTACATCTTCTCAGTCAGCCCACTCAGCCTGCTGGCC
TGTGTATTCCCCAACGTCTTTGTGAATGATGG
GTCCAAGCTTCCCCAGAAGTTCTATTGGTGGTTCTTCCAGACCGTCCCAGGTATGACAGGTGTGCTTCTGCTCCTGGTCCTGGCCATC
ATGTATGTCTTCGCCTCCCACCACTTCCGCCGCCGCAGCTTCCGGGGCTTCTGGCTGACCCACCACCTCTACATCCTGCTCTATGCCCTG
CTCATCATCCACGGCAGCTATGCTCTGATC
CAGCTGCCCACTTTCCACATCTACTTCCTGGTCCCGGCAATCATCTATGGAGGTGACAAGCTGGTGAGCCTGAGCCGGAAGAAGGTGGAGATCAGCGTGGTGAAGGCGGAGCTGCTGCCC
TCAG
GAGTGACCTACCTGCAATTCCAGAGGCCCCAAGGCTTTGAGTACAAGTCAGGACAGTGGGTGCGGATCGCCTGCCTGGCTCTGGGGACCACCGAGTACCACCCCTTCACACTGACC
TCCGCGCCCCATGAGGACACACTCAGCCTGCACATCCGGGCAGTGGGGCCCTGGACCACTCGCCTCAGGGAGATCTACTCATCCCCAAAAGGCAATGCCTGTGCTGGATACCCAAAG
CTG
TACCTTGATGGACCGTTTGGAGAGGGCCATCAGGAGTGGCATAAATTTGAGGTGTCAGTGTTGGTGGGAGGGGGCATTGGGGTCACCCCCTTTGCCTCCATCCTCAAAGACCTGGTCTTC
AAGTCATCCTTGGGCAGCCAAATGCTGTGTAAGAAG
ATCTACTTCATCTGGGTGACACGGACCCAGCGTCAGTTTGAGTGGTTGGCTGACATCATCCGAGAGGTGGAGGAGAACGACCAC
CAGGACCTGGTGTCTGTGCACATTTATGTCACCCAGCTGGCTGAGAAGTTCGACCTCAGGACCACCATGCTA
TACATCTGCGAGCGGCACTTCCAGAAAGTGCTGAACCGGAGTCTGTTC
ACGGGCCTGCGCTCCATCACCCACTTTGGCCGTCCCCCCTTCGAGCCCTTCTTCAACTCCCTGCAGGAGGTCCACCCACAG
GTGCGCAAGATCGGGGTGTTCAGCTGCGGCCCTCCAGGA
ATGACCAAGAATGTAGAGAAGGCCTGTCAGCTCGTCAACAGGCAGGACCGAGCCCACTTCATGCACCACTATGAGAACTTCTGA

Retrieve as FASTA  
cDNA
Send to BLAST
.........1.........2.........3.........4.........5.........6.........7.........8.........9.........0.........1.........2
TTGGAAGTCG CGCGGGACCC CTTTTATAGC AGCGTGGGCG ACGTGCCACA CGGGTGTCCC AGCCCAGGGG CTGGTCTGAG CTGGAAGAGG TTATGCAAAT AAGGGCCCCA CCTCCACAGC  AGGAGGGTGA GCCCTAGGTC CAGATGCTCA CACTGGCGCA GGTCTGTCCT GAGCCGACAC CTGCACAGTG GCGAGACCAA GGACCCAGAG AGAAAGGTGA GAGTGCAGCC GGGGAGGCTA  AGGATCGGCG GAGCTGGAAG AGTGAGGGTG AAGGCAAGAA GTAGAGCACA GAAGCAAAGA TTTTAAGAGG AAAGAAGACA TCTGAACCCA ACACCACCCT AAACTACAGG CTGCAGGGTT  GGCATGCTCC GTGCAAGACC AGAGGCACTG ATGCTCCTGG GAGCTCTTCT GACTGGATCC CTGGATCCAT CGGGCAATCA GGACGCACTC TCACTGCCCT GGGAAGTGCA GCGCTATGAC  GGCTGGTTTA ACAACCTGAG GCACCACGAG CGTGGTGCTG TTGGCTGCCG GTTGCAGCGC CGCGTACCAG CCAATTACGC CGACGGTGTG TATCAGGCTC TGGAGGAGCC GCAGCTGCCC  AACCCGCGCC GGCTCAGCAA CGCAGCCACG CGGGGCATAG CCGGCCTGCC GTCGCTCCAC AACCGCACCG TACTGGGGGT CTTCTTTGGC TACCATGTTC TTTCCGACGT GGTGAGCGTG  GAAACGCCCG GCTGCCCCGC CGAGTTCCTC AACATCCGCA TCCCACCTGG AGACCCCGTG TTCGACCCCG ACCAGCGCGG GGACGTGGTG CTGCCCTTCC AGAGGAGCCG CTGGGACCCC  GAGACCGGAC GGAGTCCCAG CAACCCCCGG GACCTGGCCA ACCAGGTGAC GGGCTGGCTG GACGGCAGCG CCATCTATGG CTCCTCGCAC TCCTGGAGCG ACGCGCTGCG GAGCTTCTCG  GGGGGACAGC TGGCGTCGGG GCCCGACCCC GCTTTCCCCC GAGACTCGCA GAACCCCCTG CTCATGTGGG CGGCGCCCCG ACCCCCGCCA CCGGGGCAGA ACGGGCCCCG GGGGCCCTTC  GGGGCAGAGA GAGGGAACCG GGAACCCTTC CTGCAGGCGC TGGGCCTGCT CTGGTTCCGC TACCACAACC TGTGGGCGCA GAGGCTGGCC CGCCAGCACC CAGACTGGGA GGACGAGGAG  CTGTTCCAGC ACGCACGCAA GAGGGTCATC GCCACCTACC AGAACATCGC TGTGTATGAG TGGCTGCCCA GCTTCCTGCA GAAAACACTC CCGGAGTATA CAGGATACCG TCCTTTCCTA  GACCCCAGCA TCTCCCCGGA ATTTGTGGTG GCCTCTGAGC AGTTCTTCTC TACCATGGTG CCCCCTGGTG TCTACATGAG AAATGCCAGC TGTCATTTCC GGAAGGTCCT GAACAAGGGT  TTTCAAAGCT CCCAAGCTCT CAGGGTCTGC AACAACTACT GGATTCGGGA GAATCCCAAT CTGAACAGTA CCCAGGAGGT GAATGAGCTG CTGCTGGGAA TGGCCTCCCA GATTTCGGAG  CTGGAGGACA ACATCGTGGT TGAAGATCTG AGGGATTACT GGCCTGGCCC TGGCAAATTC TCCCGTACAG ACTATGTGGC CAGCAGCATC CAACGTGGCC GAGATATGGG GCTGCCCAGC  TATAGCCAGG CCCTGCTGGC CTTTGGGCTG GACATCCCAA GGAACTGGAG TGATCTCAAC CCTAATGTGG ACCCCCAGGT GCTGGAGGCC ACAGCTGCCC TGTACAACCA GGACCTATCC  CAGCTAGAGC TGCTCCTTGG GGGGCTCCTG GAGAGCCATG GGGACCCTGG ACCCCTGTTC AGTGCCATTG TCCTCGACCA GTTTGTACGG CTGCGGGATG GTGACCGCTA CTGGTTTGAG  AACACCAGGA ATGGGCTGTT CTCCAAGAAG GAGATTGAAG ACATCCGAAA TACCACCCTG CGGGACGTGC TGGTCGCTGT TATCAACATT GACCCCAGTG CCCTGCAGCC CAATGTCTTT  GTCTGGCATA AAGGTGCACC CTGCCCTCAA CCTAAGCAGC TCACAACTGA CGGCCTGCCC CAGTGCGCAC CCCTGACTGT GCTTGACTTC TTTGAAGGCA GCAGCCCTGG TTTTGCCATC  ACCATCATTG CTCTCTGCTG CCTTCCCTTA GTGAGTCTGC TTCTCTCTGG AGTGGTGGCC TATTTCCGGG GCCGAGAACG CAAGAAGCTA CAAAAGAAAG TCAAAGAGAG CGTGAAGAAG  GAAGCAGCCA AAGATGGAGT GCCAGCGATG GAGTGGCCAG GCCCCAAGGA GAGGAGCAGT CCCATCATCA TCCAGCTGCT GTCAGACAGG TGTCTGCAGG TCCTGAACAG GCGTCTCACT  GTGCTCCGTG TGGTCCAGCT GCAGCCTCTG CAGCAGGTCA ACCTCATCCT GTCCAACAAC CGAGGATGCC GCACCCTGCT GCTCAAGATC CCTAAGGAGT ACGACCTGGT GCTGCTGTTT  AGTTCTGAAG AGGAACGGGG CGCCTTTGTG CAGCAGCTAC GGGACTTCTG CATGCGCTGG GCTCTGGGCC TCCATGTGGC TGAGATGAGT GAGAAGGAGC TATTTAGGAA GGCTGTGACA  AAGCAGCAGC GGGAACGCAT CCTGGAGATC TTCTTCAGAC ACCTTTTTGC TCAGGTGCTG GACATCAACC AGGCTGATGC AGGGACCCTG CCCCTGGACT CCTGCCAGAA GGTGCGGGAG  GCCCTGACCT GCGAGCTGAG CAGGGCCGAG TTTGCTGAGT CCCTGGGCCT CAAGCCCCAG GACATGTTTG TGGAGTCCAT GTTCTCTCTG GCTGACAAGG ATGGCAATGG CTACCTGTCC  TTCCGAGAGT TCCTGGACAT CCTGGTGGTC TTCATGAAAG GCTCCCCAGA GGATAAGTCC CGTCTAATGT TTACCATGTA TGACCTGGAT GAGAATGGCT TCCTCTCCAA GGACGAATTC  TTCACCATGA TGAGATCCTT CATCGAGATC TCCAACAACT GCCTGTCCAA GGCCCAGCTG GCCGAGGTGG TGGAGTCCAT GTTCCGGGAG TCGGGATTCC AGGACAAGGA GGAGCTGACA  TGGGAGGATT TTCACTTCAT GCTGCGGGAC CATGACAGCG AGCTCCGCTT CACGCAGCTC TGTGTCAAAG GTGGAGGTGG AGGTGGAAGT GGTATTAGAG ATATCTTTAA ACAAAACATC  AGCTGTCGAG TCTCGTTCAT CACTCGGACA CCTGGGGAGC GCTCCCACCC CCAGGGACTG GGGCCCCCTG CCCCAGAAGC CCCAGAGCTG GGAGGCCCTG GGCTGAAGAA GAGGTTTGGC  AAAAAGGCAG CAGTGCCCAC TCCCCGGCTG TACACAGAGG CGCTGCAAGA GAAGATGCAG CGAGGCTTCC TAGCCCAAAA GCTGCAGCAG TACAAGCGCT TCGTGGAGAA CTACCGGAGG  CACATCGTGT GTGTGGCAAT CTTCTCGGCC ATCTGTGTTG GCGTGTTTGC AGATCGTGCT TACTACTATG GCTTTGCCTC GCCACCCTCG GACATTGCAC AGACCACCCT CGTGGGCATC  ATCCTGTCAC GAGGCACGGC GGCCAGCGTC TCCTTCATGT TCTCTTATAT CTTGCTCACC ATGTGCCGCA ACCTCATAAC CTTCCTGCGA GAGACTTTCC TCAACCGCTA TGTGCCTTTT  GATGCCGCAG TGGACTTCCA CCGCTGGATC GCCATGGCTG CTGTTGTCCT GGCCATTTTG CACAGTGCTG GCCACGCAGT CAATGTCTAC ATCTTCTCAG TCAGCCCACT CAGCCTGCTG  GCCTGTGTAT TCCCCAACGT CTTTGTGAAT GATGGGTCCA AGCTTCCCCA GAAGTTCTAT TGGTGGTTCT TCCAGACCGT CCCAGGTATG ACAGGTGTGC TTCTGCTCCT GGTCCTGGCC  ATCATGTATG TCTTCGCCTC CCACCACTTC CGCCGCCGCA GCTTCCGGGG CTTCTGGCTG ACCCACCACC TCTACATCCT GCTCTATGCC CTGCTCATCA TCCACGGCAG CTATGCTCTG  ATCCAGCTGC CCACTTTCCA CATCTACTTC CTGGTCCCGG CAATCATCTA TGGAGGTGAC AAGCTGGTGA GCCTGAGCCG GAAGAAGGTG GAGATCAGCG TGGTGAAGGC GGAGCTGCTG  CCCTCAGGAG TGACCTACCT GCAATTCCAG AGGCCCCAAG GCTTTGAGTA CAAGTCAGGA CAGTGGGTGC GGATCGCCTG CCTGGCTCTG GGGACCACCG AGTACCACCC CTTCACACTG  ACCTCCGCGC CCCATGAGGA CACACTCAGC CTGCACATCC GGGCAGTGGG GCCCTGGACC ACTCGCCTCA GGGAGATCTA CTCATCCCCA AAAGGCAATG CCTGTGCTGG ATACCCAAAG  CTGTACCTTG ATGGACCGTT TGGAGAGGGC CATCAGGAGT GGCATAAATT TGAGGTGTCA GTGTTGGTGG GAGGGGGCAT TGGGGTCACC CCCTTTGCCT CCATCCTCAA AGACCTGGTC  TTCAAGTCAT CCTTGGGCAG CCAAATGCTG TGTAAGAAGA TCTACTTCAT CTGGGTGACA CGGACCCAGC GTCAGTTTGA GTGGTTGGCT GACATCATCC GAGAGGTGGA GGAGAACGAC  CACCAGGACC TGGTGTCTGT GCACATTTAT GTCACCCAGC TGGCTGAGAA GTTCGACCTC AGGACCACCA TGCTATACAT CTGCGAGCGG CACTTCCAGA AAGTGCTGAA CCGGAGTCTG  TTCACGGGCC TGCGCTCCAT CACCCACTTT GGCCGTCCCC CCTTCGAGCC CTTCTTCAAC TCCCTGCAGG AGGTCCACCC ACAGGTGCGC AAGATCGGGG TGTTCAGCTG CGGCCCTCCA  GGAATGACCA AGAATGTAGA GAAGGCCTGT CAGCTCGTCA ACAGGCAGGA CCGAGCCCAC TTCATGCACC ACTATGAGAA CTTCTGAGCC TGTCCTCCCT GGCTGCTGCT TCCAGTATCC  TGCCTTCCCT TCTGTGCACC TAAGTTGCCC AGCCCTGCTG GCAATCTCTC CATCAGAATC CACGTTGGGC CTCAGCTGGA GGGCTGCAGA GCCCCTCCCA ATATTGGGAG AATATTGACC  CAGACAATTA TACAAATGAG AAAAGGCAGG AGACTATGTT CTACAATTGC AGTGCATGAT GATTATAAGT CCACCTATTT ATCAACAGCA CCGTTCCTGC AGCCCTCCAG CCTTCCTGCC  CTTAGCAAGT GCACAACCAG TCAGGATCTC CCAAAGAAGA TAAAGACCAC TCCTCACCCC AGCTCAAGCC ATGGCAGGCG TGGCAAGCAA AGTGGGGAGG AGACAGTCCC TGCTTGTGAC  AAGTGTGGAG GTGAAAAGGT ACAATAGTGC TTGTCTCCAA TAGCTCCCCA CATCTCTAAT TGACTTCCAC AAAATCGATG CATTGCTTTG GTATTTGCTT GGCCTGACAT TTGAGGGAGG  AGGAGGCCGG GATCCTCTGG CTGAGAATCT CCTCAGAGCC CAGTGCAGAA GCTGTGATGC TTAGAACCTG GACAGCCCGA CTGCCTCAAC TCTGTCTCCA GGTCTATTCC CTCTGGCTCC  AAAAGGAGCA GCCCTACTTC CACCCCTTCC CGTCCCCAAA GTGTCAGCAA CTTTGAGGAG GGCACCAGGA AACAAAGATG CCTCCCCAAT CCTGATATTC TTGATGTCAC CAGTGATACC  CACTGCCCTG ACCCCTGGGC AGGCCCCTCT CTGCATCTAC TGGAGTGGTC CCTGGGCTCT GGGGCTGAAG GATTCCAGCC TCTCTGCCAG ATATTCAGAA CTCGCTCTCA ATTCACCTCT  TCCACAAGAG TTGGGTGACC AGCTGTCCTA GTTTGCCCAG GACTCTCCCT GTTTTAGCAC TGAAAGTCTC TTGCCCCAGG AAACCCCATC AGTCCCAGGC AGATTGGGAC AGCTGGTCAC  CTTATGCAAG AGCCAGCCTG AAACATCCCC TCCATACTCA GCTCTTTAAC TTTTCCTTTT TCATTGGGCT CTTTCCTAAA AAGCTAAGCT GTAAAATATT TTACATCGAG GTATAATAAA  TAATCATGTA CATGTTTTAC CACCACCCAG GTCAAGACAT AGAACATTTC AACATTTCCA TCACCCCAGA AACTCCCCTT GTACCCCCTT CCACTTCGTC TCCCCTAGCT CCTAGAAGCA  ACCACTGATG TGATTTCTAC CAAATCCAGT TTTGGTCCTA CTAAATGTAC TCTTTTGAGA CTGGCCTCTT TCACTCGCCA TAATGCCTTT GTAATTCATC CATGCTGTTG TGTGTATCAG  CAGTTTGTTC CTTTTCATTG CTGAGTAGTA TTCCATTGTA GAGATGTACC ACAGTTTGTT TATTCTTCTG TTGATGGACA TTTGGGTTGT TTCTAATTTT GAATGATTAT AAATAAAAAT 
TCTGTGAGTG TTCTTGTACC TA 

Retrieve as FASTA