RAW FORMAT

This format is similar to text/plain format except that it removes any whitespace, accepts only alphabetic characters and rejects anything else. Spaces and TAB characters are ignored. Input sequence with other non-alphabetic characters (such as digits, punctuation characters) is rejected as erroneous.

Following is an example of protein in raw format.

ELRLRYCAPAGFALLKCNDADYDGFKTNCSNVSVVHCTNLMNTTVTTGLLLNGSYSENRT
QIWQKHRTSNDSALILLNKHYNLTVTCKRPGNKTVLPVTIMAGLVFHSQKYNLRLRQAWC
HFPSNWKGAWKEVKEEIVNLPKERYRGTNDPKRIFFQRQWGDPETANLWFNCHGEFFYCK
MDWFLNYLNNLTVDADHNECKNTSGTKSGNKRAPGPCVQRTYVACHIRSVIIWLETISKK
TYAPPREGHLECTSTVTGMTVELNYIPKNRTNVTLSPQIESIWAAELDRYKLVEITPIGF
APTEVRRYTGGHERQKRVPFVVQSQHLLAGILQQQKNL LAAVEAQQQMLKLTIWGVK

HOW TO CONVERT FASTA TO RAW FORMAT

The following example is a protein sequence in FASTA format.

>Example1 envelope protein
ELRLRYCAPAGFALLKCNDADYDGFKTNCSNVSVVHCTNLMNTTVTTGLLLNGSYSENRT
QIWQKHRTSNDSALILLNKHYNLTVTCKRPGNKTVLPVTIMAGLVFHSQKYNLRLRQAWC
HFPSNWKGAWKEVKEEIVNLPKERYRGTNDPKRIFFQRQWGDPETANLWFNCHGEFFYCK
MDWFLNYLNNLTVDADHNECKNTSGTKSGNKRAPGPCVQRTYVACHIRSVIIWLETISKK
TYAPPREGHLECTSTVTGMTVELNYIPKNRTNVTLSPQIESIWAAELDRYKLVEITPIGF
APTEVRRYTGGHERQKRVPFVVQSQHLLAGILQQQKNL LAAVEAQQQMLKLTIWGVK

For converting fasta format into raw format, just delete the first interpretive line. The left is a protein in raw format.

 

HOW TO CONVERT BLAST OUTPUT TO RAW FORMAT

1. NIH Converter Server

2. Readseq

3. Bioperl

Close