Polyprotein Cleavage Prediction
Polyprotein cleavage data are provided to facilitate the analysis and comparison of viral polyprotein chains at the level of individual mature products.
The dataset contains FASTA sequences corresponding to cleavage products derived from reference viral polyproteins selected from the ICTV Virus Metadata Resource (VMR). Cleavage sites were predicted by sequence similarity between viruses belonging to the same genus or displaying high sequence similarity. Experimentally validated cleavage information recorded in UniProtKB/Swiss-Prot was used as the reference for these predictions.
FASTA headers follow the format:
<UniProt AC>_<Chain ID> <UniProtKB virus name>; <TaxID>; <Cleavage range>; <Cleavage product name> Example: >A0A0C4W3V9_011 Jutiapa virus; 64299; 2197-2219; Peptide 2k TQIDTTLAIFVHSMLLFVGMVVA
On each virus family page, the complete dataset or selected subsets can be downloaded using the search filters. The download includes both FASTA and CSV files containing the filtered results.
Funded by NIH* through the Pathogen Data Network
*This resource is supported as a whole or in part by the National Institute Of Allergy And Infectious Diseases of the National Institutes of Health under Grant n U24AI183840, awarded to the SIB Swiss Institute of Bioinformatics.