Global protein expression feature annotation
Format
A data frame with 1379979 rows and 8 variables:
protein_idcharacter, RefSeq identifier in format
[accession].[version], e.g., NP_001004415.1. Note: contaminants should be identified with the following nomenclature: "Contaminant_XXXX"redundant_idscharacter, pipe-separated list of additional RefSeq identifiers with redundant peptide-matching sequences to
protein_idthat cannot be resolved by inference due to lack of unique peptidesis_contaminantlogical, whether
protein_idis a contaminantpeptide_scoredouble, MSGF+ SpecE Value or Spectrum Mill score (
1/bestScore)sequencecharacter, peptide sequence determined by MSGF+ or Spectrum Mill analysis of LC-MS/MS features
organism_namecharacter, organism whose protein collection database was used for LC-MS/MS peptide identification, e.g., Rattus norvegicus
tissuecharacter, tissue abbreviation, one of TISSUE_ABBREV
assaycharacter, assay "PROT"
Details
PROT feature annotation is only available via download from Google Cloud Storage:
https://storage.googleapis.com/motrpac-rat-training-6mo-extdata/feature-annot/PROT_FEATURE_ANNOT.rda.
You can use MotrpacRatTraining6mo::load_feature_annotation() to download and return this file.