Protein ubiquitination feature annotation
Format
A data frame with 21436 rows and 12 variables:
protein_idcharacter, RefSeq identifier in format
[accession].[version], e.g., NP_001004415.1. Note: contaminants should be identified with the following nomenclature: "Contaminant_XXXX"redundant_idscharacter, pipe-separated list of additional RefSeq identifiers with redundant peptide-matching sequences to
protein_idthat cannot be resolved by inference due to lack of unique peptidesis_contaminantlogical, whether
protein_idis a contaminantpeptide_scoredouble, MSGF+ SpecE Value or Spectrum Mill score (
1/bestScore)sequencecharacter, peptide sequence determined by MSGF+ or Spectrum Mill analysis of LC-MS/MS features
organism_namecharacter, organism whose protein collection database was used for LC-MS/MS peptide identification, e.g., Rattus norvegicus
ptm_idcharacter, concatenation of
protein_idand the modified amino acid residue (e.g., serine residue, 20 denoted as S20s), with "-" as a separatorptm_peptidecharacter, concatenation of
ptm_idand peptide with "-" as a separator. Values in this column are unique.confident_scoredouble, score indicating confidence of modified amino acid residue localization by LC-MS/MS fragmentation as calculated by Ascore algorith (0-1000) or Spectrum Mill (0-1)
confident_sitelogical, whether a modified site was confidently localized by LC-MS/MS fragmentation
tissuecharacter, tissue abbreviation, one of TISSUE_ABBREV
assaycharacter, assay "UBIQ"
Details
UBIQ feature annotation is only available via download from Google Cloud Storage:
https://storage.googleapis.com/motrpac-rat-training-6mo-extdata/feature-annot/UBIQ_FEATURE_ANNOT.rda.
You can use MotrpacRatTraining6mo::load_feature_annotation() to download and return this file.