repfdr state assignments — GRAPH_STATES • MotrpacRatTraining6moData

repdfr state assignments (up, down, or null) for each training-regulated feature (5% FDR) for each sex at each time point, which specify node assignments for each differential feature. Missing values indicate that the repfdr posterior probabilities did not meet the cutoff for that feature.

Usage

GRAPH_STATES

Format

A data frame with 34244 rows and 10 variables:

feature: character, unique feature identifier in the format 'ASSAY_ABBREV;TISSUE_ABBREV;feature_ID' only for training-regulated features at 5% IHW FDR. For redundant differential features, 'feature_ID' is prepended with the specific platform to make unique identifiers. See REPEATED_FEATURES for details.
ome: character, assay abbreviation, one of ASSAY_ABBREV
tissue: character, tissue abbreviation, one of TISSUE_ABBREV. Note that VENACV, OVARY, TESTES, were not included in the graphical representation of differential features due to missing groups (e.g., females trained for 1 week).
feature_ID: character, MoTrPAC feature identifier
state_1w: character, state (1, up-regulated; 0, null; -1, down-regulated) of the feature in each sex (F, females; M, males) at the 1-week training time point, relative to sex-matched untrained animals
state_2w: character, state of the feature in each sex at the 2-week training time point
state_4w: character, state of the feature in each sex at the 4-week training time point
state_8w: character, state of the feature in each sex at the 8-week training time point
path: character, assigned states from weeks 1-8, separated by "->". This represents a feature's full path through the graph. NA if the state at any of the four time points is NA.
tissue_path: character, assigned states from weeks 1-8, separated by "->". This represents a feature's full path through the graph. NA if the state at any of the four time points is NA.

Details

Given the posteriors Pr(h|z_i) computed using repfdr::repfdr() where h is a configuration vector in -1,0,1^8 (specifying the 8 analyzed groups, 4 time points in males and females), and z_i is the vector of z-scores of analyte i, we assign analytes to "states". A state is a tuple (s_m,j, s_f,j), where s_m,j is the differential abundance state null, up, or down (0,1, and -1 in the notation above, respectively) in males at time point j, and s_f,j is defined similarly for females (at time point j). Thus, we have nine possible states in each time point. For example, assume we inspect analyte i in time point j, asking if the abundance is up-regulated in males while null in females. Then, we sum over all posteriors Pr(h|z_i) such that males are up-regulated and females have 0. If the resulting value is greater than 0.5, then we say that analyte i belongs to the node set S(s_m,j, s_f,j). Thus, we use S(s_m,j, s_f,j) to denote all analytes that belong to a state (s_m,j, s_f,j). Then, for every pair of states from adjacent time points j and j+1 we define their edge set as the intersection of S(s_m,j, s_f,j) and S(s_m,j+1, s_f,j+1). Thus, thenode sets edge sets together define a tree structure that represent different differential patterns over sex and time.