repdfr
state assignments (up, down, or null) for each
training-regulated feature (5% FDR) for each sex at each time point,
which specify node assignments for each differential feature.
Missing values indicate that the repfdr
posterior probabilities
did not meet the cutoff for that feature.
Format
A data frame with 34244 rows and 10 variables:
feature
character, unique feature identifier in the format 'ASSAY_ABBREV;TISSUE_ABBREV;feature_ID' only for training-regulated features at 5% IHW FDR. For redundant differential features, 'feature_ID' is prepended with the specific platform to make unique identifiers. See REPEATED_FEATURES for details.
ome
character, assay abbreviation, one of ASSAY_ABBREV
tissue
character, tissue abbreviation, one of TISSUE_ABBREV. Note that VENACV, OVARY, TESTES, were not included in the graphical representation of differential features due to missing groups (e.g., females trained for 1 week).
feature_ID
character, MoTrPAC feature identifier
state_1w
character, state (1, up-regulated; 0, null; -1, down-regulated) of the feature in each sex (F, females; M, males) at the 1-week training time point, relative to sex-matched untrained animals
state_2w
character, state of the feature in each sex at the 2-week training time point
state_4w
character, state of the feature in each sex at the 4-week training time point
state_8w
character, state of the feature in each sex at the 8-week training time point
path
character, assigned states from weeks 1-8, separated by "->". This represents a feature's full path through the graph. NA if the state at any of the four time points is NA.
tissue_path
character, assigned states from weeks 1-8, separated by "->". This represents a feature's full path through the graph. NA if the state at any of the four time points is NA.
Details
Given the posteriors Pr(h|z_i) computed using repfdr::repfdr()
where h is a configuration
vector in -1,0,1^8 (specifying the 8 analyzed groups, 4 time points in males and females),
and z_i is the vector of z-scores of analyte i, we assign analytes to "states".
A state is a tuple (s_m,j, s_f,j), where s_m,j is the differential abundance
state null, up, or down (0,1, and -1 in the notation above, respectively) in males
at time point j, and s_f,j is defined similarly for females (at time point j).
Thus, we have nine possible states in each time point.
For example, assume we inspect analyte i in time point j, asking if the abundance is
up-regulated in males while null in females. Then, we sum over all posteriors Pr(h|z_i)
such that males are up-regulated and females have 0.
If the resulting value is greater than 0.5, then we say that analyte i belongs to the
node set S(s_m,j, s_f,j). Thus, we use S(s_m,j, s_f,j) to denote all analytes
that belong to a state (s_m,j, s_f,j).
Then, for every pair of states from adjacent time points j and j+1 we define their
edge set as the intersection of S(s_m,j, s_f,j) and S(s_m,j+1, s_f,j+1).
Thus, thenode sets edge sets together define a tree structure that represent different
differential patterns over sex and time.