Skip to contents

repdfr state assignments (up, down, or null) for each training-regulated feature (5% FDR) for each sex at each time point, which specify node assignments for each differential feature. Missing values indicate that the repfdr posterior probabilities did not meet the cutoff for that feature.

Usage

GRAPH_STATES

Format

A data frame with 34244 rows and 10 variables:

feature

character, unique feature identifier in the format 'ASSAY_ABBREV;TISSUE_ABBREV;feature_ID' only for training-regulated features at 5% IHW FDR. For redundant differential features, 'feature_ID' is prepended with the specific platform to make unique identifiers. See REPEATED_FEATURES for details.

ome

character, assay abbreviation, one of ASSAY_ABBREV

tissue

character, tissue abbreviation, one of TISSUE_ABBREV. Note that VENACV, OVARY, TESTES, were not included in the graphical representation of differential features due to missing groups (e.g., females trained for 1 week).

feature_ID

character, MoTrPAC feature identifier

state_1w

character, state (1, up-regulated; 0, null; -1, down-regulated) of the feature in each sex (F, females; M, males) at the 1-week training time point, relative to sex-matched untrained animals

state_2w

character, state of the feature in each sex at the 2-week training time point

state_4w

character, state of the feature in each sex at the 4-week training time point

state_8w

character, state of the feature in each sex at the 8-week training time point

path

character, assigned states from weeks 1-8, separated by "->". This represents a feature's full path through the graph. NA if the state at any of the four time points is NA.

tissue_path

character, assigned states from weeks 1-8, separated by "->". This represents a feature's full path through the graph. NA if the state at any of the four time points is NA.

Details

Given the posteriors Pr(h|z_i) computed using repfdr::repfdr() where h is a configuration vector in -1,0,1^8 (specifying the 8 analyzed groups, 4 time points in males and females), and z_i is the vector of z-scores of analyte i, we assign analytes to "states". A state is a tuple (s_m,j, s_f,j), where s_m,j is the differential abundance state null, up, or down (0,1, and -1 in the notation above, respectively) in males at time point j, and s_f,j is defined similarly for females (at time point j). Thus, we have nine possible states in each time point. For example, assume we inspect analyte i in time point j, asking if the abundance is up-regulated in males while null in females. Then, we sum over all posteriors Pr(h|z_i) such that males are up-regulated and females have 0. If the resulting value is greater than 0.5, then we say that analyte i belongs to the node set S(s_m,j, s_f,j). Thus, we use S(s_m,j, s_f,j) to denote all analytes that belong to a state (s_m,j, s_f,j). Then, for every pair of states from adjacent time points j and j+1 we define their edge set as the intersection of S(s_m,j, s_f,j) and S(s_m,j+1, s_f,j+1). Thus, thenode sets edge sets together define a tree structure that represent different differential patterns over sex and time.