RRBS experimental and quantification QC metrics for DNA methylation (METHYL) data
Format
A data frame with 416 rows and 75 variables:
viallabelcharacter, sample identifier
vial_labeldouble, sample identifier
2D_barcodedouble, sample barcode
Speciescharacter, species
BIDinteger, biospecimen ID
PIDdouble, participant ID, one per animal
Tissuecharacter, tissue description
Sample_categorycharacter, study sample ("study") or reference standard ("ref)
GET_sitecharacter, which Genomics, Epigenomics, and Transcriptomics (GET) site performed the assay, "Stanford" or "MSSM" (Icahn School of Medicine at Mount Sinai)
DNA_extr_plate_IDinteger, DNA extraction plate ID
DNA_extr_datecharacter, DNA extraction date
DNA_extr_protocolcharacter, DNA extraction protocol
DNA_extr_robotcharacter, robot used for DNA extraction
DNA_concdouble, DNA concentration (ng/uL)
A280/260double, 280/260 ratio
A260/230double, 260/230 ratio
Lib_prep_datecharacter, library preparation date
Lib_DNA_massdouble, DNA mass used for library prep (ng)
Lib_DNA_voldouble, volume of the library (uL)
lambda_DNA_massdouble, spiked-in Lambda DNA mass (ng)
Lib_robotcharacter, robot used for library prep
Lib_kit_vendorcharacter, library prep vendor
Lib_kit_typecharacter, library prep kit
Lib_kit_IDcharacter, library kit ID
Lib_batch_IDcharacter, library prep batch ID
Lib_index_1character, i7 index
Lib_index_2logical, i5 index
Lib_adapter_1character, Trueseq i7 index with 16 bp index
Lib_adapter_2character, include customized Metseq primer
Lib_UMI_cycle_numinteger, number of bases of UMI
Lib_adapter_sizeinteger, the total size of the two adapters
Lib_concdouble, DNA concentration for the library (ng/uL)
Lib_frag_sizeinteger, average library fragment size
Lib_molaritydouble, library molarity (nM)
Seq_platformcharacter, sequencing platform
Seq_dateinteger, sequencing date
Seq_machine_IDcharacter, serial number of the sequencing machine
Seq_flowcell_IDcharacter, flow cell ID
Seq_flowcell_runinteger, flow cell run
Seq_flowcell_lanecharacter, flow cell lane
Seq_flowcell_typecharacter, flow cell type, e.g., "S4"
Seq_lengthinteger, read length
Seq_end_typeinteger, 1=single-end, 2=paired-end
reads_rawinteger, number of raw read pairs
pct_adapter_detecteddouble, percent of reads with adapter detected
pct_trimmeddouble, percent of trimmed reads from the initial trimming
pct_no_MSPIdouble, percent of reads with no MSPI present among the trimmed reads
pct_trimmed_basesdouble, percent of bases that were trimmed
pct_removeddouble, percent of reads that were removed due to adapter trimming or no presence of MSPI
readsinteger, number of read pairs in the trimmed FASTQ files
pct_GCdouble, percent GC content in the trimmed FASTQ files
pct_dup_sequencedouble, percent of duplicated sequences in trimmed FASTQ files
pct_phixdouble, percent of phix reads in trimmed FASTQ files
pct_chrXdouble, percent of reads mapped to chromosome X
pct_chrYdouble, percent of reads mapped to chromosome Y
pct_chrMdouble, percent of reads mapped to the mitochondrial genome
pct_chrAutodouble, percent of reads mapped to autosomal chromosomes
pct_contigdouble, percent of reads mapped to contigs
pct_Uniqdouble, percent of uniquely mapped reads
pct_Unaligneddouble, percent of unaligned reads
pct_Ambidouble, percent of ambiguously mapped reads
pct_OTdouble, percent of mapped reads aligned to the original top stand
pct_OBdouble, percent of mapped reads aligned to the original bottom stand
pct_CTOTdouble, percent of mapped reads aligned to the complementary to original top strand
pct_CTOBdouble, percent of mapped reads aligned to the complementary to original bottom strand
pct_umi_dupdouble, PCR duplication rate assessed using UMIs (Unique Molecular Identifiers)
pct_CpGdouble, global CpG methylation level based on the deduplicated data
pct_CHGdouble, global CHG methylation level based on the deduplicated data
pct_CHHdouble, global CHH methylation level based on the deduplicated data
lambda_pct_Uniqdouble, percent of uniquely mapped reads to lambda
lambda_pct_Ambidouble, percent of ambiguously mapped reads to lambda
lambda_pct_umi_dupdouble, PCR duplication rate assessed using UMIs (Unique Molecular Identifiers)
lambda_pct_CpGdouble, global CpG methylation level based on the deduplicated data
lambda_pct_CHGdouble, global CHG methylation level based on the deduplicated data
lambda_pct_CHHdouble, global CHH methylation level based on the deduplicated data
Seq_batchcharacter, unique identifier for sequencing batch