RNA-seq experimental and quantification QC metrics for transcriptomic (TRNSCRPT) data
Format
A data frame with 935 rows and 82 variables:
viallabelcharacter, sample identifier
vial_labeldouble, sample identifier
2D_barcodedouble, sample barcode
Speciescharacter, species
BIDinteger, biospecimen ID
PIDdouble, participant ID, one per animal
Tissuecharacter, tissue description
Sample_categorycharacter, study sample ("study") or reference standard ("ref)
GET_sitecharacter, which Genomics, Epigenomics, and Transcriptomics (GET) site performed the assay, "Stanford" or "MSSM" (Icahn School of Medicine at Mount Sinai)
RNA_extr_plate_IDcharacter, RNA extraction plate ID
RNA_extr_datecharacter, RNA extraction date
RNA_extr_concdouble, RNA concentration (ng/uL)
RINdouble, RNA Integrity Number
r_260_280double, 260/280 ratio
r_260_230double 260/230 ratio
Lib_prep_datecharacter, library preparation date in MM/DD/YYYY format
Lib_RNA_concdouble, RNA concentration used for library prep (ng/uL)
Lib_RNA_volinteger, RNA volume used for library prep (uL)
Lib_robotcharacter, robot used for library prep
Lib_vendorcharacter, library prep vendor
Lib_typecharacter, library prep type
Lib_kit_idcharacter, library prep kit ID
Lib_batch_IDcharacter, library prep batch ID that distinguished different sample processing batches
Lib_barcode_wellcharacter, well
Lib_index_1character, i7 index
Lib_index_2character, i5 index
Lib_adapter_1character, Truseq I7 index with 16bp index
Lib_adapter_2character, Truseq I5 index with 8bp index
Lib_UMI_cycle_numinteger, number of bases of UMI
Lib_adapter_sizeinteger, total size of the two adapters
Lib_frag_sizeinteger, average library fragment size (bp)
Lib_DNA_concdouble, DNA concentration of original stock of the library (ng/uL)
Lib_molaritydouble, library molarity (nM)
Seq_platformcharacter, sequencing platform
Seq_dateinteger, sequencing date, YYMMDD format
Seq_machine_IDcharacter, serial number of the sequencer
Seq_flowcell_IDcharacter, flow cell ID
Seq_flowcell_runinteger, flow cell run
Seq_flowcell_lanecharacter, flow cell lane
Seq_flowcell_typecharacter, flow cell type, e.g., S4
Seq_lengthinteger, read length
Seq_end_typeinteger, 1=single-end, 2=paired-end
Phasecharacter, study phase, "PASS1B-06"
Seq_batchcharacter, unique identifier for sequencing batch
reads_rawdouble, number of read pairs in the raw FASTQ
pct_adapter_detecteddouble, percent of reads with adapter detected
pct_trimmeddouble, percent of reads that were trimmed
pct_trimmed_basesdouble, percent of bases that were trimmed
readsdouble, number of read pairs in the trimmed FASTQ files
pct_GCdouble, percent GC content in trimmed FASTQ files
pct_dup_sequencedouble, percent of duplicated sequences in trimmed FASTQ files
pct_rRNAdouble, percent of rRNA reads in trimmed FASTQ files
pct_globindouble, percent of globin reads in trimmed FASTQ files
pct_phixdouble, percent of phix reads in trimmed FASTQ files
pct_picard_dupdouble, PCR duplication assessed by Picard’s tool MarkDuplicate
pct_umi_dupdouble, PCR duplication rate assessed using UMIs (Unique Molecular Identifiers)
avg_input_read_lengthdouble, average input read length
uniquely_mappeddouble, number of uniquely mapped reads
pct_uniquely_mappeddouble, percent of uniquely mapped reads
avg_mapped_read_lengthdouble, average input mapped length
num_splicesdouble, number of splices
num_annotated_splicesdouble, number of annotated splices
num_GTAG_splicesdouble, number of GT/AG and CT/AC splices
num_GCAG_splicesdouble, number of GC/AG and CT/GC splices
num_ATAC_splicesdouble, number of AT/AC and GT/TA splices
num_noncanonical_splicesdouble, number of non-canonical splices
pct_multimappeddouble, percent of reads that multimapped
pct_multimapped_toomanydouble, percent of reads that multimapped too many times
pct_unmapped_mismatchesdouble, percent of unmapped reads due to mismatches
pct_unmapped_tooshortdouble, percent of unmapped reads due to shortness
pct_unmapped_otherdouble, percent of unmapped reads for other reason
pct_chimericdouble, percent chimeric reads
pct_chrXdouble, percent of reads mapped to chromosome X
pct_chrYdouble, percent of reads mapped to chromosome Y
pct_chrMdouble, percent of reads mapped to the mitochondrial genome
pct_chrAutodouble, percent of reads mapped to autosomal chromosomes
pct_contigdouble, percent of reads mapped to contigs
pct_codingdouble, percent of bases mapped to coding
pct_utrdouble, percent of bases mapped to untranslated region
pct_intronicdouble, percent of bases mapped to introns
pct_intergenicdouble, percent of bases mapped to intergenic
pct_mrnadouble, percent of bases mapped to mRNA
median_5_3_biasdouble, median 5' to 3' bias