ATAC-seq feature annotation
Format
A data frame with 1209773 rows and 16 variables:
assaycharacter, assay abbreviation, one of ASSAY_ABBREV
assay_codecharacter, assay code used in data release. See MotrpacBicQC::assay_codes.
feature_IDcharacter, MoTrPAC feature identifier
chromcharacter, chromosome: 1-20, X, or Y
startdouble, base pair of feature start
enddouble, base bair of feature end
widthinteger, width of feature in base pairs
chipseeker_annotationcharacter, annotation from
ChIPseeker::annotatePeak()custom_annotationcharacter, a version of the ChIPseeker annotations with many corrections. Values include: "Distal Intergenic", "Promoter (<=1kb)", "Exon", "Promoter (1-2kb)", "Downstream (<5kb)", "Upstream (<5kb)", "5' UTR", "Intron", "3' UTR", "Overlaps Gene", where "Overlaps Gene" means the feature has a non-zero overlap with either the start or end of the gene but was not otherwise asssigned an annotation.
distanceToTSSdouble, minimum distance from one end of the feature to the transcription start site.
relationship_to_genedouble, distance from the closest edge of the feature to the start or end of the closest gene, whichever is closer. A value of 0 means there is non-zero overlap between the feature and the gene. A negative value means the feature is upstream of
geneStart. A a positive value means the feature is downstream ofgeneEnd. Note thatgeneStartandgeneEndare strand-agnostic, i.e.geneStartis always less thangeneEnd, even if the gene is on the negative strand (geneStrand == 2).ensembl_genecharacter, Ensembl gene ID from release 96 of the Rattus norvegicus gene annotation
geneStartinteger, base pair start of gene; strand-agnostic, meaning
geneStartis always less thangeneEndgeneEndinteger, base pair end of gene; strand-agnostic, meaning
geneStartis always less thangeneEndgeneLengthinteger, length of gene in base pairs
geneStrandinteger, 1 (forward strand) or 2 (reverse strand)
Details
ATAC feature annotation is only available via download from Google Cloud Storage:
https://storage.googleapis.com/motrpac-rat-training-6mo-extdata/epigen-rda/ATAC_FEATURE_ANNOT.rda.
You can use MotrpacRatTraining6mo::load_atac_feature_annotation() to download and return this file.
This table was generated using MotrpacRatTraining6mo::get_peak_annotations().
relationship_to_gene is the shortest distance between the feature and the start or end of the closest gene.
It is 0 if the feature has any overlap with the gene.
custom_annotation fixes many issues with the ChIPseeker annotation (v1.22.1).