Mapping between various rat and human gene identifiers for orthologs as reported by RGD (2020-01-10).
Format
A data frame with 21461 rows and 12 variables:
RAT_SYMBOL
character, official rat gene symbol
NAME
character, gene name
RAT_GENE_RGD_ID
integer, rat RGD gene ID
RAT_NCBI_GENE_ID
integer, rat NCBI/Entrez gene ID
RAT_ENSEMBL_ID
character, rat Ensembl gene ID
RAT_UNIPROT_ID
character, semicolon-separated list of rat UniProt gene IDs
RAT_OLD_SYMBOL
character, semicolon-separated list of rat deprecated or alias gene symbols
HUMAN_ORTHOLOG_SYMBOL
character, human official gene symbol
HUMAN_ORTHOLOG_RGD
character, human RGD gene ID
HUMAN_ORTHOLOG_NCBI_GENE_ID
character, human NCBI/Entrez gene ID
HUMAN_ORTHOLOG_ENSEMBL_ID
character, human Entrez gene ID
HUMAN_ORTHOLOG_SOURCE
character, human ortholog source
Source
<https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_39/gencode.v39.metadata.EntrezGene.gz>
<https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_39/gencode.v39.metadata.HGNC.gz>
<https://ftp.ebi.ac.uk/pub/databases/gencode/Gencode_human/release_39/gencode.v39.chr_patch_hapl_scaff.basic.annotation.gtf.gz>
<https://download.rgd.mcw.edu/pub/data_release/RGD_ORTHOLOGS.txt> (2020-01-10)
<https://download.rgd.mcw.edu/data_release/RAT/GENES_RAT.txt> (2020-11-15)
Details
This map was compiled from several external sources. GENCODE metadata and annotation files were used to map between human Ensembl transcript IDs, Entrez IDs, GENCODE IDs, and Ensembl gene IDs (Frankish et al., 2021). RGD files were used to map between human and rat gene symbols as well as between various rat gene identifiers (Smith et al., 2020).
There is one row per unique combination of \code{RAT_SYMBOL}, \code{RAT_GENE_RGD_ID}, \code{RAT_NCBI_GENE_ID}, \code{RAT_ENSEMBL_ID},
\code{RAT_UNIPROT_ID}, \code{HUMAN_ORTHOLOG_SYMBOL}, \code{HUMAN_ORTHOLOG_RGD}, \code{HUMAN_ORTHOLOG_NCBI_GENE_ID}, and \code{HUMAN_ORTHOLOG_ENSEMBL_ID}, in the table. For example, there are 24 rows corresponding to \code{RAT_SYMBOL == "Tnf"}
so some genes correspond to multiple rows 3 \code{RAT_ENSEMBL_ID}s correspond to Tnf, and 8 \code{HUMAN_ORTHOLOG_ENSEMBL_ID}s correspond to TNF. because