Question: How do I obtain gene names from ENSG values?
0
gravatar for nrickett
13 months ago by
nrickett0
United Kingdom
nrickett0 wrote:

Hi

I have performed DGE using DESEQ2 (input htseq-count, following mapping by BWA-MEM) and have ended up with a list of differentially expressed genes in ENSG format. I was hoping there is a way to convert these into the actual human gene names.

Thanks very much for any help.

rna-seq galaxy • 1.1k views
ADD COMMENTlink modified 13 months ago by Jennifer Hillman Jackson25k • written 13 months ago by nrickett0
1
gravatar for Jennifer Hillman Jackson
13 months ago by
United States
Jennifer Hillman Jackson25k wrote:

Hello,

Three options:

1 - Use a reference annotation dataset in the analysis that includes the target gene names as an attribute. This is "gene_name" if using Cuffdiff and a GTF file; some other tools require gene_id (GTF) or Gene (if in GFF3 format). iGenomes is a good source if your genome/version is included (down the tar file locally, unpack it, then just upload the genes.gtf dataset to Galaxy).

2 - Find a file that maps the two IDs and use the tool Join two Datasets side by side on a specified field. The UCSC tables named like "xref" are one choice (if your genome is supported there): http://genome.ucsc.edu

3 - This will not work for your case, but for others wanting to map IDs, the tool UniProt ID mapping and retrieval is often helpful.

Hope that helps! Jen, Galaxy team

ADD COMMENTlink written 13 months ago by Jennifer Hillman Jackson25k
Please log in to add an answer.

Help
Access

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.
Powered by Biostar version 16.09
Traffic: 172 users visited in the last hour