FastQC form setting and usage

Heads up! This is a static archive of our support site. Please go to help.galaxyproject.org if you want to reach the Galaxy community. If you want to search this archive visit the Galaxy Hub search

Latest

Open

RNA-Seq

ChIP-Seq

SNP

Assembly

Forum

Home

Welcome to Galaxy Biostar! User support for Galaxy! about • faq • rss

Log In

Sign Up

Question: FastQC form setting and usage

0

4.2 years ago by

ginna • 0

United States

ginna • 0 wrote:

What does Contaminant list mean? For an example when I select the settings for the FastQC:Read QC tool there is a drop down box and my reference genome is listed there. Should I select it?

forms manuals usage fastqc tools • 1.7k views

ADD COMMENT • link •

modified 4.2 years ago by fubar ♦ 1.1k • written 4.2 years ago by ginna • 0

0

4.2 years ago by

Jennifer Hillman Jackson ♦ 25k

United States

Jennifer Hillman Jackson ♦ 25k wrote:

Hello,

The FastQC manual is linked from the tool form, which is the best source for usage details.

However, I can let you know what I have used this for: screening out known artifact from the analysis. For example, if in a public dataset an earlier run of FastQC revealed an overrepresented sequence that was identified as likely being an adaptor, or if the description of the data contains an adaptor. Use a tabular formatted file: column 1 an identifier, column 2 a nucleotide string. The underlying tool may also accept a fasta file, but not in the Galaxy wrapped version, that I know of.

Hopefully this helps, Jen, Galaxy team

ADD COMMENT • link modified 4.2 years ago • written 4.2 years ago by Jennifer Hillman Jackson ♦ 25k

0

4.2 years ago by

fubar ♦ 1.1k

Australia

fubar ♦ 1.1k wrote:

The help text on the tool form is about all you'll find anywhere but an example with some explanation is here: https://github.com/csf-ngs/fastqc/blob/master/Contaminants/contaminant_list.txt

The choices you see in the fastqc tool are the tabular datasetsl from your local history as defined by the tool xml:

<param name="contaminants" type="data" format="tabular" optional="true" label="Contaminant list" help="tab delimited file with 2 columns: name and sequence. For example: Illumina Small RNA RT Primer CAAGCAGAAGACGGCATACGA"/> </inputs>

Choosing the reference genome as the contaminant sequences list would probably be a very bad idea :)

ADD COMMENT • link written 4.2 years ago by fubar ♦ 1.1k

Please log in to add an answer.

Similar posts • Search »

Tool form input file selection and batch processing methods
If i give more that one data set simultaneously for variant analysis, that will give the variant ...
Trackster Error
Hello, I am trying to look at an output from Tophat using Trackster, but I keep getting the foll...
Incorrect Permissions When Uploading Files
Hello, I think there is a problem with the way permissions are set when uploading files. As admi...
Dynamic Tool Parameter Lists based on an Input File
What is the best way (or any way) to generate dynamic Galaxy tool fields based on data inside a s...
Bowtie2 Select Reference Genome; eliminating false positive variants
Three questions: 1) When specifying a reference genome, what are the advantages and disadvantages...
Data upload to Galaxy Fails
Hi! I am trying to upload a tab-delimited file for LEfSe on Galaxy. However, all my attempts ge...
Tool Config File: Multi-Select For Files?
Dear all, My tool does accept multiple input-files, but there are normally bunches of them, so t...
Problem with Reference genome from history in Tuxedo
Hi, 1- I am using Tuxedo for tomato. I run the workflow from the shared data. In the TopHat sec...
Selecting multiple datasets after filtering
Something changed about the multiple dataset selection control. I used to be able to filter for a...
Problems With Color Settings In Visualizations
Hi, I am working with some saved visualizations, and finding that the color settings are not wor...
Are there maximum Penalty Scores in BWA-MEM?
Apologies if this is a simplistic question, I am new to Galaxy. I am working on a variant callin...
History & Visualization List Empty
Hi there, I've set up a local galaxy installation that so far is working very nicely. However on...
list collection output - format set from input
I have an input, that can be fasta,fastqsanger,fastqillumina: <param name="fastq_input1" typ...
Cuffdiff Question About Using An Unspecified (?) Database/Build
Hello! I have an RNA-Seq project which consists of 5 samples from the species tr...
CuffMerge Fails to accept hg19 human Sequence Data
TopHat 2.1.1 and Cufflinks appear to work properly. After the tool cufflinks, the next tool in m...
Error when running Bismark with pair end/Full parameter list
Hi, When trying to launch a Bismark with bowtie2 I get the following error: Error executing too...
Cutadapt I don't see my list of adapter
Hello, I have a problem with Cutadapt, I would like use a list of adapter, I have copied my list...

Content

Help

About
FAQ

Access

RSS
Stats
API

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by Biostar version 16.09

Traffic: 177 users visited in the last hour