Question: Identifying Tags - Galaxy Question
0
D. A. Cowart • 30 wrote:
Hello,
I need to perform an action (or series of actions) on an 454 dataset
using
Galaxy, and have not been able to figure out the necessary steps, even
after looking through the toolbar expressions and using custom search.
My file is a fasta and has the standard format:
CTGAGTCAGGTCAACAATCATAAGATATTGGCACCATGTACCTGTGGTTCTCGTTTCC
ATGTTA
CTGAGTCAGGTCAACAATCATAAGACATCGGCTCTCTATATTTAATATTGGT
Each of the 100,000 sequences within this file contains a specific
tag,
which is the first 8 nucleotides.
There are 19 tags total. I would like to identify these tags and add
an
identifier of the tag to the sequence name.
Therefore, if I am looking for the first tag (CTGAGTCA), the output
would
look like:
*CTGAGTCA*GGTCAACAATCATAAGATATTGGCACCATGTACCTGTGGTTCTCGTTTCC
ATGTTA
Is it possible to achieve this using Galaxy? If possible, could you
kindly
suggest tools to use.
Thank you in advance,
Dominique Cowart
ADD COMMENT
• link
•
modified 5.2 years ago
by
Jennifer Hillman Jackson ♦ 25k
•
written
5.2 years ago by
D. A. Cowart • 30