Question: How To Filter The Sequences Containing Not[Atcg] Character?
    
    0
        
师云 • 110 wrote:
Hi Jen,As the title, I have a [fasta] file that obtained from a [gtf]
file,
and I want to get the output like this to filter the sequences that
contain any not[ATCG] character?
I have a large of sequences to filter. I thought a way that firstly
convert the file to [interval] file, and secondly SELECT the line not
matching the patten /\t[ATCGatcg]*[^ATCGatcg]/.Am I right? Or there is
a one-step way ?
            
    ADD COMMENT
    • link
    
    
        
        •
        
            
            
        
    
    
            
            modified 5.0 years ago
            
                by 
Jennifer Hillman Jackson ♦ 25k
            
            •
        
        written 
        5.0 years ago by 
师云 • 110
    
            