Seqanswers Leaderboard Ad

**lethalfang** · 10-23-2013, 02:27 PM

Originally posted by gevielr View Post

I'm looking for a script that I could use to remove all homopolymer reads from my 100 bp PE reads. These are unusually overabundant in my sample.

I can't seem to find a script that will do this. I've tried to use fastx_clipper, but defining the adapter as a homopolymer, and that didn't work (defined adaptor too long?).

Any ideas?

I wrote a script in python3 that does that. You may find that useful. If you're interested, email me [email protected]

**gevielr** · 10-23-2013, 03:38 PM

Great, thanks!! I'll shoot you an email.

**Wallysb01** · 10-23-2013, 04:00 PM

You could also use dust via prinseq. You might have to play around with the scoring to get a sense of the type of reads you're losing, but it should become pretty clear when you're dumping only very low complexity stuff, like say trinucleotide repeats and shorter.

**bfantinatti** · 07-31-2014, 10:37 AM

Dust

I am using DUST to perform this task.
But DUST do not remove the reads with low complexity. It put the low complexity bases in lowercase.
I don't know how to remove those lowercase reads after running DUST.

How do you guys do to remove those reads after running DUST?

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

discarding homopolymer reads

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News