Seqanswers Leaderboard Ad

**dnusol** · 01-26-2011, 06:52 AM

Hi, I found this useful page about this issue.

9th Discussion-28 October 2010 - BioWiki

http://bioinfo-core.org/index.php/9th_Discussion-28_October_2010

HTH

Dave

**frymor** · 01-26-2011, 10:35 PM

Thanks for the tip.
It is a good page with summaries about the different images of the fastqc software, a thing a lot of people were looking for in a different thread.

BTW, can anyone tell me of a good way to remove the duplicates reads from the equation.
running the fastqc program I get a lot of duplicated reads (see attachment).

As I am looking for differentially regulated genes I am not sure whether I should exclude the duplicated reads or not, but I would like to try ans see what I get when doing so.

Q: can anyone tell me how to filter duplicated genes from the sam files or before the bowtie run from the fastq files?

Q: Is it the right way when going for differential expression also to exclude the duplications? or do I need to keep them?

Thanks

Assa

Attached Files

duplication_levels.pdf (42.6 KB, 48 views)

**jp.** · 12-06-2013, 12:18 AM

did you get the answer ?
would like to share it here
thank you

Originally posted by frymor View Post

Thanks for the tip.
It is a good page with summaries about the different images of the fastqc software, a thing a lot of people were looking for in a different thread.

BTW, can anyone tell me of a good way to remove the duplicates reads from the equation.
running the fastqc program I get a lot of duplicated reads (see attachment).

As I am looking for differentially regulated genes I am not sure whether I should exclude the duplicated reads or not, but I would like to try ans see what I get when doing so.

Q: can anyone tell me how to filter duplicated genes from the sam files or before the bowtie run from the fastq files?

Q: Is it the right way when going for differential expression also to exclude the duplications? or do I need to keep them?

Thanks

Assa

**frymor** · 12-06-2013, 01:00 AM

No I didn't get any response for the questions I posted.

I am not sure though how important is the duplication rate in this step. I'm using tophat2 with the option to exclude all duplicated reads, so I am not worried about the duplication in the original fastq file.

I hope I am thinking in the right direction.

**anamika** · 12-06-2013, 01:42 AM

Sangenix

SangeniX: A comprehensive, automated, scalable and user friendly NGS data analysis suite

Sangenix Has module for duplication removal.

Give it a try : http://www.sangenix.com/

**frymor** · 12-06-2013, 01:44 AM

let me know again, when it is a freeware

**vineet jha** · 12-06-2013, 02:00 AM

Sangenix

Beta Version is available. you can contact to us via contact page in http://www.sangenix.com/contactus.aspx

**dpryan** · 12-06-2013, 02:03 AM

Removing the duplicates could be done with the samtools rmdup command (you could alternatively use markDuplicates from picard). This is generally not needed for RNAseq, since a certain amount of duplication would be both expected and desired for highly expressed genes (i.e., many/most of these probably aren't PCR duplicates).

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, Yesterday, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin Yesterday, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

trimming in tophat

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News