Seqanswers Leaderboard Ad

**Michael.Ante** · 04-30-2014, 04:35 AM

Cufflinks might have a problem, if your data inheres a 5' or a 3' bias.
Did you supplied the inner mate pair distance as parameters to Cufflinks? You can derive it from the library QC plots (Bioanalyzer, Tapestation, etc.).

The easiest sanity check is to view your data in a genome browser (e.g. IGV) and have a look at your mark gene.

Cheers,
Michael

**tinkering** · 04-30-2014, 09:19 AM

Thank you Michael. Yes, IGV indicates that mark gene is highly expressed, over 4000 reads. Right now I guess it is the problem of a default configuration of cufflinks. Cufflinks/cuffdiff etc have a maximum number of fragments that can fall within a locus. If a locus has more than this maximum, it is skipped. The threshold is configurable via the --max-bundle-frags option.

I will check if that gene will be picked up after increasing the --max-bundle-frags.

Chan

**Michael.Ante** · 05-01-2014, 11:49 PM

Hi Chan,

I fear, that this is not the crucial point. Per default, Cufflinks' and Cuffdiff's parameter max-bundle-frags is set to 1,000,000 fragments per locus.

Here are a view checks you can make to pin-point the problem:
Compare Cufflinks' estimated inner-mat-pair distance from the log-files with the library size distribution. Denote, that you add to the "inner-mat-pair distance" the length of both reads and the adapter length.

Compare a view highly abundant genes from Cufflinks' output with the IGV browser or the actual read count of these loci.

Use a small subset of your data to run the Tuxedo-pipeline with only the read 1 set. And compare the mark gene's abundance.

Use RSeQC to check your alignment for the "read coverage over gene body". It'll give you an hint for coverage biases, which might confuse Cufflinks.

**tinkering** · 05-03-2014, 05:37 PM

Thanks!

After setting the --max-bundle-frags parameters as 10,000,000, the mark gene was assembled by cufflinks. I checked the expression abundance in IGV with big-wig files, the number ranges from 3000-4000+. That mark gene has 6000nt of CDS. That means > 1,000,000 reads mapping to that gene, so if using the default value of "--max-bundle-frags", that mark gene will be skipped by cufflinks.

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, 07-25-2024, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin 07-25-2024, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 160 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

Cufflinks did not assembly a mark-gene ! Any solution?

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News