Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • NOTEST on Cuffdiff output

    I am new to the Tuxedo Suite and decided to start by going through the workflow in the Nature Protocols publication. All seemed to be going well until I got to the point of differential expression analysis with Cuffdiff. There, the output file gene_exp.diff had NOTEST in the status column for almost all genes. I looked at the Cufflinks manual, which indicated that this output can result from not having enough alignments for testing in any given gene. I then used Samtools to be sure the .bam files that were generated by TopHat had a number of mapped reads that was comparable to the Nature Protocols example. The results, for all accepted_hits.bam files suggested that the number of mapped reads was very similar to what was expected according to the protocol. An example is below:

    $ samtools idxstats C2_R3_thout/accepted_hits.bam

    2L 23011544 4591000 0
    2R 21146708 5029664 0
    3L 24543557 4112826 0
    3R 27905053 5317642 0
    4 1351857 194612 0
    M 19517 0 0
    X 22422827 4157222 0

    I then tried to use the Cuffdiff option -c, to reduce the minimum number of alignments needed at a locus from 10 to 5, but the resulting gene_exp.diff file still had NOTEST in the vast majority of genes.

    In case it helps, the Cuffdiff command that I used was as follows:

    $ cuffdiff -o diff_out -b ./Drosophila_melanogaster/Ensembl/BDGP5.25/Sequence/Bowtie2Index/genome.fa -p 8 -L C1,C2 -u merged_asm/merged.gtf ./C1_R1_thout/accepted_hits.bam,./C1_R2_thout/accepted_hits.bam,./C1_R3_thout/accepted_hits.bam ./C2_R1_thout/accepted_hits.bam,./C2_R2_thout/accepted_hits.bam,./C2_R3_thout/accepted_hits.bam

    If any additional information is necessary for you to help me troubleshoot this issue, please let me know.

  • #2
    I got the similar issue here. Any idea?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM
    • seqadmin
      Strategies for Sequencing Challenging Samples
      by seqadmin


      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
      03-22-2024, 06:39 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    17 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    21 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    16 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    46 views
    0 likes
    Last Post seqadmin  
    Working...
    X