Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cuffdiff complains SAM file being not sorted, when I am sure they are right.

    Hello all,

    I am running cuffdiff with different gtf files but with the same set of sam files.
    For example, they look like this.
    cuffdiff -b hg19.fa --max-bundle-frags 4000000 -o lincRNAs lincRNAs.combined.gtf a.sam b.sam c.sam
    cuffdiff -b hg19.fa --max-bundle-frags 4000000 -o codingGenes coding.txt.gtf a.sam b.sam c.sam, where a.sam, b.sam, and c.sam are from different tissues.

    In running them, I have successfully run the latter, but I got the following error for the former:
    --------------------------------------------------------------------------------
    [08:24:24] Loading reference annotation.
    [08:24:24] Inspecting maps and determining fragment length distributions.

    Error: this SAM file doesn't appear to be correctly sorted!
    current hit is at chrX:61774, last one was at chrM:16495
    Cufflinks requires that if your file has SQ records in
    the SAM header that they appear in the same order as the chromosomes names
    in the alignments.
    If there are no SQ records in the header, or if the header is missing,
    the alignments must be sorted lexicographically by chromsome
    name and by position.
    --------------------------------------------------------------------------------

    But the thing is sam files cannot be wrong for the former running, because I am using the same set of the sam files. As it happens in loading reference annotation, I am thinking this might be related to lincRNAs.combined.gtf in the example, but actually this file is a result of cuffcompare (as reflected in the name), another package in cuff serires tools. So, I cannot think this file is incompatible with cuffdiff.

    I am wondering where I have to look into to address this issue.

    Thanks a lot for your advices in advance.
    HJ.

  • #2
    The error message sounds pretty straightforward, is it incorrect about the order of your headers?

    Try using Picard to sort, if you aren't already.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Genetic Variation in Immunogenetics and Antibody Diversity
      by seqadmin



      The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
      11-06-2024, 07:24 PM
    • seqadmin
      Choosing Between NGS and qPCR
      by seqadmin



      Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
      10-18-2024, 07:11 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 11:09 AM
    0 responses
    22 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, Today, 06:13 AM
    0 responses
    20 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 11-01-2024, 06:09 AM
    0 responses
    30 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 10-30-2024, 05:31 AM
    0 responses
    21 views
    0 likes
    Last Post seqadmin  
    Working...
    X