Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • cuffdiff complains SAM file being not sorted, when I am sure they are right.

    Hello all,

    I am running cuffdiff with different gtf files but with the same set of sam files.
    For example, they look like this.
    cuffdiff -b hg19.fa --max-bundle-frags 4000000 -o lincRNAs lincRNAs.combined.gtf a.sam b.sam c.sam
    cuffdiff -b hg19.fa --max-bundle-frags 4000000 -o codingGenes coding.txt.gtf a.sam b.sam c.sam, where a.sam, b.sam, and c.sam are from different tissues.

    In running them, I have successfully run the latter, but I got the following error for the former:
    --------------------------------------------------------------------------------
    [08:24:24] Loading reference annotation.
    [08:24:24] Inspecting maps and determining fragment length distributions.

    Error: this SAM file doesn't appear to be correctly sorted!
    current hit is at chrX:61774, last one was at chrM:16495
    Cufflinks requires that if your file has SQ records in
    the SAM header that they appear in the same order as the chromosomes names
    in the alignments.
    If there are no SQ records in the header, or if the header is missing,
    the alignments must be sorted lexicographically by chromsome
    name and by position.
    --------------------------------------------------------------------------------

    But the thing is sam files cannot be wrong for the former running, because I am using the same set of the sam files. As it happens in loading reference annotation, I am thinking this might be related to lincRNAs.combined.gtf in the example, but actually this file is a result of cuffcompare (as reflected in the name), another package in cuff serires tools. So, I cannot think this file is incompatible with cuffdiff.

    I am wondering where I have to look into to address this issue.

    Thanks a lot for your advices in advance.
    HJ.

  • #2
    The error message sounds pretty straightforward, is it incorrect about the order of your headers?

    Try using Picard to sort, if you aren't already.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Quality Control Essentials for Next-Generation Sequencing Workflows
      by seqadmin




      Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

      Nucleic Acid Quality Control
      Preparing for NGS starts with isolating the...
      02-10-2025, 01:58 PM
    • seqadmin
      An Introduction to the Technologies Transforming Precision Medicine
      by seqadmin


      In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
      01-27-2025, 07:46 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 02-07-2025, 09:30 AM
    0 responses
    72 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-05-2025, 10:34 AM
    0 responses
    113 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 02-03-2025, 09:07 AM
    0 responses
    87 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 01-31-2025, 08:31 AM
    0 responses
    48 views
    0 likes
    Last Post seqadmin  
    Working...
    X