Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • jdjax
    Member
    • Dec 2010
    • 23

    #16
    Originally posted by DZhang View Post
    You should check the header information of your bam file. One way to do it is to convert bam to sam using samtools, then check the top portion of the sam files. (e.g., using 'more your.sam'). Let us know what you see in the header.

    I am away from the campus network, and do not have access to the server right now. When I back on campus I will post that. Thanks for the help.
    jdjax
    Ph.d. Student
    Åarhus University

    Comment

    • jdjax
      Member
      • Dec 2010
      • 23

      #17
      So when looking at the SAM file header line : @HD VN:1.0 SO:coordinate


      I also noticed that the rest of the file does not look uniform. The first line there is '@SQ [tab] SN: value [tab] LN: value'. However the following lines are not the same: '@SQ [tab] SN: value LN:value', where there is no tab between the SN and LN value. Through out the header I see some lines that contain '@SQ [tab] SN: value [tab] LN: value' and '@SQ [tab] SN: value LN:value'.

      I thought SAM files are supposed to be uniform and have a tab between each column? If I am correct, how do I fix my SAM header?

      Thank you again for your help.
      Last edited by jdjax; 08-15-2011, 03:00 AM.
      jdjax
      Ph.d. Student
      Åarhus University

      Comment

      • ribozyme
        Junior Member
        • Oct 2008
        • 3

        #18
        Originally posted by jdjax View Post
        DZhang,

        I did not sort the sam. I am just testing these programs out so I did not use any options for tophat or cufflinks. Tophat made a file accept_hits.bam. I used that file as input for the cufflinks.

        My cufflinks command was just: cufflinks accepted_hits.bam

        I also want to more descriptive about errors I am recieveing in the hopes of figuring this problem. This is what the error stated:

        cufflinks: /usr/lib64/libz.so.1 : no version information available
        Warning: BAM header too large
        File accepted_hits does not appear to be a valid BAM file, trying SAM
        Inspecting reads and determining fragment length distribution.
        SAM error on line 2880: CIGAR op has zero length
        SAM error on line 3240: CIGAR op has zero length
        SAM error on line 3464: CIGAR op has zero length
        SAM error on line 5063: CIGAR op has zero length
        SAM error on line 30750: CIGAR op has zero length
        SAM error on line 51722: CIGAR op has zero length

        This continues with increasing line numbers until it reaches the end of the file.
        I have also checked /usr/lib64/libz.so.1 and it is in /usr/lib64

        libz.so.1 -> libz.so.1.2.3

        is what is present in on the server.

        Again thanks for your input. I appreciate any help. =)
        I am facing the same problem. Any Help Will be appreciated.

        Comment

        • DZhang
          Senior Member
          • Jun 2010
          • 177

          #19
          Here is what is said about input file format for cufflinks on its website:

          Note the use of the custom tag XS. This attribute, which must have a value of "+" or "-", indicates which strand the RNA that produced this read came from. While this tag can be applied to any alignment, including unspliced ones, it must be present for all spliced alignment records (those with a 'N' operation in the CIGAR string).

          The SAM file supplied to Cufflinks must be sorted by reference position. If you aligned your reads with TopHat, your alignments will be properly sorted already. If you used another tool, you may want to make sure they are properly sorted as follows:

          sort -k 3,3 -k 4,4n hits.sam
          I suspect two things might be the cause: properly sorting the sam/bam file or the CIGAR specification. Tophat will remove these potential problems so I would really recommend giving it a try at least as one of the trouble shooting steps.

          Comment

          • jdjax
            Member
            • Dec 2010
            • 23

            #20
            After some serious problem solving we figured out what we need to do to make Bowtie output work with cufflinks.
            When working with Bowtie you get a UNsorted SAM file, so you first have to coverted it to a BAM file, then sort the file, then convert it back to a SAM file. Then when you have a sorted SAM file cufflinks does not bring up an error.

            I have no idea why this occurs - all I care about is that is works.

            Good Luck.
            jdjax
            Ph.d. Student
            Åarhus University

            Comment

            • DZhang
              Senior Member
              • Jun 2010
              • 177

              #21
              Hi jdjax,

              Thank you for the update and working solution. Did you try "sort -k 3,3 -k 4,4n hits.sam" as suggested by the cufflink website? It it does, it will save you lots of time (and disk space) to convert sam/bam/sam.

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                Here are nine questions we think about, in roughly the order they matter, before...
                Yesterday, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM
              • SEQadmin2
                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                by SEQadmin2


                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                Introduction

                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                05-22-2026, 06:42 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              20 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              38 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              44 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-04-2026, 08:59 AM
              0 responses
              49 views
              0 reactions
              Last Post SEQadmin2  
              Working...