Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Cufflinks with solexa data

    Has anyone used cufflinks for solexa RNA-Seq data?

    Te reads are mapped using eland_rna algorithm and used export2sam.pl to convert the eland mapping into sam format.

    I get the following error when I try to use cufflinks.

    Code:
    $ ./cufflinks /test/lib.sam 
    Counting hits in map
    Error: this SAM file doesn't appear to be correctly sorted!
            current hit is at RM:-1, last one was at (null):0
    Please let me know the steps involved from solexa eland mapping data. Thanks.

  • #2
    I would assume that the SAM file is indeed unsorted, as the error message says. If so, you can use the samtools package to sort the SAM file (I think you'll first have to convert it to BAM using "samtools view" before sorting, and then convert back) and try again.

    Alternatively (and easier), you can sort the Eland output file (or the SAM file) on the command line using a command like

    sort -o infile.bed -k 1,1 -k 2,2n infile.bed

    (if the chromosome is in column 1 and the starting coordinate is in column 2, that is ... if not, you have to substitute the correct column numbers for "1" and "2" above)

    Comment


    • #3
      possibly your sam file requires sorted by samtools . I did not have experience. good luck.

      Comment


      • #4
        Use the following command to sort your SAM files. I experienced the same problem and this fixed it. The code is from the Cufflinks manual, no need for samtools or anything else.

        Code:
        sort -k 3,3 -k 4,4n hits.sam > hits.sam.sorted

        Comment


        • #5
          Hi
          Thanks for the response. I did sort the sam file as per the manual but I still get the error as below:

          Code:
          Counting hits in map
          Error: this SAM file doesn't appear to be correctly sorted!
                  current hit is at mm9:-43, last one was at (null):0
          My reference genome chromosome names are c1,c2,c3...etc and splice junction reads starts with mm9. This is according to the eland mappingwhich includes mapping data against genome which includes c1,c2,c3 etc.. and the splice juctions generated (mm9)

          Please let me know how to overcome this problem. Thanks.

          Comment


          • #6
            I would try aligning the reads with TopHat instead. I have no experience with the Eland RNA-seq alignment program, but I doubt TopHat would do a much worse job.

            Comment


            • #7
              Hi,
              HAs anyone tried converting solexa eland output into sam format and use cufflinks successfully? I would erally appreciate very much for any help. Thanks.
              Regards

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin


                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
                Yesterday, 07:01 AM
              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              44 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              43 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              38 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              55 views
              0 likes
              Last Post seqadmin  
              Working...
              X