Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • tophat with mixed paired end reads

    Hi,

    I have a file with mixed paired end reads (50bp and 70bp). My insert size is 400bp. Did anyone try running tophat with mixed paired end reads? If so, what would be the tophat -r option (will it be 300bp or 260bp or an average 280bp)?

    Thank you in advance.

    Nirmala

  • #2
    In general, I don't think mixing different read lengths is recommended, although I see that 1.1.2 is supposed to support variable length reads, but I haven't tried it and I am not exactly sure what this means, i.e. does it mean you can mix the 27bp with the 100bp as the manual explicitly advised against before (and still advises against), or it only works well with a more limited range of variation.

    How many reads do you have? I would try running it on mixed reads, with an -r of 280, and if the output is anomalous (Cufflinks can't assemble anything, you're missing a lot of junctions, etc.), switch to something like this: you could trim all reads to 50, map, get junctions, map all 70s separately, get the junctions again, add the annotation, and them map the 50s and 70s separately against the resulting set of junctions, and then join the output.

    Comment


    • #3
      Tophat output bed file

      I have some naive question in terms of the output from TopHat.

      track name=junctions description="TopHat junctions"
      chr start end name score strand thickst thickend xxx blockcount blocksize
      test_chromosome 180 402 JUNC00000001 46 + 180 402 255,0,0 2 70,52 0,170
      test_chromosome 349 550 JUNC00000002 38 + 349 550 255,0,0 2 51,50 0,151



      Question: 1) what kind of score (here 46 and 38) is a good indicative of splicing junction
      2) the block count is the read counts on the two sides of a junction? e.g. 70, 52 means 70 reads on the left and 52 reads on the right of the junction
      3) what is the first number 0 in the block size 0,170. In this case I assume that the block size is 170 nt. And therefore the gap (intron) between this junction is 52 based on the calculation 402 - 180 = 222 and then 222 -170 = 52.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 11:49 AM
      0 responses
      12 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 08:47 AM
      0 responses
      16 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      61 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      60 views
      0 likes
      Last Post seqadmin  
      Working...
      X