Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How does inner distances(--mate-inner-dist and --mate-std-dev) work in tophat?

    I am confused about the inner distance setting. From the manual file, it should be set to (fragementlength-2*readslength , eg: 300-2*50=100). But if the distance counting is based on genome location, then the distance between the pairs should be (fragementlength-2*readslength+inserted_introns_length). Does anybody know how tophat manage the intron insertion?

  • #2
    I think that the mate inner distances is set in order to detect whether the mates are in different exons etc. That is, if the distance is much larger/smaller than the 200 (300-2*50) that the software has been told to expect, something interesting might be going on.

    Comment


    • #3
      I'm trying different settings of inner distance for my experiments. Anybody out there with the answer as to which value is good?

      Comment


      • #4
        Hi snp_analyser. I'm afraid there is no good value - the inner distances depend on the sizes of your fragments and the lengths of your reads, which are experiment specific.

        We typically use a tool like Bowtie to help us find our fragment sizes empirically. We run a paired-end alignment with Bowtie, using default parameters for -I and -X. We then examine the output to see, in general, how far apart the reads in a pair as aligned. This indicates the mate inner distance.

        In terms of terminology, the "gap" or "inner distance" is the distance between the aligned reads (not counting the reads themselves). The "insert size", on the other hand, includes the reads themselves, so can be thought of as the "fragment size".

        If you look at Bowtie output, the alignment position of a read is the position of the first base in the read (from the perspective of the forward reference strand). This means that if you subtract the alignment positions of the two reads in a pair, the result is actually "inner distance" + "read length". So you will need to subtract the read length to get the inner distance.

        You should probably write (or find) a script to do this for you to ensure you examine enough pairs to get a representative feel for the value. Our data typically shows a normally distributed inner distance.

        Comment


        • #5
          Thanks for the detailed reply!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Recent Advances in Sequencing Analysis Tools
            by seqadmin


            The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
            05-06-2024, 07:48 AM
          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 06:57 AM
          0 responses
          11 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-06-2024, 07:17 AM
          0 responses
          16 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-02-2024, 08:06 AM
          0 responses
          19 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-30-2024, 12:17 PM
          0 responses
          24 views
          0 likes
          Last Post seqadmin  
          Working...
          X