Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Abberant junctions by tophat

    I have tried to align my paired end RNA-Seq reads to the genome using Tophat. I ran a sample dataset from the SRA (SR018268_1 and _2) and the data looked fine. However, when I run my datasets, I get a lot of spurrious junctions. In the attached example, I show the junctions and coverage for one sample. All the exons map beautifully and have coverage > 200X, but the junctions between exons were not determined for almost all of these exons are not joined and the majority of "junctions" (>80%) in the dataset are intergenic (or intragenic) even with low coverage. For exampl, the far left junction is supported by 92 reads, the middle by 83, and the right by 2.

    I have tried to manipulate the alignment parameters such as -r set to either 165 or 41. These correspond to 230 bp DNA identified from the bioanalyzer minus the inner distance alone (230-35-35=165) or including the primer sequences (230-35-35-119=41). This didn't really change things much.

    So my questions are:
    1) Why aren't these junctions being called by tophat?
    2) Why would the junction on the right show up?
    3) How do I get past this?
    Attached Files

  • #2
    How long are these reads?

    Comment


    • #3
      these are 2 x 35 bp reads. I also don't know if this matters, but my mapping qualities from the SAM files are:
      632465 0
      368741 1
      38907221 255
      1170126 3


      Does this matter?
      Last edited by RockChalkJayhawk; 05-05-2010, 01:47 PM. Reason: Update

      Comment


      • #4
        Also, when I was trying to figure all this out, I made a fastq file that was only 100K long to troubleshoot and I ran into another problem. There are instances where junctions (true ones) appear only when the small dataset is used and not when the full dataset is used. Otherwise, they are exactly the same.

        For instance, this figure shows no junctions when this gene is sequenced 33,333 times, but by subselecting and mapping with only 45x coverage, most of the exons are joined together.

        Where did they go in the full analysis?

        I am using the following code:
        Code:
        tophat -r 41 -p 6 --solexa1.3-quals hg19 sequence_1 sequence_2
        Attached Files

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Genetic Variation in Immunogenetics and Antibody Diversity
          by seqadmin



          The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
          11-06-2024, 07:24 PM
        • seqadmin
          Choosing Between NGS and qPCR
          by seqadmin



          Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
          10-18-2024, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 11:09 AM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Today, 06:13 AM
        0 responses
        19 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 11-01-2024, 06:09 AM
        0 responses
        30 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-30-2024, 05:31 AM
        0 responses
        21 views
        0 likes
        Last Post seqadmin  
        Working...
        X