Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Indels in Sequence Reads

    Hi everyone,

    I am using paired end 100 bp reads on an illumina hiseq platform to find chromosomal rearrangements. I can find them just fine with the paired end data since the pairs are flagged, but I need to find the junctions within the sequence reads themselves. I've tried BWA, Bowtie2, Tophat2, SNAP, and ELAND and none of these report junctions except for Tophat2, but Tophat2 is not suitable since I'm sequencing DNA. I can see the junctions if I do not soft clip my reads, but it doesn't display the reads as junctions like it does when I use Tophat2 on RNA. Does anyone have any suggestions on how to get some of these programs to display split reads or filter them out into a separate file so I can find and analyze them effectively?

    The junctions are also more than just a few base pairs. They can vary from a few KB to hundreds of KB and a few are inter chromosomal rearrangements

  • #2
    Dear @YOLO69SWAG,

    The Subread aligner may help you to do this. If you run Subread with the '-J' option, it will mark those bases which can not be aligned together with other bases from the same read using 'S' in the CIGAR string (soft clipping). You can then identify the junction/fusion positions within reads using this information.

    Subread can be downloaded from http://subread.sourceforge.net

    Best wishes,

    Wei

    Comment


    • #3
      There are a number of programs designed specifically for detecting indels and structural variants. Check out the software wiki.

      Comment


      • #4
        Thanks Wei,

        I'm almost done aligning now using subread. I'll make another post this wknd with the result.

        Comment


        • #5
          Subread

          Hey Wei,

          I tried the program and was able to get it to align my reads. I can see the soft clipped bases and these are almost always my junction reads. This helps a lot, since I can see the junctions that I saw with my paired end information, but there's still a problem because the junctions are in many different locations. Going through each of these sites will take forever BLASTing manually.

          Woe is graduate school?

          Any more suggestions would be much appreciated.

          Comment


          • #6
            Dear @YOLO69SWAG,

            Thanks for letting me know it worked and I'm glad it helps.

            Could you elaborate a bit more about what you want to do with these fusion reads? This will let more people give you comments about your project. Do you want to get the exact fusion positions or do you want to know the gene which were fused together?

            Best wishes,

            Wei

            Comment


            • #7
              I'm trying to record and count these junction reads and the coordinates of the junction. They occur spontaneously in certain parts of the genome. I already detect them using paired end information, but being able to get the information on junctions within reads will let me quantify the presence of these junctions accurately. If I could get my DNA-seq reads to be read using a gapped aligner and have the junctions displayed like Tophat/bowtie does with RNA seq data that would be perfect.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Recent Advances in Sequencing Analysis Tools
                by seqadmin


                The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                05-06-2024, 07:48 AM
              • seqadmin
                Essential Discoveries and Tools in Epitranscriptomics
                by seqadmin




                The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                04-22-2024, 07:01 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, Yesterday, 06:35 AM
              0 responses
              15 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 05-09-2024, 02:46 PM
              0 responses
              21 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 05-07-2024, 06:57 AM
              0 responses
              18 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 05-06-2024, 07:17 AM
              0 responses
              19 views
              0 likes
              Last Post seqadmin  
              Working...
              X