Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • fanx
    Member
    • Sep 2012
    • 22

    Chimeric reads

    Hello, my previous post seems very quiet. Here I raise this question in a different way.

    I use Bowtie 2 to find many 454 reads are mapped >1 times. I believe these are chimeric reads due to the use of multiple stand displacement for initial amplification of minute amount DNA template. Is anyone aware of some tools that can split chimeric reads into single non-chimeric reads? Thanks a lot.
  • Torst
    Senior Member
    • Apr 2008
    • 275

    #2
    Doesn't it just mean that the read comes from a region of the genome that is repeated? So it aligns to all copies of the repeat?

    Comment

    • fanx
      Member
      • Sep 2012
      • 22

      #3
      Thanks. In fact, I know the problem is reads rather than the reference genome. I believe most reads are chimeric.

      Comment

      • Torst
        Senior Member
        • Apr 2008
        • 275

        #4
        UCHIME claims to be able to do some de novo chimera detection, or use bits of your known reference:

        Comment

        • fanx
          Member
          • Sep 2012
          • 22

          #5
          Trost, Thanks your point. I made a search prior to the post, which found UCHIME. I didn't investigate it in detail because I guess UCHIME may need a high coverage, such as PCR amplicons. However, my RNA-Seq data is from human blood that is complicated with RNA from human genes, virus and fungi etc at low coverage.

          One way I thought about is to split reads into 2 parts and then see if they turn to be single mapped reads. This spit can be done easily with current scripts but I worry about potential lose of information. To my knowledge, there is really no way to handle this issue.

          I am using 454 and I think I should switch to Illumina. Short reads might be helpful.

          Comment

          • bernardo_bello
            Member
            • May 2012
            • 49

            #6
            We have recently sequenced a bacterial transcriptome with 316 chip from IonTorrent (1.5 million sequences). After filtering low quality data and trimming adapters we noticed that only 51.33% sequences were mapped on reference genome. Looking for the unmapped sequences we can see that most of them are chimeric transcripts, so impossible mapping for them an also causing bias on results. Also many of the unmapped are sequences lacking homology in 20% of the starting sequence.

            I would like to know your opinion about it.
            Should I have to move to 454 or Illumina? Our Sequencing Department have no idea of why we have so many chimeras.

            Thank you, Bernardo

            Comment

            • fanx
              Member
              • Sep 2012
              • 22

              #7
              To answer your question, I need to know 1) is there any amplification step prior to the sequencing? 2) what's the aligner you used for mapping.

              Comment

              • bernardo_bello
                Member
                • May 2012
                • 49

                #8
                Hello fanx,

                Thank you for your response.

                1) Yes, there is a PCR step. We have used the hole transcriptome procedure described here 'Ion Total RNA-Seq Kit v2'

                2) I used BWA for Illumina in Galaxy with default settings.

                Comment

                • fanx
                  Member
                  • Sep 2012
                  • 22

                  #9
                  1, If there is a PCR step, chimeric reads are not unexpected. Many polymerases, especially those assuming high fidelity, have strand displacement activity. The occurrance of chimeric reads depends on both polymerases and protocols.

                  2, Some chimeric reads may not be authenic ones. In this situation, I usually increase mapping stringency and found many of "chimeric" reads became single-hit ones.

                  3, For remaining "true" chimeric reads, there are 2 ways to go. One is just to discard them (as shown in many previous publications where this issue is largely ignored). If your data has a profound depth, I don't think this will affect/bias your final result. The other way is to extract these chimeric reads only and do some trimming, re-aligned to see whether if they become single-hit reads, and finally combine all single hit reads for downstream analysis.

                  4, Finally, I assume you already done quality control prior to the align.

                  Comment

                  • bernardo_bello
                    Member
                    • May 2012
                    • 49

                    #10
                    >1, If there is a PCR step, chimeric reads are not unexpected. Many polymerases, especially those assuming high fidelity, have strand displacement activity. The occurrence of chimeric reads depends on both polymerases and protocols.

                    Ok, I think I'm wasting money.

                    >2, Some chimeric reads may not be authentic ones. In this situation, I usually increase mapping stringency and found many of "chimeric" reads became single-hit ones.

                    I'm mapping only >Q20 reads. I've seen were and how they are mapping and they have 100% similarity in both hits. Sometimes there are three hits for one read.

                    >If your data has a profound depth, I don't think this will affect/bias your final result.

                    I have 800.000 sequences mapped to a 2 Mb prokaryotic genome. My mean read length is about 150 bp.

                    >The other way is to extract these chimeric reads only and do some trimming

                    I would like to finish my PhD, trimming is not feasible! So many chimeras.

                    >4, Finally, I assume you already done quality control prior to the align.

                    Of course, only >Q20 and trimming low quality 3' region.

                    Thank you, Bernardo

                    Comment

                    • bernardo_bello
                      Member
                      • May 2012
                      • 49

                      #11
                      Sorry, I forgot to say that I have finally 25% mapping sequences (of 3 million). My reference sequence is a draft genome in 47 segments.

                      Comment

                      • bernardo_bello
                        Member
                        • May 2012
                        • 49

                        #12
                        FASTQC of unmapped reads

                        If you give me your email I can send you FASTQC output of unmapped reads to know your opinion.


                        Bernardo

                        Comment

                        • jshaik
                          Junior Member
                          • Jun 2011
                          • 6

                          #13
                          aligner from sanger

                          This aligner seem to address the issue of chimeric reads: http://www.sanger.ac.uk/resources/software/smalt/
                          I personally didnt try this yet but will try it next time I need to align something.

                          Comment

                          • bernardo_bello
                            Member
                            • May 2012
                            • 49

                            #14
                            Originally posted by jshaik View Post
                            This aligner seem to address the issue of chimeric reads: http://www.sanger.ac.uk/resources/software/smalt/
                            I personally didnt try this yet but will try it next time I need to align something.
                            Thanks, seems there is not an associated publication for SMALT. There is?

                            Comment

                            • jshaik
                              Junior Member
                              • Jun 2011
                              • 6

                              #15
                              No there is no publication associated with it. But people have compared it with other aligners in their works.

                              Comment

                              Latest Articles

                              Collapse

                              • SEQadmin2
                                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                                by SEQadmin2


                                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                                Here are nine questions we think about, in roughly the order they matter, before...
                                06-18-2026, 07:11 AM
                              • SEQadmin2
                                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                by SEQadmin2


                                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                ...
                                06-02-2026, 10:05 AM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by SEQadmin2, 06-17-2026, 06:09 AM
                              0 responses
                              36 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-09-2026, 11:58 AM
                              0 responses
                              99 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-05-2026, 10:09 AM
                              0 responses
                              120 views
                              0 reactions
                              Last Post SEQadmin2  
                              Started by SEQadmin2, 06-04-2026, 08:59 AM
                              0 responses
                              113 views
                              0 reactions
                              Last Post SEQadmin2  
                              Working...