Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • loba17
    Member
    • Sep 2011
    • 19

    Newbler Warning - Primer contamination

    Hello,
    I am using Newbler 2.6 to do a de novo cDNA assembly with 454 reads. The program is giving me a warning about possible primer contamination (i.e. TGTTTTTTTTTCT). I checked the assembly and found about 250 contigs (out of 20,000 in total) with the reported primer sequence. We used the MINT cDNA synthesis kit and the reported sequence seems to be part of the MINT kit primers. I could use the -vt flag with runAssembly and provide a fasta file with the primer sequences to trim the reads but would this be correct?

    Reason for not trimming:
    > the primer is part of the mRNA and therefore should not me removed - I will loose some information

    Reasons for trimming:
    > the primer sequence could lead to incorrect asemblies
    > the primer sequence might be part of the mRNA but not the protein - this region could cause false positives in blast searches

    I know that RNAseq data have this characteristic bias (e.g. random hexamer primer) but I think nobody is trimming the read because of it. I could assemble the reads without trimming and remove the contigs with the primer sequence not at the end.

    Is anybody willing to share his thoughts or experience on this? I would appreciate your help. Thanks!
  • maubp
    Peter (Biopython etc)
    • Jul 2009
    • 1544

    #2
    I had something similiar, and got better assemblies removing the MINT primer.

    Comment

    • loba17
      Member
      • Sep 2011
      • 19

      #3
      Dear maubp,

      thanks for the answer. I was wondering if you trimmed the reads or the contigs? Did you use the -v option or a different program for the read trimming? Would you mind specify "better assembly". How did you assign the quality improvement?
      Last edited by loba17; 06-14-2012, 02:22 AM.

      Comment

      • sklages
        Senior Member
        • May 2008
        • 628

        #4
        Originally posted by maubp View Post
        I had something similiar, and got better assemblies removing the MINT primer.
        same here, we had a lot of EST libs created with MINT system; we always removed primer sequences (as I would do for any other libs as well).

        Comment

        • maubp
          Peter (Biopython etc)
          • Jul 2009
          • 1544

          #5
          I think I tried both the -v option (for Newbler) and trimming the reads (for Newbler and MIRA).

          Without trimming the reads I got some very strange coverage patterns where at one end of a contig there was a MINT adapter that was overly represented. For a MIRA EST example see Figure 5 in Milne et al. 2012 http://dx.doi.org/10.1093/bib/bbs012

          Comment

          • loba17
            Member
            • Sep 2011
            • 19

            #6
            Dear sklages, dear maubp,

            thanks for your help.

            I was reading more about the trimming step. It seems that the -vt flag is the best way to proceed. People also recommend to use the -vs flag to remove rRNA sequences. For this part I could downloaded the RNAmmer fast file from the CBS website and used it with the -vs flag. This would, however, only cover prokaryotes. Any suggestion about eukaryotes rRNA sequences? I could get the ribosomal sequence from NCBI but I guess the file would be rather large.

            In addition, I also found references recommending the -urt flag. But this seems to be controversial. I tried it and it resulted in a large number of contigs ... three times more than before. Therefore I think it is best not to use it at least for my assembly. Are there other reasons (not) to use it?

            Comment

            • flxlex
              Moderator
              • Nov 2008
              • 412

              #7
              Regarding the '-urt' flag: it is supposed to give more complete transcripts. It could be that there are ore contigs per isogroup ('gene'), but that you would have to check. So, maybe the flag is actually usefull (I would compare data with and without it - not just looking at contig number - to make sure)

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM
              • SEQadmin2
                Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                by SEQadmin2


                With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                Introduction

                Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                05-22-2026, 06:42 AM
              • SEQadmin2
                Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                by SEQadmin2

                Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                05-06-2026, 09:04 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-02-2026, 12:03 PM
              0 responses
              19 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-02-2026, 11:40 AM
              0 responses
              14 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-28-2026, 11:40 AM
              0 responses
              29 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 05-26-2026, 10:12 AM
              0 responses
              31 views
              0 reactions
              Last Post SEQadmin2  
              Working...