Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Post-demultiplex adaptor removal?

    If one has demultiplexed dual index reads on a MiSeq AND included "adaptor removal" as part of the demultiplexing (on instrument), should one also run the FASTQ files through an adaptor removal programme, or is this just overkill?

  • #2
    I would suggest running FastQC on the data. It is a program that measures a wide variety of quality metrics. That way you can see with your own eyes the data quality measures, including adapter sequence contamination.

    Comment


    • #3
      @cement_head: If there are no remaining adapters then all you lost is some time. For miseq datasets you would need less than 30 min to scan/trim data with bbduk.sh from BBMap. You can then be sure that there would be no extraneous sequences remaining in your data. Especially important if you were doing any de novo work.

      Comment


      • #4
        I suggest fastp to do automatic adapter trimming, read filtering and quality control. fastp is developed in C++ with multi-threading support, it's ultra-fast.

        fastp has following features:
        1, filter out bad reads (too low quality, too short, or too many N...)
        2, cut low quality bases for per read in its 5' and 3' by evaluating the mean quality from a sliding window (like Trimmomatic but faster).
        3, trim all reads in front and tail
        4, cut adapters. Adapter sequences can be automatically detected,which means you don't have to input the adapter sequences to trim them.
        5, correct mismatched base pairs in overlapped regions of paired end reads, if one base is with high quality while the other is with ultra low quality
        6, preprocess unique molecular identifer (UMI) enabled data, shift UMI to sequence name.
        7, report JSON format result for further interpreting.
        8, visualize quality control and filtering results on a single HTML page (like FASTQC but faster and more informative).
        9, split the output to multiple files (0001.R1.gz, 0002.R1.gz...) to support parallel processing. Two modes can be used, limiting the total split file number, or limitting the lines of each split file.
        10, support long reads (data from PacBio / Nanopore devices).

        fastp creates reports in both HTML and JSON format.

        HTML report: http://opengene.org/fastp/fastp.html
        JSON report: http://opengene.org/fastp/fastp.json

        fastp is an open source project at github: https://github.com/OpenGene/fastp
        OpenGene(Libraries and tools for NGS data analysis),AfterQC(Fastq Filtering and QC)
        FusionDirect.jl( Detect gene fusion), SeqMaker.jl(Next Generation Sequencing simulation)

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Latest Developments in Precision Medicine
          by seqadmin



          Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

          Somatic Genomics
          “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
          05-24-2024, 01:16 PM
        • seqadmin
          Recent Advances in Sequencing Analysis Tools
          by seqadmin


          The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
          05-06-2024, 07:48 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 06:55 AM
        0 responses
        12 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-30-2024, 03:16 PM
        0 responses
        24 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-29-2024, 01:32 PM
        0 responses
        27 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-24-2024, 07:15 AM
        0 responses
        215 views
        0 likes
        Last Post seqadmin  
        Working...
        X