Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Does BBDuk work on concatenated fastq files

    Basically, I have around 200 samples and wanted to sequence at a depth of 50 million reads per sample. This was not possible on one run so I chose to do all 200, 14 times. I have concatenated all the R1 together and all the R2 together from one sample and so on.... so now I have 200 R1 and 200 R2.

    I was wondering if BBDuk can deal with these files as each file is made up of 14 fastq output with 14 headings.

    Thank you in advance.

  • #2
    BBduk will work fine. Hopefully you concatenated the files in exactly the same order for both R1/R2 files.

    Comment


    • #3
      Adaptor trimming is not working

      Thank you for getting back to me. The adaptor trimming is not working sadly.

      This is what my script looks like:

      Ordered=t #Set to true to output reads in same order as input
      Ktrim=r #once a reference kmer is matched in a read, that kmer and all the bases to the right will be trimmed
      K=21 #specifies the kmer size
      Mink=8 #"mink" allows it to use shorter kmers at the ends of the read
      Hdist=2 #number of permitted mismatches


      for Prefix in `ls -1 *_R1.fastq.gz | sed 's/_R1.fastq.gz//'`
      do

      bbduk.sh -Xmx128g in1=$Prefix\_R1.fastq.gz in2=$Prefix\_R2.fastq.gz out1=$Prefix\_clean_R1.fastq.gz out2=$Prefix\_clean_R2.fastq.gz ref=$adapters ordered=$Ordered ktrim=$Ktrim k=$K mink=$Mink hdist=$Hdist tpe tbo

      done

      Remember my R1 and R2 files consist of concatenated sequences from different runs. Do you think this could be the reason?

      Many thanks

      Comment


      • #4
        As I said before as long as the files are concatenated in same order AND they had the same number of reads in sync across R1/R2 files to begin with this should work without any problems. If things are not working you need to make sure that the reads in your files are in sync. You can check on that using a different bbtool called "repair.sh".

        BBduk.sh needs very little memory there is no need to assign 128G for this job. 4G would be perfectly fine.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Non-Coding RNA Research and Technologies
          by seqadmin




          Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

          Nobel Prize for MicroRNA Discovery
          This week,...
          10-07-2024, 08:07 AM
        • seqadmin
          Recent Developments in Metagenomics
          by seqadmin





          Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
          09-23-2024, 06:35 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 02:44 PM
        0 responses
        7 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-11-2024, 06:55 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-02-2024, 04:51 AM
        0 responses
        110 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-01-2024, 07:10 AM
        0 responses
        117 views
        0 likes
        Last Post seqadmin  
        Working...
        X