Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    When using non-quality-trimmed fastq for BBSplit, it worked just fine!

    My quality-trimming step uses BBDuk -- this command:
    Code:
    ./bbmap/bbduk.sh in=nonrRNA_bran11.fastq out=QualTrimmed_bran11.fastq qtrim=r trimq=10 overwrite=true
    Thoughts on how I should revise this to prevent an invalid file format error when moving to BBSplit?

    I really appreciate your helpful (and quick!) replies!

    Comment


    • #17
      That command looks fine. Perhaps you had a one time corruption with the file you got.

      Comment


      • #18
        Thank you for your help!

        I have just one more question regarding the output from BBSplit.

        -------------- Results ---------------

        Genome: 1
        Key Length: 13
        Max Indel: 20
        Minimum Score Ratio: 0.56
        Mapping Mode: normal
        Reads Used: 1829806 (416340827 bases)

        Mapping: 1630.950 seconds.
        Reads/sec: 1121.93
        kBases/sec: 255.28


        Read 1 data: pct reads num reads pct bases num bases

        mapped: 13.1904% 241358 13.0507% 54335231
        unambiguous: 9.5835% 175359 9.5952% 39948829
        ambiguous: 3.6069% 65999 3.4554% 14386402
        low-Q discards: 0.0000% 0 0.0000% 0

        perfect best site: 1.9436% 35564 1.8453% 7682929
        semiperfect site: 1.9468% 35623 1.8485% 7695929

        Match Rate: NA NA 96.0824% 52522434
        Error Rate: 75.3143% 204784 3.8796% 2120757
        Sub Rate: 73.8546% 200815 2.3134% 1264595
        Del Rate: 14.3653% 39060 0.6014% 328726
        Ins Rate: 16.4704% 44784 0.9649% 527436
        N Rate: 3.9234% 10668 0.0380% 20766

        Total time: 1715.910 seconds.
        Since I had three reference sequences, is the "Genome: 1" referring to the combined reference of all three that I listed under the "ref=" parameter?

        Code:
        ref=p.americana_genome.fasta,Blattabacterium_genome.fasta,Blattabacterium_plasmid.fasta

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Best Practices for Single-Cell Sequencing Analysis
          by seqadmin



          While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
          Today, 07:15 AM
        • seqadmin
          Latest Developments in Precision Medicine
          by seqadmin



          Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

          Somatic Genomics
          “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
          05-24-2024, 01:16 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 08:18 AM
        0 responses
        8 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Today, 08:04 AM
        0 responses
        10 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 06-03-2024, 06:55 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 05-30-2024, 03:16 PM
        0 responses
        27 views
        0 likes
        Last Post seqadmin  
        Working...
        X