Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Jerry_Zhao
    Member
    • Feb 2012
    • 20

    Problems about the Trimmomatic on bisulfite sequencing data sets

    Hi, All,

    I have some problems about using Trimmomatic on bisulfite sequencing data sets.


    For example, I downloaded the raw data of human methylome from Lister et al. 2009 Nature (SRR019072.sra), and got the FASTQ file by the sratoolkit.
    Next, I tried the Trimmomatic to trim the adaptors and other low-quality sequences.

    This is the command I used.
    java -classpath trimmomatic-0.20.jar org.usadellab.trimmomatic.TrimmomaticSE -threads 10 -phred64 SRR019072.fastq SRR019072.trim.fq ILLUMINACLIP:remove_adaptor_PCR.fa:2:40:15 LEADING:2 TRAILING:2 MINLEN:60

    Because they were using the single-end adaptors, the remove_adaptor_PCR.fa file is as follows:
    >Prefix_PCR_PRIMER_SEQUENCE/1
    AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT
    >Prefix_PCR_PRIMER_SEQUENCE/2
    CAAGCAGAAGACGGCATACGAGCTCTTCCGATCT
    >ADAPTOR_SEQUENCE_B
    ACACTCTTTCCCTACACGACGCTCTTCCGATCT
    >ADAPTOR_SEQUENCE_A
    GATCGGAAGAGCTCGTATGCCGTCTTCTGCTTG

    This is the running results:
    ILLUMINACLIP: Using 1 prefix pairs, 2 forward/reverse sequences, 0 forward only sequences, 0 reverse only sequences
    Input Reads: 13074151 Surviving: 4569673 (34.95%) Dropped: 8504478 (65.05%)
    TrimmomaticSE: Completed successfully


    It seems that 65% of reads were dropped.
    Moreover, even 65% reads were dropped, the trimmed results still did not pass the FastQC, especially for the "Per base sequence content" and "Kmer Content".


    Therefore, I need your helps to figure out which parameter I used was not proper? Or the remove_adaptor_PCR.fa is not correct?
    Do we need specific parameters for bisulfite sequencing data?


    Many thanks and best regards,
    Jerry

Latest Articles

Collapse

  • seqadmin
    Pathogen Surveillance with Advanced Genomic Tools
    by seqadmin




    The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
    03-24-2025, 11:48 AM
  • seqadmin
    New Genomics Tools and Methods Shared at AGBT 2025
    by seqadmin


    This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

    The Headliner
    The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
    03-03-2025, 01:39 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 03-20-2025, 05:03 AM
0 responses
42 views
0 reactions
Last Post seqadmin  
Started by seqadmin, 03-19-2025, 07:27 AM
0 responses
52 views
0 reactions
Last Post seqadmin  
Started by seqadmin, 03-18-2025, 12:50 PM
0 responses
38 views
0 reactions
Last Post seqadmin  
Started by seqadmin, 03-03-2025, 01:15 PM
0 responses
194 views
0 reactions
Last Post seqadmin  
Working...