Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Trimmomatic settings for RNAseq analysis on hg19?

    Hi,

    I hope this is the right forum to post to, if not I apologize.

    I am new to RNAseq analysis, and currently undertaking a project doing differential gene expression analysis on human cell lines (disease vs. control). I am using Trimmomatic to perform trimming on the sample data, and I had some questions about the best parameters to use for the specific project I am doing.

    The sequencing was done with an Illumina NovaSeq machine using paired end sequencing and 10m reads per sample. After performing FastQC I can see that the "Illumina Universal Adapter" is over represented in most samples, and that the read length is 151bp.

    For reference here is the example posted on the Trimmomatic website:

    java -jar trimmomatic-0.39.jar PE input_forward.fq.gz input_reverse.fq.gz output_forward_paired.fq.gz output_forward_unpaired.fq.gz output_reverse_paired.fq.gz output_reverse_unpaired.fq.gz ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:2:True LEADING:3 TRAILING:3 MINLEN:36​

    My main questions are:

    1) what are the best numeric values to use for the end of the ILLUMINACLIP argument (e.g., ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:2:True)? And does this differ depending on what the project is about (e.g., why would this be different for a RNAseq project versus a DNA assembly project)?

    2) what would the best / most common values for the LEADING (remove low quality bases from the beginning), TRAILING (remove low quality bases from the end), and MINLEN (remove reads below a minimum length) arguments?

    I apologize if these are very basic questions. I am not sure what the best practices are for performing this type of QC, and curious how it differs between research projects and what the standard practices are.

    Thanks you for you help.

    Nathan




Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 11:49 AM
0 responses
15 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-24-2024, 08:47 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
62 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Working...
X