Trimmomatic settings for RNAseq analysis on hg19?

nyb2007

Junior Member

Join Date: Jan 2024

Posts: 1
- Share
- Tweet
#1

Trimmomatic settings for RNAseq analysis on hg19?

01-12-2024, 12:46 AM

Hi,

I hope this is the right forum to post to, if not I apologize.

I am new to RNAseq analysis, and currently undertaking a project doing differential gene expression analysis on human cell lines (disease vs. control). I am using Trimmomatic to perform trimming on the sample data, and I had some questions about the best parameters to use for the specific project I am doing.

The sequencing was done with an Illumina NovaSeq machine using paired end sequencing and 10m reads per sample. After performing FastQC I can see that the "Illumina Universal Adapter" is over represented in most samples, and that the read length is 151bp.

For reference here is the example posted on the Trimmomatic website:

java -jar trimmomatic-0.39.jar PE input_forward.fq.gz input_reverse.fq.gz output_forward_paired.fq.gz output_forward_unpaired.fq.gz output_reverse_paired.fq.gz output_reverse_unpaired.fq.gz ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:2:True LEADING:3 TRAILING:3 MINLEN:36

My main questions are:

1) what are the best numeric values to use for the end of the ILLUMINACLIP argument (e.g., ILLUMINACLIP:TruSeq3-PE.fa:2:30:10:2:True)? And does this differ depending on what the project is about (e.g., why would this be different for a RNAseq project versus a DNA assembly project)?

2) what would the best / most common values for the LEADING (remove low quality bases from the beginning), TRAILING (remove low quality bases from the end), and MINLEN (remove reads below a minimum length) arguments?

I apologize if these are very basic questions. I am not sure what the best practices are for performing this type of QC, and curious how it differs between research projects and what the standard practices are.

Thanks you for you help.

Nathan
Tags: None

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Trimmomatic settings for RNAseq analysis on hg19?

Latest Articles

ad_right_rmr

News