FeatureCounts and E.coli Reference

mitchum20

Junior Member

Join Date: Mar 2018

Posts: 6
- Share
- Tweet
#1

FeatureCounts and E.coli Reference

06-07-2024, 05:57 AM

Hi all,

I am struggling to count Bowtie2-mapped reads using featureCounts and after three days of investigation I hope you can help me.

My workflow is simple.
Bacterial (E.coli) Illumina RNAseq data -> FASTQC -> Bowtie2 to custom E.coli strain reference -> Samtools subsampling -> FeatureCounts.

Bowtie2 Mapped Reads: 34,445,043
FeatureCounts: 2,672,611 (7.8%) Successfully assigned alignments

The call:
featureCounts -a reference.gtf -p -T 10 -M -O -t CDS -g ID -o output.txt mapped.bam

Why does featureCounts only annotate 7% of the reads, even though the entire genome is full of perfectly annotated genes (see attachment) ?
How to run featureCounts properly on bacterial RNAseq data?
Is there an alternative software to featureCounts?

Please have a look at the IGV attachment. Everything looks fine, I am very desperate how to debug this situation. I tried almost all combinations of featureCounts paramters. I used PROKKA and BAKTA genome annotations.

If you have any guess or idea how to come closer to the origin of the problem please let me know.

Here is the featureCounts output:
Assigned 2672611
Unassigned_Unmapped 0
Unassigned_Read_Type 0
Unassigned_Singleton 0
Unassigned_MappingQuality 0
Unassigned_Chimera 0
Unassigned_FragmentLength 0
Unassigned_Duplicate 0
Unassigned_MultiMapping 0
Unassigned_Secondary 0
Unassigned_NonSplit 0
Unassigned_NoFeatures 31772432
Unassigned_Overlapping_Length 0
Unassigned_Ambiguity 0

Best,
Michael

1 Photo
Tags: None
mitchum20

Junior Member

Join Date: Mar 2018

Posts: 6
- Share
- Tweet
#2

06-23-2024, 11:43 PM

I solved the problem. The issue was that most of the reads correspond to 23S rRNA which is only visible if you change the featureCounts type to rRNA.
Comment

Previous template Next

Genetic Variation in Immunogenetics and Antibody Diversity

by seqadmin

The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
- Channel: Articles
11-06-2024, 07:24 PM
Choosing Between NGS and qPCR

by seqadmin

Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
- Channel: Articles
10-18-2024, 07:11 AM

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 24 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

FeatureCounts and E.coli Reference

Comment

Latest Articles

ad_right_rmr

News