Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • FeatureCounts and E.coli Reference

    Hi all,

    I am struggling to count Bowtie2-mapped reads using featureCounts and after three days of investigation I hope you can help me.

    My workflow is simple.
    Bacterial (E.coli) Illumina RNAseq data -> FASTQC -> Bowtie2 to custom E.coli strain reference -> Samtools subsampling -> FeatureCounts.

    Bowtie2 Mapped Reads: 34,445,043
    FeatureCounts: 2,672,611 (7.8%) Successfully assigned alignments

    The call:
    featureCounts -a reference.gtf -p -T 10 -M -O -t CDS -g ID -o output.txt mapped.bam

    Why does featureCounts only annotate 7% of the reads, even though the entire genome is full of perfectly annotated genes (see attachment) ?
    How to run featureCounts properly on bacterial RNAseq data?
    Is there an alternative software to featureCounts?

    Please have a look at the IGV attachment. Everything looks fine, I am very desperate how to debug this situation. I tried almost all combinations of featureCounts paramters. I used PROKKA and BAKTA genome annotations.

    If you have any guess or idea how to come closer to the origin of the problem please let me know.

    Here is the featureCounts output:
    ​Assigned 2672611
    Unassigned_Unmapped 0
    Unassigned_Read_Type 0
    Unassigned_Singleton 0
    Unassigned_MappingQuality 0
    Unassigned_Chimera 0
    Unassigned_FragmentLength 0
    Unassigned_Duplicate 0
    Unassigned_MultiMapping 0
    Unassigned_Secondary 0
    Unassigned_NonSplit 0
    Unassigned_NoFeatures 31772432
    Unassigned_Overlapping_Length 0
    Unassigned_Ambiguity 0​

    Best,
    Michael

  • #2
    I solved the problem. The issue was that most of the reads correspond to 23S rRNA which is only visible if you change the featureCounts type to rRNA.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Genetic Variation in Immunogenetics and Antibody Diversity
      by seqadmin



      The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
      11-06-2024, 07:24 PM
    • seqadmin
      Choosing Between NGS and qPCR
      by seqadmin



      Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
      10-18-2024, 07:11 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Today, 11:09 AM
    0 responses
    24 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, Today, 06:13 AM
    0 responses
    20 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 11-01-2024, 06:09 AM
    0 responses
    30 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 10-30-2024, 05:31 AM
    0 responses
    21 views
    0 likes
    Last Post seqadmin  
    Working...
    X