Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Problems Counting SAM files with dexseq_count.py

    Hi all,

    I'm having problems counting my SAM files with "dexseq_counts.py". I have counted them before with a custom generated gtf file (I used Stringtie to generate it) and it was fine. Now I'm trying to use a new version of the gff file, where i have done some clean up, removing single exon transcripts. My problems is that when I try to count my files with the new gff file I get the following error:

    Traceback (most recent call last):
    File "dexseq_count.py", line 97, in <module>
    features[f.iv] += f
    File "_HTSeq.pyx", line 524, in HTSeq._HTSeq.GenomicArray.__getitem__ (src/_HTSeq.c:10761)
    File "_HTSeq.pyx", line 384, in HTSeq._HTSeq.ChromVector.__getitem__ (src/_HTSeq.c:7809)
    File "_HTSeq.pyx", line 348, in HTSeq._HTSeq.ChromVector._create_view (src/_HTSeq.c:7382)
    IndexError: Cannot subset to zero-length interval.


    This is my command:
    python dexseq_count.py GFF_test3.gff -p yes -r name -s reverse ../../trimmed/alignment/P0_1/P0_1_sort_name.sam P0_1_ExonCounts

    And this is how my gff file looks like:

    1 dexseq_prepare_annotation.py exonic_part 312155 312383 . + . transcripts TCONS_00000001+TCONS_00000002+TCONS_00000003; exonic_part_number 001; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 312361 312383 . + . transcripts TCONS_00000002; exonic_part_number 002; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 315197 315479 . + . transcripts TCONS_00000001; exonic_part_number 003; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 315933 316737 . + . transcripts TCONS_00000001+TCONS_00000004; exonic_part_number 004; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 316355 316737 . + . transcripts TCONS_00000001; exonic_part_number 005; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 317591 317815 . + . transcripts TCONS_00000001; exonic_part_number 006; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 321633 321756 . + . transcripts TCONS_00000001; exonic_part_number 007; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 322916 323800 . + . transcripts TCONS_00000001; exonic_part_number 008; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 382921 383126 . + . transcripts TCONS_00000005+TCONS_00000006; exonic_part_number 009; gene_id XLOC_000001
    1 dexseq_prepare_annotation.py exonic_part 383127 383127 . + . transcripts TCONS_00000005; exonic_part_number 010; gene_id XLOC_000001

    Since I haven't changed my SAM files or anything in my computer, I'm guessing my problem is in the gff. I thought it could be due to exons with just 1 nt, but i had more of those in the original file before and that wasn't a problem before. Anyone has an idea what could be causing the error? I'd appreciate any help I can get. I'm very lost right now.
    Thank you very much,

    Miriam

  • #2
    I would actually guess that you have a 0-length entry. A simple way to debug things like this is to subset the gff file until the problem goes away. You can then quickly find the region that's causing this and manually have a look.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Understanding Genetic Influence on Infectious Disease
      by seqadmin




      During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

      Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
      09-09-2024, 10:59 AM
    • seqadmin
      Addressing Off-Target Effects in CRISPR Technologies
      by seqadmin






      The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
      08-27-2024, 04:44 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 09-11-2024, 02:44 PM
    0 responses
    11 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-06-2024, 08:02 AM
    0 responses
    145 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 09-03-2024, 08:30 AM
    0 responses
    152 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 08-27-2024, 04:40 AM
    0 responses
    161 views
    0 likes
    Last Post seqadmin  
    Working...
    X