Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • virg4l
    Junior Member
    • May 2015
    • 7

    HTseq-count errors

    I put 10 .bam files (from Tophat) through the same samtools & htseq-count commands, and 3/10 didn't work (each with their own error) in htseq.

    $ samtools view -h -o sample.sam sample.bam

    $ htseq-count --stranded no -i gene_id sample.sam genes.gtf > sample.gene.counts

    The errors are:

    1) ...596437 GFF lines processed.
    Error occured when reading beginning of SAM/BAM file.
    [Exception type: StopIteration, raised in count.py:84]

    This one particularly confuses me since the program didn't choke on any of the other sam files that were prepared the same way.

    2) ...20000000 SAM alignment records processed.
    Error occured when processing SAM input (line 20080682 of file TH2/DEprep/BR2_N2In.sam):
    ('SAM line does not contain at least 11 tab-delimited fields.', 'line 20080682 of file TH2/DEprep/BR2_N2In.sam')
    [Exception type: ValueError, raised in _HTSeq.pyx:1276]

    3) ...35089676 SAM alignments processed.
    [Errno 5] Input/output error
    [Exception type: IOError, raised in count.py:212]


    Any suggestions for how I should fix these problems?
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    I wonder if you're having hard drive corruption issues.

    In any case, make sure you're using the most recent version of htseq-count. Also, check line 20080682 of the sam file mentioned and ensure it's not malformed.

    Comment

    Latest Articles

    Collapse

    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM
    • SEQadmin2
      From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
      by SEQadmin2


      Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


      The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
      ...
      06-02-2026, 10:05 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, 06-26-2026, 11:10 AM
    0 responses
    16 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    49 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-09-2026, 11:58 AM
    0 responses
    108 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-05-2026, 10:09 AM
    0 responses
    125 views
    0 reactions
    Last Post SEQadmin2  
    Working...