Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • VeenaV
    Junior Member
    • Jul 2010
    • 4

    Reads length and display in tview

    Hi,

    I had a couple of questions:

    (1) I used the below command to generate the pileup (I created a bowtie index for hg18 & then ran tophat on the sequence data against the UCSC gene annotation "tophat -G hg18.gff3 -i 30 -I 15000 ../bowtie-0.12.5/hg18 s_o_sequence.fastq", the resultant accepted_hits.sam output file was sorted and indexed).
    "samtools*pileup*-vcf*hg18.fa*accepted_hits_sorted.bam*>*accepted_hits_sorted.pileup"

    A couple of lines from accepted_hits_sorted.pileup [The number of reads covering chr1, co-ordinate 747408 (SNP quality score 140) is 28 as per the pileup output]:
    chr1 747408 A M 140 140 60 28 ,,c,ccc,,,,,,c,,,,,,,,,,cccc YWPNKONQQOTWSMHPSGQXXXGGHTJO
    chr1 748086 A M 22 22 60 11 ......^~,^~,^~c^~c^~, WYYXYXESTEI


    Below is a snapshot of what I see when I open tview ("samtools tview accepted_hits_sorted.bam hg18.fa") and navigate to chr1:747408 (the first dotted line appears underlined). I see just 2 reads displayed in the text alignment viewer after I navigate to chr1:747408. Shouldn’t there have been 28 reads?

    747411 747421 747431 747441 747451 747461 747471 747481 747491 747501 747511 747521
    ACCCTGTCTATACTACCTGCCTGTCTAGCAGATCCACCCTGTCTATACTACCTGCCCATCCAGCAGGTCCACCCTGTCTACACTACCTCCCTGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN
    ....................................... .............C.........................
    .......................................
    .......................................
    ,,,,,,,,,,,,,c,,,,,,,,,,,,,,,,,,,,,,,,,


    (2) The read length for the sequence data is 33. However, some of the reads displayed in tview are much shorter. Why would that be?
    These are Illumina reads. I used tophat (command used is mentioned above) to generate the accepted_hits.sam from the fastq sequence file. Tophat internally runs the bowtie aligner.
    PS: I have other files wherein I ran BWA, but I haven’t tried viewing them yet using tview.

    Any suggestions/comments will be helpful.

    Thanks in anticipation,
    Veena

Latest Articles

Collapse

  • GATTACAT
    Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
    by GATTACAT
    Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
    07-01-2026, 11:43 AM
  • SEQadmin2
    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
    by SEQadmin2


    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

    Here are nine questions we think about, in roughly the order they matter, before...
    06-18-2026, 07:11 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by SEQadmin2, Yesterday, 11:08 AM
0 responses
7 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-30-2026, 05:37 AM
0 responses
11 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-26-2026, 11:10 AM
0 responses
20 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-17-2026, 06:09 AM
0 responses
53 views
0 reactions
Last Post SEQadmin2  
Working...