Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • slp
    Junior Member
    • Dec 2010
    • 9

    Novoalign query

    I am running into problems trying to run novoalign. We ran a 96 bp paired end exome sequencing exp. But the fastq files have reads ranging from 35-96bp (because of adapter trimming).
    Each fast file has about 10s of Millions of reads. But when I try to align the reads with the following command
    novoalign -d hg38.nix -f read1.fastq read2.fastq -o SAM > Out.sam

    only few hundred reads are processed out of the Millions of reads (Please see below). I am not sure whats going wrong here.

    Interpreting input files as Illumina FASTQ, Casava Pipeline 1.8 & later.
    # Index Build Version: 3.3
    # Hash length: 14
    # Step size: 2
    # Paired Reads: 259
    # Proper Pairs: 193 (74.5%)
    # Read Sequences: 518
    # Unique Alignment: 469 (90.5%)
    # Multi Mapped: 36 ( 6.9%)
    # No Mapping Found: 7 ( 1.4%)
    # QC Failures...
    # Read Length: 6 ( 1.2%)
    # Elapsed Time: 5.855 (secs.)
    # CPU Time: 12.41 (secs.)
    # Fragment Length Distribution
    # From To Count
    # 45 59 11
    # 60 74 18
    # 75 89 3
    # 90 104 33
    # 105 119 23
    # 120 134 19
    # 135 149 17
    # 150 164 9
    # 165 179 9
    # 180 194 7
    # 195 209 4
    # 210 224 3
    # 225 239 4
    # 240 254 3
    # 255 269 2
    # 270 284 1
    # 285 299 1
    # 300 314 2
    # 315 329 0
    # 330 344 0
    # 345 359 0
    # 360 374 0
    # 375 389 0
    # 390 404 1
    # 405 419 1
    # Mean 132, Std Dev 62.0
    # Done at Mon Nov 9 20:31:31 2015


    Any help is much appreciated.

Latest Articles

Collapse

  • SEQadmin2
    Nine Things a Sample Prep Scientist Thinks About Before Sequencing
    by SEQadmin2


    I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

    Here are nine questions we think about, in roughly the order they matter, before...
    06-18-2026, 07:11 AM
  • SEQadmin2
    From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
    by SEQadmin2


    Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


    The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
    ...
    06-02-2026, 10:05 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by SEQadmin2, Today, 05:37 AM
0 responses
5 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-26-2026, 11:10 AM
0 responses
16 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-17-2026, 06:09 AM
0 responses
50 views
0 reactions
Last Post SEQadmin2  
Started by SEQadmin2, 06-09-2026, 11:58 AM
0 responses
110 views
0 reactions
Last Post SEQadmin2  
Working...