Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Novoalign native report - Paired reads

    Hi,

    If anyone has worked with a report (native format) generated using novoalign, please help me with these doubts. The datasets used are Illumina paired reads.

    A) Below is the snapshot of a Novoalign report (native format) for Illumina paired reads.

    Code:
    @0:1:1:34:429 L GAAGNAAAAATAAAAGCATTAGNAGAAATTTGTACA IIII$IIIII&IIIIIIIIIII$IIIIIIIIIIIII U 14 91 >gi|9629357:1-9117 2177 F . 2308 R
    @0:1:1:34:429 R TNCTTATTAAGCNCTCTGAAATNNANNNNTTTTCTC I$IIIIIIIIII$IIIIIIIII$$'$$$$IIIIIII U 126 91 >gi|9629357:1-9117 2308 R . 2177 F 25A>G 36G>A
    With the help of the Novocraft Alignment suite pdf (section Output Formats, page 24), I was able to understand certain columns in the report but please help me identify what
    1. Aligned Sequence
    2. Aligned Offseet
    3. Pair Sequence
    4. Pair Offset
    5. Mismatches
    are, in the report.

    B ) I was also looking for the aligned reads' start and end positions. Is that information available in this report?

    C) At the end of the report are 3 columns given with data

    # Fragment Length Distribution
    # From To Count
    # 27 29 4
    # 30 32 30
    # 33 35 141
    # 36 38 696
    # 39 41 1136 ..............etc


    Does this mean that from positions 27 to 29, there are 4 reads and so on.

    D) Finally, here were the report statistics.

    # Paired Reads: 9686877
    # Pairs Aligned: 6253455
    # Read Sequences: 19373754
    # Aligned: 14102273
    # Unique Alignment: 14102068
    # Gapped Alignment: 875179
    # Quality Filter: 248607
    #Homopolymer Filter: 1306


    I understand that 2 times Paired Reads = Read Sequences. Please help me in understanding why 2 times Pairs Aligned < Aligned. Again if I add Gapped Alignment with Unique Alignment, I do not get Aligned.

    Please advice.

Latest Articles

Collapse

  • seqadmin
    Best Practices for Single-Cell Sequencing Analysis
    by seqadmin



    While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
    Today, 07:15 AM
  • seqadmin
    Latest Developments in Precision Medicine
    by seqadmin



    Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

    Somatic Genomics
    “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
    05-24-2024, 01:16 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 08:18 AM
0 responses
8 views
0 likes
Last Post seqadmin  
Started by seqadmin, Today, 08:04 AM
0 responses
10 views
0 likes
Last Post seqadmin  
Started by seqadmin, 06-03-2024, 06:55 AM
0 responses
13 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-30-2024, 03:16 PM
0 responses
27 views
0 likes
Last Post seqadmin  
Working...
X