Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Novoalign options

    Hello Group,

    I've been trying to find the optimal parameters for aligned bisulfite treated 76bp PE reads to a small reference that includes repeats. I'm confused with the option to set the fragment length and standard deviation. My question is: what exactly does the fragment length refer to? Is it the distance between mapped mates or does it include the reads.

    The library I am using isolated adapter fragments between 250-300bp (minus adapter = 131-181). Given 76 bp reads, some mates could overlap. If the fragment length refers to distance between aligned reads than this value could be negative. Anyone want to clarify?
    Thanks a lot!

  • #2
    HI, Just found your post. The fragment length mean and standard deviation are outer coordinates, it doesn't matter if you set them a bit higher than the real values.
    Colin

    Comment


    • #3
      So by outer coordinates you mean the most distal coordinates of an adapter-ligated molecule, i.e. average length of your library that you would see after PCR amplification on a gel?

      For the library I mentioned, I used 150bp as the mean fragment length since I assumed it was the distance between reads, and the alignment worked pretty well. Is it more computationally strenuous to have a larger fragment length setting?

      Comment


      • #4
        It's the length of the DNA fragment as mapped by the aligner and hence doesn't include any adapters. It's not the length of the gap between the two read alignments.
        If your gel includes PCR adapters then you need to adjust for this.

        It doesn't really affect computation as Novoalign adjusts to the fragment lengths it sees, but setting it too short may mean reads don't get paired properly.
        It's not a major issue as range for pairs is from 0 to mean + 6 standard deviations so it's usually enough.
        You can also run Novoalign on a few K reads and check the reported fragment length distribution. Use the -# option to limit the number of reads processed. e.g. -# 2K will map 2000 reads and stop.

        Comment


        • #5
          Great, that makes a lot of sense. I actually meant "reads" instead of "adapters." Of course in Illumina sequencing the read begins before the adapter since the sequencing primer anneals to the adapter... I'm glad you caught that. I appreciate all the help. Thanks a lot!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Recent Developments in Metagenomics
            by seqadmin





            Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
            09-23-2024, 06:35 AM
          • seqadmin
            Understanding Genetic Influence on Infectious Disease
            by seqadmin




            During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

            Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
            09-09-2024, 10:59 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 10-02-2024, 04:51 AM
          0 responses
          8 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 10-01-2024, 07:10 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-30-2024, 08:33 AM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-26-2024, 12:57 PM
          0 responses
          16 views
          0 likes
          Last Post seqadmin  
          Working...
          X