Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Novoalign options

    Hello Group,

    I've been trying to find the optimal parameters for aligned bisulfite treated 76bp PE reads to a small reference that includes repeats. I'm confused with the option to set the fragment length and standard deviation. My question is: what exactly does the fragment length refer to? Is it the distance between mapped mates or does it include the reads.

    The library I am using isolated adapter fragments between 250-300bp (minus adapter = 131-181). Given 76 bp reads, some mates could overlap. If the fragment length refers to distance between aligned reads than this value could be negative. Anyone want to clarify?
    Thanks a lot!

  • #2
    HI, Just found your post. The fragment length mean and standard deviation are outer coordinates, it doesn't matter if you set them a bit higher than the real values.
    Colin

    Comment


    • #3
      So by outer coordinates you mean the most distal coordinates of an adapter-ligated molecule, i.e. average length of your library that you would see after PCR amplification on a gel?

      For the library I mentioned, I used 150bp as the mean fragment length since I assumed it was the distance between reads, and the alignment worked pretty well. Is it more computationally strenuous to have a larger fragment length setting?

      Comment


      • #4
        It's the length of the DNA fragment as mapped by the aligner and hence doesn't include any adapters. It's not the length of the gap between the two read alignments.
        If your gel includes PCR adapters then you need to adjust for this.

        It doesn't really affect computation as Novoalign adjusts to the fragment lengths it sees, but setting it too short may mean reads don't get paired properly.
        It's not a major issue as range for pairs is from 0 to mean + 6 standard deviations so it's usually enough.
        You can also run Novoalign on a few K reads and check the reported fragment length distribution. Use the -# option to limit the number of reads processed. e.g. -# 2K will map 2000 reads and stop.

        Comment


        • #5
          Great, that makes a lot of sense. I actually meant "reads" instead of "adapters." Of course in Illumina sequencing the read begins before the adapter since the sequencing primer anneals to the adapter... I'm glad you caught that. I appreciate all the help. Thanks a lot!

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Advanced Methods for the Detection of Infectious Disease
            by seqadmin




            The recent pandemic caused worldwide health, economic, and social disruptions with its reverberations still felt today. A key takeaway from this event is the need for accurate and accessible tools for detecting and tracking infectious diseases. Timely identification is essential for early intervention, managing outbreaks, and preventing their spread. This article reviews several valuable tools employed in the detection and surveillance of infectious diseases.
            ...
            Yesterday, 01:15 PM
          • seqadmin
            Strategies for Investigating the Microbiome
            by seqadmin




            Microbiome research has led to the discovery of important connections to human and environmental health. Sequencing has become a core investigational tool in microbiome research, a subject that we covered during a recent webinar. Our expert speakers shared a number of advancements including improved experimental workflows, research involving transmission dynamics, and invaluable analysis resources. This article recaps their informative presentations, offering insights...
            11-09-2023, 07:02 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 08:12 AM
          0 responses
          14 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 11-22-2023, 09:29 AM
          1 response
          46 views
          0 likes
          Last Post VilliamPast  
          Started by seqadmin, 11-22-2023, 08:53 AM
          0 responses
          30 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 11-21-2023, 08:24 AM
          0 responses
          23 views
          0 likes
          Last Post seqadmin  
          Working...
          X