Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • STACKS and PE Illumina Data (ddRAD)

    I have a dataset from a ddRAD library (Illumina HiSeq4K, 150PE). I'm playing around with STACKS, and noticed that it has some limitations when dealing with paired-end data from ddRAD datasets.

    Namely, it treats them as separate/independent loci.

    This seems like a problem from a population genomic point of view, since there is a base assumption (admittedly often violated) of independence between loci. If we have paired data (single-reads and paired-end ("PE") reads), we know that those two loci aren't at all likely to be independent.

    What I have done so far is demultiplex individuals' data based on inline barcode sequence(s) and give a rough quality filter (sliding-window 15%, min quality score = 10). So I'm left with four files for each individual:

    One file of SE reads and one file of PE reads, in-phase (where both reads from the same fragment/cluster were kept and not discarded).

    One file of SE reads, whose PE counterpart has been discarded.

    One file of PE reads, whose SE counterpart has been discarded.

    Only one or the other read - SE or PE - needs to have a SNP. The other read can be discarded. Can STACKS keep track of this? Can it go through assembly and SNP-calling and keep track of header-titles, and use that information to figure out what those sequence pairs are?

    Or is that information lost? (I suspect that for the remainders - the latter two files described - it would be quite difficult to recover that information...)

    Alternatively, is it worth throwing caution to the wind and using the SE and PE reads and throwing all of those data together at the end? Is there another approach here?

    Many thanks,
    Sean

Latest Articles

Collapse

  • seqadmin
    Recent Advances in Sequencing Analysis Tools
    by seqadmin


    The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
    05-06-2024, 07:48 AM
  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 06:57 AM
0 responses
11 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-06-2024, 07:17 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-02-2024, 08:06 AM
0 responses
19 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-30-2024, 12:17 PM
0 responses
24 views
0 likes
Last Post seqadmin  
Working...
X