Partitioning huge paired-end datasets with the khmer suite

jov14

Member

Join Date: Oct 2014

Posts: 18
- Share
- Tweet
#1

Partitioning huge paired-end datasets with the khmer suite

01-12-2017, 06:04 AM

The khmer suite has a workflow for partitioning reads based on graph-connectivity, which I want to test for a specific (huge) paired-end dataset.

However, from the documentation, I am not sure how exactly it treats read-pairs? It seems that arguments which specify how read-pairs should be treated only exist for the normalization-workflows?

e.g. load-graph.py (the first step of the partitioning workflow) accepts one or multiple input files. If I use multiple input files, will the script automatically assume paired-end reads in separate files (forward reads in the first, reverse reads in the second file)? Will it automatically assume interleaved reads if i only supply one input file?

In which form will the partitioned reads be output? Automatically Interleaved? Or possibly first all forward reads and then all reverse reads (requiring another processing step before assembly of paired reads)?
Tags: khmer suite, metagenomics, read, read partitioning

Previous template Next

Topics	Statistics	Last Post
A Close Examination at Probiotic-Related Bacteremia by seqadmin Started by seqadmin, Today, 08:06 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 08:06 AM
Expanded Genetic Insights into Blood Pressure Regulation by seqadmin Started by seqadmin, 04-30-2024, 12:17 PM	0 responses 13 views 0 likes	Last Post by seqadmin 04-30-2024, 12:17 PM
The Role of Enhancers in Defining Cell Fate by seqadmin Started by seqadmin, 04-29-2024, 10:49 AM	0 responses 19 views 0 likes	Last Post by seqadmin 04-29-2024, 10:49 AM
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 26 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM

Seqanswers Leaderboard Ad