Truncating reads in fastq file

timydaley

Member

Join Date: Jun 2010

Posts: 26
- Share
- Tweet
#1

Truncating reads in fastq file

02-03-2014, 02:42 PM

I want to simulate shorter reads from a particular dataset. Say I want 50bp paired end reads from a 100bp paired end read data set while keeping the same insert size. Would I take the first 50 characters of the sequence and the score strings from each end? Or would I take the first 50 from the first end and the last 50 from the last end.

Extracting the characters is easy with a simple awk command. I'm just curious about the order.

Thank you very much.

Last edited by timydaley; 02-03-2014, 02:53 PM.
Tags: None
mastal

Senior Member

Join Date: Mar 2009

Posts: 666
- Share
- Tweet
#2

02-03-2014, 03:09 PM

the first 50 from the second reads of the pair, if you want to keep the same insert size, because the sequences are given from 5' to 3', so the first 50 bases are the ones from the end of the fragment.
Comment

Previous template Next

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 24 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad