Seqanswers Leaderboard Ad

**gibberwocky** · 03-28-2013, 01:56 PM

I forgot to mention that after running the fast-dump command I used a subset of the fastq file to make checking the different command options quicker

Code:

cat ${DATA}/$i/SRR081275_1.fastq | sed -n 1,4000000p > ${DATA}/$i/"$i"_1.fastq
cat ${DATA}/$i/SRR081275_2.fastq | sed -n 1,4000000p > ${DATA}/$i/"$i"_2.fastq
head ${DATA}/$i/"$i"_1.fastq
head ${DATA}/$i/"$i"_2.fastq
bgzip -f "$i"_1.fastq
bgzip -f "$i"_2.fastq

Also, I have NGS data from Illumain GAIIx 1.8 format which works fine in both Picard MarkDuplicates and GASV, the fastq read format for which looks like this:

@HWUSI-EAS1643R:30:FC:1:1:3636:1000 1:N:0:CGATGT
NTTCCTACTCCGACATGCTAAAGATCCATGAAACAGAGTGTTTGCCAACAAGTCCAGATTTTTACCAGGCTCATCTCTTCAGTTTCAAAGAATCAGTTTC
+
#***,/223/@:@@@@@@@@:<<<<@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@:@@@@@@22.2577577@@@@@@@@@@@228@@@@@@

**tmacme** · 12-25-2014, 10:14 PM

Did you check the bam file header. What's the read name? I encounter a warning for READ_NAME_REGEX unalbe to detect "read name" during MarkDuplicate with picardtools. And the output shows 0 optical duplicate.

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Today, 11:49 AM	0 responses 11 views 0 likes	Last Post by seqadmin Today, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

SRA fastq-dump puzzle

Comment

Comment

Latest Articles

ad_right_rmr

News