Unconfigured Ad

**Brian Bushnell** · 10-08-2014, 08:43 AM

Those are Illumina reads, and could be either ASCII-64 (old Illumina) or ASCII-33 (Sanger) format; most likely ASCII-64 but I can't tell from that read. It may be possible if you post some more reads (particularly if you can find a read with an 'N' base call).

**skmotay** · 10-08-2014, 08:46 AM

Here's one with several Ns:

@GWZHISEQ02:32

1YMKACXX:5:1101:5470:1986 1:N:0:ATCACG
CTGGATATCAATAATGCTCTCCNTAGGGATATTTCCCGCAAATTTGANNNN
+
CCCFFFFFHHHHHJJJJJJJJJ#3AGIJJJJJJJJJJJJJJJJJJJJ####

**Brian Bushnell** · 10-08-2014, 08:53 AM

That's strange, normally N should be Q0 (!) not Q2 (#), but it appears to be ASCII-33 (Sanger) data. I'm not sure why the reads are not mapping. You may want to BLAST some of them to a database like NT to make sure they come from the correct organism.

**GenoMax** · 10-08-2014, 08:57 AM

You should not need to "groom" the data if they are already Sanger formatted. Just choose the "pencil" edit icon against the name of the dataset and manually set the data type to "fastqsanger" under "datatype" tab.

You should do some QC/trimming though as that may be affecting your alignments.

**skmotay** · 10-08-2014, 08:58 AM

I'm wondering if maybe they need to be adapter-trimmed? They all failed Kmer in fastqc.

**Brian Bushnell** · 10-08-2014, 09:01 AM

In that case, probably yes! Though that's easiest to do if you know what kind of adapters were used.

**skmotay** · 10-08-2014, 12:40 PM

Originally posted by GenoMax View Post

You should not need to "groom" the data if they are already Sanger formatted. Just choose the "pencil" edit icon against the name of the dataset and manually set the data type to "fastqsanger" under "datatype" tab.

I changed the dataset type to fastqsanger, but Tophat2 and Trimmomatic are still not recognizing the files. I click the dropkey in either program (ex: RNA-Seq FASTQ file, forward reads) and there's nothing there.

Edit: This is true for either paired-end (which is correct for my data) or single-end options.

SOLUTION: I am dumb. Accidentally changed them to fastqCsanger

Topics	Statistics	Last Post
Engineered Protein Motor Takes Its First Steps Along DNA Track by SEQadmin2 Started by SEQadmin2, Today, 11:05 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Today, 11:05 AM
High-Resolution Sequencing Exposes Hidden Toxoplasma Diversity by SEQadmin2 Started by SEQadmin2, 07-02-2026, 11:08 AM	0 responses 27 views 0 reactions	Last Post by SEQadmin2 07-02-2026, 11:08 AM
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, 06-30-2026, 05:37 AM	0 responses 25 views 0 reactions	Last Post by SEQadmin2 06-30-2026, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 25 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM

Unconfigured Ad

Inputting fastq files into Tophat2 without info on seq platform type

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News