Seqanswers Leaderboard Ad

**Brian Bushnell** · 08-19-2014, 09:49 AM

Originally posted by Bacms View Post

Shouldn't reads always align to the reverse strand on the second file or am I getting this wrong? And if so what could have cause this to happen? I am just puzzled by the data since the pairs are always supposed to be forward-reverse right?

No, both reads have a 50% chance of aligning to both strands. The bias for read 1 is very strange. Perhaps you could share your command lines, which may be helpful.

Also, what organism is it? And is there a reason you are mapping with bowtie2 rather than an RNA-seq aligner, and mapping reads as single-ended rather than paired? Also, posting the FastQC report may help.

**Bacms** · 08-19-2014, 10:20 AM

Originally posted by Brian Bushnell View Post

No, both reads have a 50% chance of aligning to both strands. The bias for read 1 is very strange. Perhaps you could share your command lines, which may be helpful.

But with the new Illumina protocol you are supposed to get strand specific reads right? Or do you expect to get 50% even with strand specific?

Originally posted by Brian Bushnell View Post

Also, what organism is it? And is there a reason you are mapping with bowtie2 rather than an RNA-seq aligner, and mapping reads as single-ended rather than paired? Also, posting the FastQC report may help.

This is Chlamydomonas Reinhardtii and the reason for using bowtie is that as far as I can tell there is no way of using tophat/cufflinks as input to express but I may be wrong.
What is the best way to attach the report from fastqc as the attachment size is to small to attach.

**Brian Bushnell** · 08-19-2014, 10:28 AM

Originally posted by Bacms View Post

But with the new Illumina protocol you are supposed to get strand specific reads right? Or do you expect to get 50% even with strand specific?

My mistake, I did not notice you were mapping to the transcriptome. When mapping to the genome you would expect 50-50 because half the transcripts should be on each strand, but transcriptome mapping with a strand-specific protocol indeed should have read 1 map almost entirely to one strand and read 2 to the other, since they are all presented in the sense orientation.

What is the best way to attach the report from fastqc as the attachment size is to small to attach.

Hmmm... I think you can output it as a pdf which appears to have a 19MB size limit. Otherwise, just post the most relevant images individually, like base content, quality, and anything it fails.

P.S. And I still recommend you post your mapping command line; you should perform the mapping on both reads at once and it's not clear to me if you are doing that.

**Olaf Blue** · 08-19-2014, 10:28 AM

For ScriptSeq libraries, use –fr secondstrand. -fr secondstrand means that the strand being synthesized on the sequencer is the sense strand for Read 1.

Olaf

**Bacms** · 08-19-2014, 10:47 AM

[QUOTE=Brian Bushnell;147990]My mistake, I did not notice you were mapping to the transcriptome. When mapping to the genome you would expect 50-50 because half the transcripts should be on each strand, but transcriptome mapping with a strand-specific protocol indeed should have read 1 map almost entirely to one strand and read 2 to the other, since they are all presented in the sense orientation.

Ok yes I am aware that when you are aligning to the genome you should get ~50/50%

Originally posted by Brian Bushnell View Post

Hmmm... I think you can output it as a pdf which appears to have a 19MB size limit. Otherwise, just post the most relevant images individually, like base content, quality, and anything it fails.

I will double check the fastqc for an option to output in pdf

Originally posted by Brian Bushnell View Post

P.S. And I still recommend you post your mapping command line; you should perform the mapping on both reads at once and it's not clear to me if you are doing that.

Will do but since I am using a python script to perform the system commands I didn't have the individual commands for all steps. Here they are now:
#Run fastqc
Running fastqc v0.11.2
fastqc --outdir=../results/140526_I453_FCC4LT4ACXX_L1_Index1/ ../fastq/140526_I453_FCC4LT4ACXX_L1_Index1_1.fq ../fastq/140526_I453_FCC4LT4ACXX_L1_Index1_2.fq

#Running trimmomatic
java -jar trimmomatic-0.32.jar PE -threads 24 -trimlog trim_log.txt ../fastq/140526_I453_FCC4LT4ACXX_L1_Index1_1.fq ../fastq/140526_I453_FCC4LT4ACXX_L1_Index1_2.fq ../results/140526_I453_FCC4LT4ACXX_L1_Index1/140526_I453_FCC4LT4ACXX_L1_Index1_1P.fq ../results/140526_I453_FCC4LT4ACXX_L1_Index1/140526_I453_FCC4LT4ACXX_L1_Index1_1U.fq ../results/140526_I453_FCC4LT4ACXX_L1_Index1/140526_I453_FCC4LT4ACXX_L1_Index1_2P.fq ../results/140526_I453_FCC4LT4ACXX_L1_Index1/140526_I453_FCC4LT4ACXX_L1_Index1_2U.fq ILLUMINACLIP:TruSeq3-PE.fa:2:30:10 LEADING:20 TRAILING:20 SLIDINGWINDOW:4:20 MINLEN:40

#Align against the transcriptome using bowtie
bowtie2 -k 100 -p 22 --phred64 --un-conc ../results/140526_I453_FCC4LT4ACXX_L1_Index1/unmapped.fq -x Creinhardtii_281_v5.5.transcript -1 ../results/140526_I453_FCC4LT4ACXX_L1_Index1/140526_I453_FCC4LT4ACXX_L1_Index1_1P.fq -2 ../results/140526_I453_FCC4LT4ACXX_L1_Index1/140526_I453_FCC4LT4ACXX_L1_Index1_2P.fq -S ../results/140526_I453_FCC4LT4ACXX_L1_Index1/cDNA.bowtie

Topics	Statistics	Last Post
ASHG 2024 Highlights – Part Two by seqadmin Started by seqadmin, Today, 11:09 AM	0 responses 23 views 0 likes	Last Post by seqadmin Today, 11:09 AM
ASHG 2024 Highlights – Part One by seqadmin Started by seqadmin, Today, 06:13 AM	0 responses 20 views 0 likes	Last Post by seqadmin Today, 06:13 AM
Seq-Scope Expands Possibilities for High-Resolution Gene Expression Analysis by seqadmin Started by seqadmin, 11-01-2024, 06:09 AM	0 responses 30 views 0 likes	Last Post by seqadmin 11-01-2024, 06:09 AM
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks by seqadmin Started by seqadmin, 10-30-2024, 05:31 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-30-2024, 05:31 AM

Seqanswers Leaderboard Ad

Announcement

Problems with mapping of scriptSeq of RNASeq

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News