Unconfigured Ad

**GenoMax** · 11-23-2015, 06:20 AM

If this is illumina data then it looks like they must have been pre-trimmed. If that is the case go ahead and use them.

If not, then is this ion data?

**mastal** · 11-23-2015, 06:21 AM

Is it Illumina data? It looks like your reads have already been trimmed to remove adapters, and they should be fine as they are. Ask your service provider what trimming has been done to the data.

**paolo.kunder** · 11-23-2015, 06:28 AM

Yes this is Illumina NextSeq500 data,
I am a bit confused, why they should be of different length after trimming?
I analyzed many RNA Seq data of Hiseq and they are all (100%) the same length, I also downloaded 26 Human Tissued form Encode with Hiseq and they are all (100%) the same length.

**paolo.kunder** · 11-23-2015, 06:34 AM

and moreover, why should I have more than 300'000 reads 32bp long,

I have demultiplexed the data, with the following adapters

index
TCCGGAGA
CGCTCATT
ATTACTCG
GAGATTCC
ATTACTCG
TCCGGAGA
CGCTCATT
GAGATTCC
ATTACTCG
TCCGGAGA
CGCTCATT
GAGATTCC
ATTACTCG
TCCGGAGA
CGCTCATT
GAGATTCC
TCCGGAGA
CGCTCATT
ATTACTCG
TCCGGAGA
CGCTCATT
ATTACTCG

**GenoMax** · 11-23-2015, 06:35 AM

These may be bad libraries that had short inserts which results in read-through into adapter on the other end. Those would generally need to be trimmed.

**GenoMax** · 11-23-2015, 06:38 AM

Originally posted by paolo.kunder View Post

and moreover, why should I have more than 300'000 reads 32bp long,

I have demultiplexed the data, with the following adapters

index
TCCGGAGA
CGCTCATT
ATTACTCG
GAGATTCC
ATTACTCG
TCCGGAGA
CGCTCATT
GAGATTCC
ATTACTCG
TCCGGAGA
CGCTCATT
GAGATTCC
ATTACTCG
TCCGGAGA
CGCTCATT
GAGATTCC
TCCGGAGA
CGCTCATT
ATTACTCG
TCCGGAGA
CGCTCATT
ATTACTCG

Demultiplexing with these barcodes should have nothing to do with the length of the reads.

**paolo.kunder** · 11-23-2015, 06:39 AM

so is definitely a bad library preparation?
I have to be sure before going to complain to the service!

**GenoMax** · 11-23-2015, 06:43 AM

If these reads were pre-trimmed then they likely represent bad libraries. This may not necessarily indicate bad library preparation since the original samples themselves may be bad. One would need to look at the QC for samples and then libraries before concluding either one (or both) are bad.

**paolo.kunder** · 11-23-2015, 06:51 AM

these may help you?

Attached Files

**GenoMax** · 11-23-2015, 07:03 AM

This is outside my sphere of expertise so someone from experimental side of things will need to verify it authoritatively but it looks like these libraries have short inserts. Remember the size of the fragments include illumina adapters (~80 bp). Shorter fragments tend to cluster more efficiently.

Topics	Statistics	Last Post
New AI Model Captures Long-Range Genomic Signals to Improve RNA Splice Site Prediction by SEQadmin2 Started by SEQadmin2, Yesterday, 05:37 AM	0 responses 6 views 0 reactions	Last Post by SEQadmin2 Yesterday, 05:37 AM
Large-Scale Protein Screen Uncovers Hidden Regulators of Alternative Polyadenylation by SEQadmin2 Started by SEQadmin2, 06-26-2026, 11:10 AM	0 responses 16 views 0 reactions	Last Post by SEQadmin2 06-26-2026, 11:10 AM
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 51 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 110 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM

Unconfigured Ad

reads of different size - RNA SEQ

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News