Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • alyamahmoud
    replied
    I am also very confused to why 50bp SE give higher alignment rate than 100bp PE ? I have done the experiment myself using 100 bp PE and compared to trimmed 50 bp SE.
    Thank you

    Leave a comment:


  • alyamahmoud
    replied
    @Simon and HESmith

    does your recommendation apply to poorly annotated genomes as well ?

    Leave a comment:


  • HESmith
    replied
    I'll second Simon's advice. You'll get very little additional information by paired-end sequencing (since, in essence, you're just counting each gene twice). Biological replicates provide much more information. In fact, I'd recommend triplicates at a minimum to provide statistical power to your analysis. If you index the samples, they can be sequenced in the same lane; the only additional cost would be preparing separate libraries (~$50 each for Illumina). Three biological replicates of 20 million reads each are much better than a single sample of 60 million.

    Leave a comment:


  • Simon Anders
    replied
    If you want to look for differential expression, sequencing depth is usually the most limiting factor. Hence, I would go for 50bp single-end and invest the money you saved into sequencing another lane (ideally, with a biological replicate).

    Longer reads are useful to see where splice sites are located, but for humans, we already know that quite well, and for yeast, there is hardly any splicing, anyway. Long reads don't help much for mapping, because the transcribed part of the genome is usually not that repetitive, and 50 bp is usually long enough to even distinguish most orthologs.

    Paired-end reads may or may not help to distinguish isoforms of the same gene, but, at least for yeast, this is unimportant, of course.

    Leave a comment:


  • efoss
    replied
    Thanks very much everyone. I'll be going with 50 bp paired reads.

    Eric

    Leave a comment:


  • dreesbl
    replied
    If you're sequencing an equal number of base pairs, I vote for paired end reads. I agree with dcfactor that there won't be much difference between estimates from 50 bp and 100 bp read data if you're counting the alignment hits per gene. You get more data for your sequencing buck because each aligned pair gives you information on not only the sequences covered by the reads but the region between them as well.

    50 bp of sequence in a mate pair can be more useful for read mapping than an extra 50 bp in the read itself. If you don't find what you're looking for the gene-level expression patterns, having paired end data leaves more avenues open for other analysis.

    [email protected]
    Spiral Genetics has 6 repositories available. Follow their code on GitHub.

    Leave a comment:


  • westerman
    replied
    Also, depending on the software you use, the RNA-seq module may not be able to use PE reads. At least without some extra work. Yes, CASAVA, I'm looking at you!

    I'll agree that for well annotated genomes 50 bp SE should be satisfactory.

    Leave a comment:


  • HESmith
    replied
    In that case, single end 50bp should be fine for obtaining gene counts.

    Leave a comment:


  • efoss
    replied
    Originally posted by HESmith View Post
    Hi Eric,

    Is your genome well-annotated? If not, and you plan to build gene models, then PE-50bp would be preferable.
    Yes, the genomes are well annotated. I'm working with human and yeast.

    Eric

    Leave a comment:


  • HESmith
    replied
    Hi Eric,

    Is your genome well-annotated? If not, and you plan to build gene models, then PE-50bp would be preferable.

    Leave a comment:


  • sarbashis
    replied
    An important application of paired read is that each pair suppose to come from same gene. So by using paired end read one can easily mark approx gene boundaries.

    Leave a comment:


  • dcfactor
    replied
    50 base-pair paired-end reads span a longer region of the transcript. Each read represents one end of a ~200-300 base-pair RNA fragment, compared to a 100 base-pair read which only gives you information about 100 bases. A larger fragment means you are more likely to span a splice junction, insertion, or deletion. Therefore 50 bp is preferable. Remember that with next gen sequencing technology, for the most part your read is only a "tag" that tells you where in the genome the fragment originated. As long as the "tag" is long enough to be unique (and 50 bp is for the most part) you are set.

    Leave a comment:


  • efoss
    started a topic 50 bp paired end reads vs. 100 bp single end reads

    50 bp paired end reads vs. 100 bp single end reads

    In an RNA seq experiment in which the goal is simply to count transcripts (vs., e.g., finding evidence for translocations or novel splice variants), is it better to have 100 base pair single end reads or 50 base pair paired end reads? (100 base pair single end reads seem better to me.)

    Thanks.

    Eric

Latest Articles

Collapse

  • seqadmin
    Advanced Methods for the Detection of Infectious Disease
    by seqadmin




    The recent pandemic caused worldwide health, economic, and social disruptions with its reverberations still felt today. A key takeaway from this event is the need for accurate and accessible tools for detecting and tracking infectious diseases. Timely identification is essential for early intervention, managing outbreaks, and preventing their spread. This article reviews several valuable tools employed in the detection and surveillance of infectious diseases.
    ...
    11-27-2023, 01:15 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 12-08-2023, 10:35 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-05-2023, 02:24 PM
0 responses
18 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-05-2023, 07:37 AM
0 responses
30 views
0 likes
Last Post seqadmin  
Started by seqadmin, 12-04-2023, 08:23 AM
0 responses
15 views
0 likes
Last Post seqadmin  
Working...
X