Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • gringer
    replied
    Tablet has a "read concordance" mode for viewing strand direction on reads. Both pairs of reads that are consistent with a mapped direction following the reference sequence are coloured green (customisable), and reads that are consistent with the reverse complement of the reference sequence are coloured red.

    Leave a comment:


  • Camg
    replied
    Originally posted by sowmyai View Post
    Looks like Jim Robinson had it backwards for the color legends. Actually

    red=positive,
    blue=negative

    He has clarified here:
    https://groups.google.com/forum/#!to...lp/YiVzwnUuOZM
    I always thought that blue=positive and red=negative. See screen shots of my data in IGV.

    I used Illumina dUTP-type stranded library prep and fr-firststrand with Tophat2. I find several places that say dUTP libraries should use fr-firststrand with Tophat, for example: http://onetipperday.sterding.com/201...pe-to-use.html


    Is anyone able to clear this up?

    Thanks
    Attached Files

    Leave a comment:


  • sowmyai
    replied
    Looks like Jim Robinson had it backwards for the color legends. Actually

    red=positive,
    blue=negative

    He has clarified here:

    Leave a comment:


  • kmcarr
    replied
    Originally posted by cliffbeall View Post
    I am fairly sure fr-firststrand means read one (aka forward read or fr) corresponds to the reference genome sequence (firststrand). fr-secondstrand means read one corresponds to the reverse complement (secondstrand) of the reference genome. FR and RF would be a similar concept - but they are just making a shorthand where if the read one or forward read corresponds to the reference strand it maps to the left of the reverse read on a linear representation of the genome (FR) and if it corresponds to the reverse complement of the reference it maps to the right of the reverse (RF).

    I hope that makes sense -- you might need to go back to DNA/genome structure and how Illumina works to really understand the above.
    I'm afraid that this is incorrect.

    First 'fr' does not mean 'forward read'. What it means is the the two reads are arranged in the FR (Foward/Reverse) as opposed to the RF (Reverse/Forward) orientation. FR means that the read pairs are oriented with the 3' end point toward each other when aligned to the reference and RF means they are pointing away from each other.

    Code:
    FR:
    -------------------------------------------------------------------------
       ----------------->                              <-------------------
    
    RF:
    -------------------------------------------------------------------------
       <-----------------                               ------------------->
    Firststrand means that read #1 matches the first strand of cDNA generated during conversion of the single stranded mRNA to ds-cDNA. In other words the R1 is the reverse complement of the mRNA. Note that all of these reference landmarks are relative to transcript, not the genome. For Illumina strand specific, paired end mRNA-Seq reads the orientation of the reads relative to their original RNA is:

    Code:
      mRNA 5'-----------------------------------------------------------AAAAAAAAAAAA
                       R2 ------------>                <------------ R1

    Leave a comment:


  • cliffbeall
    replied
    Originally posted by ArghavanAri View Post
    Thanks for the information.
    But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
    Can anyone help me with that?
    I am fairly sure fr-firststrand means read one (aka forward read or fr) corresponds to the reference genome sequence (firststrand). fr-secondstrand means read one corresponds to the reverse complement (secondstrand) of the reference genome. FR and RF would be a similar concept - but they are just making a shorthand where if the read one or forward read corresponds to the reference strand it maps to the left of the reverse read on a linear representation of the genome (FR) and if it corresponds to the reverse complement of the reference it maps to the right of the reverse (RF).

    I hope that makes sense -- you might need to go back to DNA/genome structure and how Illumina works to really understand the above.

    Leave a comment:


  • ArghavanAri
    replied
    Thanks for the information.
    But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
    Can anyone help me with that?

    Leave a comment:


  • gringer
    replied
    As others have mentioned, you need to consider the direction of mapping as well as the read number. For a standard Illumina strand-specific sample preparation, the reverse read is the first read:
    • Reverse mapping, First read -- consistent with a gene model reading left to right on IGV
    • Reverse mapping, Second read -- consistent with a gene model reading right to left on IGV
    • Forward mapping, First read -- consistent with a gene model reading right to left on IGV
    • Forward mapping, second read -- consistent with a gene model reading left to right on IGV


    Just in case you weren't aware, Tablet (since v1.14.10.20) has a concordant / directional colouring mode that colours sequences appropriately.
    Last edited by gringer; 09-14-2015, 07:31 PM.

    Leave a comment:


  • cliffbeall
    replied
    The two genes are on different strands, you can see by the little chevrons in the annotation track at bottom, so that is expected.

    Leave a comment:


  • ArghavanAri
    replied
    Hi,

    I am trying to align and count the reads of paired-end strand-specific RNA-seq data using Hisat and htseq-count.
    This is how my IGV look like. https://www.dropbox.com/s/snpapn051b...ness.jpeg?dl=0
    (Color alignment by "first-of-pair strand") (view as pairs).
    I am confused now. Why all the read pairs are red for NFE2L2? and clur for HNRNPA3?
    I know that my data is strand specific, and that this color is showing the first pair read. So why both pairs have the same color?
    Also can anyone tell me what options I should use for Hisat (or Tophat2) and htseq-count?

    Thanks,
    Last edited by ArghavanAri; 09-14-2015, 07:18 AM.

    Leave a comment:


  • stanikay
    replied
    Thanx panos-ed..

    Leave a comment:


  • panos_ed
    replied
    According to this, directional and strand-specific are the same.

    Leave a comment:


  • stanikay
    replied
    So is directional RNA-seq the same as strand-specific RNA-seq? any clarification will be greatly appreciated.

    Thanx

    Leave a comment:


  • Dario1984
    replied
    Originally posted by Jim Robinson View Post
    red = neg strand (first read of the pair is no the neg strand)
    blue = pos strand
    grey = pair information not available, so "first of pair" is undefined
    If I have Illumina directional RNA-seq data, should the colours be the opposite or the same as the strand of the RefSeq gene, when colouring by the first read of the pair ? My colours are opposite to the RefSeq gene, but when isoforms are assembled by Cufflinks with --library-type fr-secondstrand, the isoforms are, as expected, on the same strand as the gene.

    Leave a comment:


  • Simon Anders
    replied
    No, even in a non-strand-specific library, the two reads of a pair should point towards each other (or away from each other if it was a library-prep protocol with circularization). You should check whether it is predominantly the first read that point in the sense direction of the gene it aligns to.

    Leave a comment:


  • panos_ed
    replied
    Thanks for the info Jim!

    So if most of the pairs in a contig contain reads that are like this:

    Code:
    =====>----------<=====
    (red)           (blue)
    then this means that sequencing was strand-specific.

    Am I right?

    What if both of them are red or blue?

    Leave a comment:

Latest Articles

Collapse

  • seqadmin
    Best Practices for Single-Cell Sequencing Analysis
    by seqadmin



    While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
    06-06-2024, 07:15 AM
  • seqadmin
    Latest Developments in Precision Medicine
    by seqadmin



    Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

    Somatic Genomics
    “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
    05-24-2024, 01:16 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Today, 07:24 AM
0 responses
9 views
0 likes
Last Post seqadmin  
Started by seqadmin, Yesterday, 08:58 AM
0 responses
11 views
0 likes
Last Post seqadmin  
Started by seqadmin, 06-12-2024, 02:20 PM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 06-07-2024, 06:58 AM
0 responses
184 views
0 likes
Last Post seqadmin  
Working...
X