Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How should a strand-specific experiment look like in IGV?

    I have a paired-end, strand-specific dataset and used Trinity to assemble the contigs. I then mapped the reads back to the contigs to calculate transcript abundance and viewed the resulting bam file in IGV (don't have a reference genome).

    Is there a way that I can tell whether my dataset is really strand-specific just by looking at the picture in IGV? If I had single-end sequences I should expect to have reads only in one strand (i.e. pointing to the right). Am I right?

    But since I have paired-end reads, what the IGV picture should look like?

  • #2
    Hi,

    It depends on the library details, but you should could try "color by first-in-pair" from the popup menu. This will color by the strand of the template, rather than the read, at least for libraries I am familiar with. Also, igvtools has options to compute coverage by pair strand, although this isn't available in igv itself yet. Better strand support is on our radar.

    Jim

    Comment


    • #3
      Thanks Jim!

      I selected "first-of-pair" as you told me but can't find out what the colors mean (and couldn't find it either in IGV's manual).

      Can you please explain what the colors (red, blue and grey) mean?

      Comment


      • #4
        red = neg strand (first read of the pair is no the neg strand)
        blue = pos strand
        grey = pair information not available, so "first of pair" is undefined

        Comment


        • #5
          Thanks for the info Jim!

          So if most of the pairs in a contig contain reads that are like this:

          Code:
          =====>----------<=====
          (red)           (blue)
          then this means that sequencing was strand-specific.

          Am I right?

          What if both of them are red or blue?

          Comment


          • #6
            No, even in a non-strand-specific library, the two reads of a pair should point towards each other (or away from each other if it was a library-prep protocol with circularization). You should check whether it is predominantly the first read that point in the sense direction of the gene it aligns to.

            Comment


            • #7
              Originally posted by Jim Robinson View Post
              red = neg strand (first read of the pair is no the neg strand)
              blue = pos strand
              grey = pair information not available, so "first of pair" is undefined
              If I have Illumina directional RNA-seq data, should the colours be the opposite or the same as the strand of the RefSeq gene, when colouring by the first read of the pair ? My colours are opposite to the RefSeq gene, but when isoforms are assembled by Cufflinks with --library-type fr-secondstrand, the isoforms are, as expected, on the same strand as the gene.

              Comment


              • #8
                So is directional RNA-seq the same as strand-specific RNA-seq? any clarification will be greatly appreciated.

                Thanx

                Comment


                • #9
                  According to this, directional and strand-specific are the same.

                  Comment


                  • #10
                    Thanx panos-ed..

                    Comment


                    • #11
                      Hi,

                      I am trying to align and count the reads of paired-end strand-specific RNA-seq data using Hisat and htseq-count.
                      This is how my IGV look like. https://www.dropbox.com/s/snpapn051b...ness.jpeg?dl=0
                      (Color alignment by "first-of-pair strand") (view as pairs).
                      I am confused now. Why all the read pairs are red for NFE2L2? and clur for HNRNPA3?
                      I know that my data is strand specific, and that this color is showing the first pair read. So why both pairs have the same color?
                      Also can anyone tell me what options I should use for Hisat (or Tophat2) and htseq-count?

                      Thanks,
                      Last edited by ArghavanAri; 09-14-2015, 07:18 AM.

                      Comment


                      • #12
                        The two genes are on different strands, you can see by the little chevrons in the annotation track at bottom, so that is expected.

                        Comment


                        • #13
                          As others have mentioned, you need to consider the direction of mapping as well as the read number. For a standard Illumina strand-specific sample preparation, the reverse read is the first read:
                          • Reverse mapping, First read -- consistent with a gene model reading left to right on IGV
                          • Reverse mapping, Second read -- consistent with a gene model reading right to left on IGV
                          • Forward mapping, First read -- consistent with a gene model reading right to left on IGV
                          • Forward mapping, second read -- consistent with a gene model reading left to right on IGV


                          Just in case you weren't aware, Tablet (since v1.14.10.20) has a concordant / directional colouring mode that colours sequences appropriately.
                          Last edited by gringer; 09-14-2015, 07:31 PM.

                          Comment


                          • #14
                            Thanks for the information.
                            But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
                            Can anyone help me with that?

                            Comment


                            • #15
                              Originally posted by ArghavanAri View Post
                              Thanks for the information.
                              But I still cannot understand the difference between fr-firststrand & fr-secondstrand in Tophat2 (or RF & FR in Hisat).
                              Can anyone help me with that?
                              I am fairly sure fr-firststrand means read one (aka forward read or fr) corresponds to the reference genome sequence (firststrand). fr-secondstrand means read one corresponds to the reverse complement (secondstrand) of the reference genome. FR and RF would be a similar concept - but they are just making a shorthand where if the read one or forward read corresponds to the reference strand it maps to the left of the reverse read on a linear representation of the genome (FR) and if it corresponds to the reverse complement of the reference it maps to the right of the reverse (RF).

                              I hope that makes sense -- you might need to go back to DNA/genome structure and how Illumina works to really understand the above.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Best Practices for Single-Cell Sequencing Analysis
                                by seqadmin



                                While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
                                06-06-2024, 07:15 AM
                              • seqadmin
                                Latest Developments in Precision Medicine
                                by seqadmin



                                Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                                Somatic Genomics
                                “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                                05-24-2024, 01:16 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, Yesterday, 07:49 AM
                              0 responses
                              14 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 06-20-2024, 07:23 AM
                              0 responses
                              14 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 06-17-2024, 06:54 AM
                              0 responses
                              16 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 06-14-2024, 07:24 AM
                              0 responses
                              25 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X