Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • jiwu2573
    Member
    • Jun 2009
    • 34

    Differentially expressed exons?

    I know Cufflinks and DEGseq can do DE gene analysis. Consider the resolution of mRNA sequence, it will be even more interesting to look at the DE at the exon level rather than the gene level. Does anyone know any available software or adapted method for identification of DE exons for mRNA sequencing?
  • Xi Wang
    Senior Member
    • Oct 2009
    • 317

    #2
    Certainly, DEGseq can do this job if you give the gene annotation per exon instead of gene. You need write some simple script to split genes into exons, but remember to follow the refflat format.
    Xi Wang

    Comment

    • jiwu2573
      Member
      • Jun 2009
      • 34

      #3
      Originally posted by Xi Wang View Post
      Certainly, DEGseq can do this job if you give the gene annotation per exon instead of gene. You need write some simple script to split genes into exons, but remember to follow the refflat format.
      Can you explain it in more details about making the exon annotation?

      All those format conversions are a nightmare for a biology person without too much background in computer science like me. But if you can give more detailed insturctions, I can get help from IT person and get it done.

      P.S. Affymetrix is providing exon microarrays now, and I think more and more people will be interested in exon analysis intead of gene. A function of exon expression analysis will certainly be an advantage of DEGseq, which will attract more users as well.

      Comment

      • Xi Wang
        Senior Member
        • Oct 2009
        • 317

        #4
        Ok, let's start from the exon regions you already have, for example,

        Code:
        chr1  100    200 -
        chr1  1000  1200 -
        chr1  2000  2300 -
        then you write into the format below:

        Code:
        Exon_1       Exon_1       chr1    -       100         200         100        200        1      100,      200,
        Exon_2       Exon_2       chr1    -       1000        1200        1000       1200       1      1000,     1200,
        Exon_3       Exon_3       chr1    -       2000        2300        2000       2300       1      2000,     2300,
        A script will do this automatically.

        If you don't have the data for every exon, you need extract the information from gene annotation, and be careful of the redundant exon regions.

        Thanks for you suggestion. We may add this function of analyzing differentially expressed exons in our package in next update. That will be more direct to get what you want.
        Xi Wang

        Comment

        • Thomas Doktor
          Senior Member
          • Apr 2009
          • 105

          #5
          An easier way would be to pull down the exons from UCSC in bed format and then use BEDTools' intersectBed utility to count the number of reads mapping to the individual exons. You can then do count statistics or calculate RPKM values for individual exons. If you save your alignment in SAM/BAM format BEDTools can convert your alignments to bed format, otherwise you will have to write your own script or use another utility to convert your aligned reads to bed format.

          I think that you might find it difficult to identify differentially spliced exons this way, because you would find so many differentially expressed exons between samples that reflect differential expression rather than differential splicing. To really identify differentially expressed isoforms I think you need a more dedicated algorithm to compare the individual exon levels within a gene to other exons within the same gene. The problem here is that different exons are sequenced with variable efficiency and also mapped with different efficiency (some might contain repeat regions and). These problems are also present when comparing expression between two different genes, but I would think the problem increases as the length of the element decreases because it becomes more and more vulnerable to sequencing and mapping artefacts. Comparing expression of the same gene between to samples is another matter as they are subjected to the same biases, but of course the dynamic range of the measurement may decrease.

          Btw, DEGseq seems like a really nice and easy to use package, my concerns regarding its use in this way is merely of a general nature and not specific to DEGseq.
          Last edited by Thomas Doktor; 02-19-2010, 07:53 AM.

          Comment

          • steven
            Senior Member
            • Aug 2009
            • 269

            #6
            Originally posted by Thomas Doktor View Post
            I think that you might find it difficult to identify differentially spliced exons this way, because you would find so many differentially expressed exons between samples that reflect differential expression rather than differential splicing. To really identify differentially expressed isoforms I think you need a more dedicated algorithm to compare the individual exon levels within a gene to other exons within the same gene. The problem here is that different exons are sequenced with variable efficiency and also mapped with different efficiency (some might contain repeat regions and). These problems are also present when comparing expression between two different genes, but I would think the problem increases as the length of the element decreases because it becomes more and more vulnerable to sequencing and mapping artefacts. Comparing expression of the same gene between to samples is another matter as they are subjected to the same biases, but of course the dynamic range of the measurement may decrease.
            I totally agree.

            Comment

            • jiwu2573
              Member
              • Jun 2009
              • 34

              #7
              If I am particularly interested in a few genes, any easy way to compare those exon expressions within these genes?

              Comment

              • Xi Wang
                Senior Member
                • Oct 2009
                • 317

                #8
                Originally posted by jiwu2573 View Post
                If I am particularly interested in a few genes, any easy way to compare those exon expressions within these genes?
                If you have the well-defined exons for these genes, you can simply use some tools, such as BEDtools, to get the number of reads falling in these exonic regions. Than, statistic methods can be applied to identify the differentially expressed exons. Or some existing tools, such as DEGseq, can help you with the statistical inference.
                Xi Wang

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                  by SEQadmin2


                  I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                  Here are nine questions we think about, in roughly the order they matter, before...
                  06-18-2026, 07:11 AM
                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, 06-17-2026, 06:09 AM
                0 responses
                21 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-09-2026, 11:58 AM
                0 responses
                39 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-05-2026, 10:09 AM
                0 responses
                46 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-04-2026, 08:59 AM
                0 responses
                49 views
                0 reactions
                Last Post SEQadmin2  
                Working...