Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Chuckytah
    Member
    • Mar 2011
    • 65

    [NGS - analysis of gene expression data] Machine Learning + RNAseq data

    Hello,

    i'm a portuguese student girl of master degree in bioinformatics

    Do you advise me some good websites/articles for theese topics:
    --‐ Next generation sequencing for gene expression measurement (RNA‐seq)
    --‐ data analysis challenges in RNA--‐Seq data
    --‐ applications of machine learning in classification of RNAseq data
    --‐ identification and critical analysis available tools


    I wanted to focus more in "applications of machine learning in
    classification of RNAseq data" do you know some pratical case that i could
    present to my classmates?

    Thanks a lot,

    Inês Martins
    University of Minho, Portugal
  • steven
    Senior Member
    • Aug 2009
    • 269

    #2
    Hi Inês,

    Here is a recent review about RNA-seq challenges in bioinformatics:


    However, no real "machine learning" method comes to my mind about RNA-seq.. I do not recall Cufflinks or Scripture are trained on some learning dataset. Are they?

    Comment

    • altodor
      Member
      • Nov 2009
      • 12

      #3
      Storey paper

      There are excellent references in this paper:



      John Storey

      H. Craig Mak
      Nature Biotechnology 29, 331–333 (2011) doi:10.1038/nbt.1831
      Published online 08 April 2011
      John Storey provides his take on the importance of new statistical methods for high-throughput sequencing.

      Good luck!

      Comment

      • Simon Anders
        Senior Member
        • Feb 2010
        • 995

        #4
        By "machine learning", do you mean clustering and classification? To my k nowledge, not much has been done there yet so far. This is because, typically, you want to have large data sets with tens, better hundreds, of samples, to bring ML techniques to fruitful use and then, microarrays are still preferred as they are still cheaper. So, have a look at what people have done for microarray studies. I'm sure people will pretty soon start thinking about how to adapt these methods to RNA-Seq, so stay tuned.

        Comment

        • Chuckytah
          Member
          • Mar 2011
          • 65

          #5
          thank you all guys

          Comment

          • Chuckytah
            Member
            • Mar 2011
            • 65

            #6
            In the topic: Next generation sequencing for gene expression measurement

            i'm doing RA (relative abundances)... I have the number of EST in one gene and i divide it by the total number of EST in the sample... is this right?

            Comment

            • sphil
              Senior Member
              • Apr 2010
              • 192

              #7
              It's not enough to normalise with the number of est in sample. let me referr you to RPKM (PMID: 18516045) or packages like deg-seq, rna-seqc.

              Comment

              • steven
                Senior Member
                • Aug 2009
                • 269

                #8
                Originally posted by Chuckytah View Post
                In the topic: Next generation sequencing for gene expression measurement

                i'm doing RA (relative abundances)... I have the number of EST in one gene and i divide it by the total number of EST in the sample... is this right?
                I am confused about the terminology. What is your data/experiment exactly? ESTs are not reliable and should not be used to infer expression values. If you are doing some Digital Gene Expression or SAGE experiment then it is possible. With full RNA-seq too but methods differ: with SAGE tags you do not expect the length of the transcript to matter as the goal is to get all the reads of a given transcript to originate from a unique position. For Whole Transcriptome Sequencing RNA-seq reads should originate from all over the transcripts. Therefore the number of reads is expected to correlate with the size of the transcript, so a normalization may be required (see RPKM above). Note that if you are comparing gene expressions between different conditions this size-normalization is not required.

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                19 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                14 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                29 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-26-2026, 10:12 AM
                0 responses
                31 views
                0 reactions
                Last Post SEQadmin2  
                Working...