Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • RNA-Seq quality controls: golden standard tool?

    Hi,
    I need to establish a protocol for checking the quality of the RNA-Seq data. There are few pipelines for this purpose published over the last few years (ShortRead, htSeqTools, ArrayExpressHTS), and I'm wondering which of these are commonly used and represent golden standards? The questions I ask in my QC is what is the degradation level of the RNA, what is the quality of the sequences coming out of the Illumina platforms (HiSeq at the moment), how the given sample differs from the rest (is it a weird outlier).
    I have other datasets that I would like to evaluate as well, and they are produced on GAII platform of different sequence read lengths. So Ideally, I would be looking for the QC tool that are:
    1) flexible to use different read length
    2) provide rigorous QC with nice graphs
    3) can use the output from the TopHat pipeline.

    I would very much appreciate any help, suggestions, advices.
    Thanks!!!
    Anna

  • #2
    I might not be a great reference but I don't think there IS a standard at this point.

    FastQC is a nice tool to get a set of quality assesment tests and graphs all at once for the raw reads (in FASTQ format) http://www.bioinformatics.babraham.a...ojects/fastqc/. You might use the output of FastQC to help you get an idea of whether you want to trim bases off of the 5' or 3' ends of your reads. Some aligners can do that for you, like BWA. Most aligners provide an option for you to specify some type of threshold for base qualities that are accepted for alignments. So tools like FastQC are just there for you to check up on the quality of your run however they aren't directly used to control what you run through the aligners.

    As far as determining how "any given sample differs from the rest" - this question could be pretty complex to answer. You can look at SNPs, differential gene expression, or splice variant differences (from some novel transcript assembler like cufflinks). You can use the "tuxedo" pipeline to access differential expression and splicing variation between samples. For SNPs I like to use samtools mpileup followed by bcftools for variant calling. After that I use bedtools to make comparisons between VCF outputs from bcftools to determine which SNPs are unique to which samples, which are shared, etc.

    I've had good results from clustering samples in R using its hierarchical clustering function on gene expression output across multiple samples from multiple lines. However determining why any sample clusters separately from others (or more specifically producing a gene list responsible for the clustering) has not been straightforward nor "established" from what I can tell.
    /* Shawn Driscoll, Gene Expression Laboratory, Pfaff
    Salk Institute for Biological Studies, La Jolla, CA, USA */

    Comment


    • #3
      There is a program called RNASeqQC which is more useful that FastQC for this purpose.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Recent Advances in Sequencing Technologies
        by seqadmin







        Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

        Long-Read Sequencing
        Long-read sequencing has...
        12-02-2024, 01:49 PM
      • seqadmin
        Genetic Variation in Immunogenetics and Antibody Diversity
        by seqadmin



        The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
        11-06-2024, 07:24 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 12-02-2024, 09:29 AM
      0 responses
      145 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-02-2024, 09:06 AM
      0 responses
      51 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-02-2024, 08:03 AM
      0 responses
      42 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 11-22-2024, 07:36 AM
      0 responses
      72 views
      0 likes
      Last Post seqadmin  
      Working...
      X