Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • liumangmang
    Junior Member
    • Nov 2010
    • 9

    human RNA seq alignment QC

    Hi,
    I used HISAT2 to align human RNA-seq data (Illumina PE, Stranded, rRNA depleted).
    We are interested in comparing gene expression levels, snp and different isoform usage between treatment conditions in the end.
    I got about 70-75% reads aligned concordantly 1 time and about 20% concordantly >1 time with less than 10% disconcordantly aligned. Overall mapped rate is over 95%.
    Is there a common benchmark that how much % unique concordantly mapped reads is typical for human RNA?
    I'm wondering how to determine whether alignment result is acceptable for carrying on the downstream analysis and how I could tweak with the parameters in HISAT if needed.....

    Thank you,
    Last edited by liumangmang; 10-05-2016, 11:59 AM.
  • SylvainL
    Senior Member
    • Feb 2012
    • 180

    #2
    Sounds good to me... Over 70% of uniquely mapped reads for Human is quite good. You also have to keep in mind that this percentage can obviously depend on the expressed genes...

    Comment

    • Michael.Ante
      Senior Member
      • Oct 2011
      • 127

      #3
      Hi,

      Before alignment I'd perform QC with FastQC and FastQC screen (especially in case of rRNA depletion).
      After alignment, I'd perform QC with a set of RSeQC's tools:
      • bam_stat.py
      • clipping_profile.py
      • geneBody_coverage.py
      • infer_experiment.py (if your library was strand-preserving)
      • inner_distance.py (for PE-runs)
      • read_distribution.py


      A lot of results can be collected by the multiQC reporting tool.

      Cheers,
      Michael

      Comment

      • Persistent LABS
        Member
        • Apr 2016
        • 21

        #4
        Hi liumangmang,
        Your alignment summary looks good. There is a similar post here: http://seqanswers.com/forums/showthread.php?t=29769
        You can refer this.
        Persistent LABS

        Comment

        • liumangmang
          Junior Member
          • Nov 2010
          • 9

          #5
          Thank you all, I'll try those QC tools and come back to update.

          Comment

          • liumangmang
            Junior Member
            • Nov 2010
            • 9

            #6
            Update:
            We get average about 15%-25% multi mapped reads. And there are some public available data from SRA (same tissue type and protocal), also get similar multi mapped ratio.
            And RSeQC + multiQC really worked well, it gave me pretty summary report!

            Thank you all.

            Comment

            Latest Articles

            Collapse

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, Yesterday, 10:09 AM
            0 responses
            10 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-04-2026, 08:59 AM
            0 responses
            17 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 12:03 PM
            0 responses
            26 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 11:40 AM
            0 responses
            21 views
            0 reactions
            Last Post SEQadmin2  
            Working...