Seqanswers Leaderboard Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Monika_bioinf
    Junior Member
    • Sep 2011
    • 7

    smrtpipe.py, HGAP run - detailed description of reports generated

    Hello,

    I would like to learn few details about the following reports from HGAP run, generated by smrtpipe.py:



    Do you please know how the fitted line is calculated?
    EDIT: I already figured this out and added figure with explanation in case anyone was interested



    In the quality report, what does the vertical line represent? Is it median read quality or similar metric?

    Thank you.
    Last edited by Monika_bioinf; 05-14-2015, 01:36 PM.
  • gconcepcion
    Member
    • Dec 2010
    • 68

    #2
    Originally posted by Monika_bioinf View Post
    Hello,

    I would like to learn few details about the following reports from HGAP run, generated by smrtpipe.py:



    Do you please know how the fitted line is calculated?
    EDIT: I already figured this out and added figure with explanation in case anyone was interested



    In the quality report, what does the vertical line represent? Is it median read quality or similar metric?

    Thank you.
    That's not a vertical line, it's actually the 'same line' as in the first figure that you annotated. It denotes the amount of Mb contained in sequence above a certain read quality. Not sure why the scale on the right side of the figure has been cropped off. It should be the same as the first figure (Mb). I do admit that it looks funny, almost like a vertical line, but not quite. It's always possible there is a bug in the report code, what do your filtered read statistics look like? Pre & Post?

    Comment

    • GenoMax
      Senior Member
      • Feb 2008
      • 7142

      #3
      @gconcepcion: Consider this a feature request.

      Inflexibility (e.g. no way to adjust scales for axes) of plots from SMRTportal analysis reports makes it hard to work with/understand them. Can this be changed in a future version? Is the data used to generate these accessible somewhere in the directory hierarchy?

      Comment

      • gconcepcion
        Member
        • Dec 2010
        • 68

        #4
        Originally posted by GenoMax View Post
        @gconcepcion: Consider this a feature request.

        Inflexibility (e.g. no way to adjust scales for axes) of plots from SMRTportal analysis reports makes it hard to work with/understand them. Can this be changed in a future version? Is the data used to generate these accessible somewhere in the directory hierarchy?
        Greater flexibility in terms of charting and reporting options in general is duly noted and certainly on the table for future versions of SMRTAnalysis, but I can guarantee you nothing is going to change in the 2.X.X time frame.

        All data used to create the figures can be found in the job directory.
        The two particular Filter Report figures posted in this thread are generated by the 'filter_stats.py' script which takes $JOB_DIR/data/filtered_summary.csv as input.

        The log for that particular task can be found:
        $JOB_DIR/log/P_FilterReports/statsRpt.log

        The actual command to generate the figures as well as rest of the filter stats is:
        filter_stats.py --output=$JOB_DIR --report=$JOB_DIR/results/filter_reports_filter_stats.json $JOB_DIR/data/filtered_summary.csv


        TLDR; take a look at filtered_summary.csv if you want to play with the distributions in your analysis software of choice

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Pathogen Surveillance with Advanced Genomic Tools
          by seqadmin




          The COVID-19 pandemic highlighted the need for proactive pathogen surveillance systems. As ongoing threats like avian influenza and newly emerging infections continue to pose risks, researchers are working to improve how quickly and accurately pathogens can be identified and tracked. In a recent SEQanswers webinar, two experts discussed how next-generation sequencing (NGS) and machine learning are shaping efforts to monitor viral variation and trace the origins of infectious...
          03-24-2025, 11:48 AM
        • seqadmin
          New Genomics Tools and Methods Shared at AGBT 2025
          by seqadmin


          This year’s Advances in Genome Biology and Technology (AGBT) General Meeting commemorated the 25th anniversary of the event at its original venue on Marco Island, Florida. While this year’s event didn’t include high-profile musical performances, the industry announcements and cutting-edge research still drew the attention of leading scientists.

          The Headliner
          The biggest announcement was Roche stepping back into the sequencing platform market. In the years since...
          03-03-2025, 01:39 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 03-20-2025, 05:03 AM
        0 responses
        49 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-19-2025, 07:27 AM
        0 responses
        57 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-18-2025, 12:50 PM
        0 responses
        50 views
        0 reactions
        Last Post seqadmin  
        Started by seqadmin, 03-03-2025, 01:15 PM
        0 responses
        201 views
        0 reactions
        Last Post seqadmin  
        Working...