Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    I am very interested on this discussion..
    Can Pysamstats give me the difference between reds (forward or reverse)?

    I need a table with these informations:

    Code:
    Chr/position/reference_base/read_base (number or %(A,C,G,T) found in forward or reverse strand)/insertion/deletion
    
    Example:
    
    chr1 20 A A:30 C:20 G:0 T:0 a:0 c:0 g:0 t:0 Ins:0 del:0
    IGV give me something like this when I put the arrow of the mouse over a read but I need of a table...

    Comment


    • #17
      Originally posted by Giffredo View Post
      I am very interested on this discussion..
      Can Pysamstats give me the difference between reds (forward or reverse)?
      Use variation_strand instead of variation. From the manual (https://pypi.python.org/pypi/pysamstats)

      * variation_strand - as variation but with forward/reverse strand counts

      Comment


      • #18
        Unfortunately I cannot install pysamstat package..
        So, Is it possible to reach the same results using GATK for you?.. looking at the tutorial I think it cannot discriminate the variation in one strand from the other.

        Comment


        • #19
          Originally posted by Giffredo View Post
          Unfortunately I cannot install pysamstat package..
          What problems do you encounter? Installation errors or permissions?
          pysamstats depends on numpy and pysam which can be tricky to install but it should be possible.

          Comment


          • #20
            I have not any idea.. I cannot install anything in this Lab. there is an expert for this and he told me that is not possible. I guess it is a problem about compatibility..

            Comment


            • #21
              My personal opinion:
              impossible == I am too lazy

              The differences between -z and -w:
              -z is just a memory optimization parameter. If you reduce the zoom level, you reduce the memory usage of the process
              -w The window size where the coverage will be calculated. If it is 25, it means you get coverage values for every 25 bp length slice.

              As far as I know GATK can not produce stats like this.

              Comment


              • #22
                Ok, I understood... In which way I could discriminate the splicing site osing pysamstat output?
                I would see the deletion sites present on my reads and not present on the references and I would check the presence of the classic splicing site bases...
                I m right or there are others easier methods?

                Comment


                • #23
                  When I insert this code
                  Code:
                  pysamstats -f x.fas --type variation_strand mapq_strand  x.sorted.bam > stat_pyout.txt&
                  Py gives me this error:
                  pysamstats: error: missing file operand

                  What it means?

                  instead using:

                  Code:
                  pysamstats -f x.fas --type variation_strand x.sorted.bam > pyout.txt&
                  Py works well...

                  What am I forgetting?

                  Comment


                  • #24
                    IGV tdf file

                    Hi all,

                    About coverage file from IGV, I generated a .tdf file with count in IGV with a bin of 25 bases.

                    When I visualize under IGV, I have values for exemple 145,76 for a bin.
                    If I multiply by 25, I obtain 3644 for this bin.

                    Now, when I zoom and obtain the coverage track calculated by IGV from .bam file, I see that I have about 33 reads by base so about 825 reads on the bin.

                    So seeing this difference, my question is : does someone know how coverage is calculted in IGV's .tdf files?

                    thanks!

                    Comment


                    • #25
                      Originally posted by Giffredo View Post
                      When I insert this code
                      Code:
                      pysamstats -f x.fas --type variation_strand mapq_strand  x.sorted.bam > stat_pyout.txt&
                      Py gives me this error:
                      pysamstats: error: missing file operand

                      What it means?

                      instead using:

                      Code:
                      pysamstats -f x.fas --type variation_strand x.sorted.bam > pyout.txt&
                      Py works well...

                      What am I forgetting?
                      You should pass only one option to the argument --type. So either variation_strand or mapq_strand, not both.

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Recent Advances in Sequencing Analysis Tools
                        by seqadmin


                        The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                        05-06-2024, 07:48 AM
                      • seqadmin
                        Essential Discoveries and Tools in Epitranscriptomics
                        by seqadmin




                        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                        04-22-2024, 07:01 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 05-14-2024, 07:03 AM
                      0 responses
                      19 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 05-10-2024, 06:35 AM
                      0 responses
                      42 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 05-09-2024, 02:46 PM
                      0 responses
                      53 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 05-07-2024, 06:57 AM
                      0 responses
                      42 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X