Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • CisGenome Analysis

    I could not find any topic committed to such a tool, therefore I decided to open a new one. In case it already exists, feel free to merge.

    I am handling some output sequences from Illumina GAII and trying to figure out how to analyze them.
    My ChIP is against histone modifications and I usually look at very large ditributions of signals (I would not call them "peaks", if you get what I mean).

    Now, I am trying to catch a good tool to analyze them and I was suggested to use CisGenome. I tried to figure out how it works, especially last updated feture (CisGenomev2) which is supposed to be quite easy and user friendly.
    However, when I include my sample and control and call for "peaks", I only get one peak, which I know cannot be true from other tools/observations.
    I must likely be doing some mistakes.

    If there someone who knows the tool, I will try to explain my issue.

    1) I am using genome database (hg18)
    2) I include my sample (high signal) and control (low signal)
    3) I then select the parameters:

    # Read Extension Length E: 150 (what am I supposed to use here?)
    # Bin Size B: 500 (I tried to increase over 3000)
    # Half Window Size W: 1
    # Max Gap: 50
    # Min Peak: 100
    # Standardize Windows Statistics: checked (what am I supposed to use here?)
    # Win Stat Cutoff >= 3 (what am I supposed to use here?)
    # Apply Local Read Sampling Rate Filter: checked (I tried with and w/o this with no differences in the output)

    ## Local Rate Window: 10000
    ## Local Rate Cutoff: 1e-005

    # Boundary Refinement: checked

    ## Boundary Resolution: 5

    4) I start the search and it turns out only one peak in a place where I know the signal is high.

    I wonder how this can be possible.
    My sample and control differ a lot in terms of reads. Namely, the control has a lower depth. Does this influence the readout?

    I apologise for the long and complicated post, I hope someone can help.

    Thanks in advance.

  • #2
    I am not an expert. My opinion is only for your reference.

    1.Win Stat Cutoff >= 3 (what am I supposed to use here?)
    the number is based on the exploration result. negbinomial_exp/obs<10%
    2. For boundary refinement: I don't checked if I do histone modifications thinking histone modification is different from TF binding that TF binding needs two strand while histone modification could be on single strand. but I may be wrong.

    Comment


    • #3
      Before tweaking software parameters it might actually be helpful to look at the raw genomic read coverage to confirm that the raw data looks as expected, in your case containing broad regions of enrichment (relative to control). Otherwise you try to get something from the software that your data does not provide.

      How many mapped reads do you have? The difference in reads will matter a lot, as it is not simple (i.e. impossible) to normalize the read counts properly. Therefore you will probably get many false positives/negatives.

      Comment


      • #4
        Thanks a lot for the replies.

        I totally agree that tweaking should be the refining part. However, I think that even though the filtering cuts down lots of the reads, the total read numbers are pretty much similar among the samples:
        Code:
                raw           mapped       uniq. map.    non-redundant
        1a   39,462,703     29,738,325     24,113,845     22,722,389
        1b   37,286,139     28,958,710     23,448,540     21,408,884
        1c   39,499,025     29,346,076     24,366,428     22,060,482
        2a   33,161,351     26,928,919     22,220,217     17,646,007
        2b   36,682,484     28,621,303     23,846,721     20,401,399
        2c   39,406,479     28,787,186     24,292,959     17,977,853
        What do you think?

        Comment


        • #5
          I've gotten good output with SICER when looking at such data sets. Follow their advice on installing Numpy and Scipy in the ReadMe.txt. Other then that it was pretty easy to get running.

          --------------
          Ethan

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Non-Coding RNA Research and Technologies
            by seqadmin




            Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

            Nobel Prize for MicroRNA Discovery
            This week,...
            Yesterday, 08:07 AM
          • seqadmin
            Recent Developments in Metagenomics
            by seqadmin





            Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
            09-23-2024, 06:35 AM
          • seqadmin
            Understanding Genetic Influence on Infectious Disease
            by seqadmin




            During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

            Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
            09-09-2024, 10:59 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 10-02-2024, 04:51 AM
          0 responses
          86 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 10-01-2024, 07:10 AM
          0 responses
          94 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 09-30-2024, 08:33 AM
          1 response
          95 views
          0 likes
          Last Post EmiTom
          by EmiTom
           
          Started by seqadmin, 09-26-2024, 12:57 PM
          0 responses
          20 views
          0 likes
          Last Post seqadmin  
          Working...
          X