Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Linux based enzyme restriction tool

    Hi all,
    I guess that the answer should be simple, but honestly i googled it and didnt find the answe...
    What im looking for is linux based tool to recognize enzyme restriction sites:
    1) over the genome (reference genome)
    2) over SAM/BAM file

    Right now what im doing is simply using Bowtie2 for multiple perfect hits to solve the first problem (reference genome), but it cant solve the SAM/BAM file.
    Moreover, i want to go over all the the rebase database (http://rebase.neb.com/rebase/rebase.ftp.html), and im sure there is already a tool that doing it...

    Any ideas?
    Thanks,
    Pap

  • #2
    I wrote a genomic digester , see https://www.biostars.org/p/86480/#86535

    Code:
    curl -s "http://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/chr3.fa.gz" |\
    gunzip -c  |\
    java -jar dist/biostar86480.jar -E AarI -E EcoRI 
    
    chr3    60645    60651    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    60953    60959    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    68165    68172    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    70263    70269    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    70945    70952    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    71140    71146    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    72264    72270    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    74150    74156    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    75063    75069    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    78438    78444    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    81052    81059    CACCTGC    1000    +    AarI    CACCTGC(4/8)
    chr3    84498    84504    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    84546    84552    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    84780    84787    CACCTGC    1000    +    AarI    CACCTGC(4/8)
    chr3    87771    87777    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    95344    95351    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    96358    96364    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    96734    96740    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    105956    105962    GAATTC    1000    +    EcoRI    G^AATTC

    Comment


    • #3
      Thanks lindenb!
      I saw your post before,but i have there a compliation error and our system admin is not in the area.. so we cant install anything right now..
      The error is about the "Requires apache ANT."

      Anyway, can it solve also the SAM/BAM problem?

      Thanks,
      Pap

      Originally posted by lindenb View Post
      I wrote a genomic digester , see https://www.biostars.org/p/86480/#86535

      Code:
      curl -s "http://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/chr3.fa.gz" |\
      gunzip -c  |\
      java -jar dist/biostar86480.jar -E AarI -E EcoRI 
      
      chr3    60645    60651    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    60953    60959    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    68165    68172    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    70263    70269    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    70945    70952    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    71140    71146    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    72264    72270    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    74150    74156    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    75063    75069    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    78438    78444    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    81052    81059    CACCTGC    1000    +    AarI    CACCTGC(4/8)
      chr3    84498    84504    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    84546    84552    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    84780    84787    CACCTGC    1000    +    AarI    CACCTGC(4/8)
      chr3    87771    87777    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    95344    95351    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    96358    96364    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    96734    96740    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    105956    105962    GAATTC    1000    +    EcoRI    G^AATTC

      Comment


      • #4
        I don't think you need a sysadmin ,just download ant from http://ant.apache.org/bindownload.cgi and add the executable to your PATH.

        Running this for a SAM file would be easy but I just wonder why you would need this ... :-)

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Understanding Genetic Influence on Infectious Disease
          by seqadmin




          During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

          Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
          09-09-2024, 10:59 AM
        • seqadmin
          Addressing Off-Target Effects in CRISPR Technologies
          by seqadmin






          The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
          08-27-2024, 04:44 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 06:25 AM
        0 responses
        13 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 01:02 PM
        0 responses
        12 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-18-2024, 06:39 AM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 09-11-2024, 02:44 PM
        0 responses
        14 views
        0 likes
        Last Post seqadmin  
        Working...
        X