Seqanswers Leaderboard Ad

**maasha** · 01-22-2011, 02:21 AM

Hello,

This can be done with Biopieces (www.biopieces.org) using digest_seq and BamHI as an example:

Code:

read_fasta -i genome.fna | digest_seq -p GGATCC -c 1 | plot_lendist -k SEQ_LEN -x

To get a BED file:

Code:

read_fasta -i genome.fna | digest_seq -p GGATCC -c 1 | rename_keys -k SEQ_NAME,S_ID | write_bed -xo fragments.bed

Or to do both in one go:

Code:

read_fasta -i genome.fna |
digest_seq -p GGATCC -c 1 |
plot_lendist -k SEQ_LEN -t post -o dist_plot.ps |
rename_keys -k SEQ_NAME,S_ID |
write_bed -xo fragments.bed

Restriction enzyme patterns and cut positions are found at REBASE http://rebase.neb.com - or by typing "rescan_seq --help"

Cheers,

Martin

**azneto** · 01-22-2011, 07:02 AM

You definitely should take a look at the remap tool from the EMBOSS package.

EMBOSS: remap

http://emboss.bioinformatics.nl/cgi-bin/emboss/remap

Cheers,
Adhemar

**sunsnow86** · 01-24-2011, 12:23 PM

thank you!

Thank you for helping me out! I will give a try.

**sunsnow86** · 01-25-2011, 12:50 PM

I have difficulty to run the command. I installed the packages in my desktop, and follow the instructions which listed in the web. I am not sure whether the code is sourced, and I run the test code, it seems nothing changed. Could you give more detailed information on how to install it and test it since I am a bench scientist, not that familiar with the command line program. Thanks

nexgen@nexgen-desktop:~/Desktop/biopieces$ bp_test
bp_test: command not found

**maasha** · 01-25-2011, 01:15 PM

Did you add the following section to your ~/.bashrc file:

Code:

# >>>>>>>>>>>>>>>>>>>>>>> Enabling Biopieces if installed <<<<<<<<<<<<<<<<<<<<<<<

# Modify the below paths according to your settings.
# If you have followed the installation step-by-step as described above,
# the below should work just fine.

export BP_DIR="$HOME/biopieces"  # Directory where biopieces are installed
export BP_DATA="$HOME/BP_DATA"   # Contains genomic data etc.
export BP_TMP="$HOME/tmp"        # Required temporary directory.
export BP_LOG="$HOME/BP_LOG"     # Required log directory.

if [ -f "$BP_DIR/bp_conf/bashrc" ]; then
    source "$BP_DIR/bp_conf/bashrc"
fi  

# >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>><<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<

AND

run the command

Code:

source ~/.bashrc

Martin

**raphael123** · 07-02-2014, 01:33 PM

Hi, I am confronting the same problem, in silico digestion for CCGG .
I have a file with hg19 and one line per chromosome sequence and I do :

Code:

cat hg19.txt | sed "s/[COLOR="DarkRed"]CCGG[/COLOR]/\n/g" | awk '{l=length($1); mem[l]++;}
END{for(i=0;i<=1000;i++){print mem[i]}}'

Is it stupid ? I don t understand why this team have so different results ....

Here is my results for instance : I have 9975 time one nucleotide between 2 CCGG's

Any idea ?

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 25 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 29 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 25 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 52 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Is there a in silico enzyme digestion script?

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News