Seqanswers Leaderboard Ad

**seq_GA** · 10-26-2009, 11:45 PM

Hi Aaron,
I am trying to compare two bed files. For example I started exploring a small example as below to test the usuage of the tool.

Code:

track name=pairedReads2 description="Clone Paired Reads2" useScore=1
chr22   1000    5000    cloneA  960     +       1000    5000    0       2       567,488,        0,3512
chr22   2000    6000    cloneB  900     -       2000    6000    0       2       433,399,        0,3601

But I get the error as below:

HTML Code:

 ./mergeBed -i ../../chr22_data/test2.bed 
Only one BED field detected: 1.  Verify that your files are TAB-delimited.  Exiting... 

or 

 ./mergeBed -i ../../chr22_data/test1.bed 
Unexpected number of fields: 1.  Verify that your files are TAB-delimited and that your BED file has 3,4,5 or 6 fields.  Exiting...

How do I proceed further. I have a bed file with 12 columns. B'cos each line in the bed file contains 2 blocks of sequence. Is it possible to use the tool for this kind of analysis. Please verify. Thanks.

**quinlana** · 10-27-2009, 05:13 AM

Originally posted by seq_GA View Post

Hi Aaron,
I am trying to compare two bed files. For example I started exploring a small example as below to test the usuage of the tool.

Code:

track name=pairedReads2 description="Clone Paired Reads2" useScore=1
chr22   1000    5000    cloneA  960     +       1000    5000    0       2       567,488,        0,3512
chr22   2000    6000    cloneB  900     -       2000    6000    0       2       433,399,        0,3601

But I get the error as below:

HTML Code:

 ./mergeBed -i ../../chr22_data/test2.bed 
Only one BED field detected: 1.  Verify that your files are TAB-delimited.  Exiting... 

or 

 ./mergeBed -i ../../chr22_data/test1.bed 
Unexpected number of fields: 1.  Verify that your files are TAB-delimited and that your BED file has 3,4,5 or 6 fields.  Exiting...

How do I proceed further. I have a bed file with 12 columns. B'cos each line in the bed file contains 2 blocks of sequence. Is it possible to use the tool for this kind of analysis. Please verify. Thanks.

Hi,
BEDTools only supports tab-delimited BED files with a minimum of 3 (chrom, start and end) fields and a maximum of 6 (optionally adding name, score and strand).

For example, if you extracted the first 6 columns of your example file, it could be merged as follows:

PHP Code:


$ cut -f 1-6 test.bed | mergeBed -i stdin

chr22    1000    6000

I also note that you seem to be dealing with paired sequences. BEDTools has a utility (peIntersectBed) that will intersect paired-end fearures with normal BED files. The file format paired-end BED entries can be found by using the "-h" option with peIntersectBed.

Lastly, if you are using exactly version 2.0.0, there is a much newer version available here:
http://code.google.com/p/bedtools.

All the best,
Aaron

**quinlana** · 10-27-2009, 05:55 AM

I should also note that one can track the names of which entries were merged (separated by a semicolon) by using the "-names" option.

From your example:

PHP Code:


$ cut -f 1-6 test.bed | mergeBed -i stdin -names

chr22    1000    6000    cloneA;cloneB

This is undocumented in the help and I am changing this as we "speak".
--Aaron

**seq_GA** · 10-28-2009, 01:08 AM

Hi Aaron,

Thanks for your response. I have downloaded the recent version and start using.

Code:

./mergeBed -n -i ../newdata/full.bed > /../newdata/merged.bed

The above command works.

When I try to force with -s options to check the strand information, I don't get any output.

Code:

./mergeBed -n -s -i ../newdata/full.bed > /../newdata/merged.bed

Without strand, it works fine. Even in the example you have give above no strand info is being printed in the output. Why is it so?

Basically I am trying to remove duplicate records and merge them as 1 record.

Thanks and Regards

**quinlana** · 10-28-2009, 06:23 AM

Originally posted by seq_GA View Post

Hi Aaron,

Thanks for your response. I have downloaded the recent version and start using.

Code:

./mergeBed -n -i ../newdata/full.bed > /../newdata/merged.bed

The above command works.

When I try to force with -s options to check the strand information, I don't get any output.

Code:

./mergeBed -n -s -i ../newdata/full.bed > /../newdata/merged.bed

Without strand, it works fine. Even in the example you have give above no strand info is being printed in the output. Why is it so?

Basically I am trying to remove duplicate records and merge them as 1 record.

Thanks and Regards

Hmm, it works as expected for me using Version 2.2.4. test.bed below is the same as your file above.

__without__ strand, thus ignores the fact that the two entries are on different strands and combines them:

PHP Code:


$ cut -f 1-6 test.bed | mergeBed -i stdin -names

chr22    1000    6000    cloneA;cloneB

__with__ strand, thus observes the fact that the two entries are on different strands and does not combines them:

PHP Code:


$ cut -f 1-6 test.bed | mergeBed -i stdin -s

chr22    1000    5000    +

chr22    2000    6000    -

**ewilbanks** · 11-05-2009, 02:45 PM

Hi Aaron,

How would you like me to cite your tools if we use them in a publication?

Thanks!
Lizzy

**quinlana** · 11-06-2009, 07:00 AM

Originally posted by ewilbanks View Post

Hi Aaron,

How would you like me to cite your tools if we use them in a publication?

Thanks!
Lizzy

Hi Lizzy,
We are working on the manuscript, but until then, please cite it as: Aaron R. Quinlan and Ira M. Hall, unpublished: http://code.google.com/p/bedtools/).
Thanks for asking and good luck with your manuscript.
Aaron

Topics	Statistics	Last Post
Gene Misexpression in the Healthy Human Population by seqadmin Started by seqadmin, Yesterday, 06:46 AM	0 responses 9 views 0 likes	Last Post by seqadmin Yesterday, 06:46 AM
New Method for Rapid Genetic Diagnosis of Mendelian Disorders by seqadmin Started by seqadmin, 07-24-2024, 11:09 AM	0 responses 26 views 0 likes	Last Post by seqadmin 07-24-2024, 11:09 AM
Advancing Nanopore Technology for Portable Sensing Devices by seqadmin Started by seqadmin, 07-19-2024, 07:20 AM	0 responses 159 views 0 likes	Last Post by seqadmin 07-19-2024, 07:20 AM
New RNA-Based Gene Writing Technology Achieves Precise Gene Integration by seqadmin Started by seqadmin, 07-16-2024, 05:49 AM	0 responses 127 views 0 likes	Last Post by seqadmin 07-16-2024, 05:49 AM

Seqanswers Leaderboard Ad

Announcement

BEDTools Version 2.0

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News