Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • problems with bedtools mapBed

    I am trying to use mapBed but I am running into a problem that I can illustrate with the examples below. I am using this testfile.

    >cat test.bed
    chr1 100000 109000
    chr2 100000 109000
    chr3 100000 109000
    chr4 100000 109000
    chr5 100000 109000
    chr6 100000 109000
    chr7 100000 109000
    chr8 100000 109000
    chr9 100000 109000
    chr10 100000 109000
    chr11 100000 109000
    chr12 100000 109000
    chr13 100000 109000
    chr14 100000 109000
    chr15 100000 109000
    chr16 100000 109000
    chr17 100000 109000
    chr18 100000 109000
    chr19 100000 109000
    chr20 100000 109000
    chr21 100000 109000
    chr22 100000 109000

    I am querying that against a bedgraph file that I downloaded and sorted from UCSC genome browser. I can't show the whole file but I can show you how it is sorted.

    >cat wgEncodeSydhHistoneK562bH3k4me3bUcdSig.bedgraph.noM | awk '{print $1}' | uniq
    chr1
    chr2
    chr3
    chr4
    chr5
    chr6
    chr7
    chr8
    chr9
    chr10
    chr11
    chr12
    chr13
    chr14
    chr15
    chr16
    chr17
    chr18
    chr19
    chr20
    chr21
    chr22
    chrX

    When I run the command...

    >mapBed -c 3 -o sum -a test.bed -b wgEncodeSydhHistoneK562H3k9acbUcdSig.bedgraph.noM

    I get...

    chr1 100000 109000 1584710
    chr2 100000 109000 6965660
    chr3 100000 109000 4067179
    chr4 100000 109000 3997623
    chr5 100000 109000 22672757
    chr6 100000 109000 100786
    chr7 100000 109000 13187590
    chr8 100000 109000 .
    chr9 100000 109000 5536045
    chr10 100000 109000 .
    chr11 100000 109000 .
    chr12 100000 109000 .
    chr13 100000 109000 .
    chr14 100000 109000 .
    chr15 100000 109000 .
    chr16 100000 109000 .
    chr17 100000 109000 .
    chr18 100000 109000 .
    chr19 100000 109000 .
    chr20 100000 109000 .
    chr21 100000 109000 .
    chr22 100000 109000 .

    It is fine that I dont get anything for chr8. I checked that one and there is no coverage in that region. But for the later chromosomes (10 and on) I should be seeing a signal. Can anyone explain why this is happening or what I can do to fix it? mapBed says...

    "Notes:
    (1) Both input files must be sorted by chrom, then start."

    Are my files sorted wrong? Thanks so much.
    Doug
    www.sharedproteomics.com

  • #2
    column

    Hi Doug,

    I know its a super old post, but i came across it, since i have the same problem.

    Though i can see from your command that you used -c 3 for a bedgraph file, it should be -c 4 (4th column is the score in bedgraph).

    My problem is, i am in a similar situation and mapBed is assigning 0's to coordinates that i know are non-zero!

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      Yesterday, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    58 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    53 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 09:21 AM
    0 responses
    45 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-04-2024, 09:00 AM
    0 responses
    55 views
    0 likes
    Last Post seqadmin  
    Working...
    X