Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Breakdancer Output in multisample problem

    Hi everyone,
    I have 4 samples (matepair illumina) and run Breakdancer_max as follows:
    bam2cfg.pl -h -g -q 10 sample1.bam sample2.bam sample3.bam sample4.bam >sample1_2_sample3_4.cfg
    it worked without error msg.
    breakdancer-max -o chrX -q 10 -d Fastq_SVReadsFile_X -g GBrowse_OUTFile_X.bed -l -h sample1_2_sample3_4.cfg >chrX_SV_allSample.ctx
    This also worked generating outputs with last mag as Kahan error (which i ignored as read on other blogs https://www.biostars.org/p/97367/#98866)
    The problem is that the output file chrX_SV_allSample.ctx only contains SV specific to sample4. while all other samples contains NA in their columns. I donot know where i am making mistake.
    e.g.
    Software: 1.4.4-unstable-7-6213d5a (commit 6213d5a)
    Command: breakdancer-max -o chrX -q 10 -d Fastq_SVReadsFile_X -g GBrowse_OUTFile_X.bed -l -h sample1_2_sample3_4.cfg
    Library Statistics:
    sample4.bam mean:4218.95 std:552.31 uppercutoff:6393.21 lowercutoff:1975.9 readlen:50.82 library:tight reflen:111700677 seqcov:0.939753 phycov:39.008 1:8326 2:6802 3:2818 4:436 8:8414 32:20695
    sample4.bam mean:3889.35 std:680.67 uppercutoff:6865.34 lowercutoff:1401.76 readlen:50.82 library:wide reflen:111700677 seqcov:0.701866 phycov:26.8576 1:6586 2:2710 3:2022 4:326 8:6506 32:15637
    Chr1 Pos1 Orientation1 Chr2 Pos2 Orientation2 Type Size Score num_Reads num_Reads_lib Allele_frequency sample1.bam sample2.bam sample3.bam sample4.bam
    chrX 229 312+317- chrX 292963 312+317- INV 6106 99 118 sample4.bam|118 -nan NA NA NA NA
    chrX 327014 58+52- chrX 372108 58+52- INV -1620 70 20 sample4.bam|20 -nan NA NA NA NA
    chrX 372154 58+52- chrX 430662 103+43- INV 10905 99 43 sample4.bam|43 0.301335 NA NA NA 1.40
    chrX 491312 5+2- chrX 494412 5+2- INV -3118 34 2 sample4.bam|2 -nan NA NA NA NA

    Please guide.
    Any help or suggestion is valuable

  • #2
    Do your BAM files have proper readgroup information in the header and in the reads? Can you post your configuration file?

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 04-25-2024, 11:49 AM
    0 responses
    19 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-24-2024, 08:47 AM
    0 responses
    17 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    62 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Working...
    X