Header Leaderboard Ad

Collapse

Cufflinks 2.0.2 segmentation fault

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • biocomputer
    replied
    I was able to solve my problem. Despite the "-b genome.fa" seemingly being the cause of the problem it's actually the .gtf file. See here how to modify the .gtf file:

    https://groups.google.com/d/msg/tuxe...c/p47AwnCXxvwJ

    https://groups.google.com/d/msg/tuxe...U/LJhbCHBsITAJ
    Last edited by biocomputer; 01-21-2015, 12:28 PM.

    Leave a comment:


  • offspring
    replied
    Please file an issue report at https://github.com/cole-trapnell-lab/cufflinks containing a description of the problem and how to reproduce it, otherwise the cufflinks team won't even be aware of the problem.

    Leave a comment:


  • biocomputer
    replied
    Originally posted by sterding View Post
    hi Anelda,

    I made the genome.fa and gtf file in the same order, but still I got the " Segmentation fault" error in the step of " Learning bias parameters" if I use -b option. Without the "-b" option, I don't get the error. So, I think the bug is in the "-b" option. Hopefully cufflinks team can get attention to the problem.
    I'm using Cufflinks 2.2.1 and having the same problem with cufflinks and cuffdiff. -b causes a segfault, it works fine without it. I ensured the genome.fa and gtf file have their chromosomes in the same order and contain the same chromosomes and there is lots of free memory available.

    Leave a comment:


  • sterding
    replied
    hi Anelda,

    I made the genome.fa and gtf file in the same order, but still I got the " Segmentation fault" error in the step of " Learning bias parameters" if I use -b option. Without the "-b" option, I don't get the error. So, I think the bug is in the "-b" option. Hopefully cufflinks team can get attention to the problem.

    Leave a comment:


  • sterding
    replied
    Originally posted by Anelda View Post
    I have found that the order of the reference chromosomes in the genome.fasta file and the chromosomes in the GFF/GTF file, must be exactly the same otherwise a segmentation fault occurs.
    Thanks. I am testing this. I am also curious how you found the trick

    Leave a comment:


  • Anelda
    replied
    I have found that the order of the reference chromosomes in the genome.fasta file and the chromosomes in the GFF/GTF file, must be exactly the same otherwise a segmentation fault occurs. This is specifically valid in the case of Cufflinks. To demonstrate..

    grep ">" genome.fasta > fasta.order
    cut -f 1 genome.gff | uniq > gff.order

    diff fasta.order gff.order

    If the order of the chromosomes are not the same, you'll have to reshuffle. Easiest might be to reshuffle the GFF/GTF - I'm not sure if there are any scripts that can sort fasta/gff files. I just grep each chromosome from the GFF file and send it to a separate file, then cat the individual chromosome.gff files in the correct order and create new genome.gff.

    Hope this helps someone!

    Leave a comment:


  • chrisjohn86
    replied
    I have had a lot of segmentation faults with tuxedo over the last few weeks. I finally figured out it was due to bad RAM, I removed 8GB of 16GB total and it started working fine. Memoryxp is a good RAM diagnostic tool. This is one possible reason for seg faults. There are others:
    http://www.cyberciti.biz/tips/segmen...inux-unix.html

    Leave a comment:


  • bmicro_mit1
    replied
    Yes, nothing in the 'o' file, and only what I had put in the initial post from the 'e' file.

    Leave a comment:


  • GenoMax
    replied
    Are you using the -o and -e directives with your qsub (or SGE job submission script) to capture the output/stderr output? Contents of those would be useful as well.

    Leave a comment:


  • bmicro_mit1
    replied
    No, just the segmentation fault. I am running with verbose mode right now to see if there is more output.

    Leave a comment:


  • GenoMax
    replied
    Are there any other error messages (stderr output)?

    Leave a comment:


  • bmicro_mit1
    replied
    This was run on a couple of different machines in a SGE cluster. Some of the nodes had up to 48Gb of RAM, but in the SGE email reporting the program had died, it never reported more than 5Gb of memory usage.

    Leave a comment:


  • GenoMax
    replied
    How much RAM do you have on this machine?

    Leave a comment:


  • bmicro_mit1
    started a topic Cufflinks 2.0.2 segmentation fault

    Cufflinks 2.0.2 segmentation fault

    I am using 100 bp paired end Illumina Hi-seq data with about 50M reads and trying to use tophat / cufflinks for RNA-seq analysis for human data, using Ensemble v68 gtf along with Gencode v13 lncRNA gtf annotations. These files were concatenated together to run both tophat 2.0.6 with bowtie2 2.0.6:

    tophat -p 4 --solexa1.3-quals --read-realign-edit-dist 0 --no-novel-juncs --library-type fr-unstranded -G $GTF -o $OUT $GENOME $FASTQ_1 $FASTQ_2

    and Cufflinks 2.0.2
    cufflinks -o $OUT -p 4 -G $GTF -b $FASTA --multi-read-correct $OUT/accepted_hits.bam

    A segmentation fault has continued to occur with multiple samples at similar locations as Cufflinks is re-estimating abundacnes with bias and multi-read correction. Below is the output:

    [09:41:21] Learning bias parameters.
    > Processed 635 loci. [*************************] 100%
    [09:45:58] Re-estimating abundances with bias and multi-read correction.
    > Processing Locus chr16:5289802-6826015 [******* ] 31%Segmentation fault (core dumped)

    Any input would be greatly appreciated.

Latest Articles

Collapse

  • seqadmin
    Improved Targeted Sequencing: A Comprehensive Guide to Amplicon Sequencing
    by seqadmin



    Amplicon sequencing is a targeted approach that allows researchers to investigate specific regions of the genome. This technique is routinely used in applications such as variant identification, clinical research, and infectious disease surveillance. The amplicon sequencing process begins by designing primers that flank the regions of interest. The DNA sequences are then amplified through PCR (typically multiplex PCR) to produce amplicons complementary to the targets. RNA targets...
    Yesterday, 01:49 PM
  • seqadmin
    Targeted Sequencing: Choosing Between Hybridization Capture and Amplicon Sequencing
    by seqadmin




    Targeted sequencing is an effective way to sequence and analyze specific genomic regions of interest. This method enables researchers to focus their efforts on their desired targets, as opposed to other methods like whole genome sequencing that involve the sequencing of total DNA. Utilizing targeted sequencing is an attractive option for many researchers because it is often faster, more cost-effective, and only generates applicable data. While there are many approaches...
    03-10-2023, 05:31 AM
  • seqadmin
    Expert Advice on Automating Your Library Preparations
    by seqadmin



    Using automation to prepare sequencing libraries isn’t a new concept, and most researchers are aware that there are numerous benefits to automating this process. However, many labs are still hesitant to switch to automation and often believe that it’s not suitable for their lab. To combat these concerns, we’ll cover some of the key advantages, review the most important considerations, and get real-world advice from automation experts to remove any lingering anxieties....
    02-21-2023, 02:14 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 03-17-2023, 12:32 PM
0 responses
13 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-15-2023, 12:42 PM
0 responses
18 views
0 likes
Last Post seqadmin  
Started by seqadmin, 03-09-2023, 10:17 AM
0 responses
68 views
1 like
Last Post seqadmin  
Started by seqadmin, 03-03-2023, 12:03 PM
0 responses
64 views
0 likes
Last Post seqadmin  
Working...
X