Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • gokhulkrishnakilaru
    Member
    • Jul 2011
    • 39

    gffread segmentation fault

    Hello All,

    I donno what is going wrong with the gffread utility from cufflinks.

    This is the command I have been using

    gffread -w /path/test.fa -g /path/hg19_ucsc.fa /path/test.gtf
    My GTF file is just 4MB with around 48K records. The output fasta file prints only around 42K records with a size of 480MB and then I see this error.

    /path: line 16: 28068 Segmentation fault (core dumped) gffread -w /path/test.fa -g /path/hg19_ucsc.fa /path/test.gtf
    I have another GTF file of size 12MB which is being used to generate a fasta file of size 3.7GB and then it generates the same segmentation error.

    All helps are appreciated.
  • swaraj
    Member
    • Feb 2012
    • 50

    #2
    Fragment your gtf file into five separate chunks and run each with gffread. it may help you unserstand the core of the fault.

    Comment

    • gokhulkrishnakilaru
      Member
      • Jul 2011
      • 39

      #3
      In the third file, I am getting the segmentation fault error. Could you please help me what it means?

      Comment

      • seb567
        Senior Member
        • Jul 2008
        • 260

        #4
        Originally posted by gokhulkrishnakilaru View Post
        In the third file, I am getting the segmentation fault error. Could you please help me what it means?
        A program is represented as segments in a computer's memory. A segmentation fault means that the program tried to access an address that is not located in one of its segments.

        Usually, this means that there is a bug in the software.

        Comment

        • gokhulkrishnakilaru
          Member
          • Jul 2011
          • 39

          #5
          Originally posted by seb567 View Post
          A program is represented as segments in a computer's memory. A segmentation fault means that the program tried to access an address that is not located in one of its segments.

          Usually, this means that there is a bug in the software.
          I broke my large file into chunks of 10000 lines each which left me with 10 files. The program runs fine for 9 files and throws this error for the last file.

          Any thoughts?

          Comment

          • seb567
            Senior Member
            • Jul 2008
            • 260

            #6
            Originally posted by gokhulkrishnakilaru View Post
            I broke my large file into chunks of 10000 lines each which left me with 10 files. The program runs fine for 9 files and throws this error for the last file.

            Any thoughts?
            Is there something unusual in the last compared to the nine others ?

            Comment

            • gokhulkrishnakilaru
              Member
              • Jul 2011
              • 39

              #7
              Originally posted by seb567 View Post
              Is there something unusual in the last compared to the nine others ?
              I am trying to figure out those things. I broke the last file into 1000 line chunks now. And all of them are giving a segmentation fault. I cross checked the end coordinate against the chrom sizes files and all of them seem to be within the limit.

              I don't know whats wrong with it.

              Comment

              • will.jackson
                Junior Member
                • Aug 2012
                • 9

                #8
                I have had the same problem with the filtering parameters of gffread (-J, -V -H etc). Can anyone suggest another program that does a similar thing to gffread? That is, filter transcripts based on CDS features and provide a multi-fasta format sequence file at the end?

                Comment

                • prios
                  Junior Member
                  • Jun 2012
                  • 4

                  #9
                  Hello all,

                  I had the same problem a while ago. I fragment my file into 10 and I got errors in two of them. In both cases I broke the files into smaller ones and (after more splitting) I found located the problem in two a gene with alternative splicing event. In my case the secondary transcript was bigger than the annotated gene sequence (the last exon coordinates were placed outside the gene).

                  I just erased the two features from my original file (I considered that it was not a great loss for my purposes) and it works fine now.

                  Hope it helps,

                  Pablo

                  Comment

                  • will.jackson
                    Junior Member
                    • Aug 2012
                    • 9

                    #10
                    Hi,
                    Thanks for your reply Pablo, I tried doing what you said and I broke down my files and kept getting the segfault even when the file was only 100 lines long(!). I think maybe I have a lot of alternate splicing in my organism (a basidiomycete (fungi)). Does anyone know of a program like gffread that can handle alternate splicing? Or another way I could get around this problem?

                    Thanks very much

                    Will

                    Comment

                    • asperjelly
                      Member
                      • Jan 2013
                      • 11

                      #11
                      I'm having the same problem as Will. I've done the tophat2-cufflinks-cuffmerge pipeline to generate a merged GTF file. However, my organism has fairly high gene density, so cufflinks is predicting very long transcripts, which are not correct. I wanted to filter the merged GTF file using gffread to discard any transcripts that have internal stops (either the -V or -J parameter). However, I keep getting the 'segmentation_fault' error. I have tried to break up the merged GTF file into smaller sizes (such as 1000 lines), however the segmentation error persists. Does anybody know a solution to this problem?

                      Comment

                      Latest Articles

                      Collapse

                      • SEQadmin2
                        Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                        by SEQadmin2


                        I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                        Here are nine questions we think about, in roughly the order they matter, before...
                        06-18-2026, 07:11 AM
                      • SEQadmin2
                        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                        by SEQadmin2


                        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                        ...
                        06-02-2026, 10:05 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by SEQadmin2, Yesterday, 11:10 AM
                      0 responses
                      7 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-17-2026, 06:09 AM
                      0 responses
                      42 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-09-2026, 11:58 AM
                      0 responses
                      103 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-05-2026, 10:09 AM
                      0 responses
                      125 views
                      0 reactions
                      Last Post SEQadmin2  
                      Working...