Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Need help in running FAR - The Flexible Adapter Remover' on Mac OS

    Hi,

    I downloaded "FAR - The Flexible Adapter Remover" from http://sourceforge.net/projects/theflexibleadap/, and installed it in MacOS follwoing the instructions in its "README.TXT" file. The program ran successfully when I used a small fasta sequencing file to test it.

    However, when I really applied it to my real sequencing file (in fasta format), It only processed very few read sequences and then stopped, showing "Terminate called throwing an exceptionabort trap 6".

    Could anyone help to explain what this sentence mean? How to fix it? Thanks

  • #2
    Perhaps you do not have enough memory to process the entire file (you can check on this using a terminal window and "top" while you run FAR). If that is true then you could split your file in pieces and then process the pieces individually.

    Comment


    • #3
      Originally posted by GenoMax View Post
      Perhaps you do not have enough memory to process the entire file (you can check on this using a terminal window and "top" while you run FAR). If that is true then you could split your file in pieces and then process the pieces individually.
      Thank you for your reply. I used "top" but I cannot tell which term I should look at. Here is the terms listed:
      "PID COMMAND %CPU TIME #TH #WQ #POR #MREG RPRVT RSHRD RSIZE VPRVT VSIZE PGRP PPID STATE UID FAULTS COW MSGSENT MSGRECV SYSBSD SYSMACH CSW PAGEINS KPRVT KSHRD USER"

      But based on my observation, I did not see any of them show a significant large number when "Far" was running except "%CPU". Can you tell me what I should look at? Thx!

      Comment


      • #4
        Look for "PhysMem" in the lines that show up above the list you have included below.

        Does PhysMem line end up showing "0M free" (or a near zero number) at some point during processing when the error occurs?

        How much RAM do you have and how big are your sequence files?


        Originally posted by weicy View Post
        Thank you for your reply. I used "top" but I cannot tell which term I should look at. Here is the terms listed:
        "PID COMMAND %CPU TIME #TH #WQ #POR #MREG RPRVT RSHRD RSIZE VPRVT VSIZE PGRP PPID STATE UID FAULTS COW MSGSENT MSGRECV SYSBSD SYSMACH CSW PAGEINS KPRVT KSHRD USER"

        But based on my observation, I did not see any of them show a significant large number when "Far" was running except "%CPU". Can you tell me what I should look at? Thx!

        Comment


        • #5
          Originally posted by GenoMax View Post
          Look for "PhysMem" in the lines that show up above the list you have included below.

          Does PhysMem line end up showing "0M free" (or a near zero number) at some point during processing when the error occurs?

          How much RAM do you have and how big are your sequence files?

          No, PhysMem actually did not change too much, and about 450mb is free when FAR is stopped.

          I am trying to run FAR in a macbook pro, which has 4G RAM. The sequence files is 13GB and it is in fasta format.

          thanks

          Comment


          • #6
            The fasta is 13GB? That is huge! I have used whole genome in fasta format which are less than 2BG.
            Are you sure this file doesn't not include run/quality information also?

            Even trying to commit ~2GB of data may cause a stack overflow on a 4GB Macbook (I presume?), and throw a crash. If you do not have access to a Linux machine with much more RAM you have to chop up your fasta file. Still 13BG is very large for such a simple format.

            Comment


            • #7
              I think the best solution is to split the file and then do the adapter removal on the pieces. Use the "split" command (http://www.computerhope.com/unix/usplit.htm).


              Originally posted by weicy View Post
              No, PhysMem actually did not change too much, and about 450mb is free when FAR is stopped.

              I am trying to run FAR in a macbook pro, which has 4G RAM. The sequence files is 13GB and it is in fasta format.

              thanks

              Comment


              • #8
                Uncompressed HiSeq sample files can be rather large. It is not unusual to have a 13GB fasta file.

                Originally posted by JackieBadger View Post
                The fasta is 13GB? That is huge!. Still 13BG is very large for such a simple format.

                Comment


                • #9
                  Originally posted by GenoMax View Post
                  I think the best solution is to split the file and then do the adapter removal on the pieces. Use the "split" command (http://www.computerhope.com/unix/usplit.htm).
                  Thx, I will try

                  Comment


                  • #10
                    Originally posted by JackieBadger View Post
                    The fasta is 13GB? That is huge! I have used whole genome in fasta format which are less than 2BG.
                    Are you sure this file doesn't not include run/quality information also?

                    Even trying to commit ~2GB of data may cause a stack overflow on a 4GB Macbook (I presume?), and throw a crash. If you do not have access to a Linux machine with much more RAM you have to chop up your fasta file. Still 13BG is very large for such a simple format.
                    13Gb is pretty small for what I do. And my macbook pro with 4Gb handles it pretty well.

                    Can you try a different program other than FAR? Maybe fasta-mcf, cutadapt or the fastx toolkit?

                    Comment


                    • #11
                      Originally posted by GenoMax View Post
                      I think the best solution is to split the file and then do the adapter removal on the pieces. Use the "split" command (http://www.computerhope.com/unix/usplit.htm).
                      I spliced the file and now there is 10m reads per file (around 600mb). The problem still exists. I guess file size is not the key...

                      but thx anyway

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Essential Discoveries and Tools in Epitranscriptomics
                        by seqadmin




                        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                        04-22-2024, 07:01 AM
                      • seqadmin
                        Current Approaches to Protein Sequencing
                        by seqadmin


                        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                        04-04-2024, 04:25 PM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 04-25-2024, 11:49 AM
                      0 responses
                      20 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-24-2024, 08:47 AM
                      0 responses
                      20 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-11-2024, 12:08 PM
                      0 responses
                      62 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 10:19 PM
                      0 responses
                      61 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X