Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Need help in running FAR - The Flexible Adapter Remover' on Mac OS

    Hi,

    I downloaded "FAR - The Flexible Adapter Remover" from http://sourceforge.net/projects/theflexibleadap/, and installed it in MacOS follwoing the instructions in its "README.TXT" file. The program ran successfully when I used a small fasta sequencing file to test it.

    However, when I really applied it to my real sequencing file (in fasta format), It only processed very few read sequences and then stopped, showing "Terminate called throwing an exceptionabort trap 6".

    Could anyone help to explain what this sentence mean? How to fix it? Thanks

  • #2
    Perhaps you do not have enough memory to process the entire file (you can check on this using a terminal window and "top" while you run FAR). If that is true then you could split your file in pieces and then process the pieces individually.

    Comment


    • #3
      Originally posted by GenoMax View Post
      Perhaps you do not have enough memory to process the entire file (you can check on this using a terminal window and "top" while you run FAR). If that is true then you could split your file in pieces and then process the pieces individually.
      Thank you for your reply. I used "top" but I cannot tell which term I should look at. Here is the terms listed:
      "PID COMMAND %CPU TIME #TH #WQ #POR #MREG RPRVT RSHRD RSIZE VPRVT VSIZE PGRP PPID STATE UID FAULTS COW MSGSENT MSGRECV SYSBSD SYSMACH CSW PAGEINS KPRVT KSHRD USER"

      But based on my observation, I did not see any of them show a significant large number when "Far" was running except "%CPU". Can you tell me what I should look at? Thx!

      Comment


      • #4
        Look for "PhysMem" in the lines that show up above the list you have included below.

        Does PhysMem line end up showing "0M free" (or a near zero number) at some point during processing when the error occurs?

        How much RAM do you have and how big are your sequence files?


        Originally posted by weicy View Post
        Thank you for your reply. I used "top" but I cannot tell which term I should look at. Here is the terms listed:
        "PID COMMAND %CPU TIME #TH #WQ #POR #MREG RPRVT RSHRD RSIZE VPRVT VSIZE PGRP PPID STATE UID FAULTS COW MSGSENT MSGRECV SYSBSD SYSMACH CSW PAGEINS KPRVT KSHRD USER"

        But based on my observation, I did not see any of them show a significant large number when "Far" was running except "%CPU". Can you tell me what I should look at? Thx!

        Comment


        • #5
          Originally posted by GenoMax View Post
          Look for "PhysMem" in the lines that show up above the list you have included below.

          Does PhysMem line end up showing "0M free" (or a near zero number) at some point during processing when the error occurs?

          How much RAM do you have and how big are your sequence files?

          No, PhysMem actually did not change too much, and about 450mb is free when FAR is stopped.

          I am trying to run FAR in a macbook pro, which has 4G RAM. The sequence files is 13GB and it is in fasta format.

          thanks

          Comment


          • #6
            The fasta is 13GB? That is huge! I have used whole genome in fasta format which are less than 2BG.
            Are you sure this file doesn't not include run/quality information also?

            Even trying to commit ~2GB of data may cause a stack overflow on a 4GB Macbook (I presume?), and throw a crash. If you do not have access to a Linux machine with much more RAM you have to chop up your fasta file. Still 13BG is very large for such a simple format.

            Comment


            • #7
              I think the best solution is to split the file and then do the adapter removal on the pieces. Use the "split" command (http://www.computerhope.com/unix/usplit.htm).


              Originally posted by weicy View Post
              No, PhysMem actually did not change too much, and about 450mb is free when FAR is stopped.

              I am trying to run FAR in a macbook pro, which has 4G RAM. The sequence files is 13GB and it is in fasta format.

              thanks

              Comment


              • #8
                Uncompressed HiSeq sample files can be rather large. It is not unusual to have a 13GB fasta file.

                Originally posted by JackieBadger View Post
                The fasta is 13GB? That is huge!. Still 13BG is very large for such a simple format.

                Comment


                • #9
                  Originally posted by GenoMax View Post
                  I think the best solution is to split the file and then do the adapter removal on the pieces. Use the "split" command (http://www.computerhope.com/unix/usplit.htm).
                  Thx, I will try

                  Comment


                  • #10
                    Originally posted by JackieBadger View Post
                    The fasta is 13GB? That is huge! I have used whole genome in fasta format which are less than 2BG.
                    Are you sure this file doesn't not include run/quality information also?

                    Even trying to commit ~2GB of data may cause a stack overflow on a 4GB Macbook (I presume?), and throw a crash. If you do not have access to a Linux machine with much more RAM you have to chop up your fasta file. Still 13BG is very large for such a simple format.
                    13Gb is pretty small for what I do. And my macbook pro with 4Gb handles it pretty well.

                    Can you try a different program other than FAR? Maybe fasta-mcf, cutadapt or the fastx toolkit?

                    Comment


                    • #11
                      Originally posted by GenoMax View Post
                      I think the best solution is to split the file and then do the adapter removal on the pieces. Use the "split" command (http://www.computerhope.com/unix/usplit.htm).
                      I spliced the file and now there is 10m reads per file (around 600mb). The problem still exists. I guess file size is not the key...

                      but thx anyway

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Recent Developments in Metagenomics
                        by seqadmin





                        Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable1. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
                        09-23-2024, 06:35 AM
                      • seqadmin
                        Understanding Genetic Influence on Infectious Disease
                        by seqadmin




                        During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

                        Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
                        09-09-2024, 10:59 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, Yesterday, 04:51 AM
                      0 responses
                      8 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 10-01-2024, 07:10 AM
                      0 responses
                      12 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 09-30-2024, 08:33 AM
                      0 responses
                      16 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 09-26-2024, 12:57 PM
                      0 responses
                      16 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X