Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • jazz710
    Member
    • Oct 2012
    • 41

    #91
    There is no output yet...

    40 threads, 800G memory.

    Comment

    • GenoMax
      Senior Member
      • Feb 2008
      • 7142

      #92
      I have a feeling the job is hung. Is this under a job scheduler? In my experience, bbduk starts writing output even when running under a scheduler right away.

      Comment

      • jazz710
        Member
        • Oct 2012
        • 41

        #93
        It was submitted to a server with a scheduler, yes.

        It's just odd that it's the exact same code and dataset, but the only difference is hdist=1...same set up with hdist=0 runs fine.

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #94
          Are you able to ssh to the node this is job is on and see if the bbduk is running (something like top -H)?

          Comment

          • jazz710
            Member
            • Oct 2012
            • 41

            #95

            Comment

            • GenoMax
              Senior Member
              • Feb 2008
              • 7142

              #96
              The java process (should correspond to bbduk) appears to be sleeping (at least in this screenshot). Does it change back to R periodically or is it staying in S?

              @Brian had talked about memory requirements with hdist=1 before. It may not be applicable in your case.

              Comment

              • jazz710
                Member
                • Oct 2012
                • 41

                #97
                Based on a minute of observation, it appears to be sleeping, however the %CPU and %MEM change as I re-issue 'top'.

                Comment

                • jazz710
                  Member
                  • Oct 2012
                  • 41

                  #98
                  Based on Brian's comments, I bet I have run out of memory. My reference is very large, and a 3*K increase in memory would put me over my server capacity. I will try to re-run with a smaller reference.

                  Comment

                  • GenoMax
                    Senior Member
                    • Feb 2008
                    • 7142

                    #99
                    Unless your scheduler is buffering the entire output in some temp location you should have seen trimmed files right away (have you ever looked before if a job does that)? Did you try top -H to see all ~40 java threads?

                    Based on your last comment that must be the case.

                    Comment

                    • susanklein
                      Senior Member
                      • Feb 2014
                      • 116

                      reads mapped per gene

                      Hi,

                      can you suggest how to get raw reads mapped per gene from bbmap?

                      Thanks,

                      S.

                      Comment

                      • GenoMax
                        Senior Member
                        • Feb 2008
                        • 7142

                        Originally posted by susanklein View Post
                        Hi,

                        can you suggest how to get raw reads mapped per gene from bbmap?

                        Thanks,

                        S.
                        Based on your other posts you appear to have discovered htseq-count and featureCounts. BBMap suite does not have a read counting program (though @Brian has one on the wishlist of things to add to BBMap).

                        Comment

                        • JeanLove
                          Junior Member
                          • Jan 2016
                          • 5

                          hi! is there a way to extract specific sequences from fasta file, regarding their positions in the file using BBMap tools? I know the numbers of their name lines and want to extract whole sequences to a file.

                          Comment

                          • GenoMax
                            Senior Member
                            • Feb 2008
                            • 7142

                            Originally posted by JeanLove View Post
                            hi! is there a way to extract specific sequences from fasta file, regarding their positions in the file using BBMap tools? I know the numbers of their name lines and want to extract whole sequences to a file.
                            It is not clear if you know their identifiers (or just their positions). If you have the identifiers then you can do the following

                            By default, "filterbyname" discards reads with names in your name list, and keeps the rest. To include them and discard the others, do this:

                            Code:
                            $ filterbyname.sh in=003.fastq out=filter003.fq names=names003.txt include=t
                            Even though this example is for fastq files I am reasonsably certain that it would work for fasta. Make sure your file names end in .fa to facilitate that.

                            In case BBMap does not work you can try faSomeRecords utility from Jim Kent @UCSC which is described here: http://seqanswers.com/forums/showthread.php?t=64004

                            Comment

                            • JeanLove
                              Junior Member
                              • Jan 2016
                              • 5

                              Originally posted by GenoMax View Post
                              It is not clear if you know their identifiers (or just their positions). If you have the identifiers then you can do the following[/url]
                              I do not know identifiers..


                              Originally posted by GenoMax View Post
                              In case BBMap does not work you can try faSomeRecords utility from Jim Kent @UCSC which is described here: http://seqanswers.com/forums/showthread.php?t=64004
                              thanks for your help, i'll look faSomeRecords utility up!

                              Comment

                              • Brian Bushnell
                                Super Moderator
                                • Jan 2014
                                • 2709

                                If you know the sequence numbers, you can use getreads.sh:

                                Usage: getreads.sh in=<file> id=<number,number,number...> out=<file>

                                The first read (or pair) has ID 0, the second read (or pair) has ID 1, etc.

                                Parameters:
                                in=<file> Specify the input file, or stdin.
                                out=<file> Specify the output file, or stdout.
                                id= Comma delimited list of numbers or ranges, in any order.
                                For example: id=5,93,17-31,8,0,12-13

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  06-02-2026, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                21 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-26-2026, 10:12 AM
                                0 responses
                                31 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...