Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ronaldrcutler
    Member
    • May 2016
    • 80

    hisat2 multiple thread usage

    Hello all, I am trying to switch over to using hisat2 instead of tophat. I have a couple questions:

    1. It seems that we are able to input a list of mates for -1 & -2. Would this be referring to inputing multiple paired reads of the same sample in to be aligned at once? For example I have 8 .fastq files (4 paired reads) - how would the input look like?

    2. How do I create a specific output file and directory? Would this be using the -S flag and inputing a file name after that? I see that the default output is .sam, is there a way to output .bam files?

    3. I tried to use the multithread option, -p/--threads 4, (as I have 4 cores) but got this error: -p/--threads arg must be at least 1. Any help with this?
  • dpryan
    Devon Ryan
    • Jul 2011
    • 3478

    #2
    1. hisat2 -x something -1 sample_1_1.fq,sample_1_2.fq -2 sample_2_1.fq,sample_2_2.fq
    2. Redirect output to a file in a directory that's already created. Hisat2 won't create directories for you.
    3. Use "-p 4" or "--nthreads 4". The "/" in the documentation indicates that things on either side are the same.

    Comment

    • ronaldrcutler
      Member
      • May 2016
      • 80

      #3
      Thank you very much. Just to be clear:
      1. This means that multiple lanes in each sample can all be run at once?
      2. To redirect output to a file in a directory I would use: -S Results.sam? Any way to output a .bam, or would I just have to use samtools to do this?
      3.This will make things go much faster, thanks. What is your opinion at running multiple instances of hisat2 (on separate command line windows) at once to run multiple samples simultaneously?

      Comment

      • dpryan
        Devon Ryan
        • Jul 2011
        • 3478

        #4
        1. Yup
        2. hisat ...options... | samtools -Sbo output.bam -
        3. We commonly run a few instances at once with 20-30 threads each (on cluster nodes with 64 cores each). At some point you'll saturate disk I/O, but it'll probably take a while.

        Comment

        • ronaldrcutler
          Member
          • May 2016
          • 80

          #5
          So I have been trying to run multiple lanes within a sample on hisat2 using this command:
          Code:
          hisat2 -q  -p 4 -x Xenopus_Laevis -1 /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L005_R1_001.fastq, /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L006_R1_001.fastq, /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L007_R1_001.fastq, /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L008_R1_001.fastq -2 /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L005_R2_001.fastq, /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L006_R2_001.fastq, /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L007_R2_001.fastq, /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L008_R2_001.fastq -S Sample_1_hisat2_results.sam
          These are paired reads as R1 for each read indicates mate 1 and R2 indicates mate 2. When I try and run this, I am getting this error:

          Code:
          Note that if <mates> files are specified using -1/-2, a <singles> file cannot
          also be specified.  Please run bowtie separately for mates and singles.
          Error: Encountered internal HISAT2 exception (#1)
          This doesn't make sense as these are all mates and not single reads. Any help with this?

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #6
            Pretty sure you can't have spaces between the R1/R2 read file names.

            Try

            Code:
            hisat2 -q  -p 4 -x Xenopus_Laevis -1 /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L005_R1_001.fastq,/Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L006_R1_001.fastq,/Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L007_R1_001.fastq,/Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L008_R1_001.fastq -2 /Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L005_R2_001.fastq,/Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L006_R2_001.fastq,/Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L007_R2_001.fastq,/Volumes/cachannel/RNA_SEQ/Notch_RNASeq/in_silico_test/Sample_1/Results_1_ATCACG_L008_R2_001.fastq -S Sample_1_hisat2_results.sam

            Comment

            • ronaldrcutler
              Member
              • May 2016
              • 80

              #7
              Works perfectly, thanks a lot.

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              12 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              46 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              106 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              125 views
              0 reactions
              Last Post SEQadmin2  
              Working...