Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Bowtie output zipped but not zipped

    I have recently used Bowtie2 to separate out reads mapping to a set of genomes which which works fine however due to the size of my current dataset I had to input as zipped and output as zipped using the command below:

    bowtie2 -f --threads 8THREADS -N 1 --un-gz NZ_bowtie_nonaligned_hungate_thread.done --al-gz NZ_bowtie_aligned_hungate_thread.done -x /ibers/ernie/home/thh32/thh32/mcCabe_qualtrimmed
    _trimmed5P_files_.fastq/Hungate1000_index_bowtie -U /ibers/ernie/scratch/thh32/alldata.fa.gz -S /dev/null

    The process itself worked fine except for the output zipping. The files themselves were not acknowledged as zipped and so could not be unzipped, however the data inside looked like this:

    �95j����q�?�k�6��t��<^L����N� ��m��(N��k9ߟ4�|^��!�\/�߹��M����33���}^�6���۩s�}wsUo/��|.e^6{��\��Sf��sz̋��A���9H�%WLO���u~

    Is there anyway to convert this into fasta format again?

    Any help would be greatly appreciated.

    Many thanks,
    Tom

  • #2
    That looks a lot like compressed data to me. Remember that zipping refers to gzip, not another zip variant. If you want to decompress then just use gunzip.

    Comment


    • #3
      I thought it looked like compressed data so I tried using gunzip but gunzip wouldn't accept the file as a zipped file and so gave this error:

      gzip: NZ_bowtie_nonaligned_hungate_thread.done: unknown suffix -- ignored

      And with unzip it provided this answer:

      Archive: NZ_bowtie_nonaligned_hungate_thread.done
      End-of-central-directory signature not found. Either this file is not
      a zipfile, or it constitutes one disk of a multi-part archive. In the
      latter case the central directory and zipfile comment will be found on
      the last disk(s) of this archive.

      Comment


      • #4
        Managed to sort it out by using mv FILE FILE.gz and then was accepted and can be unzipped

        Comment


        • #5
          You just have to change the file name, since that's what gunzip is complaining about (the suffix in this case is "done", rather than "gz"). Try calling it "NZ_bowtie_nonaligned_hungate_thread.done.gz".

          Edit: Ah, you just figured that out, good

          Comment


          • #6
            One other methods aside from renaming the file is to use 'zcat'. As the manual says:

            zcat will uncompress files that have the correct magic number whether they have a .gz suffix or not.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Quality Control Essentials for Next-Generation Sequencing Workflows
              by seqadmin




              Like all molecular biology applications, next-generation sequencing (NGS) workflows require diligent quality control (QC) measures to ensure accurate and reproducible results. Proper QC begins at nucleic acid extraction and continues all the way through to data analysis. This article outlines the key QC steps in an NGS workflow, along with the commonly used tools and techniques.

              Nucleic Acid Quality Control
              Preparing for NGS starts with isolating the...
              02-10-2025, 01:58 PM
            • seqadmin
              An Introduction to the Technologies Transforming Precision Medicine
              by seqadmin


              In recent years, precision medicine has become a major focus for researchers and healthcare professionals. This approach offers personalized treatment and wellness plans by utilizing insights from each person's unique biology and lifestyle to deliver more effective care. Its advancement relies on innovative technologies that enable a deeper understanding of individual variability. In a joint documentary with our colleagues at Biocompare, we examined the foundational principles of precision...
              01-27-2025, 07:46 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 02-07-2025, 09:30 AM
            0 responses
            72 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 02-05-2025, 10:34 AM
            0 responses
            113 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 02-03-2025, 09:07 AM
            0 responses
            90 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 01-31-2025, 08:31 AM
            0 responses
            49 views
            0 likes
            Last Post seqadmin  
            Working...
            X