Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • hui_shi
    Junior Member
    • Oct 2010
    • 6

    problem with sra toolkit fastq-dump sratoolkit.2.1.10-win64

    Hello,

    I was trying to use fastq-dump to convert a .sra file to .fastq format.

    The data contains paired end reads, so when I type in commands,
    fastq-dump --split-files .sra, it gives two files, *_1.fastq and *_2.fastq.

    But the problem is the size of the two files are different, one is a lot bigger than the other. This doesn't seems to be right.

    Does anyone know why this happens? and how to fix it? Shouldn't the two files contain the same number of reads, so that the size should be the same as well?

    Thank you,

    Hui
    Last edited by hui_shi; 06-15-2012, 05:11 AM.
  • vadim
    Member
    • Sep 2009
    • 37

    #2
    Could you post the run accession please?

    Comment

    • hui_shi
      Junior Member
      • Oct 2010
      • 6

      #3
      Thanks for your reply. I didn't use --accession or -A, don't know if that's what you mean. Actually, I don't understand why using -A, except to modify the output name.

      The full command I typed in was:

      fastq-dump --split-files SRR443885.sra

      it generates two files, SRR443885_1.fastq and SRR443885_2.fastq, but they are of different size.

      Thanks,

      Hui
      Last edited by hui_shi; 06-15-2012, 06:01 AM.

      Comment

      • vadim
        Member
        • Sep 2009
        • 37

        #4
        The spot descriptor claims that the read length is 150bp, 75 forward and 75 reverse. But the actual read length is 100 bases only. I guess fastq-dump just obeys the rules by splitting the read at 75, so the reverse read becomes 3 times smaller than the forward.

        The run metadata is here:


        The experiment metadata is here:


        You can contact them at sra@ncbi about this.

        Comment

        • emp
          Member
          • Jan 2014
          • 11

          #5
          regarding small sra format file

          I wanted to convert .sra file of size 289.98 Mb. when i am using sratool fastq-dump.. it is generating error message as:-



          An error occurred during processing.
          A report was generated into the file '.../ncbi_error_report.xml'.
          If the problem persists, you may consider sending the file


          kindly help me out for this ASAP

          Comment

          • dpryan
            Devon Ryan
            • Jul 2011
            • 3478

            #6
            Did you try looking at what was in the file? Given that it's an error report, perhaps it's informative...

            Comment

            • emp
              Member
              • Jan 2014
              • 11

              #7
              Respected dpryan,


              The file is in xml format which on opening with google chrome is giving code. It does'nt have any error specified there in...

              Here in is the file which i am trying to convert to fastq.

              Comment

              • dpryan
                Devon Ryan
                • Jul 2011
                • 3478

                #8
                Try upgrading your copy of the SRA toolkit, that usually fixes this sort of issue (I tried to extract the file you linked and couldn't until I upgraded my local copy of fastq-dump).

                Comment

                • emp
                  Member
                  • Jan 2014
                  • 11

                  #9
                  kindly tell me how can i upgrade sra tool kit??


                  Is it through verbose??

                  Comment

                  • dpryan
                    Devon Ryan
                    • Jul 2011
                    • 3478

                    #10
                    Try here, on NCBI

                    Comment

                    • tw7649116
                      Junior Member
                      • Mar 2014
                      • 8

                      #11
                      Dear hui_shi,

                      I have the same problem with convert the sra to pair-end fastq.
                      Do you remember how to solve this problem?

                      Thank you very much!!

                      Comment

                      • GenoMax
                        Senior Member
                        • Feb 2008
                        • 7142

                        #12
                        Are you using the latest SRAtoolkit? Have you done the steps described in this post: http://seqanswers.com/forums/showpos...6&postcount=7?

                        Comment

                        • tw7649116
                          Junior Member
                          • Mar 2014
                          • 8

                          #13
                          Thank you GenoMax,

                          I used the latest version.
                          In that way we need to redownload the file?
                          Thank you! If I can't solve this, may be I need to download again!

                          Best wishes

                          Comment

                          • GenoMax
                            Senior Member
                            • Feb 2008
                            • 7142

                            #14
                            If you have downloaded the .sra file have you tried the command this way?

                            Code:
                            c:\> fastq-dump.exe -F --split-files c:\path_to\SRAfile.sra
                            Sometimes people do submit unequal length data sets so if you have mismatched cycle numbers that is a possibility. What SRA# are you looking at?

                            Comment

                            Latest Articles

                            Collapse

                            • SEQadmin2
                              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                              by SEQadmin2


                              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                              ...
                              06-02-2026, 10:05 AM
                            • SEQadmin2
                              Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                              by SEQadmin2


                              With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                              Introduction

                              Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                              05-22-2026, 06:42 AM
                            • SEQadmin2
                              Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                              by SEQadmin2

                              Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                              Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                              05-06-2026, 09:04 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by SEQadmin2, 06-02-2026, 12:03 PM
                            0 responses
                            19 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-02-2026, 11:40 AM
                            0 responses
                            14 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 05-28-2026, 11:40 AM
                            0 responses
                            29 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 05-26-2026, 10:12 AM
                            0 responses
                            31 views
                            0 reactions
                            Last Post SEQadmin2  
                            Working...