Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • problem with sra toolkit fastq-dump sratoolkit.2.1.10-win64

    Hello,

    I was trying to use fastq-dump to convert a .sra file to .fastq format.

    The data contains paired end reads, so when I type in commands,
    fastq-dump --split-files .sra, it gives two files, *_1.fastq and *_2.fastq.

    But the problem is the size of the two files are different, one is a lot bigger than the other. This doesn't seems to be right.

    Does anyone know why this happens? and how to fix it? Shouldn't the two files contain the same number of reads, so that the size should be the same as well?

    Thank you,

    Hui
    Last edited by hui_shi; 06-15-2012, 05:11 AM.

  • #2
    Could you post the run accession please?

    Comment


    • #3
      Thanks for your reply. I didn't use --accession or -A, don't know if that's what you mean. Actually, I don't understand why using -A, except to modify the output name.

      The full command I typed in was:

      fastq-dump --split-files SRR443885.sra

      it generates two files, SRR443885_1.fastq and SRR443885_2.fastq, but they are of different size.

      Thanks,

      Hui
      Last edited by hui_shi; 06-15-2012, 06:01 AM.

      Comment


      • #4
        The spot descriptor claims that the read length is 150bp, 75 forward and 75 reverse. But the actual read length is 100 bases only. I guess fastq-dump just obeys the rules by splitting the read at 75, so the reverse read becomes 3 times smaller than the forward.

        The run metadata is here:


        The experiment metadata is here:


        You can contact them at sra@ncbi about this.

        Comment


        • #5
          regarding small sra format file

          I wanted to convert .sra file of size 289.98 Mb. when i am using sratool fastq-dump.. it is generating error message as:-



          An error occurred during processing.
          A report was generated into the file '.../ncbi_error_report.xml'.
          If the problem persists, you may consider sending the file


          kindly help me out for this ASAP

          Comment


          • #6
            Did you try looking at what was in the file? Given that it's an error report, perhaps it's informative...

            Comment


            • #7
              Respected dpryan,


              The file is in xml format which on opening with google chrome is giving code. It does'nt have any error specified there in...

              Here in is the file which i am trying to convert to fastq.

              Comment


              • #8
                Try upgrading your copy of the SRA toolkit, that usually fixes this sort of issue (I tried to extract the file you linked and couldn't until I upgraded my local copy of fastq-dump).

                Comment


                • #9
                  kindly tell me how can i upgrade sra tool kit??


                  Is it through verbose??

                  Comment


                  • #10
                    Try here, on NCBI

                    Comment


                    • #11
                      Dear hui_shi,

                      I have the same problem with convert the sra to pair-end fastq.
                      Do you remember how to solve this problem?

                      Thank you very much!!

                      Comment


                      • #12
                        Are you using the latest SRAtoolkit? Have you done the steps described in this post: http://seqanswers.com/forums/showpos...6&postcount=7?

                        Comment


                        • #13
                          Thank you GenoMax,

                          I used the latest version.
                          In that way we need to redownload the file?
                          Thank you! If I can't solve this, may be I need to download again!

                          Best wishes

                          Comment


                          • #14
                            If you have downloaded the .sra file have you tried the command this way?

                            Code:
                            c:\> fastq-dump.exe -F --split-files c:\path_to\SRAfile.sra
                            Sometimes people do submit unequal length data sets so if you have mismatched cycle numbers that is a possibility. What SRA# are you looking at?

                            Comment

                            Latest Articles

                            Collapse

                            • seqadmin
                              Recent Advances in Sequencing Analysis Tools
                              by seqadmin


                              The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
                              05-06-2024, 07:48 AM
                            • seqadmin
                              Essential Discoveries and Tools in Epitranscriptomics
                              by seqadmin




                              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                              04-22-2024, 07:01 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by seqadmin, Today, 07:03 AM
                            0 responses
                            10 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 05-10-2024, 06:35 AM
                            0 responses
                            29 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 05-09-2024, 02:46 PM
                            0 responses
                            37 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 05-07-2024, 06:57 AM
                            0 responses
                            31 views
                            0 likes
                            Last Post seqadmin  
                            Working...
                            X