Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • emanlee
    Member
    • Apr 2013
    • 15

    md5sum got different hash code of SRA files

    Hi, All
    We downloaded ftp://ftp-trace.ncbi.nlm.nih.gov/sra.../SRR308015.sra
    and check its md5sum using the following command on linux:
    md5sum SRR308015.sra
    we got results:
    5205d800cef6eff41beaa94f4a00b0e9
    Which is different from hash code released by SraRunInfo.csv from SRA:
    RunHash
    0602E4F9BF0EDC10871B6AD6A4136CF4
    ReadHash
    D5D02A4031F17B6CA30ED7EBE52D5654

    Thanks!

    Aimin
  • dariober
    Senior Member
    • May 2010
    • 311

    #2
    Originally posted by emanlee View Post
    Hi, All
    We downloaded ftp://ftp-trace.ncbi.nlm.nih.gov/sra.../SRR308015.sra
    and check its md5sum using the following command on linux:
    md5sum SRR308015.sra
    we got results:
    5205d800cef6eff41beaa94f4a00b0e9
    Which is different from hash code released by SraRunInfo.csv from SRA:
    RunHash
    0602E4F9BF0EDC10871B6AD6A4136CF4
    ReadHash
    D5D02A4031F17B6CA30ED7EBE52D5654

    Thanks!

    Aimin
    I've downloaded the sra you linked and got md5sum:

    5205d800cef6eff41beaa94f4a00b0e9

    Are you sure the hashes (RunHash, ReadHash) you are looking at are md5sum?

    Comment

    • emanlee
      Member
      • Apr 2013
      • 15

      #3
      Thanks! We are not sure what hash code were used in Runhash and Readhash.
      We have seen such a post:

      Comment

      • matheus.cburger
        Junior Member
        • Jun 2011
        • 1

        #4
        Answer from NCBI support:

        "That field you mentioned (RunHash) is an field used in internal file checking that has nothing to do with MD5 value. In fact SRA ftp files does not have that value. You will need to use the vdb-validate tool to validate a downloaded SRR#.sra file."

        Comment

        Latest Articles

        Collapse

        • SEQadmin2
          Nine Things a Sample Prep Scientist Thinks About Before Sequencing
          by SEQadmin2


          I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

          Here are nine questions we think about, in roughly the order they matter, before...
          06-18-2026, 07:11 AM
        • SEQadmin2
          From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
          by SEQadmin2


          Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


          The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
          ...
          06-02-2026, 10:05 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, Yesterday, 05:37 AM
        0 responses
        6 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-26-2026, 11:10 AM
        0 responses
        16 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-17-2026, 06:09 AM
        0 responses
        51 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        110 views
        0 reactions
        Last Post SEQadmin2  
        Working...