Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Giancarlo
    Junior Member
    • Sep 2009
    • 6

    454 Raw datas

    Hello!

    I am quite new with 454 sequencing and I just got my first run (raw data). The files are in this format:
    C1226_SV911631_RunID28380309_reads.tgz
    C1226_SV911631_RunID28380309_sff.tgz
    C1226_SV911631_RunID28413700_reads.tgz
    C1226_SV911631_RunID28413700_sff.tgz
    C1226_SV911631_SID16663_assembly.tgz
    C1226_SV911631_SID16664_assembly.tgz

    I have really no idea where to start....we used an external sequencing service and this we get...Does anyone know if I need special software to open the files?
    What do they mean reads.tgz, sff.tgz, and assembly.tgz?
    I really need help for something that you could find very simple and stupid.
    Thanks a lot!
    Giancarlo
  • RCJK
    Senior Member
    • May 2009
    • 156

    #2
    Hi Giancarlo,

    If you are unsure of what files your provider has given you, you should ask them to clarify what they are. I'd say they have given you the reads, the sff files, and the assembly (de novo) files for two region or two samples. The .tgz is a compressed archive (if I am getting the terminology correct), so you should be able to decompress it and more files will be present.

    Comment

    • Giancarlo
      Junior Member
      • Sep 2009
      • 6

      #3
      Hi RCJK,

      thanks for your reply. I also think I have received the reads and assembly. Yes, these are two samples, but when I decompress the files I have at the end this sff file that I am not able to open on a windows operating system. That's why I was wondering if I need some other programs for visualize the files. What I want to see is which sequences are inside...if we got what we want or not
      Thanks again,
      Giancarlo

      Comment

      • nicolallias
        Member
        • Jan 2010
        • 23

        #4
        From a sff you can extract (even on a Windows) fasta and quality files with sff_extract.
        You may also have a look on this page this page.

        Comment

        • ulz_peter
          Senior Member
          • Feb 2010
          • 219

          #5
          Hi Giancarlo,

          You can open sff files within the Software shipped by Roche (which unfortunately only runs in Linux). However, you should be able to open the reads files (usually in Fastq format which you should be able to open using a text editor like Notepad in windows). The only problem could be the size of the files...
          For the assembly files there are several possible format although the Roche software only outputs .ace files. These are text-based as well and also very large. If you do not possess a 64bit Windows system this could be a problem...

          If you want to visualize the assembly you could go with Tablet:
          Link: http://bioinf.scri.ac.uk/tablet/

          it can handle .ace files and runs in windows.

          I hope this helps...
          (Nevertheless, when dealing with NGS data I would suggest changing to Linux as most bioinformatic analysis programs work well in Linux but not in Windows...)

          Could you describe the experiment in more detail (which organism, de-novo sequencing, resequencing, targetted sequencing,...). This has a great influence on the choice of analysis software.

          Comment

          • Giancarlo
            Junior Member
            • Sep 2009
            • 6

            #6
            hi guys,

            thanks for the suggestions. I will check carefully all of these otpions and I will let you know (or I will ask Roche to ship me the software, should be not a problem). Yes, the files are quite big, but I'll try anyway. To be short this is a transcriptome sequencing of a tumor cell line. If you know good and more specific software for RNAseq would be nice to know which one you find best.
            All the best,
            Giancarlo

            Comment

            • joscarhuguet
              Member
              • Feb 2010
              • 18

              #7
              Originally posted by Giancarlo View Post
              hi guys,

              thanks for the suggestions. I will check carefully all of these otpions and I will let you know (or I will ask Roche to ship me the software, should be not a problem). Yes, the files are quite big, but I'll try anyway. To be short this is a transcriptome sequencing of a tumor cell line. If you know good and more specific software for RNAseq would be nice to know which one you find best.
              All the best,
              Giancarlo
              You could extract fasta and quality files from the sff files. Then you could use MIRA assembler. The information here.
              Download MIRA for free. MIRA V5 is available only on GitHub! The V4 version released here on SourceForge stay up as some automated release fetching packages rely on V4.
              Last edited by joscarhuguet; 03-22-2011, 02:37 PM.

              Comment

              Latest Articles

              Collapse

              • GATTACAT
                Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by GATTACAT
                Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                07-01-2026, 11:43 AM
              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 07-02-2026, 11:08 AM
              0 responses
              7 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-30-2026, 05:37 AM
              0 responses
              12 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              20 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              54 views
              0 reactions
              Last Post SEQadmin2  
              Working...