Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • need a small illumina run data for practice

    Could someone please be so kind as to give me some illumina data (preferably miseq/hiseq) to play with? I'm trying to process the raw data with CASAVA, but the runs I have seem to be faulty, because they miss all kind of information files.
    Of course , it should only be small runs , since large ones would be impossible to download via network.
    I would be infinitely grateful, I really need to learn to work with illumina data.

  • #2
    You should check with your local illumina field applications scientist for help. They should be able to get you a copy from another institution that is local.

    That said why do you think the copies you have are faulty? Are you getting errors when trying to run CASAVA?

    Comment


    • #3
      Yep, plenty of errors while trying to convert .bcl to fasta. First, it says that samplesheet.csv doesn't exist, when I create one and try to run,then it says that it cannot find bclconverter.cpp(although CASAVA has been configured, built and installed properly), then that .clocs files are missing (now where do I take them? I only have .locs, filter, .bcl,.control, .stats,).
      I think that CASAVA is up-todate (1.8.2) and the runs are >2 years old (dated 2011)

      Comment


      • #4
        Is the data folder that you have access to a complete copy as made by the instrument?

        Depending on the RTA version used (make sure that your folder has the RunInfo.xml and config.xml files) the BCL to FASTQ converter is supposed to use the right position files (.clocs or .locs).

        You can explicitly provide (--positions-format .locs) option to the configureBclToFastq.pl command and see if that works.

        Comment


        • #5
          Yeah , I made sure the file tree matches the one pointed out in the user guide. But since the config file was missing , I had to take one from a different run. When it didn't work I made one myself. It worked to some extent, but well, I can't be sure that the absence of the original config file doesn't screw the following process.
          That's why I'd like to have a nice good run, preferably multiplex to practice demultiplexing as well, though it isn't supposed to be difficult. Well it all isn't supposed to be difficult but somehow I can't figure it out.

          Comment


          • #6
            If you don't have the full flowcell folder then you are likely to run into issues. The XML files store run related information that is needed for downstream analysis (as you discovered).

            There are a couple of data sets (not sure if they are complete) included in the CASAVA install (they should be under /casava-1.8.2/src/CASAVA_v1.8.2/data/share/examples/Validation/ directory). Look into those while you locate a data set.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              The Impact of AI in Genomic Medicine
              by seqadmin



              Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
              02-26-2024, 02:07 PM
            • seqadmin
              Multiomics Techniques Advancing Disease Research
              by seqadmin


              New and advanced multiomics tools and technologies have opened new avenues of research and markedly enhanced various disciplines such as disease research and precision medicine1. The practice of merging diverse data from various ‘omes increasingly provides a more holistic understanding of biological systems. As Maddison Masaeli, Co-Founder and CEO at Deepcell, aptly noted, “You can't explain biology in its complex form with one modality.”

              A major leap in the field has
              ...
              02-08-2024, 06:33 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 02-28-2024, 06:12 AM
            0 responses
            25 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 02-23-2024, 04:11 PM
            0 responses
            72 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 02-21-2024, 08:52 AM
            0 responses
            80 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 02-20-2024, 08:57 AM
            0 responses
            69 views
            0 likes
            Last Post seqadmin  
            Working...
            X