Header Leaderboard Ad

Collapse

Subsampling Fastq

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Subsampling Fastq

    Anyone know an easy way to randomly subsample from a fastq (Illumina data)?

    ShortReads seems to have a function FastqSampler, but I can't seem to make it work (?FastqSampler gives No documentation for 'FastqSampler' in specified packages and libraries:
    you could try '??FastqSampler')

    Thanks!

    Chris

  • #2
    i don't think you loaded ShortRead
    start R
    > R
    in R:
    > install.packages("ShortRead")
    > library("ShortRead")
    > ?FastqSampler
    --
    Jeremy Leipzig
    Bioinformatics Programmer
    --
    My blog
    Twitter

    Comment


    • #3
      haha, I'm a bit of a newbie but not quite THAT bad...

      I was testing it out on my Mac/GUI version of R. That does not seem to work.

      Linux version seems fine. Maybe I'm still missing something ... but thats my guess as to what was going on.

      Thanks though!

      Comment


      • #4
        exporting

        Ok, followup question...

        Once I've done

        Code:
        donkey <- FastqSampler(con, n=1e6, readerBlockSize=1e8, verbose=FALSE)
        How do I write the sampled sequences to a fastq file?

        THanks, and apologies if this obvious.

        Chris

        Comment


        • #5
          You can also use the mothur package which has a similar subsampling feature. The examples shows fasta only but I think it can handle paired fasta/qual files.

          Comment


          • #6
            Hi chrisbala,
            in ShortRead package ( manual :
            http://www.bioconductor.org/packages.../ShortRead.pdf
            )

            there is writeFastq method that looks like what you need:
            from manual:

            writeFastq signature(object = "ShortReadQ", file = "character", mode="character",
            ...): Write object to file in fastq format. mode defaults to ‘w’. This creates a new
            file, or fails if file already exists. Use mode="a" to append to an existing file. file is
            expanded using path.expand.


            hope it helps
            Last edited by pbseq; 01-11-2011, 06:04 AM.

            Comment


            • #7
              On a related note, I was trying to reduce my data size by using khmer, from a paper on Digitial Normalization. Does anyone have experience with this tool? I am getting an error when using it where it says I have no paired reads ... but I do. Are there simliar tools?

              Comment


              • #8
                @shanebrubaker:

                I suspect that you will get a better response if you ask your questing in a different thread with a title that contains "Digital Normalization" instead of burying it inside this thread. As for the answer to your question, I do not know. I am just starting to explore the program and may find out the answer later today.

                Comment


                • #9
                  i need to subsampling my fastq file of Iontorrent shotgun reads. I install R-2.15 and going by the previous suggesitios in this thread i tried to install ShortRead by > install.packages("ShortRead") but get the folowing error
                  Warning message:
                  In getDependencies(pkgs, dependencies, available, lib) :
                  package ‘Shortread’ is not available

                  ne help regarding this and if any other ways to random sampling of my fastq file. Thanx in advannce

                  Regards
                  Chayan

                  Comment


                  • #10
                    Chayan: see this thread for other options in case the R solution does not work: http://seqanswers.com/forums/showthread.php?t=16505

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      How RNA-Seq is Transforming Cancer Studies
                      by seqadmin



                      Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                      09-07-2023, 11:15 PM
                    • seqadmin
                      Methods for Investigating the Transcriptome
                      by seqadmin




                      Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                      Whole Transcriptome RNA-seq
                      Whole transcriptome sequencing...
                      08-31-2023, 11:07 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 09:05 AM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-21-2023, 06:18 AM
                    0 responses
                    10 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-20-2023, 09:17 AM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 09-19-2023, 09:23 AM
                    0 responses
                    26 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X