Unconfigured Ad

Collapse
This topic is closed.
X
This is a sticky topic.
X
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Ka123$
    Member
    • Jul 2009
    • 27

    Hi Anthony,
    so here is what I am doing. We have decided to stick with the GERALD files to convert it to wig.......(PI's order !)
    I checked for unaligned files and none were there.
    I have a .export file with a s_#_export.txt
    java -Xmx2G -jar SeparateReads.jar elandext 7_XXXXXX_GERALD-YYYY-MM-DD.export G_sep_7
    Version: Initializing class SeparateReads $Revision: 1082 $
    Version: Initializing class Generic_AlignRead_Iterator $Revision: 1318 $
    Version: Initializing class Log_Buffer $Revision: 1145 $
    Version: Initializing class ElandExtIterator $Revision: 832 $
    Exception in thread "main" java.lang.NumberFormatException: For input string: ""
    at java.lang.NumberFormatException.forInputString(Unknown Source)
    at java.lang.Integer.parseInt(Unknown Source)
    at java.lang.Integer.parseInt(Unknown Source)
    at src.lib.ioInterfaces.ElandExtIterator.next(ElandExtIterator.java:180)
    at src.lib.ioInterfaces.ElandExtIterator.next(ElandExtIterator.java:20)
    at src.lib.ioInterfaces.Generic_AlignRead_Iterator.hasNext(Generic_AlignRead_Iterator.java:103)
    at src.fileUtilities.SeparateReads.main(SeparateReads.java:69)
    ^[[A

    It looks like that GERALD gives out a .txt file . How can I specify what type of aligner is gerald? If I did elandext or eland_extended it does not work......

    is there a way to directly convert a .txt from solexa export files to .wig in findpeaks?

    Comment

    • apfejes
      Senior Member
      • Feb 2008
      • 236

      Hi Ka123$,

      Thanks for the detailed report! I've managed to re-create the problem by parsing a data set that is similar. I observed that the iterator crashes on reads marked with "QC", so I've modified the code in order to reject those reads.

      I can do two things for you. The first is that I can compile the code for you and send you the latest version via email. The second is that I can check in the code changes so that you can check it out and compile it yourself. Either option is open.

      Thanks again for the very helpful bug report!

      Anthony

      Edit: The code has been checked in to the repository, if you're interested in building from scratch.
      Last edited by apfejes; 09-30-2009, 03:42 PM.
      The more you know, the more you know you don't know. —Aristotle

      Comment

      • Ka123$
        Member
        • Jul 2009
        • 27

        Thanks so much anthony ! If you could compile and email me that will be great!!!! I appreciate it so much!.......

        Comment

        • apfejes
          Senior Member
          • Feb 2008
          • 236

          Hi Ka123$,

          I'm sorry - I can't seem to find your email address. Could you send it to me again? I'll package up a copy for you in the morning.

          Anthony
          The more you know, the more you know you don't know. —Aristotle

          Comment

          • Ka123$
            Member
            • Jul 2009
            • 27

            Hi apfejes,
            I had sent you my email ID earlier last week. I was wondering if you got it or not....Please can you check again. I am sending you a email with this thread and you can reply to me on that....Thanks

            Comment

            • apfejes
              Senior Member
              • Feb 2008
              • 236

              Hi Ka123$,

              I emailed it to you last week. If it didn't arrive, it may be that it was too large. Can you check on the maximum email attachment size your email can accept? The attachment was 10.6Mb, which may have been to large.

              If that's the case, please let me know, and I'll arrange to host it somewhere for you.

              Anthony
              The more you know, the more you know you don't know. —Aristotle

              Comment

              • eslondon
                Member
                • Jul 2009
                • 21

                Same log problem, probably silly but still...

                When using Separatereads.jar I have no issues if I use only 1 input file, all works fine. If I try to use it in the way described in the example, i.e. using the asterisk to provide it as input several input files in one directory, it decides that it should try to write the output into one of the inputs...

                Here is the command line:
                java -jar ~/programs/VancouverShortRead/fp4/SeparateReads.jar bowtie /data/bioinfo/302KC/*.map /data/bioinfo/Analysis/mapping/brain/

                And here it the output:
                Error: Coundn't create log file : /data/bioinfo/302KC/HCT449_brain_s_2_sequence.fastq.map/SeparateReads.log

                Any tips?

                thanks

                Elia
                --------------------------------------
                Elia Stupka
                Co-Director and Head of Unit
                Center for Translational Genomics and Bioinformatics
                San Raffaele Scientific Institute
                Via Olgettina 58
                20132 Milano
                Italy
                ---------------------------------------

                Comment

                • eslondon
                  Member
                  • Jul 2009
                  • 21

                  Hmmm... must be something silly that I will regret having posted... I have the same problem with SortFiles as well... rather than taking in all *.gz, it takes the first one, and assumes the second one is the location for the log file.... could it be a shell/environment issue?

                  Update: fails also without using asterisk... basically it allows only one input file, and takes the 2nd input file as the output directory

                  Elia
                  Last edited by eslondon; 10-17-2009, 02:09 AM.
                  --------------------------------------
                  Elia Stupka
                  Co-Director and Head of Unit
                  Center for Translational Genomics and Bioinformatics
                  San Raffaele Scientific Institute
                  Via Olgettina 58
                  20132 Milano
                  Italy
                  ---------------------------------------

                  Comment

                  • apfejes
                    Senior Member
                    • Feb 2008
                    • 236

                    Hi Elia,

                    The short answer is that you don't need to run separateReads/SortReads on map files, as the reads they contain are already sorted by chromosome and start position. Of course, if you're trying to do something other than run FindPeaks with them, that's a different story.

                    Edit: I should probably also add that it's not a good idea to try. These two particular utilities were intended only for use with text format files - not pre-sorted binary files. I've never tested it out on a .map file.

                    Anthony
                    Last edited by apfejes; 10-18-2009, 08:20 AM. Reason: Additional information
                    The more you know, the more you know you don't know. —Aristotle

                    Comment

                    • greggrant
                      Member
                      • Dec 2008
                      • 28

                      Thanks for this list, that's really awesome. What do people think is the best way at this point to map approximately 5 million 100 bp reads to a transcriptome? I'm looking for alignment allowing (a specified number of) mismatches but no gaps. Thanks again for this list!

                      Comment

                      • dan
                        wiki wiki
                        • Jul 2008
                        • 194

                        Originally posted by greggrant View Post
                        Thanks for this list, that's really awesome. What do people think is the best way at this point to map approximately 5 million 100 bp reads to a transcriptome? I'm looking for alignment allowing (a specified number of) mismatches but no gaps. Thanks again for this list!
                        I'd map against the genome (you never know) using bowtie or SOAP.

                        You can look them up here:

                        Homepage: Dan Bolser
                        MetaBase the database of biological databases.

                        Comment

                        • greggrant
                          Member
                          • Dec 2008
                          • 28

                          Originally posted by dan View Post
                          I'd map against the genome (you never know) using bowtie or SOAP.

                          You can look them up here:

                          http://seqanswers.com/wiki/Software
                          Those options won't find things that map across exon/exon junctions. I need something that can map ungapped to the transcriptome, probably BLAST will do the trick, bu there should be something faster.

                          Comment

                          • apfejes
                            Senior Member
                            • Feb 2008
                            • 236

                            We use a database of all predicted/potential exon/exon junctions in addition to the genome, and then use maq/bwa - it seems to do very well.
                            Last edited by apfejes; 10-27-2009, 07:12 AM. Reason: too sleepy to get it right the first time.
                            The more you know, the more you know you don't know. —Aristotle

                            Comment

                            • ewilbanks
                              Member
                              • Mar 2009
                              • 83

                              Try TopHat http://tophat.cbcb.umd.edu/

                              It uses Bowtie to map reads and analyzes the mapping results to identify splice junctions between exons.

                              Comment

                              • Xi Wang
                                Senior Member
                                • Oct 2009
                                • 317

                                Hi all,

                                Have you noticed a review on ChIP-seq and RNA-seq computational studies? It mentioned and summarized some available tools on ChIP-seq and RNA-seq data processing.



                                Review
                                Nature Methods 6, S22 - S32 (2009)
                                doi:10.1038/nmeth.1371
                                Computation for ChIP-seq and RNA-seq studies
                                Shirley Pepke1, Barbara Wold2 & Ali Mortazavi2


                                Best wishes,
                                Xi
                                Xi Wang

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  Yesterday, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, Yesterday, 12:03 PM
                                0 responses
                                19 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, Yesterday, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-26-2026, 10:12 AM
                                0 responses
                                31 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...