Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Xi Wang
    Senior Member
    • Oct 2009
    • 317

    Originally posted by James View Post
    Hi guys,

    I'm new to seqanswers and to alignment. So I started using bowtie as it seems to have a good reputation manual and tutorial.

    I am working through the tutorial, I have got to the point where I am supposed to build a new index using the E. coli strain O157:H7 downloaded from ncbi. However when I run the command I get this error

    could not open NC_002127.fna

    I'm using terminal in Mac OSX 10.5.8. Anybody had the same problem? Am I not putting the NC-002127.fna file in the correct directory?

    I would move on in the tutorial however I need to use the build option to create an index for the organism I work on.

    Thanks in advance.
    Is your file NC_002127.fna in the work directory, which means the current directory where you type the command. If not, please give the path to the file, then the program can find the file.
    Xi Wang

    Comment

    • varunkilaru
      Junior Member
      • Mar 2010
      • 1

      Out of Memory allocating the offs[] array for bowtie index

      Hi Ben
      First of all,Bowtie is great. Thanks for that.I am running into a few problems when I was working with the human index.

      The command I used was bowtie -C hg19_c reads/e_coli_1000.fq. This fails with the message : Out of Memory allocating the offs[] array for bowtie index.

      I am using a 8GB Windows 64 bit processor with a 3 ghz quad core processor. I am not understanding why it is running out of memory. Can you kindly let me know what could be the problem? Thanks a lot and sorry for the inconvenience

      Comment

      • betty
        Junior Member
        • Dec 2008
        • 3

        Bowtie for meta data

        hello everyone,
        I am using Bowtie for SOLiD based metagenomic mapping. The references are millions of 1kb-length fragments, I don't know wheather it is the reason that index-building spent quite a lot of time (Total time for backward call to driver() for mirror index: 01:56:46).

        My former analysis using AB software indicated that most of the reads could aligned to many references, and the mappable reads were not considerable.

        For my case, could you give some suggestion on index-building and alignment parameters setting?

        Thanks a lot,

        Betty

        Comment

        • Xi Wang
          Senior Member
          • Oct 2009
          • 317

          Originally posted by betty View Post
          hello everyone,
          I am using Bowtie for SOLiD based metagenomic mapping. The references are millions of 1kb-length fragments, I don't know wheather it is the reason that index-building spent quite a lot of time (Total time for backward call to driver() for mirror index: 01:56:46).

          My former analysis using AB software indicated that most of the reads could aligned to many references, and the mappable reads were not considerable.

          For my case, could you give some suggestion on index-building and alignment parameters setting?

          Thanks a lot,

          Betty
          I think it could be such long time. For the human genome (~ 3Gb and 25 chromosomes), it takes about 3-4 hours.
          Xi Wang

          Comment

          • Lien
            Member
            • Dec 2009
            • 47

            Originally posted by JimC View Post
            Ben,
            I've tried to read all the posts, but I may have missed this answer if posted.

            I'm having problem with running bowtie on a mouse genome dataset.
            The error is a large number of reads giving the warning:

            Warning: Exhausted best-first chunk memory for read .....

            current command line: bowtie -S -p 1 --solexa1.3-quals --un unmapped.fq -m 10 --max maxmapped.fq -n 3 -X 600 /ccmb/CoreBA/Data/BowtieData/mm9 -1 ../s_7_1_sequence.txt -2 ../s_7_2_sequence.txt mm9_align.sam


            version: bowtie --version
            bowtie version 0.12.3
            64-bit
            Built on ccmb-comp1.umms.med.umich.edu
            Tue Mar 2 12:33:36 EST 2010
            Compiler: gcc version 4.1.2 20080704 (Red Hat 4.1.2-46)
            Options: -O3
            Sizeof {int, long, long long, void*, size_t, off_t}: {4, 8, 8, 8, 8, 8}


            Any suggestions would be helpful as I feel that I'm not getting the level of alignment I should be seeing with this data.

            Thanks !
            Jim
            Hello,

            I have the same problems using a 64-bit computer ('Warning: Exhausted best-first chunk memory for read HWUSI_ ...; skipping read). My paired-end data are in .txt format. Could this have anything to do with the problem? Otherwise, as I'm only starting to work with these data, I have no clue to other things that could be causing this problem.

            Thanks a lot!
            Lien

            Comment

            • betty
              Junior Member
              • Dec 2008
              • 3

              mismatches in color space

              hi,
              I have a doubt about color space mismatches, and I don't know whether my understanding is correct.

              I set -C -n 2 -l 20 for SOLiD data alignment. So, it permits at most 2 color space mismatches in the first 20 characters of the read (trimming the tag "T" and the first base).
              In the output file, the single mismatch was treated as system error and was ignored. Only the adjacent mismatches which could be correctly explained by SNP were reported.

              So, there may be many many single mismatches ignored in Bowtie because of system error? Is there any other consideration besides single and adjacent mismatches?

              Any suggestions would be appreciated.

              Regards,

              Betty

              Comment

              • Lien
                Member
                • Dec 2009
                • 47

                Originally posted by Lien View Post
                Hello,

                I have the same problems using a 64-bit computer ('Warning: Exhausted best-first chunk memory for read HWUSI_ ...; skipping read). My paired-end data are in .txt format. Could this have anything to do with the problem? Otherwise, as I'm only starting to work with these data, I have no clue to other things that could be causing this problem.

                Thanks a lot!
                Lien
                Apparently, it is normal that some reads are skipped because they can't align. Just hope the percentage that is skipped isn't too high!

                Comment

                • Lien
                  Member
                  • Dec 2009
                  • 47

                  Hi again,

                  I performed a paired-end run on the Illumina, with 100bp reads. The original DNA fragments are +/- 200 bp. I have a percentage of about 40% of the reads that failed to align. I think some of the original DNA fragments are smaller than 200 bp, so there would be an overlap between both paired reads. To solve this, I think I would have to change the minimum insert size. But this would mean that this number would become negative (for example -30). Is this possible?

                  Thanks!

                  Comment

                  • isharon
                    Junior Member
                    • Feb 2010
                    • 4

                    I have a reference genome that is in fact a collection of many contigs whose lengths range between a few hundreds to a few thousands bps. Also most of my reads probably won't align to any of the contigs. Will bowtie work for such case?

                    Thanks,
                    Itai

                    Comment

                    • Subho
                      Junior Member
                      • Apr 2010
                      • 2

                      Hi Ben, how different are hg18 and NCBI36 genomes? Genomic co-ordinates for quite a few single nucleotide mutations obtained from RNAseq data based on bowtie hg18 index are not mapping to the same bases on NCBI36.
                      Thanx,-s

                      Comment

                      • Ben Langmead
                        Senior Member
                        • Sep 2008
                        • 200

                        Originally posted by isharon View Post
                        I have a reference genome that is in fact a collection of many contigs whose lengths range between a few hundreds to a few thousands bps. Also most of my reads probably won't align to any of the contigs. Will bowtie work for such case?

                        Thanks,
                        Itai
                        Yes, it should.
                        Ben

                        Comment

                        • Ben Langmead
                          Senior Member
                          • Sep 2008
                          • 200

                          Originally posted by Subho View Post
                          Hi Ben, how different are hg18 and NCBI36 genomes? Genomic co-ordinates for quite a few single nucleotide mutations obtained from RNAseq data based on bowtie hg18 index are not mapping to the same bases on NCBI36.
                          Thanx,-s
                          My understanding is that they're hg18 and NCBI v36.* are the same. See, e.g., the first question on this faq:



                          Thanks,
                          Ben

                          Comment

                          • guisinmm
                            Junior Member
                            • Mar 2010
                            • 3

                            I'm new to NGS, and I'm wondering if Bowtie could be of use to me. I have Illumina reads, and I want to assemble chloroplast genomes. I have built an index using another chloroplast genome, and I am successful in running Bowtie using default parameters. Sequence differences should be relatively minor in coding regions, but they could be significant in non-coding regions. I'll need to tweek the parameters accordingly, I guess. So, here are my questions: Would Bowtie be useful for me? How would you suggest I set -n or -v to allow for between species sequence differences? Once I get the output, how can I map these to the reference to create contigs? I see the location of the alignment in the output, but what step do I take next to work with the output?

                            Thanks for the help!
                            guisinmm

                            Comment

                            • seq_GA
                              Senior Member
                              • Feb 2009
                              • 124

                              Hi,
                              I have downloaded fastq file format from ncbi (Illumina reads) and trying to use Bowtie and I get the following error.

                              Code:
                              Error: reads file does not look like a FASTA file
                              terminate called after throwing an instance of 'int'
                              Let me know how to overcome this problem. Thanks.

                              Comment

                              • seq_GA
                                Senior Member
                                • Feb 2009
                                • 124

                                I managed to figure out the problem. I was using -f option for input file instead of -q option for inputfastq format. Thanks.

                                Comment

                                Latest Articles

                                Collapse

                                • SEQadmin2
                                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                                  by SEQadmin2


                                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                                  ...
                                  06-02-2026, 10:05 AM
                                • SEQadmin2
                                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                                  by SEQadmin2


                                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                                  Introduction

                                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                                  05-22-2026, 06:42 AM
                                • SEQadmin2
                                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                                  by SEQadmin2

                                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                                  05-06-2026, 09:04 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by SEQadmin2, Today, 08:59 AM
                                0 responses
                                7 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 12:03 PM
                                0 responses
                                21 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 06-02-2026, 11:40 AM
                                0 responses
                                14 views
                                0 reactions
                                Last Post SEQadmin2  
                                Started by SEQadmin2, 05-28-2026, 11:40 AM
                                0 responses
                                29 views
                                0 reactions
                                Last Post SEQadmin2  
                                Working...