Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Originally posted by James View Post
    Hi guys,

    I'm new to seqanswers and to alignment. So I started using bowtie as it seems to have a good reputation manual and tutorial.

    I am working through the tutorial, I have got to the point where I am supposed to build a new index using the E. coli strain O157:H7 downloaded from ncbi. However when I run the command I get this error

    could not open NC_002127.fna

    I'm using terminal in Mac OSX 10.5.8. Anybody had the same problem? Am I not putting the NC-002127.fna file in the correct directory?

    I would move on in the tutorial however I need to use the build option to create an index for the organism I work on.

    Thanks in advance.
    Is your file NC_002127.fna in the work directory, which means the current directory where you type the command. If not, please give the path to the file, then the program can find the file.
    Xi Wang

    Comment


    • Out of Memory allocating the offs[] array for bowtie index

      Hi Ben
      First of all,Bowtie is great. Thanks for that.I am running into a few problems when I was working with the human index.

      The command I used was bowtie -C hg19_c reads/e_coli_1000.fq. This fails with the message : Out of Memory allocating the offs[] array for bowtie index.

      I am using a 8GB Windows 64 bit processor with a 3 ghz quad core processor. I am not understanding why it is running out of memory. Can you kindly let me know what could be the problem? Thanks a lot and sorry for the inconvenience

      Comment


      • Bowtie for meta data

        hello everyone,
        I am using Bowtie for SOLiD based metagenomic mapping. The references are millions of 1kb-length fragments, I don't know wheather it is the reason that index-building spent quite a lot of time (Total time for backward call to driver() for mirror index: 01:56:46).

        My former analysis using AB software indicated that most of the reads could aligned to many references, and the mappable reads were not considerable.

        For my case, could you give some suggestion on index-building and alignment parameters setting?

        Thanks a lot,

        Betty

        Comment


        • Originally posted by betty View Post
          hello everyone,
          I am using Bowtie for SOLiD based metagenomic mapping. The references are millions of 1kb-length fragments, I don't know wheather it is the reason that index-building spent quite a lot of time (Total time for backward call to driver() for mirror index: 01:56:46).

          My former analysis using AB software indicated that most of the reads could aligned to many references, and the mappable reads were not considerable.

          For my case, could you give some suggestion on index-building and alignment parameters setting?

          Thanks a lot,

          Betty
          I think it could be such long time. For the human genome (~ 3Gb and 25 chromosomes), it takes about 3-4 hours.
          Xi Wang

          Comment


          • Originally posted by JimC View Post
            Ben,
            I've tried to read all the posts, but I may have missed this answer if posted.

            I'm having problem with running bowtie on a mouse genome dataset.
            The error is a large number of reads giving the warning:

            Warning: Exhausted best-first chunk memory for read .....

            current command line: bowtie -S -p 1 --solexa1.3-quals --un unmapped.fq -m 10 --max maxmapped.fq -n 3 -X 600 /ccmb/CoreBA/Data/BowtieData/mm9 -1 ../s_7_1_sequence.txt -2 ../s_7_2_sequence.txt mm9_align.sam


            version: bowtie --version
            bowtie version 0.12.3
            64-bit
            Built on ccmb-comp1.umms.med.umich.edu
            Tue Mar 2 12:33:36 EST 2010
            Compiler: gcc version 4.1.2 20080704 (Red Hat 4.1.2-46)
            Options: -O3
            Sizeof {int, long, long long, void*, size_t, off_t}: {4, 8, 8, 8, 8, 8}


            Any suggestions would be helpful as I feel that I'm not getting the level of alignment I should be seeing with this data.

            Thanks !
            Jim
            Hello,

            I have the same problems using a 64-bit computer ('Warning: Exhausted best-first chunk memory for read HWUSI_ ...; skipping read). My paired-end data are in .txt format. Could this have anything to do with the problem? Otherwise, as I'm only starting to work with these data, I have no clue to other things that could be causing this problem.

            Thanks a lot!
            Lien

            Comment


            • mismatches in color space

              hi,
              I have a doubt about color space mismatches, and I don't know whether my understanding is correct.

              I set -C -n 2 -l 20 for SOLiD data alignment. So, it permits at most 2 color space mismatches in the first 20 characters of the read (trimming the tag "T" and the first base).
              In the output file, the single mismatch was treated as system error and was ignored. Only the adjacent mismatches which could be correctly explained by SNP were reported.

              So, there may be many many single mismatches ignored in Bowtie because of system error? Is there any other consideration besides single and adjacent mismatches?

              Any suggestions would be appreciated.

              Regards,

              Betty

              Comment


              • Originally posted by Lien View Post
                Hello,

                I have the same problems using a 64-bit computer ('Warning: Exhausted best-first chunk memory for read HWUSI_ ...; skipping read). My paired-end data are in .txt format. Could this have anything to do with the problem? Otherwise, as I'm only starting to work with these data, I have no clue to other things that could be causing this problem.

                Thanks a lot!
                Lien
                Apparently, it is normal that some reads are skipped because they can't align. Just hope the percentage that is skipped isn't too high!

                Comment


                • Hi again,

                  I performed a paired-end run on the Illumina, with 100bp reads. The original DNA fragments are +/- 200 bp. I have a percentage of about 40% of the reads that failed to align. I think some of the original DNA fragments are smaller than 200 bp, so there would be an overlap between both paired reads. To solve this, I think I would have to change the minimum insert size. But this would mean that this number would become negative (for example -30). Is this possible?

                  Thanks!

                  Comment


                  • I have a reference genome that is in fact a collection of many contigs whose lengths range between a few hundreds to a few thousands bps. Also most of my reads probably won't align to any of the contigs. Will bowtie work for such case?

                    Thanks,
                    Itai

                    Comment


                    • Hi Ben, how different are hg18 and NCBI36 genomes? Genomic co-ordinates for quite a few single nucleotide mutations obtained from RNAseq data based on bowtie hg18 index are not mapping to the same bases on NCBI36.
                      Thanx,-s

                      Comment


                      • Originally posted by isharon View Post
                        I have a reference genome that is in fact a collection of many contigs whose lengths range between a few hundreds to a few thousands bps. Also most of my reads probably won't align to any of the contigs. Will bowtie work for such case?

                        Thanks,
                        Itai
                        Yes, it should.
                        Ben

                        Comment


                        • Originally posted by Subho View Post
                          Hi Ben, how different are hg18 and NCBI36 genomes? Genomic co-ordinates for quite a few single nucleotide mutations obtained from RNAseq data based on bowtie hg18 index are not mapping to the same bases on NCBI36.
                          Thanx,-s
                          My understanding is that they're hg18 and NCBI v36.* are the same. See, e.g., the first question on this faq:



                          Thanks,
                          Ben

                          Comment


                          • I'm new to NGS, and I'm wondering if Bowtie could be of use to me. I have Illumina reads, and I want to assemble chloroplast genomes. I have built an index using another chloroplast genome, and I am successful in running Bowtie using default parameters. Sequence differences should be relatively minor in coding regions, but they could be significant in non-coding regions. I'll need to tweek the parameters accordingly, I guess. So, here are my questions: Would Bowtie be useful for me? How would you suggest I set -n or -v to allow for between species sequence differences? Once I get the output, how can I map these to the reference to create contigs? I see the location of the alignment in the output, but what step do I take next to work with the output?

                            Thanks for the help!
                            guisinmm

                            Comment


                            • Hi,
                              I have downloaded fastq file format from ncbi (Illumina reads) and trying to use Bowtie and I get the following error.

                              Code:
                              Error: reads file does not look like a FASTA file
                              terminate called after throwing an instance of 'int'
                              Let me know how to overcome this problem. Thanks.

                              Comment


                              • I managed to figure out the problem. I was using -f option for input file instead of -q option for inputfastq format. Thanks.

                                Comment

                                Latest Articles

                                Collapse

                                • seqadmin
                                  Choosing Between NGS and qPCR
                                  by seqadmin



                                  Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
                                  10-18-2024, 07:11 AM
                                • seqadmin
                                  Non-Coding RNA Research and Technologies
                                  by seqadmin




                                  Non-coding RNAs (ncRNAs) do not code for proteins but play important roles in numerous cellular processes including gene silencing, developmental pathways, and more. There are numerous types including microRNA (miRNA), long ncRNA (lncRNA), circular RNA (circRNA), and more. In this article, we discuss innovative ncRNA research and explore recent technological advancements that improve the study of ncRNAs.

                                  Nobel Prize for MicroRNA Discovery
                                  This week,...
                                  10-07-2024, 08:07 AM

                                ad_right_rmr

                                Collapse

                                News

                                Collapse

                                Topics Statistics Last Post
                                Started by seqadmin, Yesterday, 05:31 AM
                                0 responses
                                10 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 10-24-2024, 06:58 AM
                                0 responses
                                20 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 10-23-2024, 08:43 AM
                                0 responses
                                48 views
                                0 likes
                                Last Post seqadmin  
                                Started by seqadmin, 10-17-2024, 07:29 AM
                                0 responses
                                58 views
                                0 likes
                                Last Post seqadmin  
                                Working...
                                X