Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • aakriti
    Member
    • Jul 2014
    • 11

    Help with Bowtie2

    Hi all!

    I am currently working on a project where I am trying to align assembled scaffolds from the Human Microbiome Project to a gene sequence (agaA from C. striatum). I am having trouble with this since every time I try to make an alignment, I get a result like this:

    27528 reads; of these:
    27528 (100.00%) were unpaired; of these:
    27528 (100.00%) aligned 0 times
    0 (0.00%) aligned exactly 1 time
    0 (0.00%) aligned >1 times
    0.00% overall alignment rate

    I'm not sure if this is an accurate result. Is there a better way to approach this problem? The reason I'm doing this is because I'm trying to find mutational hot spots in the agaA gene (which I know for sure is found on the skin microbiome) and so I thought that I could perhaps use the data available already in the HMP.

    Thanks!
  • hui_shi
    Junior Member
    • Oct 2010
    • 6

    #2
    So basically, none of your reads were mapped according to what you have shown here.

    I think you could try to relax the parameters a bit, so see if any of your reads align to the reference genome.

    Comment

    • aakriti
      Member
      • Jul 2014
      • 11

      #3
      Thanks for the response! Where can I set the parameters for the alignment? Is it in the Bowtie2_build file?

      Comment

      • Brian Bushnell
        Super Moderator
        • Jan 2014
        • 2709

        #4
        If the query sequences are longer than the reference sequence, that may cause problems. You might be better off shredding the sequences to something like 200bp before mapping.

        Comment

        • aakriti
          Member
          • Jul 2014
          • 11

          #5
          Do you mean dividing the query sequences further? I believe this could be the case since the reference sequence is just one gene.

          Comment

          • Brian Bushnell
            Super Moderator
            • Jan 2014
            • 2709

            #6
            I mean breaking the query sequences into pieces each of length ~200bp, so that they will be shorter then the reference sequence.

            Comment

            • GenoMax
              Senior Member
              • Feb 2008
              • 7142

              #7
              If the scaffolds are much larger than the gene how about doing the search other way round (gene against the scaffolds)?

              Comment

              • aakriti
                Member
                • Jul 2014
                • 11

                #8
                OKay I can write a script to try that. Do you think it would be possible to use the query sequences as the reference and try to align my gene of interest to that? Or would that not work (still trying to understand how the indexing works :|)

                Comment

                • aakriti
                  Member
                  • Jul 2014
                  • 11

                  #9
                  Ah! Genomax, you took the words out of my mouth :P

                  Comment

                  • GenoMax
                    Senior Member
                    • Feb 2008
                    • 7142

                    #10
                    Depending on how many mutations you expect you may want to use a non-NGS aligner.

                    Brian: Would Pacbio aligner from BBMap work?

                    Comment

                    • dpryan
                      Devon Ryan
                      • Jul 2011
                      • 3478

                      #11
                      Actually, you can also just use local alignment if the query is longer than the reference. That often works.

                      Comment

                      • aakriti
                        Member
                        • Jul 2014
                        • 11

                        #12
                        Thank you for all the suggestions! I tried the alignment method suggestions you guys sent me (actually still have to look into BBMap), but am getting the same results (no alignment).

                        Do you guys have any suggestions on loosening the parameters? Will this affect the integrity of the alignment though?

                        Also, I was wondering if you guys knew much about the HMP database in general. I'm still trying to get my head around what the scaffolds are referring to and what exactly the assembly fasta files contain. The documentation on this seems a bit sparse/doesn't correspond well with the data on the website. Is the chance high that my gene is actually not even present in the sequences assembled? (Though this seems highly unlikely that it wouldn't be present in any of the scaffolds).

                        Thanks!

                        Comment

                        • Brian Bushnell
                          Super Moderator
                          • Jan 2014
                          • 2709

                          #13
                          Originally posted by GenoMax View Post
                          Depending on how many mutations you expect you may want to use a non-NGS aligner.

                          Brian: Would Pacbio aligner from BBMap work?
                          If the scaffolds diverge sufficiently from the gene sequence, then yes, the BBMap Pac Bio version (mapPacBio8k.sh) or Blast would be better, as they are much more tolerant of low-identity alignments than Bowtie2.

                          Comment

                          • Mark
                            Member
                            • Nov 2008
                            • 54

                            #14
                            Agreed. Just make a blast db out of your scaffolds and blastn them with your gene

                            Comment

                            Latest Articles

                            Collapse

                            • SEQadmin2
                              Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                              by SEQadmin2


                              I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.


                              Here are nine questions we think about, in roughly the order they matter, before...
                              06-18-2026, 07:11 AM
                            • SEQadmin2
                              From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                              by SEQadmin2


                              Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                              The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                              ...
                              06-02-2026, 10:05 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by SEQadmin2, 06-17-2026, 06:09 AM
                            0 responses
                            30 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-09-2026, 11:58 AM
                            0 responses
                            44 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-05-2026, 10:09 AM
                            0 responses
                            48 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-04-2026, 08:59 AM
                            0 responses
                            50 views
                            0 reactions
                            Last Post SEQadmin2  
                            Working...