Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BFAST- match quick question

    Hello all,

    I have 30 million RNA-Seq data (illumina), 75bp in length and wish to map it to the chicken genome. I did make the necessary index files (10), using the instruction in "7.1" and started the "bfast match" process to search the index. The problem i see is that the read out, which says...for the first index...
    Reads processed: 1
    Cleaning.....
    Complete.....
    Found 1 match.
    My question is what is this 1 match? Did it find only 1 match out of 30 million reads or just some kind of output. The reason i ask i just don't want to waste machine time. Kindly help

  • #2
    That is what it means. You should save that output for the posterity .

    Something may have went wrong. Post the full commands and outputs (use gist or pastie to avoid flooding SA).

    Also, for testing, you may want to reduce your initial dataset to just a few hundred of thousands of reads.
    -drd

    Comment


    • #3
      This what the read out looks like:

      /bfast+bwa-0.6.4e$ bfast match -f chicken.fa -n 10 -r 1F_4L_pf.fastq > bfast.matches.1F_4L_pf.bmf
      ************************************************************
      Checking input parameters supplied by the user ...
      Validating fastaFileName chicken.fa.
      Validating readsFileName 1F_4L_pf.fastq.
      Validating tmpDir path ./.
      **** Input arguments look good!
      ************************************************************
      ************************************************************
      Printing Program Parameters:
      programMode: [ExecuteProgram]
      fastaFileName: chicken.fa
      mainIndexes [Auto-recognizing]
      secondaryIndexes [Not Using]
      readsFileName: 1F_4L_pf.fastq
      offsets: [Using All]
      loadAllIndexes: [Not Using]
      compression: [Not Using]
      space: [NT Space]
      startReadNum: 1
      endReadNum: 2147483647
      keySize: [Not Using]
      maxKeyMatches: 8
      maxNumMatches: 384
      whichStrand: [Both Strands]
      numThreads: 10
      queueLength: 250000
      tmpDir: ./
      timing: [Not Using]
      ************************************************************
      Searching for main indexes...
      Found 10 index (10 total files).
      Not using secondary indexes.
      ************************************************************
      Reading in reference genome from chicken.fa.nt.brg.
      In total read 1 contigs for a total of 1100480441 bases
      ************************************************************
      Reading 1F_4L_pf.fastq into a temp file.
      Will process 1 reads.
      ************************************************************
      Searching index file 1/10 (index #1, bin #1)...
      Reading index from chicken.fa.nt.1.1.bif.
      Read index from chicken.fa.nt.1.1.bif.
      Reads processed: 1
      Cleaning up index.
      Searching index file 1/10 (index #1, bin #1) complete...
      Found 1 matches.
      ************************************************************
      Searching index file 2/10 (index #2, bin #1)...
      Reading index from chicken.fa.nt.2.1.bif.
      Read index from chicken.fa.nt.2.1.bif.
      Reads processed: 1
      Cleaning up index.
      Searching index file 2/10 (index #2, bin #1) complete...
      Found 1 matches.
      ************************************************************
      Searching index file 3/10 (index #3, bin #1)...
      Reading index from chicken.fa.nt.3.1.bif.
      Read index from chicken.fa.nt.3.1.bif.
      Reads processed: 0


      Thanks for the help.......

      Comment


      • #4
        Can you post the first few reads that would be great.

        Comment


        • #5
          Notice:

          Code:
          Reads processed: 1
          Against all the indexes.
          Most likely there is something wrong in your fastq file.
          Yes, can you post a few entries from your fastq?
          -drd

          Comment


          • #6
            these are few lines from the fastq file:

            @No name
            AAAGCCACGTGCAACCATCATCAAACCAGTTGGTGGAGATAAGAATGGAGGCAG
            +No name
            C@@@?CCCCCCBCCCCCCCCCCCBCC;CC@:<<0<=;?A?A@CCA>B=@@C@CC
            @No name
            TTTCTAAGGTCACGTTAACTGTAAACCAGTTCAATATTGAACTTCCTTTTCAATTTGGTT
            +No name
            BABBBBB@@BBBBBBB?BBBB@BB2B=BB@B==BB@BB@B@BBBBBBBB?BBBBBBBBBB
            @No name
            GGCAAATACACCATAGACAAGGTTCAGCCAGAGGATGCAGGAAAATATGAGTGCACATT
            +No name
            CC??BCBBCCC?C?CC?CBC@=:8=?A=??CBCCB3@A=@CC8BCCBCCC?@@@CA?=@
            @No name
            TTTTAGGGGCAGACTCAGAAGAGCTGGATTCTGATGATCTGGATGAAGAGGAGGAGTTTA
            +No name
            CCCC@CCAAACC?CCCCCC?CCAAC>CCCCC>BC@CB?CCCC=CA2@CCCCC?@<?>:?C
            @No name
            AGTCTCACACAACAGTTTGAGGAAAAAGCTGCTTCTTATGACAAACTGGAAAAAACCAAG
            +No name
            CCACCBCCCC><BBBBBBBBBBBBBBBBBBBB9BBBBBB@ABB><:B>B>>>7>DBB5>B
            @No name
            ACGTACAAATTCAGTATGTGTAAGTGACTTATGCTTCATTAAGGCAAAAGTAGATCATGC
            +No name
            CCCCCABCCCCCCC@B?B>??@@@=B@B=@@1@<@BBBB<@AB>@A>>@B>?@>>@@BBB
            @No name
            CCCTAAATGCAGCAACATCAAGCAGATATACTTCACAGATTGCTGCTGTGTATCTTTGTG
            +No name
            CCCCCCCCCCCCCDCC@CCCBCCACCCCCCCCCCBC=CCCCCCBCCCCCBCCCCCB@CD@

            Comment


            • #7
              The reads appear odd, they all have the same name. I would imaging BFAST would not like this.

              Comment


              • #8
                Do you really have "No name" as read ids for all the reads? Where is that data coming from? Try to change the reads id on a few reads and make them unique (test_#), then rerun match.
                -drd

                Comment


                • #9
                  The data was from illumina reads from the sequence center. I only know that they are 75 bp long and was sequenced using the GA system (i think). I am not sure what pipeline was used. I will give it a try (change names) but i have used this for other aligners such as bwa, bowtie etc and have not encountered any problems with that.

                  Comment


                  • #10
                    So it turns out its the header @____ made the mistake. So i got the original fastq files with unique names and the BFAST index worked. So thanks for all your help

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Strategies for Sequencing Challenging Samples
                      by seqadmin


                      Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                      03-22-2024, 06:39 AM
                    • seqadmin
                      Techniques and Challenges in Conservation Genomics
                      by seqadmin



                      The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

                      Avian Conservation
                      Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
                      03-08-2024, 10:41 AM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 06:37 PM
                    0 responses
                    10 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, Yesterday, 06:07 PM
                    0 responses
                    9 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-22-2024, 10:03 AM
                    0 responses
                    49 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 03-21-2024, 07:32 AM
                    0 responses
                    67 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X