Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • kalinka23
    Junior Member
    • Aug 2011
    • 7

    bfast index

    Hi everybody,

    i am new here at this forum and hereby i would like to greet everybody.

    I would like to try the bfast algorithm to map Solid reads (colourspace) to human genome. Now i try to index the genome (hg19) but i am really confused how to do that.

    Did i understand it right that, if i want to use the recommended masks (given in the bfast-book, section 7.1.2 for ABI Solid reads with at least 50bp in length) that i have to index the genome 10 times using the following commands:

    bfast index -f hg19.fa -m 1111111111111111111111 -w 14 -i 1
    bfast index -f hg19.fa -m 111110100111110011111111111 -w 14 -i 2
    bfast index -f hg19.fa -m 10111111011001100011111000111111 -w 14 -i 3
    .....

    bfast index -f hg19.fa -m 1110110001011010011100101111101111 -w 14 -i 9
    bfast index -f hg19.fa -m 111111001000110001011100110001100011111 -w 14 -i 10

    ???

    And another question is:
    for the mapping the reads in colorspace are converted to fastq files and than used for the mapping (using the command: solid2fastq -n ... -o reads *.csfasta *.qual). Doesn't have the conversion from colorspace reads to fastq reads an adverse affect (in terms of e.g. loss of information ...)?

    Thank you very much!

    Best regards,
    Karin
  • nilshomer
    Nils Homer
    • Nov 2008
    • 1283

    #2
    1. Yes, you must create ten indices.

    2. No loss of information. The original data is present in the mapped SAM file in the CS/CZ tags.

    Comment

    Latest Articles

    Collapse

    • GATTACAT
      Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by GATTACAT
      Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
      07-01-2026, 11:43 AM
    • SEQadmin2
      Nine Things a Sample Prep Scientist Thinks About Before Sequencing
      by SEQadmin2


      I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

      Here are nine questions we think about, in roughly the order they matter, before...
      06-18-2026, 07:11 AM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by SEQadmin2, Yesterday, 11:08 AM
    0 responses
    6 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-30-2026, 05:37 AM
    0 responses
    11 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-26-2026, 11:10 AM
    0 responses
    19 views
    0 reactions
    Last Post SEQadmin2  
    Started by SEQadmin2, 06-17-2026, 06:09 AM
    0 responses
    53 views
    0 reactions
    Last Post SEQadmin2  
    Working...