Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • jflores
    Junior Member
    • Mar 2011
    • 7

    Conversion from Complete Genomics Data to SAM/BAM

    Hi There !

    This thread is intended to questions related to conversion from Complete Genomics Data to SAM/BAM formats.

    So here's mine:

    I'm trying to get BAM files for some of the individuals sequenced by CG. I want all reads mapped to the reference and not just alignments around variants (evidenceDnbs files) so I downloaded all MAP data associated with the individuals of my interest and I got an structure data like this for each of the individuals:

    i.e
    NA06994
    GS06994-1100-36-MAP
    GS19360-FS3-L01
    mapping_GS19360-FS3-L01_001.tsv.bz2
    reads_GS19360-FS3-L01_001.tsv.bz2
    ..
    reads_GS19360-FS3-L01_011.tsv.bz2
    mapping_GS19360-FS3-L01_011.tsv.bz2
    GS19360-FS3-L02
    GS19360-FS3-L03
    GS19360-FS3-L04
    GS19360-FS3-L05
    ..
    GS20181-FS3-L08

    There’s no other way to get a BAM file for a particular location or chromosome (my interest) than going through the entire set of files. I’m currently converting each of the reads and mapping files to BAM files (It’s taking quite a long) using the map2sam command provided by CG here:



    My question is: In order to get a full genome BAM file do I need to make first BAM files for all those reads/mapping pairs and them merge them all using samtools? I think the answer is ‘yes you do’ but I’m still a bit confused with this data structure and why there are so many folders and files for every single individual.

    Many thanks in advance,
    J. Rodrigo Flores
    [email protected]
    --
    J. Rodrigo Flores
    [email protected]
    Center for Genomic Sciences, UNAM, Mexico
  • jason.laramie
    Junior Member
    • Feb 2011
    • 3

    #2
    Hi,
    Yes that is correct. If you are interested in a particular region we have a program to do just that which is not part of cgatools. Please email me and I can send it to you.

    [email protected]
    Jason Laramie, PhD
    Principal Application Scientist
    Complete Genomics, Inc.

    Comment

    • jflores
      Junior Member
      • Mar 2011
      • 7

      #3
      Hi Jason,

      Cheers for the response ! I'm interested just in one chromosome so I'm e-mailing you right now,
      --
      J. Rodrigo Flores
      [email protected]
      Center for Genomic Sciences, UNAM, Mexico

      Comment

      • ECO
        --Site Admin--
        • Oct 2007
        • 1360

        #4
        Jflores,

        Please don't repost all over the site again. Thanks!

        Eric

        Comment

        Latest Articles

        Collapse

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        16 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        26 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        37 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-02-2026, 12:03 PM
        0 responses
        61 views
        0 reactions
        Last Post SEQadmin2  
        Working...