Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Opgen MapSolver output to fasta scaffold?

    Greetings,

    I'm trying to close a genome using optimal maps and sequence data. I have mapped my contigs to the Whole Genome Map and was hoping to try and close gaps in the scaffold. I know that MapSolver currently doesn't have the functionality to export a scaffold based on the map, but was wondering if anyone has figured out a way to create a fasta scaffold based on information in the Placement Report.

    thank you,

    Jason

  • #2
    It is pretty remarkable that OpGen doesn't provide this, especially when a workable one can be written in a page of Perl or Python. I'm a bit too tired to do so tonight, but here is an outline (maybe next week I'll actually throw this together & post it)

    1) Read the MapSolver placement report file -- there is a one line header and then tab-delimited lines with the information of interest; stop scanning when you hit a blank line (there are later pieces with other information).
    2) Parse the lines just read. Optical map id is column 0, start and end on map are columns 1 & 2, contig id in column 3 (plus mapping method; need to strip this out by removing 1st space and everything afterwards), start & end positions of contig are columns 4 & 5 and the orientation in column 6. If placement did not include entire contig, need to decide whether to truncate contig to what MapSolver liked OR cram entire piece in
    4) Generate list of contigs plus the intervening gaps. If MapSolver thinks they overlapped, need to decide whether to still pad with a gap or not
    5) Read in FASTA file with contigs, saving those that are needed; trim & reverse complement as needed
    6) Build scaffold(s)
    7) Write scaffold(s)

    SIMPLE! :-)

    Bio::SeqIO & Bio::Seq will be essential for doing this in Perl

    Comment


    • #3
      Thanks for your reply. I figured that there was a solution, but I didn't want to re-invent the wheel if someone had already written something that works. I will start writing a script following your work flow. Thank you.

      Jason

      Comment


      • #4
        Originally posted by themerlin View Post
        Thanks for your reply. I figured that there was a solution, but I didn't want to re-invent the wheel if someone had already written something that works. I will start writing a script following your work flow. Thank you.

        Jason
        Hi Jason, have you found the solution to use mapsolver map to produce the scaffold? Would you mind sharing it with me? Thanks

        Comment


        • #5
          Originally posted by krobison View Post
          It is pretty remarkable that OpGen doesn't provide this, especially when a workable one can be written in a page of Perl or Python. I'm a bit too tired to do so tonight, but here is an outline (maybe next week I'll actually throw this together & post it)

          1) Read the MapSolver placement report file -- there is a one line header and then tab-delimited lines with the information of interest; stop scanning when you hit a blank line (there are later pieces with other information).
          2) Parse the lines just read. Optical map id is column 0, start and end on map are columns 1 & 2, contig id in column 3 (plus mapping method; need to strip this out by removing 1st space and everything afterwards), start & end positions of contig are columns 4 & 5 and the orientation in column 6. If placement did not include entire contig, need to decide whether to truncate contig to what MapSolver liked OR cram entire piece in
          4) Generate list of contigs plus the intervening gaps. If MapSolver thinks they overlapped, need to decide whether to still pad with a gap or not
          5) Read in FASTA file with contigs, saving those that are needed; trim & reverse complement as needed
          6) Build scaffold(s)
          7) Write scaffold(s)

          SIMPLE! :-)

          Bio::SeqIO & Bio::Seq will be essential for doing this in Perl
          Hi Krobison, I'm not good at programming. Do you have any script to produce scaffold fasta file using mapsolver's output? Thanks a lot.

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Recent Advances in Sequencing Technologies
            by seqadmin







            Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

            Long-Read Sequencing
            Long-read sequencing has...
            12-02-2024, 01:49 PM
          • seqadmin
            Genetic Variation in Immunogenetics and Antibody Diversity
            by seqadmin



            The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
            11-06-2024, 07:24 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 12-02-2024, 09:29 AM
          0 responses
          139 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 12-02-2024, 09:06 AM
          0 responses
          49 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 12-02-2024, 08:03 AM
          0 responses
          38 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 11-22-2024, 07:36 AM
          0 responses
          69 views
          0 likes
          Last Post seqadmin  
          Working...
          X