Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • CHObot
    Member
    • May 2013
    • 11

    Finding a read's mate

    I have a BWA alignment of reads to a small (50 kb) reference sequence. It is a transgenic sequence inserted into a host cell genome. I want to be able to locate the insert's position in the host cell genome. There are reads at the ends (pointing outwards) which have their pairs unmapped. These mates would presumably be in the flanking genomic sequence that I want to identify. Is there an easy way to get the unmapped mates? I suppose I could make a list of the reads and write a script to parse the original fastQ files, but I am hoping there is a tool already available for this (seemingly common) purpose. Any help would be greatly appreciated.
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    See this thread for the info you need: http://seqanswers.com/forums/showthread.php?t=12283

    Comment

    • swbarnes2
      Senior Member
      • May 2008
      • 910

      #3
      You could parse the .bam for unmapped reads whose mates mapped close to your boundaries in the correct orientation.

      You could also align the fastqs to the 50kb genome and the host genome, then filter for reads that aligned to the host whose mates aligned to the insert. That's probably the best solution. You'd want the mapping position of the reads that aligned to host anyway, so this way you'd have them.

      Comment

      • CHObot
        Member
        • May 2013
        • 11

        #4
        Thanks for the suggestions. I'll look at the other thread and try your suggestions.

        Comment

        • kashi
          Junior Member
          • Aug 2021
          • 1

          #5
          Covid-19

          I am confused on the issue:

          the service provider company provide AmpliSeq for Illumina On-Demand, Custom, and Community Panels. for COVID diagnostic, the library was prepared, the issue started @ sample sheet, manifest file - covid- successfully added, genome (we try our level best to integrate the genome file but no use, after creating multipath the genome was integrated in sample sheet and run started, output was 93.8=Q score, Cluster passing 96.7%, Cluster density 774K) but analysis failed (Sunday) till now we try all possible methods with illumina support but no use. initially RNA amplicon was downloaded and added in sample sheet, the sample sheet was headed by DNA amplicon (no use) & now PCR amplicon was added in sample sheet but same error. plz guide.

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM
          • SEQadmin2
            Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
            by SEQadmin2


            With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


            Introduction

            Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
            05-22-2026, 06:42 AM
          • SEQadmin2
            Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
            by SEQadmin2

            Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


            Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
            05-06-2026, 09:04 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, Today, 08:59 AM
          0 responses
          10 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 12:03 PM
          0 responses
          21 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-02-2026, 11:40 AM
          0 responses
          17 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 05-28-2026, 11:40 AM
          0 responses
          31 views
          0 reactions
          Last Post SEQadmin2  
          Working...