Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Masking A/C/G/T with N?

    I'm working with a tetraploid soybean species, and I want to determine which DNA-seq reads exist on only one of the two diploid progenitor genomes. I have two consensus fasta files (one for each diploid genome) and want to map reads to them to determine which reads map to only one genome based on mapping scores. I already have a pipeline for accomplishing this, but I'm running into a problem. One consensus file has 30 million more Ns than the other. Reads that should map equally to both species favor one because it has fewer Ns, and therefore a higher mapping score. Is there a way to replace A/C/G/T's in one consensus file with an N where the other consensus file has an N? The consensus sequences were made by mapping RNA-seq and DNA-seq reads to the Glycine max genome, so they should be comparable.

  • #2
    I'm not aware of a ready-made tool to do this job but it sounds like something you could do with Perl. I'm sure there are online tutorials you could use otherwise I found this book useful when first starting (http://shop.oreilly.com/product/9780596000806.do)

    Are both consensus sequences the same length for each chromosome/scaffold? Doesn't really matter if they aren't, would just make the perl script to do this job easier.

    Comment


    • #3
      Ah, I guess I'll just have to build up my Perl skills. Probably a good idea, anyway. And yes, the consensus sequences are the same size, so it shouldn't be too difficult.

      Thanks for your help!

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Advanced Methods for the Detection of Infectious Disease
        by seqadmin




        The recent pandemic caused worldwide health, economic, and social disruptions with its reverberations still felt today. A key takeaway from this event is the need for accurate and accessible tools for detecting and tracking infectious diseases. Timely identification is essential for early intervention, managing outbreaks, and preventing their spread. This article reviews several valuable tools employed in the detection and surveillance of infectious diseases.
        ...
        11-27-2023, 01:15 PM
      • seqadmin
        Strategies for Investigating the Microbiome
        by seqadmin




        Microbiome research has led to the discovery of important connections to human and environmental health. Sequencing has become a core investigational tool in microbiome research, a subject that we covered during a recent webinar. Our expert speakers shared a number of advancements including improved experimental workflows, research involving transmission dynamics, and invaluable analysis resources. This article recaps their informative presentations, offering insights...
        11-09-2023, 07:02 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 02:24 PM
      0 responses
      5 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Today, 07:37 AM
      0 responses
      15 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 08:23 AM
      0 responses
      8 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 12-01-2023, 09:55 AM
      0 responses
      23 views
      0 likes
      Last Post seqadmin  
      Working...
      X