Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • base position

    Is there a script or tool that will identify each base position numbering in a bed file of 1200 amplicons? I am trying to identify homopolymers in an exact position not just a region.
    So, if there was a 10-base strech of sequence each nucleotide numbered and identified. A=1,A=2,A=3,A=4,A=5,A=6,T=7,C=8,G=9,C=10. Then a homopolymer of 6A could be found instead of saying there is a homopolmer of 6A in the region 1-10. Thank you.

  • #2
    It's unclear exactly what you want to do, but I imagine that could be done with GenomicRanges in R. Just load the BED file into a GRanges object, use the getSeq() function (or whatever that's called), and then apply() a function to determine if a homopolymer run exists in a given sequence (outputting whatever textual information you want).

    The unclear part is what a BED file of amplicons would even mean. BED files just give coordinates of regions in something. If the BED file described bounds for amplicons produced from some larger sequence, then what I wrote above would make sense. Otherwise, describe in more depth what you really mean.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Essential Discoveries and Tools in Epitranscriptomics
      by seqadmin




      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
      04-22-2024, 07:01 AM
    • seqadmin
      Current Approaches to Protein Sequencing
      by seqadmin


      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
      04-04-2024, 04:25 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, Yesterday, 11:49 AM
    0 responses
    15 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-24-2024, 08:47 AM
    0 responses
    16 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-11-2024, 12:08 PM
    0 responses
    61 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 04-10-2024, 10:19 PM
    0 responses
    60 views
    0 likes
    Last Post seqadmin  
    Working...
    X