Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Using 50bp short-reads to find a translocation

    I know of a translocation that occurred (I also know the sequence that was translocated) in my sequenced DNA, but I'm not sure where it was translocated to, and the reads of the translocated sequence line up against the reference in the original location. Is there any way to find a translocation using short-reads?

  • #2
    Hi Agc,
    It would help people if you supplied more information;

    1) Is it genome re-sequencing or transcriptome data
    2) Is it paired end or single reads
    3) What is the coverage/number of reads generated
    4) What species is it

    I don't know if there is a ready made solution for you but my bodge up and leg-it approach would be:

    1) Reads that span the translocation won't map to your reference sequence. E.g. Maybe as, for some reads you have, say the first 25bp could map to chromosome 1 and second 25bp to chromosome Y, using human as an example.
    2) Assuming my example is similar to your situation, I would take all reads that did not map; for each read here take the first 21bp and the last 21bp of the read and map to genome (maybe with BLAT or BLAST, altering paramters). See which first 21bp and last 21bp map to different chromosomes, there will be your candidate translocation regions but hopefully one stands out as being real (lots of reads mapping to them).
    3) Take the sequences of putative translocation, 49bp from each chromosome and make a pseudo translocation sequence BLAST database
    4) BLAST all reads against translocation sequences and record those reads that align over full length of the read. The most likely real translocation will have most reads mapping which at least overlap the translocation point by 1 base

    This is my rough approach, which will get you the right answer but involves a bit of BLAST, BLAT, Perl/Python magic and some result filtering.

    I predict some better experts of NGS know of better/easier solutions, probably with some already developed software. So give it a day or two before embarking my solution.

    Good luck.

    :-)

    ps. If it is paired end, this will help a lot as one mate pair will map to one chromosome and the other mate pair another chromosome (there should be definitely software to help do that) or just parse the SAM output from TopHat or BOWTIE.
    Last edited by poisson200; 07-22-2010, 04:08 AM.

    Comment


    • #3
      Thanks for the quick reply!

      1) Genome re-sequencing
      2) Single ~50bp reads
      3) Not sure where I can obtain that information.
      4) S. Cerevisiae

      The translocation occurred within the same chromosome, but I'll try to develop the idea of using the unmapped reads. Although I'd find some sort of ready made solution / any other suggestions very helpful.

      Comment


      • #4
        It will still work by mapping the 21bp read ends and find those that both ends map to this chromosome in question. Look at the distance between the two ends. Most reads should map each side of translocation site have similar distance between ends.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Genetic Variation in Immunogenetics and Antibody Diversity
          by seqadmin


          The field of immunogenetics explores how genetic variations are responsible for different immune responses and vulnerability to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D, Postdoctoral Researcher from the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their presented research on genetic variation in antibody loci, antibody...
          Today, 07:24 PM
        • seqadmin
          Choosing Between NGS and qPCR
          by seqadmin



          Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
          10-18-2024, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 11-01-2024, 06:09 AM
        0 responses
        24 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-30-2024, 05:31 AM
        0 responses
        21 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-24-2024, 06:58 AM
        0 responses
        25 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-23-2024, 08:43 AM
        0 responses
        55 views
        0 likes
        Last Post seqadmin  
        Working...
        X