Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • How to identify a gene targeted by external sequence into NGS data?

    Dear all
    I am working on a Trascriptome project (Illumina GAIIx, paired ends).
    I have 2 samples of Arabidopsis Thaliana. Within their sequences, an external sequence of a different genome has been inserted (transfection should be the word): the reason behind this choice is that we would able to identify the gene (prior unknown) that this sequence links, so that we can easily look at this gene expression.
    My problem is that I need to identify which gene the sequence links.
    Is there any software that achieves my goal?

    I did classical RNA-seq steps:
    - QC
    - Alignment (Tophat - Bowtie)
    - DEG (Cufflinks/diff)

    I cannot search the external sequence into the aligned samples since the external sequence does not align with the samples' reference genome.

    Thank you!!
    Best
    Andrea

  • #2
    I'm not entirely clear on what the insertion is. Is it a T-DNA from Agrobacterium? That seems the most likely candidate since you are working with Arabidopsis.

    The other problem is, do you know whether or not the insert is actually in a transcribed region?

    If the sequence of insert is known, say if it is a T-DNA, then there are methods like TAIL-PCR that can be used to identify the flanking sequences of the insert.
    Last edited by chadn737; 08-10-2011, 11:17 AM.

    Comment


    • #3
      I would probably just create a database from your possible insertion material & search it with your favorite short read search tool (e.g. BWA, Bowtie). That will identify any reads carrying the inserted DNA. If you can find a read pair (or do you not have them) with one read in the insertion and one in an Arabidopsis message, then you are home.

      There are some tools out there specifically to look for these, but I forget the names.

      Comment


      • #4
        Thank you all for your messages!

        Is it a T-DNA from Agrobacterium?
        The other problem is, do you know whether or not the insert is actually in a transcribed region?
        to chadn737: the insertion is GUS and the method is the gene trap. This time we tried without labeling a specific gene but letting GUS attach to all available genes.

        There are some tools out there specifically to look for these, but I forget the names.
        (...) just create a database from your possible insertion material & search it (...)
        to krobison: your suggestion is really nice; if you figure out those tools, let me know ;-) Do you know how to create a database of reads? How can I call BWA/Bowtie to search for this insertion? Is it a Blast?
        Last edited by aner; 08-18-2011, 11:03 PM.

        Comment


        • #5
          By database of possible insertion material I simply mean a FASTA format file with the possible insertions. The documentation for BWA, Bowtie & BLAST will instruct you how to index that file for searching (alas, each requires its own indexing).

          BTW, a paper on finding such large insertions is

          Identifying insertion mutations by whole-genome sequencing

          Comment


          • #6
            By database of possible insertion material I simply mean a FASTA format file with the possible insertions. The documentation for BWA, Bowtie & BLAST will instruct you how to index that file for searching (alas, each requires its own indexing).
            BTW, a paper on finding such large insertions is
            Identifying insertion mutations by whole-genome sequencing
            Thank you very much. I will test this method and I will let you know results.

            Comment


            • #7
              as keith suggested, its straight forward to make a bwa formatted reference sequence (for the external sequence) and see what reads map to it... I hope you did paired reads, and the pair of the mapping read maps to the arabidopsis genome, so you can ascertain where the sequence links on the genome....
              --
              bioinfosm

              Comment


              • #8
                I hope you did paired reads
                Thank you bioinfosm. Yes, I have paired reads; I will see soon if the pairs maps to Arabidopsis genome. ...I hope to be lucky...!

                Comment


                • #9
                  You might try building a Bowtie index that includes the external sequence as an additional 'chromosome' and then running TopHat Fusion.

                  Comment


                  • #10
                    You might try building a Bowtie index that includes the external sequence as an additional 'chromosome' and then running TopHat Fusion.
                    Thank you dcfargo. I read the poster and I found the relative article (so far draft). I will try also this approach!

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Best Practices for Single-Cell Sequencing Analysis
                      by seqadmin



                      While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
                      06-06-2024, 07:15 AM
                    • seqadmin
                      Latest Developments in Precision Medicine
                      by seqadmin



                      Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                      Somatic Genomics
                      “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                      05-24-2024, 01:16 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, 06-07-2024, 06:58 AM
                    0 responses
                    13 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-06-2024, 08:18 AM
                    0 responses
                    20 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-06-2024, 08:04 AM
                    0 responses
                    18 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-03-2024, 06:55 AM
                    0 responses
                    13 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X