Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • ifiddes
    Junior Member
    • Feb 2012
    • 3

    Software for identification of Trinity/Cufflinks transcripts

    Are there any good available programs/scripts for analyzing assembled transcripts?

    I imagine something like a script to blastn, blastx and tblastx each transcript and report the best hit. Something like that wouldn't be too hard to write, but I don't want to re-invent the wheel, and I also am concerned that sometimes the highest scoring hit reported by blast has a lot of gaps and is not the right result, while a lower scoring shorter hit is more likely correct, but the only way I can think of to accurately determine this is manually.

    The genome I am interested in is poorly annotated, and particularly bad in my region of interest, so just using a reference gtf with my cufflinks transcripts would not be very helpful.
  • gringer
    David Eccles (gringer)
    • May 2011
    • 845

    #2
    Your subject title suggests you're working on Trinity transcripts, so why not try the workflows suggested on the Trinity website?

    Compare the best free open source Bio-Informatics Software at SourceForge. Free, secure and fast Bio-Informatics Software downloads from the largest Open Source applications and software directory


    In particular, it sounds like you might be interestested in the Read Alignment / Abundance Estimation workflow:



    On the other hand, if you're working from cufflinks transcripts then you should probably use cufflinks for the initial analysis:

    Last edited by gringer; 02-29-2012, 12:08 AM. Reason: added cufflinks cuff link

    Comment

    • ifiddes
      Junior Member
      • Feb 2012
      • 3

      #3
      I have already done all of that, I am talking about analysis downstream of cuffdiff for cufflinks/tophat and RSEM for trinity.

      As I mentioned, my region of interest and my model organism as a whole have poor annotation and assembly, so many genes that I find to be differentially expressed are either unannotated or only annotated as hypothetical or xenoref genes. With a few hundred to a few thousand differentially expressed genes, it is not really feasible to manually examine each one. I have seen reports done by other groups where they provide an excel sheet with every Trinity transcript, FPKM, log change, and then (importantly) the blastn results for each gene, and then the blastx/tblastx for any gene that does not map or does not have annotation.

      It seems simple to write a script to do this, except for the problem I mentioned regarding gapped alignments, but rather than re-invent the wheel I was wondering if there was an available script/program.

      Comment

      • swaraj
        Member
        • Feb 2012
        • 50

        #4
        If I have understood you perfectly you have a file with sequences of transcripts and you want to annotate them. I would suggest for that blast2GO is a good platform (http://www.blast2go.com/b2ghome).

        Comment

        • ifiddes
          Junior Member
          • Feb 2012
          • 3

          #5
          This looks promising, I will try using it, thanks.

          Is there anything similar that can be installed locally and run over a command line instead of a GUI?

          Comment

          • swaraj
            Member
            • Feb 2012
            • 50

            #6
            Blast2Go has a pipeline option which you can run locally (http://www.blast2go.com/b2glaunch/resources).

            Another interesting package is SeqGene which performs many common tasks for a next generation sequencing analysis. It has a one script complete analysis pipeline which works well with a little bit of configuration.

            Comment

            Latest Articles

            Collapse

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by SEQadmin2, 06-05-2026, 10:09 AM
            0 responses
            14 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-04-2026, 08:59 AM
            0 responses
            26 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 12:03 PM
            0 responses
            33 views
            0 reactions
            Last Post SEQadmin2  
            Started by SEQadmin2, 06-02-2026, 11:40 AM
            0 responses
            23 views
            0 reactions
            Last Post SEQadmin2  
            Working...