Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Tophat --library-type option

    Dear all,

    I have a quick question about the "--library-type" option in the latest Tophat version 1.2.0:

    If the reads are from single end, strand non specific library preparation, then one should use "--library-type fr-unstranded", but the description about this standard Illumina type sounds like only for paired-end reads.

    So Tophat will treat the reads as strand specific by default if one does not specify *--library-type* option. Does anyone have experience about this?

    Many thanks!

  • #2
    I have the same question........

    Comment


    • #3
      http://seqanswers.com/forums/showpos...95&postcount=9

      Originally posted by jingerlu View Post
      I have the same question........

      Comment


      • #4
        Thanks a lot for the useful info.

        Comment


        • #5
          Tophat --library-type option

          I have Ion Proton single end/strandness RNA-Seq data and have been trying to figure out what options to give to --library-type option. I used the same dataset (~413982 reads) with no --library-type specified or with --library-type specified as either fr-firststrand or fr-secondstrand using the hints from Tophat FAQ page, but the counts from junctions.bed are quite comparable. I wonder whether this is because the strand info is only applicable to paired-end data (as some of the posts suggested)? For single end data, could tophat leverage strand info?

          #I am not sure which library type to use (fr-firststrand or fr-secondstrand), what should I do?
          #
          #One possible way to figure out the correct library-type is to run TopHat with a small subset of the reads (e.g., 1M) as follows.
          #run TopHat with fr-firststrand and count the number of junctions in junctions.bed (one of the output files from TopHat)
          #run TopHat with fr-secondstrand and count the number of junctions in junctions.bed
          #Since the splice junction finding algorithm of TopHat makes use of library-type information (if provided), one of the two TopHat runs would result in many m
          ore splice junctions than the other one. You can then use the library type that gives more junctions. If this is not the case TopHat might not work well with
          your sequencing protocol. Please let us know more details about your protocol so we can add support for new library types.

          > more tophat_coverage-search_no_library-type_option/junctions.bed|wc -l
          188
          > more tophat_no-coverage-search_fr-secondstrand/junctions.bed|wc -l
          181
          > more tophat_no-coverage-search_fr-firststrand/junctions.bed|wc -l
          175

          I would also like to confirm if tophat assumes fr-unstranded when no --library-type option is provided? While both tophat manual and tophat -h mention three supported library-type options, cufflinks manual mentioned the same library-type options and cufflinks -h on the other hand mentioned 7 supported options below. I wonder whether exactly how many options are there and whether the library-type only applies to paired-end data?
          Supported library types:
          ff-firststrand
          ff-secondstrand
          ff-unstranded
          fr-firststrand
          fr-secondstrand
          fr-unstranded (default)
          transfrags

          I am using TopHat v2.0.8 and cufflinks v2.1.1.

          Comment


          • #6
            Tophat and Proton

            Did anyone figure out the type of library to use with Ion Torrrent Proton reads when using tophat? --library-type specified as either fr-firststrand or fr-secondstrand?
            Also has anyone used tophat-fusion-post with Proton reads? I always get 0 fusions in the post directory!

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            27 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            31 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            27 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            52 views
            0 likes
            Last Post seqadmin  
            Working...
            X