Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • stephenhart
    Member
    • Sep 2011
    • 16

    gene tracking with cummerBund - persistent problem

    Hello. I'm having some difficulty with the makeGeneRegionTrack feature of cummerBund. If I rebuild my cuffData.db with 'mm10_ensGene.gtf', then the following happens:

    Code:
    > cuff <- readCufflinks(genome='mm10',gtfFile='mm10_ensGene.gtf',rebuild=TRUE)
    
    > 	cuff
    CuffSet instance with:
    	 4 samples
    	 23285 genes
    	 30073 isoforms
    	 25872 TSS
    	 24748 CDS
    	 139440 promoters
    	 155232 splicing
    	 122340 relCDS
    
    > 	myGene
    CuffGene instance for gene Insm2 
    Short name:	 Insm2 
    Slots:
    	 annotation
    	 features
    	 fpkm
    	 repFpkm
    	 diff
    	 count
    	 isoforms	 CuffFeature instance of size 1 
    	 TSS		 CuffFeature instance of size 1 
    	 CDS		 CuffFeature instance of size 1 
    
    > 	genetrack <- makeGeneRegionTrack(myGene)
    Error in .Call2("Rle_constructor", values, lengths, check, 0L, PACKAGE = "IRanges") : 
      Rle of type 'list' is not supported
    On the other hand, if I rebuild my cuffData.db file with 'genes.gtf' from the Tuxedo package, then the following happens:



    Code:
    > cuff <- readCufflinks(genome='mm10',gtfFile='genes.gtf',rebuild=TRUE)
    
    > 	cuff
    CuffSet instance with:
    	 4 samples
    	 23285 genes
    	 30073 isoforms
    	 25872 TSS
    	 24748 CDS
    	 139440 promoters
    	 155232 splicing
    	 122340 relCDS
    
    > 	myGene
    CuffGene instance for gene Insm2 
    Short name:	 Insm2 
    Slots:
    	 annotation
    	 features
    	 fpkm
    	 repFpkm
    	 diff
    	 count
    	 isoforms	 CuffFeature instance of size 1 
    	 TSS		 CuffFeature instance of size 1 
    	 CDS		 CuffFeature instance of size 1 
    
    > 	genetrack <- makeGeneRegionTrack(myGene)
    Error in `[.data.frame`(features(object), , featCols) : 
      undefined columns selected

    I've noticed similar problems in other posts, but with no solutions. Any suggestions would be appreciated.

    Thanks.
  • axa9070
    Member
    • Oct 2014
    • 11

    #2
    stephenhart: I have the first issue but not that second one.

    Either way, does anyone have any ideas yet?


    Using EMBL annotation:
    Code:
    > cuff <- readCufflinks(genome="vinifera.fa", gtfFile="vinifera.gtf", rebuild=T)
    
    >       cuff
    CuffSet instance with:
             6 samples
             30446 genes
             32764 isoforms
             31480 TSS
             29927 CDS
             456690 promoters
             472200 splicing
             437595 relCDS
    
    >       myGene
    CuffGene instance for gene XLOC_001303
    Short name:      SUT2
    Slots:
             annotation
             features
             fpkm
             repFpkm
             diff
             count
             isoforms        CuffFeature instance of size 1
             TSS             CuffFeature instance of size 1
             CDS             CuffFeature instance of size 1
    > genetrack <- makeGeneRegionTrack(myGene)
    Error in .Call2("Rle_constructor", values, lengths, check, 0L, PACKAGE = "S4Vectors") :
      Rle of type 'list' is not supported

    Using my merged annotation:
    Code:
    > cuff <- readCufflinks(genome="vinifera.fa", gtfFile="merged.gtf", rebuild=T)
    
    >       cuff
    CuffSet instance with:
             6 samples
             30446 genes
             32764 isoforms
             31480 TSS
             29927 CDS
             456690 promoters
             472200 splicing
             437595 relCDS
    
    >       myGene
    CuffGene instance for gene XLOC_001303
    Short name:      SUT2
    Slots:
             annotation
             features
             fpkm
             repFpkm
             diff
             count
             isoforms        CuffFeature instance of size 1
             TSS             CuffFeature instance of size 1
             CDS             CuffFeature instance of size 1
    > genetrack <- makeGeneRegionTrack(myGene)
    Error in .Call2("Rle_constructor", values, lengths, check, 0L, PACKAGE = "S4Vectors") :
      Rle of type 'list' is not supported
    Here's my sessionInfo:
    Code:
    > sessionInfo()
    R version 3.1.1 (2014-07-10)
    Platform: x86_64-unknown-linux-gnu (64-bit)
    
    locale:
     [1] LC_CTYPE=en_US.UTF-8       LC_NUMERIC=C
     [3] LC_TIME=en_US.UTF-8        LC_COLLATE=en_US.UTF-8
     [5] LC_MONETARY=en_US.UTF-8    LC_MESSAGES=en_US.UTF-8
     [7] LC_PAPER=en_US.UTF-8       LC_NAME=C
     [9] LC_ADDRESS=C               LC_TELEPHONE=C
    [11] LC_MEASUREMENT=en_US.UTF-8 LC_IDENTIFICATION=C
    
    attached base packages:
     [1] grid      stats4    parallel  stats     graphics  grDevices utils
     [8] datasets  methods   base
    
    other attached packages:
     [1] cummeRbund_2.8.0     Gviz_1.10.2          rtracklayer_1.26.1
     [4] GenomicRanges_1.18.1 GenomeInfoDb_1.2.2   IRanges_2.0.0
     [7] S4Vectors_0.4.0      fastcluster_1.1.13   reshape2_1.4
    [10] ggplot2_1.0.0        RSQLite_1.0.0        DBI_0.3.1
    [13] BiocGenerics_0.12.0
    
    loaded via a namespace (and not attached):
     [1] acepack_1.3-3.3          AnnotationDbi_1.28.1     base64enc_0.1-2
     [4] BatchJobs_1.5            BBmisc_1.8               Biobase_2.26.0
     [7] BiocParallel_1.0.0       biomaRt_2.22.0           Biostrings_2.34.0
    [10] biovizBase_1.14.0        bitops_1.0-6             brew_1.0-6
    [13] BSgenome_1.34.0          checkmate_1.5.0          cluster_1.15.3
    [16] codetools_0.2-9          colorspace_1.2-4         dichromat_2.0-0
    [19] digest_0.6.4             fail_1.2                 foreach_1.4.2
    [22] foreign_0.8-61           Formula_1.1-2            GenomicAlignments_1.2.0
    [25] GenomicFeatures_1.18.2   gtable_0.1.2             Hmisc_3.14-5
    [28] iterators_1.0.7          lattice_0.20-29          latticeExtra_0.6-26
    [31] MASS_7.3-35              matrixStats_0.10.3       munsell_0.4.2
    [34] nnet_7.3-8               plyr_1.8.1               proto_0.3-10
    [37] RColorBrewer_1.0-5       Rcpp_0.11.3              RCurl_1.95-4.3
    [40] R.methodsS3_1.6.1        rpart_4.1-8              Rsamtools_1.18.1
    [43] scales_0.2.4             sendmailR_1.2-1          splines_3.1.1
    [46] stringr_0.6.2            survival_2.37-7          tools_3.1.1
    [49] VariantAnnotation_1.12.2 XML_3.98-1.1             XVector_0.6.0
    [52] zlibbioc_1.12.0

    Comment

    • axa9070
      Member
      • Oct 2014
      • 11

      #3
      hey stephenhart:
      Would you mind posting this function's output:

      Code:
      > head(features(myGene))
        seqnames    start      end width strand    source type score phase
      1        1 12442003 12442535   533      - Cufflinks exon    NA    NA
      2        1 12442642 12442922   281      - Cufflinks exon    NA    NA
      3        1 12443021 12443186   166      - Cufflinks exon    NA    NA
            gene_id     isoform_id exon_number gene_name                   oId
      1 XLOC_001339 TCONS_00001459           1       NAP VIT_01s0026g02710.t01
      2 XLOC_001339 TCONS_00001459           2       NAP VIT_01s0026g02710.t01
      3 XLOC_001339 TCONS_00001459           3       NAP VIT_01s0026g02710.t01
                  nearest_ref class_code TSS_group_id CDS_id contained_in
      1 VIT_01s0026g02710.t01          =      TSS1387  P1304         <NA>
      2 VIT_01s0026g02710.t01          =      TSS1387  P1304         <NA>
      3 VIT_01s0026g02710.t01          =      TSS1387  P1304         <NA>
      I'm curious only because in the CummeRbund manual the example gives:
      Code:
      > head(features(myGene))
       
         seqnames    start      end width strand source type score
      1     chr1 20959948 20960428   481      + coding exon    NA
      2     chr1 20964335 20964622   288      + coding exon    NA
      3     chr1 20966385 20966485   101      + coding exon    NA
      4     chr1 20970983 20971165   183      + coding exon    NA
      5     chr1 20972053 20972216   164      + coding exon    NA
      6     chr1 20974998 20975125   128      + coding exon    NA
        phase class_code exon_number     gene_id gene_name nearest_ref
      1    NA          =           1 XLOC_000172     PINK1  uc001bdm.2
      2    NA          =           2 XLOC_000172     PINK1  uc001bdm.2
      3    NA          =           3 XLOC_000172     PINK1  uc001bdm.2
      4    NA          =           4 XLOC_000172     PINK1  uc001bdm.2
      5    NA          =           5 XLOC_000172     PINK1  uc001bdm.2
      6    NA          =           6 XLOC_000172     PINK1  uc001bdm.2
               oId CDS_id     isoform_id TSS_group_id
      1 uc001bdm.2   P364 TCONS_00000480       TSS264
      2 uc001bdm.2   P364 TCONS_00000480       TSS264
      3 uc001bdm.2   P364 TCONS_00000480       TSS264
      4 uc001bdm.2   P364 TCONS_00000480       TSS264
      5 uc001bdm.2   P364 TCONS_00000480       TSS264
      6 uc001bdm.2   P364 TCONS_00000480       TSS264

      Comment

      Latest Articles

      Collapse

      • SEQadmin2
        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
        by SEQadmin2


        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
        ...
        06-02-2026, 10:05 AM
      • SEQadmin2
        Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
        by SEQadmin2


        With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


        Introduction

        Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
        05-22-2026, 06:42 AM
      • SEQadmin2
        Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
        by SEQadmin2

        Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


        Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
        05-06-2026, 09:04 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by SEQadmin2, Today, 08:59 AM
      0 responses
      10 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-02-2026, 12:03 PM
      0 responses
      21 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 06-02-2026, 11:40 AM
      0 responses
      17 views
      0 reactions
      Last Post SEQadmin2  
      Started by SEQadmin2, 05-28-2026, 11:40 AM
      0 responses
      30 views
      0 reactions
      Last Post SEQadmin2  
      Working...