Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • chadn737
    Senior Member
    • Jan 2009
    • 392

    Finding per gene GC content.

    Does anyone know of a tool to calculate the GC content per gene?

    Alternatively if you know where to find the per gene GC content of Arabidopsis, that will work as well.
  • Dario1984
    Senior Member
    • Jun 2011
    • 166

    #2
    Try installing Repitools for R. Then, have a look at the gcContentCalc function. All you need as parameters is a BSgenome object of the genome sequence of organism you're studying and a GRanges of the gene coordinates.

    A list of BSgenome packages for different organisms is listed here. Scroll down until you see all the packages that begin with BSgenome and click on the one you want. For Arabidopsis, the available BSgenome packages are BSgenome.Athaliana.TAIR.04232008 and BSgenome.Athaliana.TAIR.TAIR9

    Here's a small example :

    Code:
    > library(Repitools)
    > library(BSgenome.Hsapiens.UCSC.hg18)
    > genes <- GRanges(c("chr1", "chr2"), IRanges(c(10000, 20000), c(20000, 30000)))
    > genes
    GRanges with 2 ranges and 0 elementMetadata values:
          seqnames           ranges strand
             <Rle>        <IRanges>  <Rle>
      [1]     chr1 [10000, 20000]      *
      [2]     chr2 [20000, 30000]      *
      ---
      seqlengths:
       chr1 chr2
         NA   NA
    
    > gcContentCalc(genes, Hsapiens)
    Calculating GC content.
         <NA>      <NA> 
    0.5181482 0.4384562
    So, there's 52 % GC in the first gene, and 44 % in the second gene.

    Comment

    • chadn737
      Senior Member
      • Jan 2009
      • 392

      #3
      Thanks, exactly what I needed.

      Comment

      • maubp
        Peter (Biopython etc)
        • Jul 2009
        • 1544

        #4
        For a command line tool, try geecee from EMBOSS


        If you are working in a programming language there is probably a Bio* library that can help. Biopython has an example of calculating the GC of each sequence in a FASTA file and plotting the distribution in its tutorial:

        Comment

        Latest Articles

        Collapse

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by SEQadmin2, 06-09-2026, 11:58 AM
        0 responses
        22 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-05-2026, 10:09 AM
        0 responses
        29 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-04-2026, 08:59 AM
        0 responses
        39 views
        0 reactions
        Last Post SEQadmin2  
        Started by SEQadmin2, 06-02-2026, 12:03 PM
        0 responses
        61 views
        0 reactions
        Last Post SEQadmin2  
        Working...