gene2cat argument in goseq

heathacoats

Junior Member

Join Date: Apr 2014

Posts: 2
- Share
- Tweet
#1

gene2cat argument in goseq

08-06-2014, 11:17 AM

Hi All,

I am currently trying to use goseq to analyze my RNAseq data. I believe I have all the required files, I just cannot seem to create the proper file to input for the gene2cat arguement. I am working with a non-native species. Below is a snippet of my code:

> library(goseq)
Loading required package: BiasedUrn
Loading required package: geneLenDataBase
> de.genes<-scan('de_genes_GFOLD_24.txt', what=character())
Read 4078 items
> assayed.genes<-scan('all_genes_GFOLD_24.txt', what=character())
Read 17479 items
> gene.length=scan('gene_lengths_noIDs.txt', what=numeric())
Read 17479 items
> gene.vector=as.integer(assayed.genes%in%de.genes)
> names(gene.vector)=assayed.genes
> head(gene.vector)
AAEL000001 AAEL000002 AAEL000003 AAEL000004 AAEL000005 AAEL000006
0 0 0 0 0 1
> pwf=nullp(gene.vector,bias.data=gene.length)
> head(pwf)
DEgenes bias.data pwf
AAEL000001 0 1590 0.26119999
AAEL000002 0 198 0.05383339
AAEL000003 0 2093 0.28937882
AAEL000004 0 2571 0.31792557
AAEL000005 0 1429 0.24989351
AAEL000006 1 4345 0.40676721
> rownames(pwf) <- names(gene.length)
> GOterms=read.delim('Goaccesions.txt',header=TRUE)
> GOterms=as.data.frame.matrix(GOterms)
> head(GOterms)
Gomapping geneID
1 na AAEL000001
2 na AAEL000002
3 GO:0016772 AAEL000003
4 GO:0016757 AAEL000004
5 GO:0008152 AAEL000004
6 GO:0003676 AAEL000005
> GO.wall=goseq(pwf,gene2cat=go.ids)
Error in goseq(pwf, gene2cat = go.ids) :
Was expecting a dataframe or a list mapping categories to genes. Check gene2cat input and try again.

From the goseq package: "gene2cat: A data frame with two columns containing the mapping between genes and the categories of interest."

Could anyone provide an example of this file set-up? Thanks!

Heather
Tags: goseq, rnaseq

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, 04-25-2024, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin 04-25-2024, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 62 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

gene2cat argument in goseq

Latest Articles

ad_right_rmr

News