Unconfigured Ad

**GenoMax** · 07-20-2016, 04:18 AM

Paul: Have you tried BioMart from Ensembl? You can find some help/video's on this page.

**SylvainL** · 07-21-2016, 06:49 AM

Using R...

Ref_annotations is your gff file you have to import using the function import.gff2 (with asRangedData=FALSE)
Ref_genome is your genome imported using read.DNAStringSet

The following code should give you the starting base of the first annotated exon of each gene

Code:

B <- Ref_annotations[which(seqnames(Ref_annotations) %in% names(Ref_genome))]
C <- B[which(strand(B) == "+")]
f <- as.factor(elementMetadata(C)$gene_name)
rg <- split(C,f)
rh <- unlist(range(rg))
end(rh) <- start(rh)
start(rh) <- start(rh)
names(rh) <- levels(f)
D <- rh
C <- B[which(strand(B) == "-")]
f <- as.factor(elementMetadata(C)$gene_name)
rg <- split(C,f)
rh <- unlist(range(rg))
start(rh) <- end(rh)
end(rh) <- end(rh)
names(rh) <- levels(f)
E <- rh
F <- sort(c(D, E))

Then you can export F as a bed file (function export.bed)

Hope it helps...

**pkstarstorm05** · 07-31-2016, 02:25 PM

Hi GenoMax and SylvainL,

Thanks so much for your suggestions and time! They were both very helpful.

For anyone later who comes across this post - I strongly urge you to familiarize yourself with biomaRt. Its a powerful tool for extracting all kinds of useful information.

Topics	Statistics	Last Post
UC San Diego Bioengineers Map Gene Function in Human Stem Cells by SEQadmin2 Started by SEQadmin2, 07-13-2026, 10:26 AM	0 responses 20 views 0 reactions	Last Post by SEQadmin2 07-13-2026, 10:26 AM
New Analysis Splits Leukemia Into 16 Epigenomic Subgroups by SEQadmin2 Started by SEQadmin2, 07-09-2026, 10:04 AM	0 responses 30 views 0 reactions	Last Post by SEQadmin2 07-09-2026, 10:04 AM
Genome-Wide CRISPR Screen Uncovers Unlikely Psoriasis Target by SEQadmin2 Started by SEQadmin2, 07-08-2026, 10:08 AM	0 responses 17 views 0 reactions	Last Post by SEQadmin2 07-08-2026, 10:08 AM
Engineered Protein Motor Takes Its First Steps Along DNA Track by SEQadmin2 Started by SEQadmin2, 07-07-2026, 11:05 AM	0 responses 34 views 0 reactions	Last Post by SEQadmin2 07-07-2026, 11:05 AM

Unconfigured Ad

Retrieving promoter sequences using gene symbol

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News