Unconfigured Ad

**TheSeqGeek** · 02-10-2015, 12:14 PM

I tried using excel and quickly realized I need to run loops.

**colindaven** · 02-12-2015, 06:28 AM

This sounds like a problem you can use Galaxy for

**sarvidsson** · 02-12-2015, 07:07 AM

So you have the chromosome and start/stop for your ~400 positions? Put them in BED format (tab-separated lines with "chromosome start stop"), get a GFF/GTF file for your genome with the genes (possibly filter it with grep for the features you are interested in) and use BEDTools (a swiss army knife for all annotation comparison needs); e.g. the "closest" command:

closest — bedtools 2.31.0 documentation

http://bedtools.readthedocs.org/en/latest/content/tools/closest.html

**TheSeqGeek** · 02-13-2015, 12:57 PM

I did as "sarvidsson" suggested

Both files contain chromosome name, start position, stop position, and name of feature/gene without headings

Here is an example

My list of 400 position are in the following format called "toanno.bed"
Chromosome 2985 2998 Site1
Chromosome 6738 6751 Site2

My list of genes I want to match them with are in the following format called "genome.bed"
Chromosome 351 1724 Gene1
Chromosome 1828 2946 Gene2

When I use the command
closestBed -a toanno.bed -b genome.bed > features.bed

I get a concatenated file containing both files head to tail... basically a long concatenate command...

I figured out I am not putting into .bed format. Basically the problem is with unicode.

Save your data with excel, which only does Unicode 16 then save it as Unicode 8. WoW ridiculous.

**AlliCox** · 02-18-2015, 08:57 PM

You could probably annotate the base pair positions using a tool that annotates lists of variants from NGS - if the position is near a gene, it would get annotated as upstream, downstream, intronic, etc. That would probably work for some of the positions. You could also align the bp positions to annotation information from 1000 genomes to find out if the site is in or near a gene.

**TheSeqGeek** · 02-19-2015, 05:52 AM

Originally posted by AlliCox View Post

You could probably annotate the base pair positions using a tool that annotates lists of variants from NGS .

So what's the tool?

**sarvidsson** · 02-19-2015, 06:05 AM

Originally posted by TheSeqGeek View Post

So what's the tool?

You could use SnpEff, but then you'd need to fake some VCF to get there. BEDTools is the tool for the job.

**TheSeqGeek** · 02-19-2015, 06:07 AM

Originally posted by sarvidsson View Post

You could use SnpEff, but then you'd need to fake some VCF to get there. BEDTools is the tool for the job.

Yeah, I already got it to work with bed tools closestBed command. Only issue was with type of text editor I was using to generate .bed file as I described for anyone else having similar issues.

Topics	Statistics	Last Post
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 30 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 38 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 42 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM
Long-Read RNA Sequencing Uncovers a Hidden Layer of Immune Cell Regulation by SEQadmin2 Started by SEQadmin2, 06-02-2026, 12:03 PM	0 responses 64 views 0 reactions	Last Post by SEQadmin2 06-02-2026, 12:03 PM

Unconfigured Ad

Promoter Analysis

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News