A back-of-the-envelope approach might be to model it like rolling a 4-sided die, where the probabilities of A,C,G,T are independent and based on the global GC content of that species. If you want to model the dependency of a base on an adjacent base, then you could look into Markov chains (something like this link).
Justin
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
Thanx GenoMax for quick reply.
wordCluster is doing what I already did. I have identified regulatory elements from my genome sequence. Now I want to find out the occurrence of such regulatory sequence in the genome by chance. May be some kind of probability calculation can be done? Or if in any way I can simulate genome sequence and then use simulated genome for calculating random occurrence of the regulatory element.
I already tried simple perl trick to simulate genome by creating array of NTs and calling random NT from array for number of times of genome length but that is a dirty trick and doesn't mimic real genome, like some genomes have different GC bias etc...
Leave a comment:
-
Wonder if this tool may do what you are looking for: http://bioinfo2.ugr.es/wordCluster/wordCluster.php
Leave a comment:
-
random occurrence of regulatory element in genome
Hi all,
I have detected some regulatory sequences of 13 mer from genome sequencing study. The 13-mer has a consensus sequence of GNACANNNTGTNC. How can I calculate occurrence of this 13-mer in the human genome by chance? Is there any way to simulate human genome sequence and then search for the occurrence of such 13-mer by chance? Or as an alternative, mathematical way of calculating the same?
Thanks in advance
Latest Articles
Collapse
-
by seqadmin
The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...-
Channel: Articles
11-06-2024, 07:24 PM -
-
by seqadmin
Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...-
Channel: Articles
10-18-2024, 07:11 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 11:09 AM
|
0 responses
22 views
0 likes
|
Last Post
by seqadmin
Today, 11:09 AM
|
||
Started by seqadmin, Today, 06:13 AM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
Today, 06:13 AM
|
||
Started by seqadmin, 11-01-2024, 06:09 AM
|
0 responses
30 views
0 likes
|
Last Post
by seqadmin
11-01-2024, 06:09 AM
|
||
New Model Aims to Explain Polygenic Diseases by Connecting Genomic Mutations and Regulatory Networks
by seqadmin
Started by seqadmin, 10-30-2024, 05:31 AM
|
0 responses
21 views
0 likes
|
Last Post
by seqadmin
10-30-2024, 05:31 AM
|
Leave a comment: