Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • TCGA cancer data, and bioinformatics design questions for SNP/ mirna analysis

    I'm looking for some help either developing a pipeline or using the proper tools and the correct data sets for the below (see goal).

    My languages of choice would be in python/R .

    Goal: I'm looking to create a disease specific profile of just SNPs and SNPs in miRNAs and miRNA target sites. Ideally I would get chromosome information, location and how the various SNPs mentioned above interact for a disease profile.

    PART 1: TCGA
    My first problem is using TCGA data which lists a ton of abhorrent mutations in a LOH .txt format. I'd like to be able to map those mutations to SNP's or genes or miRNA (whatever entities they belong to). The TCGA datasheet is here. Example data is here for breast cancer. I guess I can use the miRNA and mRNA data as well from there.

    Questions here:
    1. How to decipher the LOH data to figure out if it's meaningful and where it maps?
    2. Which tools to use for mapping and what formats for the final data ? Fasta ?
    3. miRNA/Targets and SNPs Next up is getting cancer specific miRNAs and mRNAs and mapping SNPs to them? I'm assuming using dbSNP or Sanger miRNA databases to get miRNA/targets and seed sequences.


    Part 2:
    I'm a bit lost as how to combine all these pieces of information, what formats to use for output (linked to individual pieces) and which tools if any to use to gather all this data using python. This tool is useful as well I think, mirdsnp.

    Any help for how to combine all this data, best practices for mapping snps and miRNAs etc.. and if there are any biopython/bioconductor tools or approaches. I'm having trouble with where to start, how to parse LOH files to get meaningful data out and how to combine it with the other tools..

    I'm doing this in an exploratory method so that I can use this information to design understand experiments later on.

Latest Articles

Collapse

  • seqadmin
    Recent Advances in Sequencing Analysis Tools
    by seqadmin


    The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
    05-06-2024, 07:48 AM
  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 05-07-2024, 06:57 AM
0 responses
12 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-06-2024, 07:17 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 05-02-2024, 08:06 AM
0 responses
21 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-30-2024, 12:17 PM
0 responses
24 views
0 likes
Last Post seqadmin  
Working...
X