Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Refgene .gtf including model refseq records (XM, XP, XR_)

    hello,

    I am looking for a mouse .gtf file based on refseq including model refseqs starting with XP, XM, XR_ .
    refgene.gtf I found on UCSC does not include model refseqs.
    Althoug mouse.1.rna.fna I found on ncbi includes model refseqs, it gives no information about their loci.

    I am wondering if such a file is available or I have to make a .gtf by mapping the fasta file to genome, which I do not want to do.

    Thanks

  • #2
    You will probably need to make this file but you don't need to do it by mapping.

    Assuming that you actually want only model (X*) records you could do this.
    Code:
    $ wget -nc ftp://ftp.ncbi.nlm.nih.gov/gene/DATA/gene2refseq.gz
    
    $ gzip -d gene2refseq.gz
    Look at the header line in the resulting gene2refseq file to understand the columns in the file.

    Code:
    $ grep "X" gene2refseq | grep "GRCm38" > mouse_model_ref
    Modify result file "mouse_model_ref" into GTF format.

    Note: Current file includes GRCm38.p3 reference.

    Comment


    • #3
      Thank you very much for the answer.
      This will do for my purpose.

      Comment


      • #4
        Hi,

        I first thought Genomax's answer has solved my problem.

        The gene2refseq file does include model refseqs and their genomic positions and orientaion, but the genomic positions it has are not each refseq's specific position.
        Rather, all the refseqs sharing the same gene Symbol seem to have the same positions.

        What I would like to know is the transcription start site of all the possible transcript variants.

        I am sure NCBI has that kind of information since you can see the exact gene models in the genome browser of NCBI Gene Database like in
        http://www.ncbi.nlm.nih.gov/gene/14433


        I would be very glad if anyone could give me an answer.

        Thanks

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Genetic Variation in Immunogenetics and Antibody Diversity
          by seqadmin



          The field of immunogenetics explores how genetic variations influence immune responses and susceptibility to disease. In a recent SEQanswers webinar, Oscar Rodriguez, Ph.D., Postdoctoral Researcher at the University of Louisville, and Ruben Martínez Barricarte, Ph.D., Assistant Professor of Medicine at Vanderbilt University, shared recent advancements in immunogenetics. This article discusses their research on genetic variation in antibody loci, antibody production processes,...
          11-06-2024, 07:24 PM
        • seqadmin
          Choosing Between NGS and qPCR
          by seqadmin



          Next-generation sequencing (NGS) and quantitative polymerase chain reaction (qPCR) are essential techniques for investigating the genome, transcriptome, and epigenome. In many cases, choosing the appropriate technique is straightforward, but in others, it can be more challenging to determine the most effective option. A simple distinction is that smaller, more focused projects are typically better suited for qPCR, while larger, more complex datasets benefit from NGS. However,...
          10-18-2024, 07:11 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Today, 11:09 AM
        0 responses
        22 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Today, 06:13 AM
        0 responses
        20 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 11-01-2024, 06:09 AM
        0 responses
        30 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 10-30-2024, 05:31 AM
        0 responses
        21 views
        0 likes
        Last Post seqadmin  
        Working...
        X