Header Leaderboard Ad

Collapse

Genomic Positions off in db135.b37.vcf

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Genomic Positions off in db135.b37.vcf

    I noticed some indels in db135.b37.vcf I downloaded from

    ftp://[email protected]

    For example:

    19 51835892 rs11402251 T TG . PASS GENEINFO=VSIG10L:147645;GNO;RSPOS=51835893;SAO=0;SLO;SSR=0;VC=DIV;VP=050100000000000100000200;WGT=0;dbSNPBuildID=120
    19 52004791 rs67024588 G GC . PASS GENEINFO=SIGLEC12:89858;GNO;RSPOS=52004794;RV;S3D;SAO=0;SLO;SSR=0;VC=DIV;VP=050300000000000100000200;WGT=0;dbSNPBuildID=130

    But we can see that the correct position is in "RSPOS=" but the second field is off.

    Is this a bug or a feature???

  • #2
    I found something that I think was similar and concluded was a bug months ago; I emailed them and one of them acknowledged it was a bug. I check back with them a few weeks later and I was told that he passed the information on to the relevant person and I haven't heard back. This was at least a few months ago.

    It may have been completely different then what you just posted but my conclusion is that I would not be surprised if there are numerous errors where things are off by 1 bp or maybe a couple more for indels.

    Comment


    • #3
      Oh I see. I think I will just fix that file with RSPOS positions. Thanks for your reply.

      Comment


      • #4
        Originally posted by ymc View Post
        Oh I see. I think I will just fix that file with RSPOS positions. Thanks for your reply.
        I recall considering that and realizing it wasn't that easy. Make sure you check a decent amount after changing them and confirming they are correct (again, whatever problems I saw may very well have been fixed by now).

        Comment


        • #5
          I am fixing this by hand now. How should I fix rs67024588? Are the alleles also wrong because at chr19:52004794 is C?

          19 52004794 rs67024588 C CC . PASS GENEINFO=SIGLEC12:89858;GNO;RSPOS=52004794;RV;S3D;SAO=0;SLO;SSR=0;VC=DIV;VP=050300000000000100000200;WGT=0;dbSNPBuildID=130

          Is this ok???

          Comment


          • #6
            According to the website it's a G: http://www.ncbi.nlm.nih.gov/projects...gi?rs=67024588, so it depends on what strand you are annotating with.

            Comment


            • #7
              But doesn't VCF always show forward strand??



              It is a G only if it is in the reverse strand. And the base before the event is a C in the forward strand, right?

              Comment


              • #8
                Ah, yes, you're probably right. My fault.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  How RNA-Seq is Transforming Cancer Studies
                  by seqadmin



                  Cancer research has been transformed through numerous molecular techniques, with RNA sequencing (RNA-seq) playing a crucial role in understanding the complexity of the disease. Maša Ivin, Ph.D., Scientific Writer at Lexogen, and Yvonne Goepel Ph.D., Product Manager at Lexogen, remarked that “The high-throughput nature of RNA-seq allows for rapid profiling and deep exploration of the transcriptome.” They emphasized its indispensable role in cancer research, aiding in biomarker...
                  09-07-2023, 11:15 PM
                • seqadmin
                  Methods for Investigating the Transcriptome
                  by seqadmin




                  Ribonucleic acid (RNA) represents a range of diverse molecules that play a crucial role in many cellular processes. From serving as a protein template to regulating genes, the complex processes involving RNA make it a focal point of study for many scientists. This article will spotlight various methods scientists have developed to investigate different RNA subtypes and the broader transcriptome.

                  Whole Transcriptome RNA-seq
                  Whole transcriptome sequencing...
                  08-31-2023, 11:07 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Today, 06:18 AM
                0 responses
                5 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, Yesterday, 09:17 AM
                0 responses
                8 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 09-19-2023, 09:23 AM
                0 responses
                25 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 09-19-2023, 09:14 AM
                0 responses
                7 views
                0 likes
                Last Post seqadmin  
                Working...
                X