Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Question about BED format - chromStart and End

    I am a bit confused about the chromStart and chromEnd positions in the BED format.

    According to UCSC:
    chromStart - The starting position of the feature in the chromosome or scaffold. The first base in a chromosome is numbered 0.
    chromEnd - The ending position of the feature in the chromosome or scaffold. The chromEnd base is not included in the display of the feature. For example, the first 100 bases of a chromosome are defined as chromStart=0, chromEnd=100, and span the bases numbered 0-99.

    Assuming I download a bed file for a gene from UCSC as below:

    chromStart: 300
    chromEnd: 500

    Now, I get a set of SNPs by mapping reads to hg18 and calling SNPs using whatever SNPcaller. I want to know how many SNPs were called within the above gene. Should I compare each SNP position with the gene range as

    300<=SNP_POSITION<=500

    or

    301<=SNP_POSITION<=500

    or

    301<=SNP_POSITION<=499

    ?

    Does anyone know which is correct?

    Thanks

  • #2
    300 <= x < 500

    so the first base is no. 300, the last base is no .499, and the range covers 200 bases.
    Just as the doc says. You may imagine the mark being between the bases.

    Comment


    • #3
      301st<=snp_position<=500th

      EDIT:

      BED is always 0-based. The first base in a sequence has coordinate 0 and therefore coordinate 300 denotes the 301st base. A more obvious example is

      0 1

      which denotes the first base.
      Last edited by lh3; 10-25-2010, 04:14 AM.

      Comment


      • #4
        It depends on how the range is defined: 0 based or 1 based positions? If one based, then 301 <= SNP_POSITION <= 500 is the range you had in the bed file.
        Last edited by Hena; 10-25-2010, 03:43 AM.

        Comment


        • #5
          300 <= x < 500

          Comment


          • #6
            Originally posted by lh3 View Post
            301st<=snp_position<=500th

            EDIT:

            BED is always 0-based. The first base in a sequence has coordinate 0 and therefore coordinate 300 denotes the 301st base. A more obvious example is

            0 1

            which denotes the first base.
            In case that everything is zero-based one has for example for range 3 (instead of 300) to 5 (instead of 500):


            0123456789 <-- positions
            ---gg----- <-- gene in range [3,5)


            therefore the correct answer is 300<=x<500!

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Understanding Genetic Influence on Infectious Disease
              by seqadmin




              During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

              Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
              09-09-2024, 10:59 AM
            • seqadmin
              Addressing Off-Target Effects in CRISPR Technologies
              by seqadmin






              The first FDA-approved CRISPR-based therapy marked the transition of therapeutic gene editing from a dream to reality1. CRISPR technologies have streamlined gene editing, and CRISPR screens have become an important approach for identifying genes involved in disease processes2. This technique introduces targeted mutations across numerous genes, enabling large-scale identification of gene functions, interactions, and pathways3. Identifying the full range...
              08-27-2024, 04:44 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Today, 06:25 AM
            0 responses
            13 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, Yesterday, 01:02 PM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 09-18-2024, 06:39 AM
            0 responses
            14 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 09-11-2024, 02:44 PM
            0 responses
            14 views
            0 likes
            Last Post seqadmin  
            Working...
            X