Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Homo Sapianes GRCh37 genomic coordinates

    Hi all,
    I am looking for coordinate file of the GRCh37 genome.
    My porpuse is to know what is the last coordinate in each chromosome.

    For example if i am using "samtools view" function.

    samtools view aln.sorted.bam chr2:20,100,000-20,200,000

    How can i know if "20,200,000" is not out of the boundries?

    Moreover, Do i have to sort the bam file before "view" specific coordinates?
    Can i know from the bam file if its already sorted?

    Thanks,
    Pap

  • #2
    i found this:

    But i want for specific version.

    Comment


    • #3
      find it here:

      Comment


      • #4
        If you want a quick way to do it from the .bam itself, samtools idxstats will make a file that breaks down how many reads hit every chromosome, and it includes how long each chromosome is.

        Comment


        • #5
          Originally posted by swbarnes2 View Post
          If you want a quick way to do it from the .bam itself, samtools idxstats will make a file that breaks down how many reads hit every chromosome, and it includes how long each chromosome is.
          Thanks!
          Do you know how could i figure out if the bam sorted by read names or by chromosomal coordinates or not sorted?

          Comment


          • #6
            There should be a line in the header that looks like this:

            @HD VN:1.4 SO:unsorted

            That would be unsorted. It could also say "SO:coordinate" which would mean it was sorted by coordinates.

            Comment


            • #7
              Originally posted by Brian Bushnell View Post
              There should be a line in the header that looks like this:

              @HD VN:1.4 SO:unsorted

              That would be unsorted. It could also say "SO:coordinate" which would mean it was sorted by coordinates.
              Great! thanks

              Comment


              • #8
                Hi,
                Can i get the sequencing coverage from the bam file?
                Or, any other easy alternative?
                Thanks

                Comment


                • #9
                  Depends on what you want. If you want average coverage, just calculate (# mapped reads) * (read length) / (genome size).

                  If you want per-base coverage, you can get that from samtools pileup, I believe. The file has to be sorted. There are also other kinds of coverage, so what are you looking for, specifically?

                  Comment


                  • #10
                    Originally posted by Brian Bushnell View Post
                    Depends on what you want. If you want average coverage, just calculate (# mapped reads) * (read length) / (genome size).

                    If you want per-base coverage, you can get that from samtools pileup, I believe. The file has to be sorted. There are also other kinds of coverage, so what are you looking for, specifically?
                    i think the first works for me!
                    Thanks

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Best Practices for Single-Cell Sequencing Analysis
                      by seqadmin



                      While isolating and preparing single cells for sequencing was historically the bottleneck, recent technological advancements have shifted the challenge to data analysis. This highlights the rapidly evolving nature of single-cell sequencing. The inherent complexity of single-cell analysis has intensified with the surge in data volume and the incorporation of diverse and more complex datasets. This article explores the challenges in analysis, examines common pitfalls, offers...
                      06-06-2024, 07:15 AM
                    • seqadmin
                      Latest Developments in Precision Medicine
                      by seqadmin



                      Technological advances have led to drastic improvements in the field of precision medicine, enabling more personalized approaches to treatment. This article explores four leading groups that are overcoming many of the challenges of genomic profiling and precision medicine through their innovative platforms and technologies.

                      Somatic Genomics
                      “We have such a tremendous amount of genetic diversity that exists within each of us, and not just between us as individuals,”...
                      05-24-2024, 01:16 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 07:24 AM
                    0 responses
                    10 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-13-2024, 08:58 AM
                    0 responses
                    11 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-12-2024, 02:20 PM
                    0 responses
                    16 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 06-07-2024, 06:58 AM
                    0 responses
                    184 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X