Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • earonesty
    replied
    Tabix allows you to access a file by quering for (for example) a chromosome and position and retrieve all the lines of that file that match your query.

    Problems arise when you need to do thousands of lookups. Tabix can take hours to do, say, 100k lookups into DBSNP data. I wrote a Perl/C++ (Text::Tidx) alternative to tabix for annotating Exomes, using DBSNP or 1K genomes data. It uses a lot of RAM, but is much faster than tabix for that stuff.

    Leave a comment:


  • pi101
    started a topic help using tabix

    help using tabix

    Dear All

    How do you view the tbi file generated by tabix? I am using tabix v0.2.5 to index a local BED file of gene positions.

    I would like to use the intervals generated by tabix and store them in a database for querying my sequence data from a RDMS. Is it possible to use tabix to
    a) generate the bin that a sequence resides in
    b) generate the bins that would need to be queried to get all feature within (or overlapping) a sequence region

    Is tabix suitable for this purpose? Being as I can't view the index file at present its difficult to answer myself. I suspect the index file will only contain the bin a sequence resides in and will not be suitable for finding the bins to query for a sequence range. If tabix, is there anything else you could recommend?

    thank you for your help

    Also, what the interval sizes used by tabix? I could not find details of this. Can you control them?

Latest Articles

Collapse

  • seqadmin
    Exploring the Dynamics of the Tumor Microenvironment
    by seqadmin




    The complexity of cancer is clearly demonstrated in the diverse ecosystem of the tumor microenvironment (TME). The TME is made up of numerous cell types and its development begins with the changes that happen during oncogenesis. “Genomic mutations, copy number changes, epigenetic alterations, and alternative gene expression occur to varying degrees within the affected tumor cells,” explained Andrea O’Hara, Ph.D., Strategic Technical Specialist at Azenta. “As...
    07-08-2024, 03:19 PM
  • seqadmin
    Exploring Human Diversity Through Large-Scale Omics
    by seqadmin


    In 2003, researchers from the Human Genome Project (HGP) announced the most comprehensive genome to date1. Although the genome wasn’t fully completed until nearly 20 years later2, numerous large-scale projects, such as the International HapMap Project and 1000 Genomes Project, continued the HGP's work, capturing extensive variation and genomic diversity within humans. Recently, newer initiatives have significantly increased in scale and expanded beyond genomics, offering a more detailed...
    06-25-2024, 06:43 AM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 07-10-2024, 07:30 AM
0 responses
30 views
0 likes
Last Post seqadmin  
Started by seqadmin, 07-03-2024, 09:45 AM
0 responses
201 views
0 likes
Last Post seqadmin  
Started by seqadmin, 07-03-2024, 08:54 AM
0 responses
212 views
0 likes
Last Post seqadmin  
Started by seqadmin, 07-02-2024, 03:00 PM
0 responses
194 views
0 likes
Last Post seqadmin  
Working...
X