Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • help using tabix

    Dear All

    How do you view the tbi file generated by tabix? I am using tabix v0.2.5 to index a local BED file of gene positions.

    I would like to use the intervals generated by tabix and store them in a database for querying my sequence data from a RDMS. Is it possible to use tabix to
    a) generate the bin that a sequence resides in
    b) generate the bins that would need to be queried to get all feature within (or overlapping) a sequence region

    Is tabix suitable for this purpose? Being as I can't view the index file at present its difficult to answer myself. I suspect the index file will only contain the bin a sequence resides in and will not be suitable for finding the bins to query for a sequence range. If tabix, is there anything else you could recommend?

    thank you for your help

    Also, what the interval sizes used by tabix? I could not find details of this. Can you control them?

  • #2
    Tabix allows you to access a file by quering for (for example) a chromosome and position and retrieve all the lines of that file that match your query.

    Problems arise when you need to do thousands of lookups. Tabix can take hours to do, say, 100k lookups into DBSNP data. I wrote a Perl/C++ (Text::Tidx) alternative to tabix for annotating Exomes, using DBSNP or 1K genomes data. It uses a lot of RAM, but is much faster than tabix for that stuff.

    Comment

    Latest Articles

    Collapse

    • seqadmin
      Advancing Precision Medicine for Rare Diseases in Children
      by seqadmin




      Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
      12-16-2024, 07:57 AM
    • seqadmin
      Recent Advances in Sequencing Technologies
      by seqadmin



      Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

      Long-Read Sequencing
      Long-read sequencing has seen remarkable advancements,...
      12-02-2024, 01:49 PM

    ad_right_rmr

    Collapse

    News

    Collapse

    Topics Statistics Last Post
    Started by seqadmin, 12-17-2024, 10:28 AM
    0 responses
    23 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-13-2024, 08:24 AM
    0 responses
    42 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-12-2024, 07:41 AM
    0 responses
    28 views
    0 likes
    Last Post seqadmin  
    Started by seqadmin, 12-11-2024, 07:45 AM
    0 responses
    42 views
    0 likes
    Last Post seqadmin  
    Working...
    X