Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • extract interchoromosomal pairs from a BAM

    Hi,

    Is there a faster tool to extract inter-chromosomal reads from a BAM file directly rather than using samtools view and awk '($3!=$7 && $7!="=")' statement?

    Thanks.

  • #2
    Anyone has some solutions regarding spilitting a huge bam like whole genome library? thanks.

    Comment


    • #3
      The only faster method would be to write something in pysam or using either the C htslib of java htsjdk APIs. Those are likely only faster due to not needing to format things for printing. Alternatively, you could write something that starts a different thread for each chromosome or, or divides things according to where there are known gaps, or...

      There are a lot of ways to divide things, but realistically you're going to be limited by IO and decompression rate, since the actual processing you're doing is trivially fast.

      Comment


      • #4
        Hi,
        Thanks for your reply. right now I am using different threas or fork as many intervals as I need. but interchromosomal pairs using the above awk statement takes a longer time for whole genome libraries. Any thoughts? Thanks.

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Advancing Precision Medicine for Rare Diseases in Children
          by seqadmin




          Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
          12-16-2024, 07:57 AM
        • seqadmin
          Recent Advances in Sequencing Technologies
          by seqadmin



          Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

          Long-Read Sequencing
          Long-read sequencing has seen remarkable advancements,...
          12-02-2024, 01:49 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 12-17-2024, 10:28 AM
        0 responses
        27 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-13-2024, 08:24 AM
        0 responses
        43 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-12-2024, 07:41 AM
        0 responses
        29 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 12-11-2024, 07:45 AM
        0 responses
        42 views
        0 likes
        Last Post seqadmin  
        Working...
        X