Seqanswers Leaderboard Ad

**adaptivegenome** · 12-14-2011, 07:08 AM

samtools and bamtools both provide very fast APIs you can use, however this requires a minimal experience with a programming language or script...

**CHRYSES** · 12-14-2011, 07:13 AM

Originally posted by genericforms View Post

samtools and bamtools both provide very fast APIs you can use, however this requires a minimal experience with a programming language or script...

Yeah, I tried to get into that, but I am not good at "C" language, I could not follow it. I hope someone else has created/thought of something...

I could go directly into text format (i.e. SAM) and parse it with PERL, but that's really very slooooooow.

**adaptivegenome** · 12-14-2011, 07:18 AM

If you are going to examine every read in order then I suppose you could parse a giant text file. This sort of sequential analysis is hard to speed up unless you parallelize it. If you are parsing a text file you could break it into many parts and then run your PERL script on the many parts in parallel (if you have access to that kind of equipment).

I am not aware of an off the shelf tool. Sorry!

Personally I opt for pthreads and C/C++...

**swbarnes2** · 12-14-2011, 09:11 AM

Illumina reads are error prone. If you pull every single read with a discrepancy from reference, you are going to pull a lot of noise.

I don't think that a pileup can be generated with only variant positions, but you could grep the pileup to only get lines with alterante letters. The pileup will have the position, all the letters called by all the reads that cross the position, and all the qualities for all the reads that cross the position.

**CHRYSES** · 12-14-2011, 12:28 PM

Originally posted by swbarnes2 View Post

Illumina reads are error prone. If you pull every single read with a discrepancy from reference, you are going to pull a lot of noise.

I don't think that a pileup can be generated with only variant positions, but you could grep the pileup to only get lines with alterante letters. The pileup will have the position, all the letters called by all the reads that cross the position, and all the qualities for all the reads that cross the position.

Yes, but how can I run a pileup on a single position with 500 million X coverage ? I think i will need to do this read by read...

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 31 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Fastest way to extract differing positions from each alignment in a BAM file

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News