Hi, I am currently trying to write some programme that will transverse the whole bam file in a short period of time.
The standard samtools api works extreme fast if I use bam_fetch (I can read through the whole transcriptom regions (gtf file from UCSC) within half an hour using a 7gb input)
The problem is, I want to detect reads in regions not specified by the transcriptom region (not in the user input). If I use the standard function in reading the reads one after another, it will take me around 2 hours. Is there any suggestion for me to speed the process up?
Thank you!
The standard samtools api works extreme fast if I use bam_fetch (I can read through the whole transcriptom regions (gtf file from UCSC) within half an hour using a 7gb input)
The problem is, I want to detect reads in regions not specified by the transcriptom region (not in the user input). If I use the standard function in reading the reads one after another, it will take me around 2 hours. Is there any suggestion for me to speed the process up?
Thank you!
Comment