I am using samtools and bamToBed to get the properly-paired read like this:
samtools view -bf 0x2 read.bam | bamToBed -i stdin > A.bed
then I use coverageBed to get counts for eahc gene in hg18 like this:
coverageBed -a A.bed -b hg18.bed.sorted -s > gene_count.txt
chr1 67051159 67113208 uc009waw.1 0 0 62049 0.0000000
chr1 67051159 67129093 uc009wax.1 0 0 77934 0.0000000
chr1 67051159 67163158 uc001dcx.1 3 108 111999 0.0009643
chr1 67075873 67163158 uc001dcy.1 3 108 87285 0.0012373
chr1 16762998 16791395 uc009vor.1 14 445 28397 0.0156707
chr1 16762998 16812569 uc009vos.1 19 625 49571 0.0126082
chr1 16765489 16786343 uc001ayz.1 12 373 20854 0.0178863
chr1 16765489 16786343 uc009vot.1 12 373 20854 0.0178863
chr1 33544953 33559286 uc001bxd.1 0 0 14333 0.0000000
chr1 41748270 42157083 uc001cgz.2 281 7868 408813 0.0192460
5th column is count. I have two questions:
what is definition of properly-paired reads?
When we calcualte count for each gene using properly-paired reads,
whether we just count one time for each properly-paired reads?
thanks,
jlm
samtools view -bf 0x2 read.bam | bamToBed -i stdin > A.bed
then I use coverageBed to get counts for eahc gene in hg18 like this:
coverageBed -a A.bed -b hg18.bed.sorted -s > gene_count.txt
chr1 67051159 67113208 uc009waw.1 0 0 62049 0.0000000
chr1 67051159 67129093 uc009wax.1 0 0 77934 0.0000000
chr1 67051159 67163158 uc001dcx.1 3 108 111999 0.0009643
chr1 67075873 67163158 uc001dcy.1 3 108 87285 0.0012373
chr1 16762998 16791395 uc009vor.1 14 445 28397 0.0156707
chr1 16762998 16812569 uc009vos.1 19 625 49571 0.0126082
chr1 16765489 16786343 uc001ayz.1 12 373 20854 0.0178863
chr1 16765489 16786343 uc009vot.1 12 373 20854 0.0178863
chr1 33544953 33559286 uc001bxd.1 0 0 14333 0.0000000
chr1 41748270 42157083 uc001cgz.2 281 7868 408813 0.0192460
5th column is count. I have two questions:
what is definition of properly-paired reads?
When we calcualte count for each gene using properly-paired reads,
whether we just count one time for each properly-paired reads?
thanks,
jlm
Comment