Unconfigured Ad

**Brian Bushnell** · 09-18-2017, 04:21 PM

I suggest you try BBSplit as in this thread. That will give you one fastq file for human reads, and one for viral reads. Then, map the viral output fastq file to the virus reference with a normal aligner (such as BBMap or Nextgenmap which you used previously). BBMap can output coverage data directly (well, BBSplit can too, actually...) using the covstats or basecov flags, if you want.

In answer to your last question, read IDs do not change with respect to reference genomes. However, unmapped bam files are not very useful and you can't use them for coverage analysis.

**Vca80553** · 09-19-2017, 04:27 AM

Dear Brian, Thanks a lot. I did as you suggested.
I also used BBMap (pileup.sh) for output coverage. I get the following results
Avg fold 53035,994
Length 7904
Ref_GC 0.0000
Covered % 100
Covered bases 7904
Plus reads 3190731
Minus reads 3190731
Read GC 0.395
Median_fold 22882
Std 24900.22

I assume I covered the whole genome. Do you happen to know whey the Ref_GC equals to 0.0000? I calculated GC content % of my reference genome and it is 36.51%. Thanks!

**Brian Bushnell** · 09-19-2017, 12:16 PM

That's because you did not specify the reference file with "ref=" when you ran pileup.sh. It's not necessary; you only need it if you want the Ref_GC column to be correct.

**Vca80553** · 09-19-2017, 01:23 PM

Ref_GC

Originally posted by Brian Bushnell View Post

That's because you did not specify the reference file with "ref=" when you ran pileup.sh. It's not necessary; you only need it if you want the Ref_GC column to be correct.

I wrote the following, but still didn't get it. Maybe something not right?

/home/sara/bbmap/pileup.sh in=1409_cat_sorted.bam "ref=/home/sara/HPV16Reference.fasta" out=1409.ref_stats1.txt hist=1409.ref1_histogram.txtll

Thanks

**Brian Bushnell** · 09-19-2017, 02:48 PM

That's strange - I tested it and it works fine for me. No reference:

Code:

pileup.sh in=mapped.sam.gz stats=covstats.txt

cat covstats.txt

#ID     Avg_fold        Length  Ref_GC  Covered_percent Covered_bases   Plus_reads      Minus_reads     Read_GC Median_fold     Std_Dev
chr1    0.0908  249250621       0.0000  8.1241  20249466        76413   74517   0.4179  0       0.33
chr10   0.0960  135534747       0.0000  8.6332  11700973        43491   43228   0.4160  0       0.33

With reference:

Code:

pileup.sh in=mapped.sam.gz stats=covstats2.txt ref=hg19_main_mask_ribo_animal_allplant_allfungus.fa.gz

cat covstats2.txt

#ID     Avg_fold        Length  Ref_GC  Covered_percent Covered_bases   Plus_reads      Minus_reads     Read_GC Median_fold     Std_Dev
chr1    0.0908  249250621       0.4183  8.1241  20249466        76413   74517   0.4179  0       0.33
chr10   0.0960  135534747       0.4167  8.6332  11700973        43491   43228   0.4160  0       0.33

I tried various things including adding "hist=" and using "out=" instead of "stats=", and putting quotes around the reference flag, but was unable to replicate this. Are you sure you are looking at the correct output file, rather than an old one?

Topics	Statistics	Last Post
Whole-Genome Sequencing Traces Faroe Islands Ancestry to a North Atlantic Founder Population by SEQadmin2 Started by SEQadmin2, 06-17-2026, 06:09 AM	0 responses 24 views 0 reactions	Last Post by SEQadmin2 06-17-2026, 06:09 AM
Sequencing the Two-Toed Sloth Genome Reveals Jumping Genes Tied to Its Extreme Metabolism by SEQadmin2 Started by SEQadmin2, 06-09-2026, 11:58 AM	0 responses 41 views 0 reactions	Last Post by SEQadmin2 06-09-2026, 11:58 AM
A New Method Makes Hantavirus Genome Analysis Faster and More Accessible by SEQadmin2 Started by SEQadmin2, 06-05-2026, 10:09 AM	0 responses 48 views 0 reactions	Last Post by SEQadmin2 06-05-2026, 10:09 AM
A New Single-Cell Method Maps DNA-Protein Interactions by SEQadmin2 Started by SEQadmin2, 06-04-2026, 08:59 AM	0 responses 49 views 0 reactions	Last Post by SEQadmin2 06-04-2026, 08:59 AM

Unconfigured Ad

Bam file with unmapped reads from another genome than reference

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News