Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
Question about Samtools Mpileup -u option clintp Bioinformatics 0 05-04-2018 03:15 PM
featureCounts option question gene_x RNA Sequencing 6 08-09-2016 12:23 PM
BBmap dedupe help JamesSeward Bioinformatics 7 07-15-2016 11:20 PM
Question about --genemodel=complete option of Augustus evolver Bioinformatics 1 10-19-2015 03:36 AM
question about VariantRecalibrator's mode option caswater Bioinformatics 0 04-16-2012 01:34 AM

Thread Tools
Old 09-29-2018, 05:02 PM   #1
Location: Saint Louis, MO

Join Date: Sep 2010
Posts: 58
Default question about dedupe renameclusters option


How does the option renameclusters work in When I set this option to true it does not rename the contigs from 0 -> n as it does in the stats file output from csf=t. Instead, the numbers are more sporadic and over a larger range. For example, I complied the renamed contig ids with the name and size from the stats file and obtain this:

renamed_contig_id stats_file_cluster_id size
0 0 110827
1 1 13812
2 2 6812
3 3 3719
6 4 3481
9 5 2817
10 6 1880
11 7 1743
13 8 1435

Here is the command that was used to generate the clustered contig and stats file: in=input.fastq ow=t \
ac=f am=f s=4 c fo pc cc \
rnc=t sort=id \
csf=stats.txt outbest=best.fa;

How does rename cluster number? Is there anyway to make these numbers have parity with the output from csf=t?
shandley is offline   Reply With Quote

bbmap, dedupe

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 11:01 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2021, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO