Seqanswers Leaderboard Ad

**N311V** · 11-08-2015, 07:14 PM

Looks great! I have a heap of fastQC and Tophat files I can test it out on.

**ewels** · 11-09-2015, 01:11 AM

Great - let me know how you get on!

Note that fkrueger redefined what I thought "a lot of reports" meant last night by running MultiQC on over 3000 samples in one go. Note that the HMTL report will probably fall over if you use sample numbers in this order of magnitude, but MultiQC also creates tab-delimited text files which you can open in Excel / use downstream.

**NeilPearson** · 11-09-2015, 07:44 AM

Tried it out earlier. Very impressed!

**N311V** · 11-09-2015, 01:08 PM

Originally posted by tallphil View Post

Great - let me know how you get on!

Note that fkrueger redefined what I thought "a lot of reports" meant last night by running MultiQC on over 3000 samples in one go.

That redefines what I mean by a lot also, I was talking about 142 samples! I'll hopefully get a chance to run it today.

**ewels** · 11-12-2015, 04:30 AM

Originally posted by NeilPearson View Post

Tried it out earlier. Very impressed!

Great, glad you liked it!

N311V - 142 samples is still quite a bit (it will probably not render all of the plots by default to prevent locking up the browser), but it should work fine I think

Note that I'm hoping to make a new template at some point in the future that has flat pre-rendered plots instead of interactive JavaScript plots. This should mean that reports with huge numbers of samples will work and don't have huge filesizes. See the GitHub issue about this here.

**N311V** · 12-08-2015, 08:33 PM

Hi ewels,

I've finally got time to run MultiQC but having trouble providing a list of directories to search. Do I need top put my fastqc and tophat results all in the same directory to produce a single report for both?

P.S. It's actually 144 samples of paired-end data. Each end is a separate file so MultiQC had to compile 288 fastqc results files. Everything looks great (love it!) and MultiQC does not appear to have had any trouble with that many files. I'm using chrome on a laptop with 16 GB of RAM so that is likely helping to stop the browser from crashing.

**ewels** · 12-08-2015, 10:58 PM

Great that it's working, and even better that you like it

You can either supply MultiQC with a parent directory that contains all files (it searches recursively through child directories), or give it multiple paths:

Code:

multiqc fastqc_dir tophat_dir

You can even give it a massive list of files if you want to:

Code:

multiqc *fastqc.zip *_tophat

Hope that helps..

Phil

**gringer** · 12-09-2015, 12:15 AM

Great idea, thanks. Any chance you could add in kallisto and kraken support?

**zinky** · 12-09-2015, 12:20 AM

nice job, i starred already

**ewels** · 12-09-2015, 12:39 AM

Originally posted by gringer View Post

Great idea, thanks. Any chance you could add in kallisto and kraken support?

Hi gringer, I can do yeah - I've noted these down as GitHub issues here and here.

If you have some typical log files that you could add, that would really help. Saves me from having to set up and run the programs myself (though I've been meaning to try them both out anyway).

**zinky** · 12-09-2015, 01:03 AM

Hi ewels, i run MultiQC on my 6 tophat output folders. i found all log files were parsed and it also gives a bowtie2 plot. So dose MultiQC check the log files fisrst then parsed keywords like "bowtie","tophat" to determine types of output? if so, i can modify file names manually and get exactly what i want(one item in plot with each sample) .

**ewels** · 12-09-2015, 01:10 AM

Hi zinky,

The strategy for parsing files varies for each module. Unfortunately bowtie has no consistent file name structure and its output is very generic. Also, as many other programs use it then its output often crops up inside other programs. I can't think of any way to know the difference between log files generated by bowtie and those generated by programs that use bowtie. If you have any suggestions I'd love to hear them!

Anyway - the easiest fix for you is to just stop the bowtie modules from running. You can do this with the -e / --exclude parameter:

Code:

multiqc -e bowtie1 -e bowtie2 .

Let me know if you have any problems with this.

Phil

**zinky** · 12-09-2015, 01:34 AM

Hi Phil, Thanks for your quick reply; Well , your suggestion is a good choice for me; Technically, parsing output of those tools to generated summary report is a lightweight job, and it therefore raise a request to the software developers. This means ask them (well inluding me) to generate logs with ID and someother markers, that's not easy. so why not to suggest users runnig MultiQC which give an interface to call third-party tools inside , rather than the original tools. I have been working on workflow building for years, i think this kinds of exprience could be better for users

**ewels** · 12-11-2015, 07:40 AM

Hi Zinky,

I think you're suggesting that I make MultiQC into a workflow / pipeline tool of some sort? I'd prefer to keep it focussed and as simple as possible I think, so that it can be easily added to the end of any workflow and used with data generated in any manner from these tools.

I've written and use a workflow tool called Cluster Flow, so MultiQC should inevitably work pretty well with that. But it should work well with everything.

Phil

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

MultiQC - a new tool to create summary reports from any analysis output

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News