I have a machine with 16 cores, 64GB memory and 16TB hard drive.
I have a very large RNASeq data set to analyse:
25 Cases (100 millions reads in each file).
25 Controls (100 millions reads in each file).
The cufflinks and cuffmerge part are all done.
The cuffdiff is given me some problems because when I use more than one thread (i.e,-p *> 1), I run out of memory after 1 day.
I am running cuffdiff with -p 1, and the job started 3 weeks ago and still running.
How can I speed up the process ? Or what are the other solutions ?
Can I split my files per chromosomes and run different analysis ? If so will my results be usable ?
Can I ask cuffdiff to write some partial results ?
Can I ask cuffdiff to compute juste the gene expression levels and discard the other results ?
Thanks for any advise.
Alpha
I have a very large RNASeq data set to analyse:
25 Cases (100 millions reads in each file).
25 Controls (100 millions reads in each file).
The cufflinks and cuffmerge part are all done.
The cuffdiff is given me some problems because when I use more than one thread (i.e,-p *> 1), I run out of memory after 1 day.
I am running cuffdiff with -p 1, and the job started 3 weeks ago and still running.
How can I speed up the process ? Or what are the other solutions ?
Can I split my files per chromosomes and run different analysis ? If so will my results be usable ?
Can I ask cuffdiff to write some partial results ?
Can I ask cuffdiff to compute juste the gene expression levels and discard the other results ?
Thanks for any advise.
Alpha
Comment