Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Copy number variation in cancer WGS

    Hi guys,

    I am looking at some whole genome sequencing (WGS) for tumor-normal pairs, and I want to find somatic copy number alteration in the tumors.
    What tools do you guys recommend for these?

    I have read about a few, e.g., BIC-Seq, OncoSNP-SEQ, CREST, etc., but have no experience. Any recommendations?

    Thanks in advance.

  • #2
    Control-FreeC is a good choice.
    And an interesting article:

    Comment


    • #3
      Thanks. Yeah I've read over that article just recently, and I like to see what are people's experiences with different software.

      Control-FREEC seems pretty good. What's your experience with it? I also want to try out BIC-Seq, OncoSNP-SEQ, and CNAnorm. I want to see how they perform. Does anyone have any experience with those?

      Comment


      • #4
        Control-FreeC is easy to use if you are working with human or mouse model. If you working with non model-organisms it is a bit tricky, because you need to generate the mappability files. Read carefully the documentation because there is a lots of settings. But the we were happy with the results.

        Comment


        • #5
          Originally posted by TiborNagy View Post
          Control-FreeC is easy to use if you are working with human or mouse model. If you working with non model-organisms it is a bit tricky, because you need to generate the mappability files. Read carefully the documentation because there is a lots of settings. But the we were happy with the results.
          I'm working with some tumor-normal WGS pairs. Finished some runs, trying to make the plot........ which is taking some time.

          I ran Control-FREEC with a pair of simulated tumor-normal data from this Syapse thing: https://www.synapse.org/#!Synapse:syn312572/wiki/60702

          It took 3 hours to complete. The *_ratio.txt file is 113M. It took 3 minutes to produce the graph.
          For the real tumor-normal pair (data file is slightly larger), it took 14 hours to complete. The The *_ratio.txt file is 169M. It took about an hour to finish plotting.

          Why that discrepancy in time?
          Last edited by lethalfang; 03-17-2014, 09:43 AM.

          Comment


          • #6
            Just checking, it's a bad idea to split Copy Number Analysis jobs into chromosome by chromosome, right? It would be bad for the sample power.

            Comment


            • #7
              I'm wondering for tumor-normal paired WGS, what are the commended settings for some of the following:

              coefficientOfVariation:
              The test file had 0.062. The manual has an "example" of 0.05. What's a good parameter from people's experience?

              forceGCcontentNormalization:
              Does GC content normalization improve the results for tumor-normal pairs? If so, is 1 or 2 better?
              1: normalize GC content first, and then calculate sample/control ratio.
              2: calculate sample/control ratio first, and then normalize GC content.

              intercept:
              1 - with GC content
              0 - with a control dataset.
              What if I have both?

              minCNAlength:
              What's a good setting based on people's experience?

              Any other setting that people find particularly better for tumor-normal pairs?

              Thanks.

              Comment


              • #8
                Originally posted by lethalfang View Post
                Why that discrepancy in time?
                It can be anything: slow hard disk, old computer, etc.

                Just checking, it's a bad idea to split Copy Number Analysis jobs into chromosome by chromosome, right?
                I am not an expert, but I think it is not a bad idea.
                Last edited by TiborNagy; 03-18-2014, 06:29 AM. Reason: spell correction

                Comment


                • #9
                  Originally posted by lethalfang View Post
                  Hi guys,

                  I am looking at some whole genome sequencing (WGS) for tumor-normal pairs, and I want to find somatic copy number alteration in the tumors.
                  What tools do you guys recommend for these?

                  I have read about a few, e.g., BIC-Seq, OncoSNP-SEQ, CREST, etc., but have no experience. Any recommendations?

                  Thanks in advance.
                  Hi, you could try sequenza, is available on CRAN.

                  Is composed by a python script that generate a suitable file for the R-package, or alternatively VarScan2 output could be used.

                  It was developed for exome sequencing, but with whole-genome it works even better.

                  It's available from the institute page or from CRAN, the python script is bundled with the R package, as the documentation and example data.

                  As usual, higher depth, and higher tumor content are a goo thing, but I managed to analyse tumor sample with relatively low depth (10x) as well samples around ~20% of tumor content with satisfying results.

                  Comment


                  • #10
                    Originally posted by ffavero View Post
                    Hi, you could try sequenza, is available on CRAN.

                    Is composed by a python script that generate a suitable file for the R-package, or alternatively VarScan2 output could be used.

                    It was developed for exome sequencing, but with whole-genome it works even better.

                    It's available from the institute page or from CRAN, the python script is bundled with the R package, as the documentation and example data.

                    As usual, higher depth, and higher tumor content are a goo thing, but I managed to analyse tumor sample with relatively low depth (10x) as well samples around ~20% of tumor content with satisfying results.
                    Cool.
                    Is it possible to see the pre-print of your submitted paper, or do you have to wait until it's published?

                    Comment


                    • #11
                      :-).
                      Well, I could but I'd like to see what the reviewers/editors have to say first.

                      We just describe the algorithm and compare the results runing on exome with respective SNP array from TCGA, as well compare the results of other similar algirithms.
                      Sequenza was, when not perfecly the same, was pretty close to the SNP array prediction...

                      Comment


                      • #12
                        Originally posted by ffavero View Post
                        :-).
                        Well, I could but I'd like to see what the reviewers/editors have to say first.

                        We just describe the algorithm and compare the results runing on exome with respective SNP array from TCGA, as well compare the results of other similar algirithms.
                        Sequenza was, when not perfecly the same, was pretty close to the SNP array prediction...
                        That sounds like OncoSNP-SEQ.

                        Comment


                        • #13
                          Originally posted by lethalfang View Post
                          That sounds like OncoSNP-SEQ.
                          Well oncoSNP-seq have a different inference implementation, plus it uses dbSNP to set heteozygous positions, while we use information from the germline.

                          It's written in MATLAB while we have implemented it in R, which should make it easier to use I suppose.

                          Anyway I've started the sofware from scratch , to have something working properly with exome, whitout borrow any concept from oncoSNP, they are ptetty different from each other, I would say.

                          I've tried to use oncoSNP-seq on exome, but it doesn't works well, as warned in the manual.
                          Although sequenza works pretty well with wgs.
                          If you want to perform some testing it it should be difficult to try them both and benchmark the difference.

                          Comment


                          • #14
                            Originally posted by ffavero View Post
                            Well oncoSNP-seq have a different inference implementation, plus it uses dbSNP to set heteozygous positions, while we use information from the germline.

                            It's written in MATLAB while we have implemented it in R, which should make it easier to use I suppose.

                            Anyway I've started the sofware from scratch , to have something working properly with exome, whitout borrow any concept from oncoSNP, they are ptetty different from each other, I would say.

                            I've tried to use oncoSNP-seq on exome, but it doesn't works well, as warned in the manual.
                            Although sequenza works pretty well with wgs.
                            If you want to perform some testing it it should be difficult to try them both and benchmark the difference.
                            Yeah installing MATLAB runtime was a pain.
                            It's much better to have a software without having to install a proprietary 3rd-party language.

                            Comment


                            • #15
                              Was trying



                              sequenza-utils.py pileup2seqz \
                              -gc ~/references/hg19.gc50Base.txt.gz \
                              -n ~/pileups_benchmark/wgs.normal.pileup.gz \
                              -t ~/pileups_benchmark/wgs.tumor.pileup.gz \
                              | bgzip > ~/out.seqz.gz

                              And I got an empty output file, with only the header. No error message.
                              The two pileups have chromosomes labeled as 1, 2, 3, ..., X, Y, M.

                              Any idea why?

                              Thanks.

                              Comment

                              Latest Articles

                              Collapse

                              • seqadmin
                                Advancing Precision Medicine for Rare Diseases in Children
                                by seqadmin




                                Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                                12-16-2024, 07:57 AM
                              • seqadmin
                                Recent Advances in Sequencing Technologies
                                by seqadmin



                                Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                                Long-Read Sequencing
                                Long-read sequencing has seen remarkable advancements,...
                                12-02-2024, 01:49 PM

                              ad_right_rmr

                              Collapse

                              News

                              Collapse

                              Topics Statistics Last Post
                              Started by seqadmin, 12-17-2024, 10:28 AM
                              0 responses
                              33 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 12-13-2024, 08:24 AM
                              0 responses
                              49 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 12-12-2024, 07:41 AM
                              0 responses
                              34 views
                              0 likes
                              Last Post seqadmin  
                              Started by seqadmin, 12-11-2024, 07:45 AM
                              0 responses
                              46 views
                              0 likes
                              Last Post seqadmin  
                              Working...
                              X