Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • sadiqsaleem09
    Junior Member
    • Apr 2011
    • 9

    Planning a cancer exome sequencing project

    Hello everyone,
    Hope you having a fine day!

    I have recently got interested into NGS and its applications for studying cancer genomes. My group is interested in studying somatic mutation landscape of a particular tumor. After going through some literature and considering my current funding, I have decided to pursue exome sequencing.

    The question for which I am or will be using exome sequencing is to see if there is any co-relation between the mutations harbored in the cancer exome and the cancer progression..

    Do you think 5 tumor samples will be a good number to start with? We will be using Illumina genome analyzer IIx, PE 58 bp...
  • Simon Anders
    Senior Member
    • Feb 2010
    • 995

    #2
    No. You are hopelessly underpowered.

    See this thread and maybe this News&Views article.

    Comment

    • sadiqsaleem09
      Junior Member
      • Apr 2011
      • 9

      #3
      Thanks Simon for your response and sharing the relevant links.

      Have you looked into these studies?



      and




      For whole exome sequencing, the first study sequenced 14 tumor and matched normal DNA and the second study sequenced 7 tumor and matched normal DNA. Considering these two reports both getting published in highly reputable journal, why do you think 5 or 6 tumor DNA with their matched normal DNA will be 'hopelessly underpowered'???

      Comment

      • Simon Anders
        Senior Member
        • Feb 2010
        • 995

        #4
        These two studies set out to find mutations that are frequent in a given tumour type. You have asked about correlating mutation with disease progression. This is quite a difference!

        Varela et al. and Wei et al. set out to find mutations which are frequent in a given cancer type, as opposed to mutations at random places. Given the large size of the human genome, it is in fact unlikely to see the same mutation twice unless this is a hotspot. (Nevertheless, both studies amplified the loci they saw mutated more than once in more than a hundred additional samples to make sure.)

        Finding such mutation is one thing. Figuring out which of these mutations are markers for good or bad prognosis is much more challenging. Imagine, you have 5 samples, two from patients who died soon afterwards and three from patients who survived long. Now, you find a mutation in three of your five samples, namely in both the bad and only in one of the three good cases. Of course, you cannot claim that this indicates that the mutation marks a bad prognosis without checking the mutation in samples from very many more patients.

        Comment

        • Heisman
          Senior Member
          • Dec 2010
          • 534

          #5
          Sadiq,

          Your first post makes it seems like you are interested in cancer progression, as opposed to simple association of mutations with cancer. For the latter there are only two distinct types of samples, those with cancer and those without. For the former there are many more distinct types of samples, those without cancer, those with initial signs of cancer, those with more developed cancer, etc. Or, probably more appropriately, you would need samples from cancer patients who have had their cancer progress to a severe stage and compare mutations in different patients with different rates of progression. I'm guessing that would make it much more underpowered.

          Comment

          • sadiqsaleem09
            Junior Member
            • Apr 2011
            • 9

            #6
            Thank you Simon and Heisman.

            To be more precise, what I meant by cancer progression was cancer metastasis. My hypothesis is that primary tumors (tumor from the original site) must harbor the genomic alterations that would indicate that the tumor has metastasized. In this case, primary tumors from patients with no lymph nodes and distant metastasis will be different from primary tumors from patients with positive lymph nodes atleast.

            To test this hypothesis and considering the limited funding, do you think searching for mutations through WES in two distinct sets of total 6 primary tumors (n = 3 + 3) followed by screening of hotspots in a bigger pool of samples (n = 100 or more) will be a viable idea?

            Please don't mind as I have just started to learn about NGS and would really like your responses. Thank you once again.

            Comment

            • krobison
              Senior Member
              • Nov 2007
              • 734

              #7
              The challenge you will face with most tumors is that there are likely to be huge numbers of mutations in the tumors, but it depends on the tumor type. Also, with only 6 primaries you'll only be likely to find very frequent hotspots (roughly those present at frequency 1/6, but of course it's more complicated than that -- and too late at night for me to even attempt to botch the calculation!). The number of mutations present at 15% or more is relatively limited compared to the many biologically important mutations which are not. The odds that you'll find a shared mutation specific to the distant metastasis samples with that kind of frequency are quite poor.

              Also, it may well be that the distant metastasis arises from a relatively rare clone within the primary, which means it will be challenging to detect.

              For the number of samples you can afford, I would think a project more likely to yield something would be to compare matched primary & distant metastasis samples ala Ding or Jones.

              You should look very carefully at what your cost internally is to run this vs. going outside; a number of commercial providers will generate human exomes for about $2K each (50X coverage). They are getting efficiencies from using the HiSeq & buying the capture reagents in large quantities. I'm not guaranteeing it will be cheaper, but it could well be since you are proposing to use a GAIIx. The one thing to be careful of is what coverage the provider will guarantee vs. what you want, but you need to watch that with an internal project as well.

              Finally, I would (of course! :-) recommend you get a copy of Brief Bioinform. 2010 Sep;11(5):524-34; it won't cover what's been published since February of 2010 & it doesn't really cover the experimental design issue you are getting into, but does give an overview of pretty much everything in cancer genomics with NGS up to them. I don't know of a more up-to-date review.

              Comment

              Latest Articles

              Collapse

              • GATTACAT
                Reply to Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by GATTACAT
                Love this - good data definitely starts from good input, and poor input can only give relatively poor data. I particularly like the mention of Nanodrop/absorbance based methods for quantification. It's such a toss up if you'll get an accurate reading or what amounts to a randomly generated number, and a lot of library/sequencing related issues can be traced back to poor quant.
                Today, 11:43 AM
              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, Yesterday, 05:37 AM
              0 responses
              7 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-26-2026, 11:10 AM
              0 responses
              17 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              52 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              110 views
              0 reactions
              Last Post SEQadmin2  
              Working...