Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • determining library size

    How does one go about determining what library size is the best to sequence, for example 200bp libraries versus 250bp or 300bp libraries?

  • #2
    Library size is determined by read length.If you aim for 2x100 reads,you can go for a 300bp insert which could produce two 100bp non overlapping reads subsequently increasing coverage.

    Comment


    • #3
      Hi!

      I had a few beginner questions about size selection when doing library prep.

      Some background info:
      -General project is a resequencing project
      -Have a reference genome
      -Using genomic DNA
      -Will be using Illumina Hiseq platform
      -Will be looking for SNPs when analyzing

      Big question: How do you determine what size your adapters/libraries should be (if doing paired ends)?
      -If using beads, your size selection can range from say 200-500 bp. From that range, how do you know which size will be best? I'm guessing this depends on the desired read lengths as stated in the above post?

      I think since I will be doing SNP detection, I'm concerned how much a couple hundred bp will affect the outcome.

      Hopefully I made some sense in my questions.
      Thanks!

      Comment


      • #4
        kjm, if you are looking for SNPs there are few things worth considering. When we do a nextRAD SNP discovery project, the main questions are 1) how many markers are needed, and 2) what kind of depth of coverage is needed.

        If you are doing whole-genome resequencing, you can't really target a desired number of SNPs. But you'll need a reasonably high coverage to sample both homologous chromosomes in a diploid (~20X), and lower coverage if the genome you are sampling is haploid.

        As to the fragment length of the library, in your case it doesn't matter too much. If you want to identify structural variation, then a longer insert size helps. You will want it to be >twice the read length so you don't waste sequencing efforts on redundant information. You might as well keep complexity in the library by not excluding much.
        Providing nextRAD genotyping and PacBio sequencing services. http://snpsaurus.com

        Comment


        • #5
          Thanks for the response SNPsaurus. Not sure if it will change anything, but I will be using the SNPs for recombination rate. So if I remember the number correctly, the goal is to have over 10,000 markers (this is the number that keeps popping in my head I think this is right).

          Comment


          • #6
            Unless you are re-sequencing closely related small genomes (like yeast), whole genome sequencing will give you many more markers than that. Genotyping-by-sequencing methods were invented for that reason--to sequence just a fraction the genome, allowing more samples to be sequenced in a lane. Check out http://www.ncbi.nlm.nih.gov/pubmed/21681211 for a review.
            Providing nextRAD genotyping and PacBio sequencing services. http://snpsaurus.com

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin


              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
              Yesterday, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            54 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            50 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            44 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            55 views
            0 likes
            Last Post seqadmin  
            Working...
            X