Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Is it possible to evaluate genome size with sequel data?

    Now we are doing the denovo assembly of marine organism with whole genome sequcing using sequel system. As we all know, the DNA extraction from marine organism is very difficult because of pollution and degradation. So is there any way to evaluate the genome size, heterozygus rate or genome repeat with DNA sequel data?
    happy

  • #2
    Use multipass pacbio reads for self error correction and Kmer counting.

    First try filtering out the multipass reads, and using those for kmer counting and self error correction.

    Make sure to remove any mitochondrial/symbionts reads before doing the kmer counting. (Identify and complete the respective genome(s) first).

    Get some good quality PCR-free illumina 2x250 reads or (BGIseq data if it works in your hands) and use it to confirm the kmer counting/self error correction/etc.

    Short reads are very helpful for getting the contaminant(s)/symbionts genomes to a good draft stage and for filtering them out from the main dataset.
    Usually such approach has to be done in the iterative fashion (with increasing amount of the input data after each iteration).

    Comment


    • #3
      Markiyan has alluded to it already; Pacbio data are not suitable for genome size estimates based on kmer analyses. The error rates of the uncorrected raw data are too high.

      Comment


      • #4
        While a kmer analysis is going to be difficult with the raw pacbio data, it is possible to estimate the (effective) genome size from overlap statistics, either for the raw reads, the error corrected preassembled reads or by mapping the raw reads to the assembled contigs.
        Run an initial assembly using a small seed read length, then plot the preassembled read overlap histogram.
        http://pb-falcon.readthedocs.io/en/l...pread-overlaps

        http://pb-falcon.readthedocs.io/en/l...GM2017_BFX.pdf

        Comment


        • #5
          I really appreciate for your help! I will have a try!
          happy

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Techniques and Challenges in Conservation Genomics
            by seqadmin



            The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

            Avian Conservation
            Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
            03-08-2024, 10:41 AM
          • seqadmin
            The Impact of AI in Genomic Medicine
            by seqadmin



            Artificial intelligence (AI) has evolved from a futuristic vision to a mainstream technology, highlighted by the introduction of tools like OpenAI's ChatGPT and Google's Gemini. In recent years, AI has become increasingly integrated into the field of genomics. This integration has enabled new scientific discoveries while simultaneously raising important ethical questions1. Interviews with two researchers at the center of this intersection provide insightful perspectives into...
            02-26-2024, 02:07 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 03-14-2024, 06:13 AM
          0 responses
          32 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 03-08-2024, 08:03 AM
          0 responses
          71 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 03-07-2024, 08:13 AM
          0 responses
          80 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 03-06-2024, 09:51 AM
          0 responses
          68 views
          0 likes
          Last Post seqadmin  
          Working...
          X