Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • SNP calling on low coverage NGS data

    Hi,

    I am new to SNP calling and would appreciate some input. I want to try and call SNP's for a specific organisms. The hurdles I have is, I do not have a reference genome to map against and I have a very low coverage 4x per organisms.

    Any suggestions would be appreciated.

  • #2
    The problem is the following: If you do not have reference genome, you need a de-novo assembly. You can find heterozygous positions in de-novo (which can be called SNPs), but in you case only two reads support this hipothesis.
    So, use this sequences to create a reference and sequence other samples to find SNPs or make a higher coverage sequencing from the initial sample.

    Comment


    • #3
      4X read depth can be even more misleading, since at heterozygous loci you will sometimes get 2 reads per allele, but also 3 reads to one allele vs 1 read for the other, or even 4 reads to one allele and no reads for the other. On the other hand, any allele supported by a few reads is probably real; you just can't fully genotype the sample but you can identify SNPs.

      Genotyping by sequencing was developed to solve this problem. By sequencing only 10,000-100,000 loci across a genome, the reads attain good depth with a small amount of sequencing. We do genotyping by sequencing, as do other companies and academic cores. You don't need a reference for the informatics, but rather the reads can just be compared from sample to sample using a core set of reads representing the loci as a reference.
      Providing nextRAD genotyping and PacBio sequencing services. http://snpsaurus.com

      Comment


      • #4
        You are in trouble. 4x is not enough for de novo assembly, and it is not really enough for calling homozygous SNPs, let alone heterozygous ones.

        If these are all the same species, I'd combine all the reads together, and do de novo assembly on that. Then align each sample's reads to that assembly, call SNPs on those alignments, and realize that you are going to miss most of the heterozygous SNPs, and many homozygous ones too.

        Comment


        • #5
          4X coveragedata without reference is almost impossible to call

          In my opinion,4X coverage reseq-data without any reference is almost impossible to call SNP.
          Best.
          Naibin

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin


            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist on Modified Bases...
            Yesterday, 07:01 AM
          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          39 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          41 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          36 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          55 views
          0 likes
          Last Post seqadmin  
          Working...
          X