Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Generic SNP/Indel Simulator for NGS data

    Expert is there any existing SNP/Indel Simulator for NGS reads?

    The principle the tool does something like taking any reference genome, pick positions randomly in that genome and begin to introduce SNP or indels in that position. Finally the tool should give reads given from the variant genome above with certain depth, and location of SNP/Indel is introduced in the reads fasta header.

  • #2
    I think Maq does that, you should take a look at http://maq.sourceforge.net/maq-manpage.shtml#7

    Comment


    • #3
      Hi Yilong,

      Thanks so much for your reply.
      I tried to run 'fakemut' from maq using this command.

      Code:
      maq fakemut chr22.fa > out.fakeref.fasta 2> out.fake.snp
      Do you know what does each column means from "out.fake.snp"?
      I can guess, but I am not quite sure.

      I can't seem find any explanation about it from the web?

      Code:
      chr22   16051056        G       c       99
      chr22   16054178        -       T       99
      chr22   16055216        C       G       99
      chr22   16056830        A       C       99
      chr22   16056930        A       G       99
      chr22   16057491        G       A       99
      chr22   16058778        C       t       99

      Comment


      • #4
        Also take a look to dnaa, specifically the dwgsim (forked from the maq tool).
        -drd

        Comment


        • #5
          I need to generate a small dataset of individuals organized in pool... with SNP and indel
          is there a way to do this ?

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          22 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          17 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          49 views
          0 likes
          Last Post seqadmin  
          Working...
          X