Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Best way to generate reads for testing of aligners/snp callers

    Hi all,

    I know of numerous programs for generating artificial reads e.g. wgsim, bgeneratereads, simNGS...

    BUT

    I am trying to find one which will annotate the output sequences with a description of various characteristics that will make downstream testing of aligners/snp callers easier.

    For example, the read name would contain info on chromosome and position of origin and the coordinates of any errors/snps/indels etc that have been introduced to the sequence.

    Does anything like this exist or would it be necessary to write a program/hack an existing one?

    Thanks in advance.

  • #2
    Shameless plug: http://sourceforge.net/apps/mediawik...ome_Simulation

    Comment


    • #3
      Originally posted by nilshomer View Post
      We are eagerly awaiting your evaluation scripts.

      Comment


      • #4
        They are there, but not documented on the wiki. Maybe you could try it out and help add some documentation? Open source FTW?

        Comment


        • #5
          Originally posted by nilshomer View Post
          I was hoping you could answer this one

          Comment


          • #6
            I know this thread is old, but I thought I might add that you can check out DNemulator at

            It allows you to generate simulated reads from a reference, using the fastq scores of a run from your favourite platform to set the scores on your simulated data. It also saves the "mapping" of where your reads come from, so you can figure out whether your mapper is doing a good job or not...

            Cheers,

            Elizabeth

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Essential Discoveries and Tools in Epitranscriptomics
              by seqadmin




              The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
              04-22-2024, 07:01 AM
            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, Today, 08:47 AM
            0 responses
            12 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            60 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            59 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            54 views
            0 likes
            Last Post seqadmin  
            Working...
            X