Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • #16
    Originally posted by biofqzhao View Post
    inGAP (Integrated Next-gen Genome Analysis Platform)

    Please refer to the following link for detail.
    http://sites.google.com/site/nextgengenomics/ingap

    inGAP is an integrated platform for next-generation sequencing project, the core function of which is to detect SNPs and indels using a Bayesian algorithm.

    (1) It does not have any read length restriction. It can handle 454 sequencing and/or Illumina Solexa sequencing and/or Sanger sequencing data sets.

    (2) It can detect most small indels in either single-end or paired-end data sets. Using the simulated data sets, inGAP could successfully identify 85%-98% of small indels with high accuracy (>99%).

    (3) It has a strong capability to identify variants based on a relatively divergent reference genome, which bring it to a much wider application other than resequencing projects.

    (4) It provides a user-friendly graphic interface, through which users can browse, search, check, classify, and even edit the identified variants.

    (5) It can be used to detect intraspecific polymorphisms (including SNPs and indels) based on a pairwise comparison of multiple whole genomes.

    (6) It employs a global heuristic searching approach to layout contigs based on one or more reference genomes.

    (7) It also provides a handful of bioinformatic tools for read simulation, mutation incorporation, format conversion, etc.

    (8) It's slower than other SNP detection programs (e.g. MAQ, SOAP, Bowtie), because it employs BLAST or BLAT to map reads. It generally takes about 220 minutes for comparing 10 million 75bp Illumina reads against a 12Mb yeast reference genome on an 8-core DELL machine.

    Any comments and suggestions are welcome.
    Can you help me please to setting the parameters, like: contig length, min (match_len/read_len) and min alignment identity, window length, window step? Thank you very much.

    Comment


    • #17
      "contig length" indicates the minimal length for your reference sequences. If you are using a whole bacterial genome as a reference, you need not make any change here and just use the default value; if you are using a batch of contigs (which may come from a de novo assembler), you need to set up a minimal length, say, 1000 bp, for the downstream read mapping. The contigs smaller than this value would be discarded.

      min (match_len/read_len) and min alignment identity: If you are mapping reads to a very close reference genome, just use the default value. But if you want to map reads to a much divergent species, please refine them as what you want. For example, if you are mapping repetitive reads to a reference transposase element, you can use a lower threshold.

      Window length and window step: just use the default value.



      Originally posted by gnome7 View Post
      Can you help me please to setting the parameters, like: contig length, min (match_len/read_len) and min alignment identity, window length, window step? Thank you very much.

      Comment


      • #18
        inGAP linux version

        Dear biofqzhao,
        in linux I can't set the parameters. Why?
        Please, look at the image uploaded.

        Thank you very much for your help,
        gnome7
        Attached Files

        Comment


        • #19
          The image uploaded looks like Windows to me, not linux.
          Nevertheless, certain parameters are only accessible if you use blast/blat.

          cheers,
          Sven

          Comment


          • #20
            inGAP manual and export sequence

            Dear Zhao, Ji and everybody else,

            This new new program inGAP looks very interesting. It was very easy to use and I was able to assemble some Illumina reads to a mitochondrial genome.

            But I found it a little more difficult to analyse the result from the assembly in the AlignViewer. Especially without any proper manual.

            I don't really understand how to export the consensus sequence of the assembled reads. Can you please explain how to do that?

            Thank you!

            Kajsa

            Comment


            • #21
              inGAP manual and export sequence

              Dear all,
              me too!

              Thank you!

              Comment


              • #22
                Thanks for your interest. We are now updating inGAP, and some functions were disabled. I think all this stuff will be done in a few weeks.

                Actually, you can write a script to extract consensus sequences based on the SNP table identified by inGAP and your reference sequence. Just as an alternative approach before the update.


                Originally posted by kajsa View Post
                Dear Zhao, Ji and everybody else,

                This new new program inGAP looks very interesting. It was very easy to use and I was able to assemble some Illumina reads to a mitochondrial genome.

                But I found it a little more difficult to analyse the result from the assembly in the AlignViewer. Especially without any proper manual.

                I don't really understand how to export the consensus sequence of the assembled reads. Can you please explain how to do that?

                Thank you!

                Kajsa

                Comment


                • #23
                  Okay, that's good. Could you make a new post here to tell us when you are finished with the updating?

                  Comment


                  • #24
                    too many contigs error

                    Hi,

                    I am trying to test out inGAP and am getting an error which states I have too many contigs (>= 1000). The program stops.

                    I am using Windows version.

                    Any ideas on what my issue is?

                    I am trying to align e. coli reads to a reference genome.

                    Thanks!

                    Comment


                    • #25
                      Hi,

                      Hopefully I get a reply on here as my emails bounce back from the main author.

                      I want to use BLAT to align my EST contigs to a reference genome. For the reference genome all I have is fasta file, and for the ESTs I have a fasta and qual file. However when I go into mapping options, BLAT is blocked out. I need to use BLAT so that spice sites are identified, thus BLAST is not an option.

                      Do I need a gene file in order to use BLAT? I doubt it seeing as im pretty sure this is not needed in the original program.

                      Any ideas?

                      Thanks,

                      J

                      Comment


                      • #26
                        I would suggest you to try BLAST, as there is no much difference between them when aligning ESTs to a reference sequence. In ingap, both aligners can only show you the best match, and the 2nd best spliced matches will be ignored. Maybe nucmer is a good option for you.

                        BTW, BLAT is disabled in the latest version of ingap.



                        Originally posted by JackieBadger View Post
                        Hi,

                        Hopefully I get a reply on here as my emails bounce back from the main author.

                        I want to use BLAT to align my EST contigs to a reference genome. For the reference genome all I have is fasta file, and for the ESTs I have a fasta and qual file. However when I go into mapping options, BLAT is blocked out. I need to use BLAT so that spice sites are identified, thus BLAST is not an option.

                        Do I need a gene file in order to use BLAT? I doubt it seeing as im pretty sure this is not needed in the original program.

                        Any ideas?

                        Thanks,

                        J

                        Comment


                        • #27
                          Thanks for the reply

                          Comment

                          Latest Articles

                          Collapse

                          • seqadmin
                            Essential Discoveries and Tools in Epitranscriptomics
                            by seqadmin




                            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                            04-22-2024, 07:01 AM
                          • seqadmin
                            Current Approaches to Protein Sequencing
                            by seqadmin


                            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                            04-04-2024, 04:25 PM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by seqadmin, Today, 08:47 AM
                          0 responses
                          10 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 04-11-2024, 12:08 PM
                          0 responses
                          60 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 04-10-2024, 10:19 PM
                          0 responses
                          57 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 04-10-2024, 09:21 AM
                          0 responses
                          53 views
                          0 likes
                          Last Post seqadmin  
                          Working...
                          X