Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Illumina Metagenomics data

    Hi All,
    I am new member to this forum.
    Earlier i used to work with 454 data, now i am switching to illumina.
    I am getting around 300 million reads (100bp) and its a metagenomic sample. So i am really confused about how to start my analysis.
    Earlier i used approaches like blastx but now i think this is not a good option.
    So i was just wondering if anyone had done something like this or have some idea on this.

    I would really appreciate your help.

    Thanks
    SS

  • #2
    Is it a 16S or WGS sample?

    Comment


    • #3
      Its a metagenomics environmental sample (no 16s).

      Comment


      • #4
        The real question is....what's the question? Are you looking for specific genes, or want to take an inventory of all genes?

        Have you tried assembling the reads yet? That's always a little sketchy with mixed communities, but it might be a good place to start.

        Comment


        • #5
          thanks for your input themerlin,
          Actually mainly its going to be a community study (in nut shell i need to annotate all of the sequences)
          Yes i tried assembly but it doesn't look good, but yes i will try again with different programs.

          Comment


          • #6
            Is this just sequence data from DNA straight from the environment, or did you clone it into vectors first?

            I handle metagenomics data, we do it in fosmids though, so it's easy to assemble contigs from one fosmid (phred/phrap). Trying to do the whole environment at once will likely be tougher. Once I have contigs, we use blastx to looks for homology and tools like fgenesb to find ORFs.

            Comment


            • #7
              hi i am new to this site can anyone tell me about effective working in schrodinger plz pass useful video tutorials if possible,

              Comment


              • #8
                Take a look at MG-RAST for annotation of your data http://metagenomics.nmpdr.org.
                Originally posted by ssharma View Post
                Hi All,
                I am new member to this forum.
                Earlier i used to work with 454 data, now i am switching to illumina.
                I am getting around 300 million reads (100bp) and its a metagenomic sample. So i am really confused about how to start my analysis.
                Earlier i used approaches like blastx but now i think this is not a good option.
                So i was just wondering if anyone had done something like this or have some idea on this.

                I would really appreciate your help.

                Thanks
                SS

                Comment


                • #9
                  Hi,

                  What do you mean by "annotate" ? Are you looking at "who is there" or "what are the functions" ?
                  Do you have reference genomes at hand, or genomes of organisms close to the ones in your sample ? Do you have an idea of the complexity of the population ? Is it eukaryote or microbes, or both ?
                  You can consider first trying to have an idea of the composition of your population, looking at some marker genes (eg : trying to find 16S or 18S reads in your dataset by mapping against reference databases)
                  If you have known reference genomes, you can also map reads against them, to evaluate the complexity/diversity
                  For a first glimpse at functions, you can try UniRef50 or KEGG genes (or any other functionally classified reference protein set) as a proxy.

                  Comment


                  • #10
                    You can try WebMGA: http://weizhong-lab.ucsd.edu/metagenomic-analysis/

                    Comment


                    • #11
                      You can try approaches like
                      -de novo assembly (metaVelvet, Abyss etc)
                      -fast clustering - (CD-Hit, RAMMCAP)
                      -reference based alignment (Genometa)

                      Comment


                      • #12
                        I would trim the reads (based on qual and remove adapters), then start assembling.
                        If you would like to know who are there, you could use MG-RAST or just blastn your trimmed reads against greengenes or SILVA 16S databases.

                        Comment


                        • #13
                          Originally posted by ssharma View Post
                          Hi All,
                          I am new member to this forum.
                          Earlier i used to work with 454 data, now i am switching to illumina.
                          I am getting around 300 million reads (100bp) and its a metagenomic sample. So i am really confused about how to start my analysis.
                          Earlier i used approaches like blastx but now i think this is not a good option.
                          So i was just wondering if anyone had done something like this or have some idea on this.

                          I would really appreciate your help.

                          Thanks
                          SS

                          You can reduce the volume of data by doing a de novo assembly.




                          HTML Code:
                          mpiexec -n 64 Ray \
                           -k \
                           31 \
                           -p \
                           Sample/ERR011142_1.fastq.gz \
                           Sample/ERR011142_2.fastq.gz \
                           -p \
                           Sample/ERR011143_1.fastq.gz \
                           Sample/ERR011143_2.fastq.gz \
                           -o \
                           Assembly

                          Sébastien Boisvert

                          Comment

                          Latest Articles

                          Collapse

                          • seqadmin
                            Recent Innovations in Spatial Biology
                            by seqadmin


                            Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

                            3D Genomics
                            While spatial biology often involves studying proteins and RNAs in their...
                            01-01-2025, 07:30 PM
                          • seqadmin
                            Advancing Precision Medicine for Rare Diseases in Children
                            by seqadmin




                            Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                            12-16-2024, 07:57 AM

                          ad_right_rmr

                          Collapse

                          News

                          Collapse

                          Topics Statistics Last Post
                          Started by seqadmin, 01-09-2025, 04:04 PM
                          0 responses
                          433 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 01-09-2025, 09:42 AM
                          0 responses
                          441 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 01-08-2025, 03:17 PM
                          0 responses
                          455 views
                          0 likes
                          Last Post seqadmin  
                          Started by seqadmin, 01-03-2025, 11:18 AM
                          1 response
                          50 views
                          1 like
                          Last Post Tonia
                          by Tonia
                           
                          Working...
                          X