Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • New to sequencing and analysis

    Hello all!

    I am new to genome sequencing and analysis. My lab recently got a bacterial genome sequenced from a company who sent us the files containing the results of the genome sequence. Now I am expected to analyse this genome. Can anybody guide thru the basic steps of sequence analysis.
    As a first step, what should do with the sequence?

  • #2
    In response to New2Sequencing

    Hi New2Sequencing -

    Please excuse me if I tell you things you already know. I figure better that way than the reverse.

    The type of analysis problem that you have depends on whether or not you have a reference sequence for your bacterial sequence. If you don't, then you need to do a de novo assembly. If you do, then you will want to map your reads to that reference. There are a lot of different tools available to perform these types of analyses. Because I work for CLC bio, I am most familiar with our tools, but many other sources, commercial and free, also provide tools for de novo assembly or reference mapping. SeqAnswers has a great list of these in their Wiki. The tool that you select will also depend on the type of reads that you have, long or short, color space or base space. The technology that was used to generate your data will answer these questions for you.

    If you are interested, CLC's Genomics Workbench is free to try for two weeks, it is easy to use, and it has some nice tutorials to get you started. It is also convenient if you do not have access to powerful computers, because it uses compute resources very efficiently. I just read a blog from Nick Loman on a de novo assembly of E.coli that was generated with Ion Torrent. He was able to assemble the entire genome, 49Mb of data for 10x coverage, in less than two minutes on an 8g desktop. Other assemblers took days. CLC bio may not be the solution you end up with, but it is a really sweet place for a beginner to start. I hope it helps you.

    Comment


    • #3
      Start by finding & reading about 10 recent publications which are closest to your research project; this give you a feeling for some typical analysis strategies, but also the sorts of questions which are being asked.

      Of course, in the end the real question is "what question(s) do I want to know the answers to?". Why did your boss decide to sequence the organism(s) in question? Defining that in detail will drive the analysis.

      Comment


      • #4
        Thanks Nomijill: that was helpful. Now I will start reading about de novo assembly and try out the softwares/freewares u mentioned. Will keep all posted.

        @krobison: That is an important suggestion. I have started reading a book about genome sequencing, however it is taking time. i should simultaneously start reading the papers.

        Trying to answer your 'why' question: I think we isolated a unique biofuels producing bacteria that showed phylogenetic similarities (16s rRNA sequencing) to a previously known bacteria, however some of the properties of the new bacteria are very unique and hence the sequencing was done. Ours is more of gaining incremental knowledge !

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Essential Discoveries and Tools in Epitranscriptomics
          by seqadmin




          The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
          04-22-2024, 07:01 AM
        • seqadmin
          Current Approaches to Protein Sequencing
          by seqadmin


          Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
          04-04-2024, 04:25 PM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 11:49 AM
        0 responses
        15 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-24-2024, 08:47 AM
        0 responses
        16 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-11-2024, 12:08 PM
        0 responses
        62 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 04-10-2024, 10:19 PM
        0 responses
        60 views
        0 likes
        Last Post seqadmin  
        Working...
        X