Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Francy87
    Member
    • May 2013
    • 28

    mapping with botwie/bwa

    Hello!
    I'm sorry for the stupid question, but I'm new in metagenomic and metatranscriptomic stuffs... I'd like to map my Hiseq shotgun metatranscriptomic reads to some reference genomes (I already know the microbial community composition, since I analysed 16S data), using botwie or bwa. The question is: do I have to map each sample to each genome? or is there a way to map to more genomes at the same time? And if not, how can I merge the multiple mapping files from the same sample, then?

    Thanks a bunch
    Francesca
  • jimmybee
    Senior Member
    • Sep 2010
    • 119

    #2
    Originally posted by Francy87 View Post
    Hello!
    I'm sorry for the stupid question, but I'm new in metagenomic and metatranscriptomic stuffs... I'd like to map my Hiseq shotgun metatranscriptomic reads to some reference genomes (I already know the microbial community composition, since I analysed 16S data), using botwie or bwa. The question is: do I have to map each sample to each genome? or is there a way to map to more genomes at the same time? And if not, how can I merge the multiple mapping files from the same sample, then?

    Thanks a bunch
    Francesca
    No you wont have to map them separately. Create a multi-fasta file of all your reference sequences and then build the mapping index from that.

    Comment

    • Francy87
      Member
      • May 2013
      • 28

      #3
      Originally posted by jimmybee View Post
      No you wont have to map them separately. Create a multi-fasta file of all your reference sequences and then build the mapping index from that.

      Hi Jimmy,
      Thanks for your reply. Can I just 'cat' all the reference genomes together or should I modify them in some way?
      Thanks again
      Francesca

      Comment

      • swbarnes2
        Senior Member
        • May 2008
        • 910

        #4
        Cat them together, then you use bowtie-build or bwa index to make the index. Then you align.

        Comment

        • jimmybee
          Senior Member
          • Sep 2010
          • 119

          #5
          Originally posted by Francy87 View Post
          Hi Jimmy,
          Thanks for your reply. Can I just 'cat' all the reference genomes together or should I modify them in some way?
          Thanks again
          Francesca
          Yeah no need to modify them.

          Comment

          • Francy87
            Member
            • May 2013
            • 28

            #6
            Hi again!
            I downloaded some reference genomes from the ncbi ftp site (about 500). Is there a simple way to cat all of them recursevely without doing it one at the time? I have many directories, each one with one genome in .fna format...

            Thanks

            Comment

            • dpryan
              Devon Ryan
              • Jul 2011
              • 3478

              #7
              If all of the *.fna files are only buried one directory deep, then something like the following bash script should work:

              Code:
              #!/bin/bash
              dirs=`find . -mindepth 1 -maxdepth 1 -type d`
              for d in $dirs
              do
                  cat $d/*.fna >> genomes.fna
              done
              This could be modified to easily handle the presence of subdirectories.

              Comment

              Latest Articles

              Collapse

              • SEQadmin2
                Nine Things a Sample Prep Scientist Thinks About Before Sequencing
                by SEQadmin2


                I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

                Here are nine questions we think about, in roughly the order they matter, before...
                06-18-2026, 07:11 AM
              • SEQadmin2
                From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                by SEQadmin2


                Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                ...
                06-02-2026, 10:05 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by SEQadmin2, 06-17-2026, 06:09 AM
              0 responses
              36 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-09-2026, 11:58 AM
              0 responses
              100 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-05-2026, 10:09 AM
              0 responses
              120 views
              0 reactions
              Last Post SEQadmin2  
              Started by SEQadmin2, 06-04-2026, 08:59 AM
              0 responses
              113 views
              0 reactions
              Last Post SEQadmin2  
              Working...