Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • bowtie_build input help

    Hello all,

    What is the proper data to use with bowtie_build if I would like to build my own index for the latest mouse genome? For example, on the bowtie site, they have 4 Pre-built index downloads (2 from NCBI, 2 from UCSC) that are 2.4 GB each to download. What were the input files used to generate these indices and than the following commands?

    If from NCBI, were they all the chromosome files located at:

    ftp://ftp.ncbi.nlm.nih.gov/genomes/M...d_chromosomes/

    or something else?

    Could anyone point me to where the files are from and located?

    Thank you so much!

  • #2
    Yes, download all of the FASTA chromosome sequences from your source of choice.

    UCSC: ftp://hgdownload.cse.ucsc.edu/goldenPath/
    Choose one of the mm#, go to bigZips/, and get the chromFa.tar.gz

    Comment


    • #3
      For building an index you'll want to use bowtie-build in the distribution of bowtie. You'll run the command

      Code:
      bowtie-build <fasta_file> <bowtie_index_prefix>
      This will create a set of files starting with <bowtie_index_prefix> that contain all the information in the fasta file, but in a format that makes it easy to look up sequences.

      If you're mapping SOLiD reads you'll need to add -C before <fasta_file>. There are additional options explained in the bowtie user manual.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Strategies for Sequencing Challenging Samples
        by seqadmin


        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
        03-22-2024, 06:39 AM
      • seqadmin
        Techniques and Challenges in Conservation Genomics
        by seqadmin



        The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

        Avian Conservation
        Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
        03-08-2024, 10:41 AM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, 03-27-2024, 06:37 PM
      0 responses
      13 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-27-2024, 06:07 PM
      0 responses
      12 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-22-2024, 10:03 AM
      0 responses
      53 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 03-21-2024, 07:32 AM
      0 responses
      69 views
      0 likes
      Last Post seqadmin  
      Working...
      X