Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • inesdesantiago
    Member
    • Jan 2009
    • 44

    Reference genome for MAQ - split reference genome by chromosome or not?

    Hello!
    I am a beginner!
    I am trying MAQ..
    My question is about the reference genome! I downloaded the mm9 genome from UCSC, and it comes as separated chr*.fa files (one fasta per chromosome)

    However, the MAQ command to do the alignments points to a single file as the reference genome:

    maq match output.map genome.bfa myreads.bfq

    I converted all my chr.fasta files to bfa files using maq "fasta2bfa".
    Now, I don't know if i am supposed to run the MAQ for each single chromosome individually or if I should have the complete genome in one-single-bfa-file.

    Any of these is a challenge.
    If we the alignment is to run chromosome by chromosome then there should be a way to merge the output files (I think...!?).
    One the other hand, if the genome is supposed to be in just one-file, how do I do that?

    Any thoughts?
    THanks!!
    ines
  • bioinfosm
    Senior Member
    • Jan 2008
    • 483

    #2
    here is a similar discussion => don't split by chromosome
    Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc
    --
    bioinfosm

    Comment

    • inesdesantiago
      Member
      • Jan 2009
      • 44

      #3
      Good to know!

      I can't find the complete genome in the UCSC database.
      Should I do it myself? merge my chromosome.fa files into one-big file looking like this:

      >chr1
      AACTGTGCACTGTGACAC...
      GTACGCACGTGCGTGCAC...
      >chr2
      ACATTGCCAACACTGTCA...
      ACACGTGCGTGCACACGT...
      >chr xyz

      I don't know if this is the right format...

      Comment

      • inesdesantiago
        Member
        • Jan 2009
        • 44

        #4
        just a quick reply to myself:
        Yes, that's the right format!

        Comment

        • bioinfosm
          Senior Member
          • Jan 2008
          • 483

          #5
          Yes, sometimes eland gives issues with fasta headers, it creates extra columns in the export or eland_extended output. So I also prefer to keep the fasta reference sequence headers small
          --
          bioinfosm

          Comment

          Latest Articles

          Collapse

          • SEQadmin2
            Nine Things a Sample Prep Scientist Thinks About Before Sequencing
            by SEQadmin2


            I’m not a sequencing expert. I’m a purification scientist who uses NGS to evaluate workflows my group develops. With this perspective, we think about the sample first and the NGS workflow second. The sequencer is an exceptionally honest reporter, but it can only report on what you give it, so whether you get clean, interpretable data from an NGS workflow is largely determined before you begin.

            Here are nine questions we think about, in roughly the order they matter, before...
            06-18-2026, 07:11 AM
          • SEQadmin2
            From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
            by SEQadmin2


            Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


            The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
            ...
            06-02-2026, 10:05 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by SEQadmin2, 06-26-2026, 11:10 AM
          0 responses
          10 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-17-2026, 06:09 AM
          0 responses
          45 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-09-2026, 11:58 AM
          0 responses
          105 views
          0 reactions
          Last Post SEQadmin2  
          Started by SEQadmin2, 06-05-2026, 10:09 AM
          0 responses
          125 views
          0 reactions
          Last Post SEQadmin2  
          Working...