Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • litali
    Member
    • Jul 2010
    • 78

    maq for 454 data?

    Hi,
    Is it possible to use Maq for 454 data? which are the input files needed? if no, is there anything similar for 454?
  • Naujv
    Junior Member
    • Jul 2010
    • 6

    #2
    Hi litali,

    I'm not sure if it's the right thing to do, the MAQ website FAQ section actually says: "Maq maps short reads to the reference and calls the genotypes from the alignment. It is speficially designed for Illumina-Solexa/AB-SOLiD reads, not for 454 or capillary ones." Personally, I'd like to use it too since I'm not too hot on Roche software.

    If you want to run it and see for yourself, you can convert your sff files into sanger fastq (several methods below), make a reference fasta file, and follow the commands shown at the MAQ website (there's an easyrun).

    Join qual and fna file into fastq:
    (a) http://seqanswers.com/forums/showthread.php?t=2775
    Convert sff into fastq:
    (b) There's also sff2fastq at github.

    The files came out the same with either code.
    Last edited by Naujv; 07-22-2010, 11:39 PM.

    Comment

    • Naujv
      Junior Member
      • Jul 2010
      • 6

      #3
      Hi litati,

      I just tried using maq on my 454 data. What I'm seeing from the alignment (maq mapview all.map > $someoutputfile) are my sequences are being cut off at 34 nts.

      Comment

      • krobison
        Senior Member
        • Nov 2007
        • 734

        #4
        Try bwasw (a mode of bwa)

        Comment

        • jgibbons1
          Senior Member
          • Oct 2009
          • 135

          #5
          I'm pretty sure MAQ can only map reads 63bp or smaller.

          Comment

          • Naujv
            Junior Member
            • Jul 2010
            • 6

            #6
            krobison, thanks! I appreciate your input.

            Took your advice and tried bwasw, but sort of ran into a problem with my alignments. My CIGAR string has "S" in them. Found another post where I guess there's a problem with the CIGAR ??

            If you have time, I would like your thoughts (and others) regarding using bwasw for reference sequences (not whole genome and not whole chromosomes). Mine are made up of 100 non-overlapping sequences in fasta format.

            Comment

            • nilshomer
              Nils Homer
              • Nov 2008
              • 1283

              #7
              Originally posted by Naujv View Post
              krobison, thanks! I appreciate your input.

              Took your advice and tried bwasw, but sort of ran into a problem with my alignments. My CIGAR string has "S" in them. Found another post where I guess there's a problem with the CIGAR ??

              If you have time, I would like your thoughts (and others) regarding using bwasw for reference sequences (not whole genome and not whole chromosomes). Mine are made up of 100 non-overlapping sequences in fasta format.
              The "S" character indicates soft-clipping, which is described in the SAM specification. If you still think it is a bug, could you post the SAM record in question?

              Comment

              • Naujv
                Junior Member
                • Jul 2010
                • 6

                #8
                nils thanks for the help! i'm new (as in today new) to bwa. it may not be an error/bug, though i tried to look where the alignment is, i couldn't find it. the sequence looks like one big ugly repeat, so maybe this is spurrious. maybe you can help me understand what 40 in the line is? mapping quality? where does good and bad lie?

                GKTESVC03GKDWH 16 ref|NG_023054.1|:5000-113024 77792 40 46S48M143S * 0 0 ATTCCATTCCATTCCATTCGGTTTNAACGGTATTCCAATCGATTCCATTCCATTCCATTCCATTCCATTCCATTCCATTCCTTTCCATTCCATTACGGATGATTCCATTCCATTGCATTCCATTCCATTCCATTCCCCTGTACTCGGGTTGATTCCATTCCATTCCATTCCAATCCATGCCATTCCACTCGTGTTGATTCCATTCTTTCCATTCCATTCAAGTTGATTCCATTCCAT .199;:992131111:.,.--,,,!--.--17995566999:=BBABBBBBDDDAAA????DAAAAADBBBAA>=<900000..22:;9;;<62444444<<==>=>>>>>AB===A?????DDDDFFDDFFFF;;99<??@@<<44488ABBBBDDDFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFHFDCCCFFFF888CCGFFFFGDFFFFC??@A????FCCDDHF
                AS:i:44 XS:i:0 XF:i:2 XE:i:6 XN:i:0

                Comment

                • nilshomer
                  Nils Homer
                  • Nov 2008
                  • 1283

                  #9
                  Originally posted by Naujv View Post
                  nils thanks for the help! i'm new (as in today new) to bwa. it may not be an error/bug, though i tried to look where the alignment is, i couldn't find it. the sequence looks like one big ugly repeat, so maybe this is spurrious. maybe you can help me understand what 40 in the line is? mapping quality? where does good and bad lie?

                  GKTESVC03GKDWH 16 ref|NG_023054.1|:5000-113024 77792 40 46S48M143S * 0 0 ATTCCATTCCATTCCATTCGGTTTNAACGGTATTCCAATCGATTCCATTCCATTCCATTCCATTCCATTCCATTCCATTCCTTTCCATTCCATTACGGATGATTCCATTCCATTGCATTCCATTCCATTCCATTCCCCTGTACTCGGGTTGATTCCATTCCATTCCATTCCAATCCATGCCATTCCACTCGTGTTGATTCCATTCTTTCCATTCCATTCAAGTTGATTCCATTCCAT .199;:992131111:.,.--,,,!--.--17995566999:=BBABBBBBDDDAAA????DAAAAADBBBAA>=<900000..22:;9;;<62444444<<==>=>>>>>AB===A?????DDDDFFDDFFFF;;99<??@@<<44488ABBBBDDDFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFHFDCCCFFFF888CCGFFFFGDFFFFC??@A????FCCDDHF
                  AS:i:44 XS:i:0 XF:i:2 XE:i:6 XN:i:0
                  Have a real close read of the SAM specification. You will be going back to this quite a bit. The 5th column is the PHRED-scaled mapping quality. Looking at the CIGAR field (6th column), "46S48M143S", there seems to be 48 bases matching your reference, with the first 46 and last 143 soft-clipped.

                  Comment

                  • Naujv
                    Junior Member
                    • Jul 2010
                    • 6

                    #10
                    Thank you! Going through 2008 MAQ paper now.

                    Comment

                    • geschickten
                      Member
                      • Jul 2009
                      • 31

                      #11
                      Hi All,

                      We have a MAQ that works with 125bp Illumina read and we also a version that works with 454 data. Its not in open domain. If anybody is interested then please send me a request at [email protected]. Thanks.

                      Comment

                      Latest Articles

                      Collapse

                      • SEQadmin2
                        From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                        by SEQadmin2


                        Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                        The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                        ...
                        06-02-2026, 10:05 AM
                      • SEQadmin2
                        Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                        by SEQadmin2


                        With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                        Introduction

                        Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                        05-22-2026, 06:42 AM
                      • SEQadmin2
                        Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                        by SEQadmin2

                        Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                        Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                        05-06-2026, 09:04 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by SEQadmin2, 06-02-2026, 12:03 PM
                      0 responses
                      19 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 06-02-2026, 11:40 AM
                      0 responses
                      14 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 05-28-2026, 11:40 AM
                      0 responses
                      29 views
                      0 reactions
                      Last Post SEQadmin2  
                      Started by SEQadmin2, 05-26-2026, 10:12 AM
                      0 responses
                      31 views
                      0 reactions
                      Last Post SEQadmin2  
                      Working...