Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Maq problem: 0 raw reads mapped

    Dear all,
    I tried Maq recently on some RNA-seq short read data. I kept getting none of the reads mapped.

    Here is my code:
    maq fasta2bfa chr1.fa chr1.bfa
    maq fastq2bfq Control-6S.HWI-E4_8_3003J.fastq Control-6S.HWI-E4_8_3003J.bfq
    #so far so good
    maq match Control-6S.HWI-E4_8_3003J.map chr1.bfa Control-6S.HWI-E4_8_3003J.bfq

    output for the last line:
    -- maq-0.7.1
    [ma_load_reads] loading reads...
    [ma_load_reads] set length of the first read as 32.
    [ma_load_reads] 24347196*2 reads loaded.
    [ma_longread2read] encoding reads... 48694392 sequences processed.
    [ma_match] set the minimum insert size as 33.
    [match_core] Total length of the reference: 247249719
    [match_core] round 1/3...
    [match_core] making index...
    [match_search] 0% processed in 271.061 sec: 0 / 0 = 0.000
    ras:SRA002355 luow$ head -n 20 nohup.out
    -- maq-0.7.1
    [ma_load_reads] loading reads...
    [ma_load_reads] set length of the first read as 32.
    [ma_load_reads] 24347196*2 reads loaded.
    [ma_longread2read] encoding reads... 48694392 sequences processed.
    [ma_match] set the minimum insert size as 33.
    [match_core] Total length of the reference: 247249719
    [match_core] round 1/3...
    [match_core] making index...
    [match_search] 0% processed in 271.061 sec: 0 / 0 = 0.000
    [match_search] 1% processed in 271.364 sec: 0 / 0 = 0.000
    ...(output truncated)
    [match_search] 99% processed in 573.003 sec: 0 / 0 = 0.000
    [match_search] 100% processed in 573.304 sec: 0 / 0 = 0.000
    [match_core] sorting the hits and dumping the results...
    [ma_load_reads] loading reads...
    [ma_load_reads] 24347196*2 reads loaded.
    [mapping_count_single] 0, 0, 0, 0
    [maq_indel_pe] the indel detector only works with short-insert mate-pair reads.
    [match_data2mapping] 0 out of 48694392 raw reads are mapped with 0 in pairs.
    -- (total, isPE, mapped, paired) = (24347196, 0, 0, 0)

    If you are interested in trying this out, the raw short read data (you may want to use only part of the data for a quick trial) and reference genome can be downloaded by typing:
    curl -O ftp://hgdownload.cse.ucsc.edu/golden...ps/chromFa.zip
    curl -O ftp://ftp.ncbi.nlm.nih.gov/sra/stati...3003J.fastq.gz

    I got similar results when I used ‘maq.pl easyrun’. align reported similar results before here: http://seqanswers.com/forums/showthread.php?t=902, but no soluatoin has been posted. I used a Mac OSX 10.5 system, Maq built from the platform independent version maq-0.7.1.tar.bz2. Any suggestions/thoughts would be greatly appreciated.
    Last edited by pparg; 02-17-2009, 07:38 AM.

  • #2
    I dont see your BFA file in the "maq match" step, so no genome to search against

    it is :

    maq map [options] <out.map> <chr.bfa> <reads_1.bfq> [reads_2.bfq]

    Comment


    • #3
      Thanks Zee,
      Good call. This turns out to be a typo in my post, sorry about that and I corrected it now.
      I actually had chr1.bfa in my maq match line originally, otherwise there would just be an error message without any output.
      Any other thoughts/suggestions?

      Comment


      • #4
        It does appear to be a strange error. Perhaps try a much smaller chunk of your dataset. You might want to check whether your reads match another chromosome e.g. chr22 or chrX - make a BFA file of that.
        I would also look at trying another aligner altogether e.g. Novoalign (www.novocraft.com) and Bowtie (bowtie-bio.sourceforge.net) that export MAQ's .map format.

        Comment


        • #5
          I built the same maq from the platform independent version maq-0.7.1.tar.bz2 on a linux machine. And repeated the same job in my original post. It works fine this time. This makes me doubt that maq does not work for Mac machines with powerpc architecture (common for Mac systems) somehow. And this may explain why the other people got the same strange problem as me here: http://seqanswers.com/forums/showthread.php?t=902.

          Comment


          • #6
            Hi, where did you get the sra002355? I can not find it on sra. Did they deleted it?? need help, thanks!!

            Originally posted by pparg View Post
            Dear all,
            I tried Maq recently on some RNA-seq short read data. I kept getting none of the reads mapped.

            Here is my code:
            maq fasta2bfa chr1.fa chr1.bfa
            maq fastq2bfq Control-6S.HWI-E4_8_3003J.fastq Control-6S.HWI-E4_8_3003J.bfq
            #so far so good
            maq match Control-6S.HWI-E4_8_3003J.map chr1.bfa Control-6S.HWI-E4_8_3003J.bfq

            output for the last line:
            -- maq-0.7.1
            [ma_load_reads] loading reads...
            [ma_load_reads] set length of the first read as 32.
            [ma_load_reads] 24347196*2 reads loaded.
            [ma_longread2read] encoding reads... 48694392 sequences processed.
            [ma_match] set the minimum insert size as 33.
            [match_core] Total length of the reference: 247249719
            [match_core] round 1/3...
            [match_core] making index...
            [match_search] 0% processed in 271.061 sec: 0 / 0 = 0.000
            ras:SRA002355 luow$ head -n 20 nohup.out
            -- maq-0.7.1
            [ma_load_reads] loading reads...
            [ma_load_reads] set length of the first read as 32.
            [ma_load_reads] 24347196*2 reads loaded.
            [ma_longread2read] encoding reads... 48694392 sequences processed.
            [ma_match] set the minimum insert size as 33.
            [match_core] Total length of the reference: 247249719
            [match_core] round 1/3...
            [match_core] making index...
            [match_search] 0% processed in 271.061 sec: 0 / 0 = 0.000
            [match_search] 1% processed in 271.364 sec: 0 / 0 = 0.000
            ...(output truncated)
            [match_search] 99% processed in 573.003 sec: 0 / 0 = 0.000
            [match_search] 100% processed in 573.304 sec: 0 / 0 = 0.000
            [match_core] sorting the hits and dumping the results...
            [ma_load_reads] loading reads...
            [ma_load_reads] 24347196*2 reads loaded.
            [mapping_count_single] 0, 0, 0, 0
            [maq_indel_pe] the indel detector only works with short-insert mate-pair reads.
            [match_data2mapping] 0 out of 48694392 raw reads are mapped with 0 in pairs.
            -- (total, isPE, mapped, paired) = (24347196, 0, 0, 0)

            If you are interested in trying this out, the raw short read data (you may want to use only part of the data for a quick trial) and reference genome can be downloaded by typing:
            curl -O ftp://hgdownload.cse.ucsc.edu/golden...ps/chromFa.zip
            curl -O ftp://ftp.ncbi.nlm.nih.gov/sra/stati...3003J.fastq.gz

            I got similar results when I used ‘maq.pl easyrun’. align reported similar results before here: http://seqanswers.com/forums/showthread.php?t=902, but no soluatoin has been posted. I used a Mac OSX 10.5 system, Maq built from the platform independent version maq-0.7.1.tar.bz2. Any suggestions/thoughts would be greatly appreciated.

            Comment


            • #7
              Has anybody succeeded in solving this problem with "0 raw reads mapped"?
              I have exactly the same problem on the ia64 (Altix) machine. Using the same Typhi data with the same command line arguments as in this thread fails on the ia64 but succeeds on my i586 desktop. (using maq-0.7.1). It looks like that there is an architecture problem in the match_search function (match.cc) ...
              The first differences when comparing the different runs show up after following command:
              maq map -n 2 -e 70 -u [email protected] [email protected] ref.bfa [email protected]
              Last edited by tsp; 06-19-2009, 04:23 AM.

              Comment


              • #8
                MAQ does not work correctly on Ia64 linux

                MAQ code seems architecture dependent. I have the same problem. zero mapped reads on a Ia64 linux machine. I tried it on a x86 linux machine. it worked fine. It might have something to do with 64 bit compiling. I can't use m64 or mlp64 option even though it's a 64 bit machine.

                There is another strange problem. I only have 3 reads in the test file. However MAQ map tells me 3*2 reads
                [ma_load_reads] 3*2 reads loaded.
                [mapping_count_single] 4, 4, 4, 4
                [maq_indel_pe] the indel detector only works with short-insert mate-pair reads.
                [match_data2mapping] 3 out of 6 raw reads are mapped with 0 in pairs.

                Has any one encounted the same problem? I checked the map file etc. It seems nothing wrong apart from above incorrect reporting.

                Comment


                • #9
                  Yes, maq does that 2*(#reads) for single-ended reads -
                  don't know why - maybe it is hardcoded in the message-print?
                  Haven't bothered to look yet;

                  Best
                  -Jonathan

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM
                  • seqadmin
                    Strategies for Sequencing Challenging Samples
                    by seqadmin


                    Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                    03-22-2024, 06:39 AM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  18 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  22 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  17 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-04-2024, 09:00 AM
                  0 responses
                  49 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X