Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Maq and export format

    Maq is now being distributed with a tool to convert export format files to standard fastq, and I'm wondering if anyone knows how it treats the reads which failed the Illumina pipeline quality threshold test.

    Thanks.

  • #2
    export2std for chastity filtered reads

    This function is implemented in the script fq_all2std.pl. I had assumed it would use all reads but later found out that it only took the read (pairs) that passed chastity filtering (there is a Y/N flag in the export file that marks this). You can comment out the line in fq_all2std.pl if you want to include the reads that failed this check:

    #if ($t[21] eq 'Y') {
    formatting code....
    print "+\n$qual\n";
    #}


    Ryan

    Comment


    • #3
      export2maq

      In a similar vein, I have been hoping to hear from the Maq community whether anyone has had any success running the export2maq function. This is meant to take an export file and reformat it into a .map file (forgoing the need to run maq map or maq assemble). I always get a segmentation fault when I try to use this function (have tried on various platforms). I have gotten no response from the maq-help mailing list, so I thought I would try here.

      Thanks,
      Ryan

      Comment


      • #4
        <bump>

        I get the same segmentation fault. I had another query, do I need to feed in both the export files for the mate pair (I would expect so).. regardless its a seg fault either ways
        --
        bioinfosm

        Comment


        • #5
          I'm glad to see that someone out there is seeing the same issue. I have isolated the problem and made a simple workaround. The converter is expecting certain fields in the export file to be filled, but our pipeline leaves them empty. I pass the exports through a perl script that puts dummy fields in those columns before passing them to export2maq. Note, I pass in both the forward and reverse export files (e.g. cat export_*.txt | add_dummy_values.pl | maq export2maq out.map in.list -)

          Comment


          • #6
            Originally posted by myrna View Post
            I'm glad to see that someone out there is seeing the same issue. I have isolated the problem and made a simple workaround. The converter is expecting certain fields in the export file to be filled, but our pipeline leaves them empty. I pass the exports through a perl script that puts dummy fields in those columns before passing them to export2maq. Note, I pass in both the forward and reverse export files (e.g. cat export_*.txt | add_dummy_values.pl | maq export2maq out.map in.list -)
            Thanks Ryan.
            I had also figured of different formatting for my export output, because of the way my reference sequence for eland was formatted.

            Anyways, I guess you are doing downstream analysis to detect structural variants, could you share how you proceed?

            I wish to use the maq sv script, but am not getting across the usage..
            --
            bioinfosm

            Comment


            • #7
              To bioinfosm:

              The input should be something like:

              maq mapview -b aln.map | awk '$6!=18&&$9>=10&&$6!=64&&$6!=192&&$6!=130' > sv.inp

              You can add more filters in the awk command to control the reads passed through maq.pl sv

              Comment


              • #8
                Originally posted by lh3 View Post
                To bioinfosm:

                The input should be something like:

                maq mapview -b aln.map | awk '$6!=18&&$9>=10&&$6!=64&&$6!=192&&$6!=130' > sv.inp

                You can add more filters in the awk command to control the reads passed through maq.pl sv
                Hi Heng,
                I get nothing when using the above filters. Starangly, my 9th column (Quality ?) is 0! Any suggestion as to what might be going wrong there?

                and btw, happy new year!
                --
                bioinfosm

                Comment


                • #9
                  The 9th column is the smaller single-end mapping quality of the two reads in a pair. It is actually the mapping quality of an abnormal pair. You can reconstruct this column anyway.

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Essential Discoveries and Tools in Epitranscriptomics
                    by seqadmin




                    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                    04-22-2024, 07:01 AM
                  • seqadmin
                    Current Approaches to Protein Sequencing
                    by seqadmin


                    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                    04-04-2024, 04:25 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, Yesterday, 08:47 AM
                  0 responses
                  14 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-11-2024, 12:08 PM
                  0 responses
                  60 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 10:19 PM
                  0 responses
                  60 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 04-10-2024, 09:21 AM
                  0 responses
                  54 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X