Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • thinkRNA
    Member
    • Jan 2010
    • 94

    FASTX-Toolkit: quality score value

    I get an error when I use FASTX-Toolkit : Please let me know how to get Fastx to work?

    ./bin/fastx_clipper -a "ATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAAA" -l 5 -k -o test.out -i test.fastq
    fastx_clipper: Invalid quality score value (char '#' ord 35 quality value -29) on line 4

    this is the test.fastq
    @GA6_00012:6:1:5480:933#0/1
    NCTGGGCACTGGGGTGCTGTGGGCAGATATGGGCTACAGAACAAGGAAAGACCCTGGTCCCGGAAGGTGCCAGCTG
    +
    #,,.225425@@5@@22@@@@@@@@(''%%'()'(@@@@@@@@@@@@@@@@@@7@#####################
  • simonandrews
    Simon Andrews
    • May 2009
    • 870

    #2
    There was another thread very recently which covered this. You need to add the -Q33 parameter to tell it that you're using Illumina encoded quality scores, not Sanger encoding.

    Comment

    • maubp
      Peter (Biopython etc)
      • Jul 2009
      • 1544

      #3
      Originally posted by simonandrews View Post
      There was another thread very recently which covered this. You need to add the -Q33 parameter to tell it that you're using Illumina encoded quality scores, not Sanger encoding.
      The other way round

      It assumes Illumina (ASCII offset 64), but you have Sanger (offset 33), hence -Q33

      See: http://seqanswers.com/forums/showthread.php?t=7399

      Comment

      • mgogol
        Senior Member
        • Mar 2008
        • 197

        #4
        Just a note, with Casava 1.8, qualities are switched to offset of 33 in illumina fastq files. So to use these with fastx toolkit, you'll also have to use the -Q33 flag.

        Comment

        • Nanu
          Member
          • Sep 2014
          • 30

          #5
          When I am running the command fastq to fasta format through Fastx toolkit..It was showing the following error in 64 bit computer on Linux.
          fastq_to_fasta: Invalid quality score value (char '(' ord 40 quality value -24) on line 8

          Now please give the solution for this problem.

          Comment

          • Nanu
            Member
            • Sep 2014
            • 30

            #6
            When I am running the command for fastq trimmer in fastx toolkit..it was showing the error :

            fastx_trimmer: Invalid quality score value (char '(' ord 40 quality value -24) on line 8

            Please suggest the solution to resolve this issue.

            Comment

            • maubp
              Peter (Biopython etc)
              • Jul 2009
              • 1544

              #7
              What was the command you ran Nanu, and did you use the -Q33 flag?

              Comment

              • Nanu
                Member
                • Sep 2014
                • 30

                #8
                I ran :
                ./fastx_trimmer -t 30 -i <filename> -o <filename2>
                No I didnt use any flag.
                I am new for this..so please guide me how to use -Q33 flag.

                Comment

                • Nanu
                  Member
                  • Sep 2014
                  • 30

                  #9
                  I ran :
                  ./fastx_trimmer -t 30 -i <filename> -o <filename2>

                  fastq_to_fasta -h -r -v -i <filename.fastq> -o <filename.fasta>

                  No I didnt use any flag.
                  I am new for this..so please guide me how to use -Q33 flag.

                  Comment

                  • maubp
                    Peter (Biopython etc)
                    • Jul 2009
                    • 1544

                    #10
                    With the fastx tool kit you must add -Q33 to the command to tell it to use the Sanger FASTQ encoding (with quality score offset of 33). Unfortunately some tools default to the now obsolete Illumina encoding (which used an offset of 64), see http://en.wikipedia.org/wiki/FASTQ_format and http://dx.doi.org/10.1093%2Fnar%2Fgkp1137

                    Comment

                    • Nanu
                      Member
                      • Sep 2014
                      • 30

                      #11
                      I have Illumina Seq data
                      Data format:

                      @HWI-1KL139:40:C0BW5ACXX:1:1102:19408:132180 1:N:0:
                      TGAACGGACTGGAAACACTGGTCATAATCATGGTGGCGAATAAGTACGCGTTCTTGCAAATCACCAGAAGGCGGTTCCTG
                      AATGAATGGGAAGCCTTCAAG
                      +
                      BCCFFFFDFHGHFIIJJJJJJIIIJGIJJGIJJ?GIHHGHIEGGFHIJJJJGHGHHFFFFFFEEEADDDDDBDB5<?CDD
                      DDDDCCEDDDDDB<ACDDCCC
                      @HWI-1KL139:40:C0BW5ACXX:1:1102:19487:132183 1:N:0:
                      CTGATTCTGAACAGCTTCTTGGGAAGTAGCGACAGCTTGGTTTTTAGTGAGTTGTTCCATTCTTTAGCTCCTAGACCTTT
                      AGCAGCAAGGTCCATATCTGA
                      +
                      @C@FFFDDD?AFDF@GHHHIIGI@@FBEHIIGD:FDFHII??FDG@FHIHE=CGIIE@AFHHEHECA;?=AEFFCBDDED
                      3>AA@BB(;;:>>>CD>CC@A
                      @HWI-1KL139:40:C0BW5ACXX:1:1102:19280:132183 1:N:0:
                      GACTTCACGCCAGAATACGAAAGACCAGGTATATGCACAAAATGAGATGCTTGCTTATCAACAGAAGGAGTCTACTGCTC
                      GCGTTGCGTCTATTATGGAAA
                      +
                      CCCFFFFFHHHHHJJJJJJIIJJJJJJJJFHIIJJIJJJJJJJJJJIJJJJJGIIJJJJGGGJJJJJHHHHEEDFFFFDE
                      DDDDDDDDBDDDEEEEDCACD

                      Where I have to mention it???

                      Comment

                      • maubp
                        Peter (Biopython etc)
                        • Jul 2009
                        • 1544

                        #12
                        From the read naming convention, your Illumina data is using the standard "Sanger" FASTQ encoding.

                        Comment

                        • mgogol
                          Senior Member
                          • Mar 2008
                          • 197

                          #13
                          Just add it as an argument. I wish this was better documented.

                          ./fastx_trimmer -Q33 -t 30 -i <filename> -o <filename2>

                          I don't know if this one needs it as well?
                          fastq_to_fasta -Q33 -h -r -v -i <filename.fastq> -o <filename.fasta>

                          Comment

                          • Brian Bushnell
                            Super Moderator
                            • Jan 2014
                            • 2709

                            #14
                            You might want to try Reformat. It automatically detects quality encoding (though you can override it), is much faster than fastx, does a better job of quality-trimming, and does fastq-to-fasta conversion in the same time as quality-trimming. And it keeps paired reads together.

                            Comment

                            Latest Articles

                            Collapse

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by SEQadmin2, 06-05-2026, 10:09 AM
                            0 responses
                            19 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-04-2026, 08:59 AM
                            0 responses
                            34 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-02-2026, 12:03 PM
                            0 responses
                            55 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-02-2026, 11:40 AM
                            0 responses
                            24 views
                            0 reactions
                            Last Post SEQadmin2  
                            Working...