Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • pcantalupo
    Junior Member
    • Sep 2011
    • 1

    SRA to fastq conversion with fastq-dump loses sequences

    Hello,

    I converted an SRA archive (ftp://ftp-trace.ncbi.nlm.nih.gov/sra...953/SRR073769/) to fastq with the fastq-dump program (sratoolkit-2.1.6). The resulting fastq file had ~160,000 less sequences (2% of the total number of spots) than expected. Why does this occur?

    Thank you,

    Paul
  • eeh_021
    Junior Member
    • Aug 2012
    • 3

    #2
    I've also experienced this problem. Did you find a solution?

    Thank you,

    Elizabeth

    Comment

    • ymc
      Senior Member
      • Mar 2010
      • 496

      #3
      how do u know the expected number of sequences?

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        I am seeing the same number of sequences as reported on the SRA page:



        in the file I downloaded.

        Code:
        ../sratoolkit.2.1.16-centos_linux64/bin/fastq-dump.2.1.18 SRR073769.sra 
        Written 8175900 spots for SRR073769.sra
        Written 8175900 spots total
        Code:
        $ more SRR073769.fastq | grep "@SRR073769" | wc -l
        8175900

        Comment

        • eeh_021
          Junior Member
          • Aug 2012
          • 3

          #5
          Oh, I see.

          I'm still a little confused about something:

          This file, SRR035116.sra, for example, is 3.9Gb
          When I convert it to fastq, however, it is only 2.2Gb. I checked the number of spots, and it's, surprisingly, the same.

          Usually when I convert sra to fastq, my files get a lot bigger. Help?

          Thank you!!

          Comment

          • GenoMax
            Senior Member
            • Feb 2008
            • 7142

            #6
            .sra file is ~383 Mb and the .fastq file is 1.6 G (on my filesystem). If your .sra file is truly that large then something must be wrong.

            Use the aspera client that SRA provides to download the .sra file.


            Originally posted by eeh_021 View Post

            I'm still a little confused about something:

            This file, SRR035116.sra, for example, is 3.9Gb
            When I convert it to fastq, however, it is only 2.2Gb. I checked the number of spots, and it's, surprisingly, the same.

            Usually when I convert sra to fastq, my files get a lot bigger. Help?

            Thank you!!

            Comment

            • eeh_021
              Junior Member
              • Aug 2012
              • 3

              #7
              383Mb?

              If you go there, it is supposed to be 3.9Gb, and that's about how big it is when I download it...

              Comment

              • seb567
                Senior Member
                • Jul 2008
                • 260

                #8
                Originally posted by eeh_021 View Post
                383Mb?

                If you go there, it is supposed to be 3.9Gb, and that's about how big it is when I download it...
                Perhaps it is more straightforward to fetch it from Europe or Japan.

                Compressed files (.fastq.gz or .fastq.bz2) are just easier to use than those .sra files.


                Sébastien Boisvert

                Comment

                • GenoMax
                  Senior Member
                  • Feb 2008
                  • 7142

                  #9
                  We are talking about two different data sets.

                  My response was for the dataset (SRR073769) that was in pcantalupo's original post.

                  Dataset you are referring to below is indeed 3.9 Gb.


                  Originally posted by eeh_021 View Post
                  383Mb?

                  If you go there, it is supposed to be 3.9Gb, and that's about how big it is when I download it...

                  Comment

                  • csmatyi
                    Member
                    • Oct 2011
                    • 25

                    #10
                    Originally posted by eeh_021 View Post
                    Oh, I see.

                    I'm still a little confused about something:

                    This file, SRR035116.sra, for example, is 3.9Gb
                    When I convert it to fastq, however, it is only 2.2Gb. I checked the number of spots, and it's, surprisingly, the same.

                    Usually when I convert sra to fastq, my files get a lot bigger. Help?

                    Thank you!!
                    I don't think that's a problem if the fastq file gets bigger because the sra file is in binary anyway, which is more compact.

                    Comment

                    • alireda82
                      Junior Member
                      • Jun 2014
                      • 1

                      #11
                      how to convert SRA file to FASTQ?

                      Comment

                      • GenoMax
                        Senior Member
                        • Feb 2008
                        • 7142

                        #12
                        Originally posted by alireda82 View Post
                        how to convert SRA file to FASTQ?
                        Use SRA toolkit: http://eutils.ncbi.nih.gov/Traces/sr...lkit_doc&f=std

                        Comment

                        • VC87
                          Member
                          • Oct 2015
                          • 18

                          #13
                          Hi everyone, i'm new here!
                          Can someone tell-me if it's possible to cenvert a WIG file type to FASTQ?thanks in advance

                          Comment

                          • blancha
                            Senior Member
                            • May 2013
                            • 367

                            #14
                            No, the Wig files do not contain the sequences, just the coverage.

                            Comment

                            Latest Articles

                            Collapse

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by SEQadmin2, 06-09-2026, 11:58 AM
                            0 responses
                            30 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-05-2026, 10:09 AM
                            0 responses
                            38 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-04-2026, 08:59 AM
                            0 responses
                            42 views
                            0 reactions
                            Last Post SEQadmin2  
                            Started by SEQadmin2, 06-02-2026, 12:03 PM
                            0 responses
                            64 views
                            0 reactions
                            Last Post SEQadmin2  
                            Working...