Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • problem with sra toolkit fastq-dump sratoolkit.2.1.10-win64

    Hello,

    I was trying to use fastq-dump to convert a .sra file to .fastq format.

    The data contains paired end reads, so when I type in commands,
    fastq-dump --split-files .sra, it gives two files, *_1.fastq and *_2.fastq.

    But the problem is the size of the two files are different, one is a lot bigger than the other. This doesn't seems to be right.

    Does anyone know why this happens? and how to fix it? Shouldn't the two files contain the same number of reads, so that the size should be the same as well?

    Thank you,

    Hui
    Last edited by hui_shi; 06-15-2012, 05:11 AM.

  • #2
    Could you post the run accession please?

    Comment


    • #3
      Thanks for your reply. I didn't use --accession or -A, don't know if that's what you mean. Actually, I don't understand why using -A, except to modify the output name.

      The full command I typed in was:

      fastq-dump --split-files SRR443885.sra

      it generates two files, SRR443885_1.fastq and SRR443885_2.fastq, but they are of different size.

      Thanks,

      Hui
      Last edited by hui_shi; 06-15-2012, 06:01 AM.

      Comment


      • #4
        The spot descriptor claims that the read length is 150bp, 75 forward and 75 reverse. But the actual read length is 100 bases only. I guess fastq-dump just obeys the rules by splitting the read at 75, so the reverse read becomes 3 times smaller than the forward.

        The run metadata is here:


        The experiment metadata is here:


        You can contact them at sra@ncbi about this.

        Comment


        • #5
          regarding small sra format file

          I wanted to convert .sra file of size 289.98 Mb. when i am using sratool fastq-dump.. it is generating error message as:-



          An error occurred during processing.
          A report was generated into the file '.../ncbi_error_report.xml'.
          If the problem persists, you may consider sending the file


          kindly help me out for this ASAP

          Comment


          • #6
            Did you try looking at what was in the file? Given that it's an error report, perhaps it's informative...

            Comment


            • #7
              Respected dpryan,


              The file is in xml format which on opening with google chrome is giving code. It does'nt have any error specified there in...

              Here in is the file which i am trying to convert to fastq.

              Comment


              • #8
                Try upgrading your copy of the SRA toolkit, that usually fixes this sort of issue (I tried to extract the file you linked and couldn't until I upgraded my local copy of fastq-dump).

                Comment


                • #9
                  kindly tell me how can i upgrade sra tool kit??


                  Is it through verbose??

                  Comment


                  • #10
                    Try here, on NCBI

                    Comment


                    • #11
                      Dear hui_shi,

                      I have the same problem with convert the sra to pair-end fastq.
                      Do you remember how to solve this problem?

                      Thank you very much!!

                      Comment


                      • #12
                        Are you using the latest SRAtoolkit? Have you done the steps described in this post: http://seqanswers.com/forums/showpos...6&postcount=7?

                        Comment


                        • #13
                          Thank you GenoMax,

                          I used the latest version.
                          In that way we need to redownload the file?
                          Thank you! If I can't solve this, may be I need to download again!

                          Best wishes

                          Comment


                          • #14
                            If you have downloaded the .sra file have you tried the command this way?

                            Code:
                            c:\> fastq-dump.exe -F --split-files c:\path_to\SRAfile.sra
                            Sometimes people do submit unequal length data sets so if you have mismatched cycle numbers that is a possibility. What SRA# are you looking at?

                            Comment

                            Latest Articles

                            Collapse

                            • seqadmin
                              Current Approaches to Protein Sequencing
                              by seqadmin


                              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                              04-04-2024, 04:25 PM
                            • seqadmin
                              Strategies for Sequencing Challenging Samples
                              by seqadmin


                              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                              03-22-2024, 06:39 AM

                            ad_right_rmr

                            Collapse

                            News

                            Collapse

                            Topics Statistics Last Post
                            Started by seqadmin, 04-11-2024, 12:08 PM
                            0 responses
                            25 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 04-10-2024, 10:19 PM
                            0 responses
                            27 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 04-10-2024, 09:21 AM
                            0 responses
                            24 views
                            0 likes
                            Last Post seqadmin  
                            Started by seqadmin, 04-04-2024, 09:00 AM
                            0 responses
                            52 views
                            0 likes
                            Last Post seqadmin  
                            Working...
                            X