Unconfigured Ad

Collapse
X
 
  • Filter
  • Time
  • Show
Clear All
new posts
  • Retro
    Member
    • Apr 2011
    • 27

    new usage of SRA toolkit/ SRA archive data download

    It seems that the NCBI SRA archive changed the way how files can be downloaded. Up till now we used the link from SRA website to download files with Aspera Connect, then we used SRA toolkit to extract fasta sequences. Now the there is I must say a little confusing description that we are not able to apply. It seems that SRA toolkit can be used to directly process data from NCBI website. Did anybody solve this situation? We are working in Windows environment. Thanks.
    link to SRA description:
  • GenoMax
    Senior Member
    • Feb 2008
    • 7142

    #2
    Can you post an example of an accession # that is not working as expected?

    Comment

    • Retro
      Member
      • Apr 2011
      • 27

      #3
      The change applies for all SRA files. So a random example:


      when I go to the download tab, there used to be links to FTP and Aspera downloads. Now there is only the new description on the use of SRA toolkit.

      Comment

      • GenoMax
        Senior Member
        • Feb 2008
        • 7142

        #4
        There is always the option of getting the fastq files directly from ENA avoiding sratoolkit altogether.

        Corresponding URL for the example you posted above:

        ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR617/SRR617107/

        ftp://ftp.sra.ebi.ac.uk/vol1/srr/SRR617/SRR617107

        Comment

        • GenoMax
          Senior Member
          • Feb 2008
          • 7142

          #5
          Corresponding NCBI SRA direct URL (using information from SRA link you included above):

          ftp://ftp-trace.ncbi.nih.gov/sra/sra...617/SRR617107/

          Comment

          • Retro
            Member
            • Apr 2011
            • 27

            #6
            OK, that works, thanks.

            However, it goes through regular download, the Aspera connection was much better. If I understand it correctly, SRA toolkit now allows processing the files directly from the NCBI site without the need to download them. For example, using the fastq dump to transform .sra files to fasta. Base on the description available on NCBI (link bellow), I was not able to do it though.

            Comment

            • GenoMax
              Senior Member
              • Feb 2008
              • 7142

              #7
              Originally posted by Retro View Post
              However, it goes through regular download, the Aspera connection was much better. If I understand it correctly, SRA toolkit now allows processing the files directly from the NCBI site without the need to download them. For example, using the fastq dump to transform .sra files to fasta. Base on the description available on NCBI (link bellow), I was not able to do it though.

              http://www.ncbi.nlm.nih.gov/books/NB...sra_data_using
              After upgrading to the latest sratoolkit (v.2.4.2-1) I tried the new method out. Here is what I discovered.

              In order to get the downloads to work, every user (especially if you are on a shared system/cluster) will have to run the configuration utility (help located at: http://trace.ncbi.nlm.nih.gov/Traces...lkit_doc&f=std) and set an appropriate path for storing configuration directories/files. Remember to save settings before you exit the utility.

              Hint: Do the following in a xterm/X11 window if you want the text to be properly formatted.

              Code:
              $ /path_to/vdb-config -i
              Once this is done then you will be able to download fastq files (and other data) directly from NCBI without downloading the .sra files.

              Following example only prints five reads to screen

              Code:
              $ /path_to/fastq-dump -X 5 -Z SRR390729
              This command will then download the full data file as fastq to the current directory
              Code:
              $ /path_to/fastq-dump SRR390729
              Last edited by GenoMax; 11-20-2014, 10:37 AM.

              Comment

              • fibar
                Member
                • Feb 2013
                • 19

                #8
                I confirm GenoMax last reply. I updated my version to 2.5.2 and it's working with the mentioned commands.

                This new version includes the setting of a proxy at the 'vdb-config -i' window, which in my case I had to enable and add as 'proxyort'. If not, the process remained stuck with no warnings.

                If you don't specify a directory, it will be downloaded at the one you are standing.

                Remember '--split-files' when you are downloading PE reads.

                Comment

                Latest Articles

                Collapse

                • SEQadmin2
                  From Collection to Sequencing: Why Sample Preparation and Preservation Define Sequencing Data
                  by SEQadmin2


                  Data variability is still an issue in sequencing technologies despite the advances in reproducibility and accuracy of these platforms. But the problem does not originate in the sequencing itself, but in the previous steps, before the sample reaches the sequencer.


                  The first step is collection, followed by preservation and sample preparation for analysis. Most scientists overlook those steps, but not being careful might just be skewing the experiment’s results.
                  ...
                  06-02-2026, 10:05 AM
                • SEQadmin2
                  Single-Cell Sequencing at an Inflection Point: Early Impacts of New Platforms and Emerging Trends
                  by SEQadmin2


                  With the launch of new single-cell sequencing platforms in 2026, the field stands at an exciting inflection point. This article surveys the most impactful advances in the field and discusses how they’re reshaping research in cancer, immunology, and beyond.


                  Introduction

                  Single-cell sequencing technologies have undergone remarkable advances over the past decade, transitioning from low-throughput experimental approaches to highly scalable platforms capable of...
                  05-22-2026, 06:42 AM
                • SEQadmin2
                  Environmental Genomics in the Age of NGS: From Microbes to Conservation Strategies
                  by SEQadmin2

                  Studying ecosystems means dealing with complex, multi-species communities that are hard to observe at scale. This complexity, however, hides many important questions to be answered, from how biogeochemical cycles work and how climate change can affect species distribution to how conservation strategies can work best.


                  Genomics, particularly since the expansion of NGS, has transformed ecosystem ecology. By sequencing environmental DNA, we can now assess biodiversity without direct...
                  05-06-2026, 09:04 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by SEQadmin2, 06-02-2026, 12:03 PM
                0 responses
                19 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 06-02-2026, 11:40 AM
                0 responses
                14 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-28-2026, 11:40 AM
                0 responses
                29 views
                0 reactions
                Last Post SEQadmin2  
                Started by SEQadmin2, 05-26-2026, 10:12 AM
                0 responses
                31 views
                0 reactions
                Last Post SEQadmin2  
                Working...