Hi,
Recently we've been trying to decrypt some SRA files of the same project to get the FastQ data. As we got the FastQ files, however, we also received some warnings as shown below:
For each SRA file we decrypted "successfully(see below)", we will get exactly 5 copies of such warnings.
A "successful" decryption here means that the FastQ files do have read information, and their sizes also seem to be reasonable. However, we're still not sure whether the decryption has led to any loss of data, especially some important information about the reads themselves (e.g. we have lost some reads).
So here are the problems we'd like to ask:
Here are the details of our decryption:
Thanks in advance!
Yang
Recently we've been trying to decrypt some SRA files of the same project to get the FastQ data. As we got the FastQ files, however, we also received some warnings as shown below:
Code:
2014-01-22T00:04:11 fastq-dump.2.3.2 warn: column not found while opening table within short read archive module - column LABEL 2014-01-22T00:04:11 fastq-dump.2.3.2 warn: column not found while opening table within short read archive module - column LABEL_START 2014-01-22T00:04:11 fastq-dump.2.3.2 warn: column not found while opening table within short read archive module - column LABEL_LEN
A "successful" decryption here means that the FastQ files do have read information, and their sizes also seem to be reasonable. However, we're still not sure whether the decryption has led to any loss of data, especially some important information about the reads themselves (e.g. we have lost some reads).
So here are the problems we'd like to ask:
- Is there any difference with respect to read information between the FastQ files decrypted from SRA files with or without the warnings mentioned above?
- If yes, what are the differences?
Here are the details of our decryption:
- sratoolkit used: version 2.3.2-5-centos_linux64 (the newest version when we downloaded the data and tried to decrypt them)
- the decryption needs a repository key, and we set it up using the GUI started up by sratoolkit.jar
- program used to decrypt SRA files: fastq-dump
- command line used to decrypt SRA files: fastq-dump --outdir $OUTPUT_DIR --bzip2 --split-3 --keep-empty-files --log-level info $SRA_FILE
- each SRA file is a paired-end RNA-Seq data of one biological sample produced by Illumina HiSeq 2000, and the read length is always 76bp.
Thanks in advance!
Yang
Comment