Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • problem understanding NCBI SRA fastq files

    I downloaded some sra files from NCBI's short read archive and converted them to fastq format. The experiment is described as paired end reads, so I expected to get two fastq files from each sra file. Instead, I only got one fastq file from each. Then I thought that I could find which reads were read1 reads and which ones were read2 reads, but I couldn't see anything to indicate whether it's a read1 or a read2. Here are some lines from one of the files:


    @SRR254172.11 ILLUMINA-20A1B2_0004_FC6282EAAXX:6:1:1921:953 length=160
    NACAAAGGTAATTGCAAGTCCCTTCGTGCCAAAACGTCCAGCCCTTCCAACCCTGTGCAAATAAGTATCAGCTGAGTCTGAATCTGCATTCATTCTGGAATGACTCAGGAAGAAAGGCTAACAAGATATAAGAACTTCAAGGAAGGCCACAAGAGAATTC
    +SRR254172.11 ILLUMINA-20A1B2_0004_FC6282EAAXX:6:1:1921:953 length=160
    #)0+)**2,,@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@:3:::@@@22:<<:8@@:@@@@@@@IIHIIIIIIII?HHIIFIGIIIIIIEGIGHIIIIFAIBDIHHGEHDBEFIIB<IIHHI3EFEDFC@HH@F@2;8<>@0??

    You get one line starting with @, then a line with the sequence, then a line essentially identical to the @ line except starting with + rather than @, and then a line with base quality scores.

    Does anyone understand this format and how I can get fastq files for both read1 and read2?

    Thank you.

    Eric

  • #2
    The basics of FASTQ are described here http://nar.oxfordjournals.org/conten...r.gkp1137.full and http://en.wikipedia.org/wiki/FASTQ_format

    How did you do the conversion? I recall there are extra switches needed at the command line for paired end data...
    Last edited by maubp; 03-30-2012, 12:23 AM.

    Comment


    • #3
      Originally posted by maubp View Post
      How did you do the conversion? I recall there are extra switches needed at the command line for paired end data...
      Hi Maubp,

      I'll bet that that's where I'm making a mistake. I did the conversion in two ways, neither of which gave me the paired end reads I wanted:

      fastq-dump *.sra

      fastq-dump.2 *.sra

      Eric

      Comment


      • #4
        You need to use the --split-3 option.

        fastq-dump --split-3 *.sra

        Comment


        • #5
          Originally posted by jrm5100 View Post
          You need to use the --split-3 option.

          fastq-dump --split-3 *.sra
          Thanks so very much!!

          Eric

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Recent Advances in Sequencing Analysis Tools
            by seqadmin


            The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...
            05-06-2024, 07:48 AM
          • seqadmin
            Essential Discoveries and Tools in Epitranscriptomics
            by seqadmin




            The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
            04-22-2024, 07:01 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, Yesterday, 06:35 AM
          0 responses
          15 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-09-2024, 02:46 PM
          0 responses
          21 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-07-2024, 06:57 AM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 05-06-2024, 07:17 AM
          0 responses
          19 views
          0 likes
          Last Post seqadmin  
          Working...
          X