Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • sra toolkit fastq-dump for paired end read set

    I've read the threads out there but haven't been able to solve this problem.

    I am trying to separate a paired end sra formatted file to the corresponding two fastq mate pair files.

    I've used both:

    fastq-dump --split-files MyFile.sra
    fastq-dump --split-3 MyFile.sra

    The output consists of 2 files, but one of the files has a read length of 75, while the other has a read length of 9. Obviously, not working correctly.

    Here's what it looks like:

    @SRR514860-split_test.1 SOLEXA3_0001:5:1:1024:2576 length=75
    NTCGTTACAATATCCACCCTGTCCCCGAAGAATGCTCTTGNNNAGNNNNNNNNNNNNNNNNNTNNNNNNNNCTNA
    +SRR514860-split_test.1 SOLEXA3_0001:5:1:1024:2576 length=75
    #))))+++(*7AAAAAA75A33303AAAA##############################################
    @SRR514860-split_test.2 SOLEXA3_0001:5:1:1024:6017 length=75
    NCGCGTTCCGAAGCCATTTTCATCAAAATCTCGTGAAAAANNNATNNNNNNNNNNNNNNNNNANNNNNNNNTTNG
    +SRR514860-split_test.2 SOLEXA3_0001:5:1:1024:6017 length=75
    #++('+****5AAA83353.3388803.35AAAA#########################################
    @SRR514860-split_test.3 SOLEXA3_0001:5:1:1024:19859 length=75
    NGCGCCCCCTGTGCAGAGGTACTATTGCTGCTGCTGCTGCNNNTGNNNNNNNNNNNNNNNNNANNNNNNNNACNG

    ==> SRR514860-split_test_2.fastq <==
    @SRR514860-split_test.1 SOLEXA3_0001:5:1:1024:2576 length=9
    NNNNNNNNN
    +SRR514860-split_test.1 SOLEXA3_0001:5:1:1024:2576 length=9
    #########
    @SRR514860-split_test.2 SOLEXA3_0001:5:1:1024:6017 length=9
    NNNNNNNNN
    +SRR514860-split_test.2 SOLEXA3_0001:5:1:1024:6017 length=9
    #########
    @SRR514860-split_test.3 SOLEXA3_0001:5:1:1024:19859 length=9
    NNNNNNNNN


    Has anyone run into something similar before? Could this be due to the way the file was submitted and not necessarily the sra toolkit?

    Thanks,

    John

  • #2
    I had a similar problem and the only way to fix it was to ask SRA to reformat it on their end. This required the input of the original submitter to verify how it should be split.

    Comment


    • #3
      Thanks aaronh...SRA identified a problem with the file.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 11:49 AM
      0 responses
      8 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, Yesterday, 08:47 AM
      0 responses
      16 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      61 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      60 views
      0 likes
      Last Post seqadmin  
      Working...
      X