Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Trouble with processing paired-end data with Pandaseq, any help appreciated

    Hi all,

    Not sure if this is best off posted in the Illumina forum or here really but here goes.

    I've been trying to use the Pandaseq tool to process some paired end MiSeq data, but I get an error message when I try to run it with my files.

    Output looks like this:

    jimothyh@manager-desktop[Documents] pandaseq -f 0454_Houghton_NoIndex_L001_R1_001.fastq -r 0454_Houghton_NoIndex_L001_R2_001.fastq
    INFO VER pandaseq 2.0 <[email protected]>
    ERR BADID @MISEQ:9:000000000-A1BL8:1:1101:17377:1320 1:N:0:
    STAT TIME Tue Sep 18 11:48:38 2012

    STAT ELAPSED 0
    STAT READS 0
    STAT NOALGN 0
    STAT LOWQ 0
    STAT OK 0
    INFO API 1

    I checked the program using some sample data and it worked fine. Someone else reported an identical error on github also as a result of trying to process MiSeq data with pandaseq. I'm wondering if there is something about the MiSeq ID that the program does not like, and if so if there might be a way of altering it so that it does work?

    My fastq files look like this:

    Code:
    @MISEQ:9:000000000-A1BL8:1:1101:17377:1320 1:N:0:
    NGCCTGACACCTGCCCGGTGCTGGAAGGTTAAAAGGAGAAGTCATCGCAAGAGAAGCTTTGAATTGAAGCCCCAGTAAACGGCGGCCGTAACAATAACGGTCCTAAGGTAGCGAAATTCCTTGTCGGGTAAGTTCCGACCTGCACGAATGG
    +
    #5<???BBDDEDDDDDGGCGGGIIFIIHFHIIIIIHHFHIHHHHHIIHHHHIIIIIIIIIIIIIIIIIHIIIHHHHIIHIEHHHHHGGGGGGGGGGGGEECEGGGHGGGGGGGG;;ECGCEGGGGGDGE<ECC?EGEEDGEEECE??EGGG
    @MISEQ:9:000000000-A1BL8:1:1101:16243:1320 1:N:0:
    NTCTTTTTTTTTTTTTTTTTTCTTTGAACGCCCATTCCACACCCGCTGATCGCCTCGATCCACTCATCCGGCTCAGGAATATTAACCTGATTCCCTTTCGATCGGGTGGGAGACAGCCTTATGACTAGGACGAAGCCTACGAGTCACCTCG
    +
    #5<===>>@@@EE@@@CCCCC-CEE>EE;+<CCCDEDEDD=DEE=ADDEDDDEE+@EEDDD9D*8;@@@:@@)8@@E@;@@7;?6<EE<EEE<EEEEE6-669;E;<;;;6<6<<(666;6;<;;E?=E==;;;6;E6;E696(6?EEE##
    @MISEQ:9:000000000-A1BL8:1:1101:17345:1320 1:N:0:
    NATGATTGATAGGTGCTGACCCCAGTACGAGAGGACCGGGTCGGACTAACCTCTGGTGTACCAGTTGTAACGCCAGTTGCAGTGCTGGGTATCTAAGTTGGGCAGGGATAAGCGCTGAAAGCATCTAAGCGCGAAGCTCTCCTAGAGATGA
    The sample data that run successfully looked like this:

    Code:
    @ILLUMINA-BE9C3F:29:FC:3:1:8146:1021 1:Y:0:GCTATA
    NNNGCNNNNNNNNNNNNNNNNNGNNGAATATTGGACAATGGGCGCANNNNNGATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTGCGGGTTGTAAAGCACTTTAAGCA
    +
    ###+2#################)##)*0*)@@@@@@@@@@@@@@@###############################################################
    @ILLUMINA-BE9C3F:29:FC:3:1:2932:1024 1:Y:0:GCTATA
    NNNTCNNNNNNNNNNNNNNNNNGNGGAATATTGGACAATGGGCGCANNNNNGATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTGCGGGTTGTAAAGCACTTTAAGCA
    +
    ###+0#################+#.+(*,&@@@@@@@@@@@@@@@)#####'((055225@@@@@@@@@@<<<<:@@@@@@@@@@#######################
    @ILLUMINA-BE9C3F:29:FC:3:1:8004:1030 1:Y:0:GCTATA
    CNNCCNNNNNNNNNNNNNNNNTGGGGAATATTGGACAATGGGCGCAAGCCTGATCCAGCAATGCCGCGTGTGTGAAGAAGGCCTGCGGGTTGTAAAGCACTTTAAGCA

    Any help or suggestions appreciated!

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    04-22-2024, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, Yesterday, 11:49 AM
0 responses
15 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-24-2024, 08:47 AM
0 responses
16 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
61 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
60 views
0 likes
Last Post seqadmin  
Working...
X