Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to combine pair end data into one file

    Hi guys,
    I am a new comer for this forum.
    I just run pair end sequencing on Illumina hiseq 2000 and got two files. I was wondering how can I combine each read and output into a single file.
    Thanks!

  • #2
    Search please!

    Comment


    • #3
      Linux: cat or sth...
      Samtools, merge bam...

      Comment


      • #4
        why would you want to do this?

        Comment


        • #5
          I guess if you want to use hmmSplicer, then you'll have to merge the paired end reads. The software doesn't support separately yet.

          You could just use a basic unix command: cat pe1.fq pe2.fq > pe_merged.fq

          Comment


          • #6
            Some assemblers also require paired-end reads in one file, like Velvet and SSAKE. You can check their scripts to combine the paired-end reads into a single file with the scripts shufflesequences.pl and makePairedOutput2UNEQUALfiles.pl.

            Comment


            • #7
              Hi forget1997,
              In an attempt to answer your question, I would imagine your aim is to generate a file to be used by velveth. Should this be the case, there are two sample perl scripts "shuffleSequences_fasta.pl" (FASTA) & "shuffleSequences_fastq.pl" (FASTQ) that come with the velveth pakage. Provide them with your files and create a third file of shuffled sequences. If this is not what you want, try and explain a little more. Do you want to join Read1 & Read2 into one read? Do you want to combine both files (cedance & lewewoo suggest something to do this). A respond to Simon's question would surely bring you help.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Advancing Precision Medicine for Rare Diseases in Children
                by seqadmin




                Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                12-16-2024, 07:57 AM
              • seqadmin
                Recent Advances in Sequencing Technologies
                by seqadmin



                Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                Long-Read Sequencing
                Long-read sequencing has seen remarkable advancements,...
                12-02-2024, 01:49 PM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 12-17-2024, 10:28 AM
              0 responses
              32 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-13-2024, 08:24 AM
              0 responses
              48 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-12-2024, 07:41 AM
              0 responses
              34 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 12-11-2024, 07:45 AM
              0 responses
              46 views
              0 likes
              Last Post seqadmin  
              Working...
              X