Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • BLASTing or Convert FASTQ: any user-interface programmes out there? Need help!

    BlastStation-Free, acts as a standalone BLAST program which has a user interface rather than having to use command lines: problem is, is that it only accepts FASTA, and not FASTQ.

    We could either:

    1. Find a new standalone user-interface BLAST program that accepts FASTQ.

    or

    2. Find a FASTQ to FASTA converter with a user-interface.

    Does anybody have suggestions of either of the programmes we need?

    Thankyou for reading.

  • #2
    Galaxy (http://main.g2.bx.psu.edu/) is web based and has a FASTQ to FASTA converter. You might want to try that.

    Sam

    Comment


    • #3
      Hey Sam,

      Thanks very much, you're right, but actually there is simply too much data to upload; around 80GB.

      Comment


      • #4
        Possibly try this program --> http://sourceforge.net/projects/ngssequencealig/files/

        Comment


        • #5
          EMBOSS seqret can do FASTQ to FASTA (command line interface).

          Comment


          • #6
            use awk !

            awk 'BEGIN{P=1}{if(P==1||P==2){gsub(/^[@]/,">");print}; if(P==4)P=0; P++}' input.fastq > output.fasta


            very easy

            Comment


            • #7
              Same idea using Perl

              perl -ne '$c++; if ($c==1){$_=~/\@(\S+)/; print ">$1\n"} if($c==2){print "$_"} if ($c==4){$c=0}' input.fastq > output.fasta

              Or if you find yourself wanting to manipulate fastq files in additional ways, the fastq utilities of Galaxy that sdarko mentioned above are available in command line form within the FASTX-Toolkit
              Last edited by malachig; 09-17-2010, 08:20 PM.

              Comment

              Latest Articles

              Collapse

              • seqadmin
                Current Approaches to Protein Sequencing
                by seqadmin


                Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                04-04-2024, 04:25 PM
              • seqadmin
                Strategies for Sequencing Challenging Samples
                by seqadmin


                Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                03-22-2024, 06:39 AM

              ad_right_rmr

              Collapse

              News

              Collapse

              Topics Statistics Last Post
              Started by seqadmin, 04-11-2024, 12:08 PM
              0 responses
              23 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 10:19 PM
              0 responses
              24 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-10-2024, 09:21 AM
              0 responses
              21 views
              0 likes
              Last Post seqadmin  
              Started by seqadmin, 04-04-2024, 09:00 AM
              0 responses
              52 views
              0 likes
              Last Post seqadmin  
              Working...
              X