Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • fasta to fastq

    Hello guys

    I' m working with TopHat and Bowtie to start my Phaseolus' RNAseq analysis, but to use this protocol I need to convert the cDNA sequences .fasta to .fastq.

    What software can I use?

  • #2
    Normally one would start with NGS data already in fastq format when doing Tophat analysis.

    If you want to convert your fasta sequences to fake fastq then you could use reformat.sh from BBMap like so

    Code:
    $ reformat.sh in=test.fa out=fake.fq qfake=35

    Comment


    • #3
      I would recommend not doing this reformatting approach, or at least proceeding with caution. With FASTA, you can be certain that the quality scores won't be used, but with a fake FASTQ you may be introducing artifacts. That is, unless you can be certain that the quality scores won't be used, but I'm not sure you can force this behavior with these tools.

      For a tool that just needs the sequences there is obviously no harm in going to FASTQ so a parser will work. For an aligner, I'd be cautious since they use quality scores.

      Comment

      Latest Articles

      Collapse

      • seqadmin
        Essential Discoveries and Tools in Epitranscriptomics
        by seqadmin




        The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
        04-22-2024, 07:01 AM
      • seqadmin
        Current Approaches to Protein Sequencing
        by seqadmin


        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
        04-04-2024, 04:25 PM

      ad_right_rmr

      Collapse

      News

      Collapse

      Topics Statistics Last Post
      Started by seqadmin, Today, 08:47 AM
      0 responses
      12 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-11-2024, 12:08 PM
      0 responses
      60 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 10:19 PM
      0 responses
      59 views
      0 likes
      Last Post seqadmin  
      Started by seqadmin, 04-10-2024, 09:21 AM
      0 responses
      54 views
      0 likes
      Last Post seqadmin  
      Working...
      X