Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • questions on ssaha2

    Hi all,

    I have several questions on ssaha2 after reading through ssaha2-manual.

    1. how to randomly report one best hit if there are many hits?
    What's the meaning of bval=2 of the paramter -best?

    2. Is the parameter "-solexa" and "-rtype solexa" exactly same?

    3. Can the output file in ".gz" format, which will reduce the storage?

    4. what's the parameter "-array" stand for? How can I deduce what large this parameter should be according to my data?

    5. what's the parameter -skip will influence?
    In the manual:
    -skip stepsiz Sets the number of nucleotide letters between the starting letter
    of successive words. I.e. With the option -skip 1 every word is hashed,
    with -skip 2 every second word, with -skip 3 very third etc.
    according to my understand, this parameter will influence the reported segment. If the terminal n bp (n < skip) in the read is not covered by one kmer exactly, this n bp will not be reported. The smaller skip will slow the speed. Is there a evaluation of the mapping probability influenced by different skip size?


    Any reply or suggestions will be highly appreciated!
    Last edited by pengchy; 09-25-2011, 02:50 AM.

Latest Articles

Collapse

  • seqadmin
    Essential Discoveries and Tools in Epitranscriptomics
    by seqadmin




    The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
    Yesterday, 07:01 AM
  • seqadmin
    Current Approaches to Protein Sequencing
    by seqadmin


    Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
    04-04-2024, 04:25 PM

ad_right_rmr

Collapse

News

Collapse

Topics Statistics Last Post
Started by seqadmin, 04-11-2024, 12:08 PM
0 responses
59 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 10:19 PM
0 responses
57 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-10-2024, 09:21 AM
0 responses
48 views
0 likes
Last Post seqadmin  
Started by seqadmin, 04-04-2024, 09:00 AM
0 responses
55 views
0 likes
Last Post seqadmin  
Working...
X