Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • how to use soap?

    I can't understand how to use it!

    I don't kown
    $soap-a is a database fa. or your query fa?

    who can help me

    I'M only a undergraduate student!

  • #2
    -a <str> your query file,
    -d <str> reference sequences file, is a database file

    Comment


    • #3
      but I fetch a sequence from my database to be a query file,then I use
      soap -a query file -d mydatabaes file -o ......
      no result!!!!!

      so I converse the database to query file,just like
      soap -a mydatabases file -d query file -o......
      6 sequences in the result!

      Why?
      I can't understand!

      Comment


      • #4
        Your query sequence should be one or more short reads. The database should contain one or more very large sequences e.g. contigs, chromosomes, etc.

        So :

        soap -a <short-reads> -d <contigs/chromosome>

        For -a sequences in FASTA or FASTQ formats are accepted. Database should be in FASTA only.

        Comment


        • #5
          Hello.
          Commonly we are using ELAND for Illumina data alignment. But for the some projects it's too short frame length available by ELAND because of its restriction (32bp as max). I need to align 40bp fragments and tried to use SOAP. The problem is that I couldn't force SOAP to taking in account quality's data (*prb files). It's working only with default '40'. I didn't find any information in official SOAP documentation or here, except notification about fastq. So the questions:
          1. Is it possible to directly attach set of *.prb files to SOAP alignment process?
          2. If not, how to convert *.prb and *.seq files to fastq? Are some tools available?

          Thanks in advance!
          Slava, MPIMG Berlin.
          Last edited by amstisla; 08-29-2008, 05:16 AM.

          Comment


          • #6
            2. If not, how to convert *.prb and *.seq files to fastq? Are some tools available?
            Yup, that's what you need to do. Maq may come with a script that'll do the job. Otherwise, it's pretty straightforward to write one yourself.

            I don't think that the alignment part actually pays attention to the quality scores, though. I think they only come into play in the columns of the output where the SNPs are listed, the alternate letter is listed with its quality score.

            So if you aren't looking for SNPs, it might not matter much.
            Last edited by swbarnes2; 08-29-2008, 08:52 AM.

            Comment


            • #7
              Maybe you can try ZOOM. It handles *_seq.txt and *_prb.txt automatically.


              Originally posted by amstisla View Post
              Hello.
              Commonly we are using ELAND for Illumina data alignment. But for the some projects it's too short frame length available by ELAND because of its restriction (32bp as max). I need to align 40bp fragments and tried to use SOAP. The problem is that I couldn't force SOAP to taking in account quality's data (*prb files). It's working only with default '40'. I didn't find any information in official SOAP documentation or here, except notification about fastq. So the questions:
              1. Is it possible to directly attach set of *.prb files to SOAP alignment process?
              2. If not, how to convert *.prb and *.seq files to fastq? Are some tools available?

              Thanks in advance!
              Slava, MPIMG Berlin.

              Comment


              • #8
                Does anybody ever use SOAP in color space ?

                - First Alignment : a bank against itself
                /opt/soap_1.11/soap -a test.fa -d test.reference.fa -s 8 -o soap.out
                -> OK. All sequences matched

                - First Alignment : a bank against itself
                /opt/soap_1.11/soap -a test.csfasta -d test.reference.csfasta -s 8 -o soap.out
                ->0 alignments !

                Any ideas ?
                thanks.

                Comment


                • #9
                  Originally posted by nservant View Post
                  Does anybody ever use SOAP in color space ?
                  How did you convert your data to color space? Usually I use solid2fastq.pl script provided with MAQ to generate pseudo base reads. Beside you need special reference. It's mean less to try align on both strands of typical reference by the way of SOAP (because of color space!). Just obtain reference sequence in reverse order and after concatenate "+" and "-" to one fasta file. When starting SOAP point -n 1 option to preserve attempts of program to look matches on "complimentary" strand.
                  Cheers, Slava

                  Comment


                  • #10
                    I made a mistake ! I don't remember why, but i was convinced SOAP could work in color space.
                    And it should not be able to do it.

                    Just for answer your question Slava, I use the convert_to_dibase_encoding.pl program from ABI to convert my sequences from base to color space.

                    Comment

                    Latest Articles

                    Collapse

                    • seqadmin
                      Essential Discoveries and Tools in Epitranscriptomics
                      by seqadmin




                      The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
                      04-22-2024, 07:01 AM
                    • seqadmin
                      Current Approaches to Protein Sequencing
                      by seqadmin


                      Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                      04-04-2024, 04:25 PM

                    ad_right_rmr

                    Collapse

                    News

                    Collapse

                    Topics Statistics Last Post
                    Started by seqadmin, Yesterday, 08:47 AM
                    0 responses
                    12 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-11-2024, 12:08 PM
                    0 responses
                    60 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 10:19 PM
                    0 responses
                    59 views
                    0 likes
                    Last Post seqadmin  
                    Started by seqadmin, 04-10-2024, 09:21 AM
                    0 responses
                    54 views
                    0 likes
                    Last Post seqadmin  
                    Working...
                    X