Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Blast 2 multifasta files one against the other

    Hello everyone,
    I have two files: one is a multifasta where each sequence is a contig of interest extracted from an assembled genome.
    The second one is also a multifasta where each sequence is a contig of an assembled genome; this file contain the whole assembled genome.
    What I want to do is: blast the first file against the second to identify in the second file which contigs are of interest so I can extract them to work only on them and not the whole genome file.
    I installed Blast on my computer so I can do it locally and not on the website.
    My problem is:
    - I tried to perform a blast2seq, and nothing, it doesn't work, in fact, it just blast the first sequence of the file 1 against the first sequence of the file 2. And then it stops
    - so I tried to transform my whole genome fasta file in a database (with formatdb -i file.fasta -p F) and I did a blastn but it didn't work either.

    So my question is: how can you blast two multifasta files one against the other?

    Thank you very much for any help you can provide me.
    Cordially,
    K.

  • #2
    Originally posted by Kawaccino View Post
    So my question is: how can you blast two multifasta files one against the other?
    You could define the other file as a subject and the other as a query. However, it'll be much faster if you create a db from the other file (i.e. you can run a multithreaded blast). I have no idea why your attempt failed. Perhaps if you told us what was it exactly that you did, and what the actual error message was. Also, consider reading the blast manual?


    p.s. A question to all. Why is legacy blast usage still so common?
    savetherhino.org

    Comment


    • #3
      Blat would likely work well for this as well. No need to mess with creating databases etc. You can produce output in a blast like format if you want to visually check the alignments.

      Last edited by GenoMax; 06-06-2013, 09:43 AM.

      Comment


      • #4
        Originally posted by rhinoceros View Post

        p.s. A question to all. Why is legacy blast usage still so common?
        Perhaps because some of us are old enough to have started practicing "bioinformatics" (it was not called that back then) with blast

        On a serious note:

        There is a good bit of history/theory available for blast.
        Statistics are well documented and understood.
        Easy web based interface to do searches.

        I am sure others will add to this list ..

        Comment


        • #5
          Hey!
          Thank you all for the help, I think I will see Blat.
          Concerning Blast I read the manual and it says nothing that helped me with my problem. By the way I don't get any Error message; the job finishes and when I open the output file, there is nothing in it.
          I aligned the two files with Mauve, and there are alignments between the two files, so I should obtain something with Blast...
          Thanks again,
          K.

          Comment


          • #6
            Originally posted by Kawaccino View Post
            Hey!
            By the way I don't get any Error message; the job finishes and when I open the output file, there is nothing in it.
            How long does the job run? Try to search just one sequence against your db to see if it works at all. Are you sure your files are in proper fasta format - only headers and sequences and nothing else?

            p.s. If you're having problems on setting up blasts properly, I'd not recommend blat for the time being..
            Last edited by rhinoceros; 06-07-2013, 01:24 AM.
            savetherhino.org

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            30 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            32 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            28 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            53 views
            0 likes
            Last Post seqadmin  
            Working...
            X