Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Extracting unique BLAST hits from a multi-hit region

    Hi all,

    I have a blast output file (from BLAST command line tool) and I want to extract from it the unique hits. I have several genomic regions of variable size and I explore them using BLAST against a home made database (there is redundancy in it). What I want to get out is one hit per fragment of the examined region. Any suggestions? I am using perl, so any answers using BioPerl would be highly appreciated.

    Thanks a lot in advance.
    Thanos

  • #2
    If you used tabular output, it would be rather trivial to parse it with cut and sort and then retrieve the sequences from your db with blastdbcmd..
    Last edited by rhinoceros; 07-02-2013, 09:23 AM.
    savetherhino.org

    Comment


    • #3
      Hi rhinoceros and thanks a lot for your immediate response. By tabular output do you mean option 6 in the -outfmt flag?

      I will try what you suggested, seems indeed fairly simple.

      Thanks once again,
      Thanos

      Comment


      • #4
        Yeah, -outfmt 6

        Also, have a look here. It's probably rather relevant (second post)
        savetherhino.org

        Comment


        • #5
          Thanks a lot once again for your help. I really appreciate that.

          Comment


          • #6
            Do you probably know how I can add subject description in the tabular output? I need them for the next steps in my pipeline. :-)

            Thanks in advance.

            Comment


            • #7
              Have a look at the manual

              syntax would be e.g. -outfmt '6 std stitle'

              However, since your db is not provided by ncbi, this will not work (or so I'd reason). Anyway, if you created your db with -parse_seqids and the descriptions are stored there in the headers, you can just take the hits from your blast and retrieve the sequences and descriptions with blastdbcmd.

              cut -f 1 yourTabOutput | sort -u > id_list (or maybe it's cut -f 2..)
              blastdbcmd -entry_batch id_list ..etc
              Last edited by rhinoceros; 07-03-2013, 03:42 AM.
              savetherhino.org

              Comment


              • #8
                Thank you once again for your response rhinoceros. I will go through it right now and check the option you suggested.

                Comment

                Latest Articles

                Collapse

                • seqadmin
                  Recent Innovations in Spatial Biology
                  by seqadmin


                  Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

                  3D Genomics
                  While spatial biology often involves studying proteins and RNAs in their...
                  01-01-2025, 07:30 PM
                • seqadmin
                  Advancing Precision Medicine for Rare Diseases in Children
                  by seqadmin




                  Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                  12-16-2024, 07:57 AM

                ad_right_rmr

                Collapse

                News

                Collapse

                Topics Statistics Last Post
                Started by seqadmin, Yesterday, 11:18 AM
                0 responses
                16 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-30-2024, 01:35 PM
                0 responses
                33 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-17-2024, 10:28 AM
                0 responses
                41 views
                0 likes
                Last Post seqadmin  
                Started by seqadmin, 12-13-2024, 08:24 AM
                0 responses
                57 views
                0 likes
                Last Post seqadmin  
                Working...
                X