Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • tblastx fmt1 Output Interpretation

    Hello,
    While locally running "blastx" with output option "-outfmt 1", I have a question.

    The below is an example of my result. As seen, throughout the outputs, all pairs of anchored proteins are not matched. How can I interpret these non-identical alignment?

    Query_3 477 GSVEYVHMLNGTMCATTRTVCAL 409
    medicagoRALF3 38 .GM.WI.QTKTAT.EGSIAD.M. 60

    Query_4 34 PYHLLLQKFYKT 69
    medicagoRALF11 104 ..NRGCS.Y.RC 115
    Thank you in advance.
    Attached Files
    Last edited by syintel87; 03-06-2015, 08:56 AM.

  • #2
    Are you searching with very short query sequences (like illumina reads)?

    Comment


    • #3
      My query sequences are contigs that are de novo assembled, some of which are short (e.g. 500) whereas others are very long (e.g. 100,000).

      But database of blast is composed of several peptide proteins whose length is short (e.g. 44).

      Comment


      • #4
        Have you tried to do the search the other way around (using your peptides as query)?

        Try using BLAT too. Especially if you know that you expect the peptides to be there in your data.

        Comment


        • #5
          In addition to "-outfmt 1", I tried other options as well.
          There seem to be different ways of alignment.

          0 = pairwise
          1 = query-anchored showing identities
          2 = query-anchored no identities
          3 = flat query-anchored, show identities
          4 = flat query-anchored, no identities

          [-outfmt 0]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 1]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 2]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23

          [-outfmt 3]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 ...LTS...VSIVIV..L. 23

          [-outfmt 4]
          Query_2 229 KMSFRYLFFAIKKYALSKF 173
          thalianaRALF4 5 KMSLTSLFFVSIVIVLSLF 23
          The results of [-outfmt 2] and [-outfmt 4] may be the results that I look forward to getting. However, I still cannot understand the principles and differences that distinguish output format 0 to 4.

          Comment


          • #6
            Ah, now I see! In formats "0, 1, and 3", dots stand for identities between query and target. And differences are shown with protein letters.

            I should have posted after more consideration.
            Thanks GenoMax! I am going to try "BLAT", too.

            Comment

            Latest Articles

            Collapse

            • seqadmin
              Current Approaches to Protein Sequencing
              by seqadmin


              Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
              04-04-2024, 04:25 PM
            • seqadmin
              Strategies for Sequencing Challenging Samples
              by seqadmin


              Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
              03-22-2024, 06:39 AM

            ad_right_rmr

            Collapse

            News

            Collapse

            Topics Statistics Last Post
            Started by seqadmin, 04-11-2024, 12:08 PM
            0 responses
            24 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 10:19 PM
            0 responses
            25 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-10-2024, 09:21 AM
            0 responses
            21 views
            0 likes
            Last Post seqadmin  
            Started by seqadmin, 04-04-2024, 09:00 AM
            0 responses
            52 views
            0 likes
            Last Post seqadmin  
            Working...
            X