Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • blast results

    I want to edit a blast file in such way that if the query sequence has a gap, that should also comes in the reference sequence like if blast output shows:

    query: aa-gcaa
    || ||||
    reference: aatgcaa
    and I want to remove the t from reference and place a gap...

    query: aa-gcaa
    || ||||
    reference: aa-gcaa

  • #2
    Why?

    Do you know any programming languages such as Perl, Python, Ruby, Java, etc? If so have a look at BioPerl, Biopython, BioRuby, BioJava etc for libraries to work with BLAST files.

    Comment


    • #3
      Yes I have tried bioperl. but I am not able to do that. I can extract the fasta sequences from these files only..

      Comment


      • #4
        Why not use regexps for, say, finding gap-containing piece of query sequence (20 bp or so) in reference and then removing whatever you want from it?

        Comment


        • #5
          you want to copy your query over your reference...?

          If so, why not just pull you query sequence and use those,
          they already have the gaps you seem to be looking for.

          But it does sound odd, maybe a better explanation of why would help.

          Comment


          • #6
            Just taking a query won't work, because it can have different nucleotides (but not gaps) at some sites. After some thinking I see that you don't need any regexps, all you need is like

            {
            reference[i]='-' if query[i]='-';
            }

            for each position in sequences. Hope you can grab some sequences, dude.
            But yes, I'd like to know why he would want to do something like this.

            Comment


            • #7
              Thanks for the reply and I am getting these results because of 454 sequencing errors, the query sequence is the gene of rhizobium bacteria and the subject is the sequence from matching contigs. I want to remove these sequencing errors in contigs. So that I can make the phylogeny of the contigs.

              Comment


              • #8
                But so you lose actual indels that could happen between these two species, don't you? I think that first you should make sure that this particular nucleotide is indeed an error. Maybe, it is of quality much less than of other nearby nucleotides, or it is at long repeat like aaaaaaaaa or something else.

                Comment


                • #9
                  The contigs are of the rhizobium strains and blast report shows the position where i m getting this type of results have the neighboring nucleotides exactly the same as with query. So I m pretty sure that these are sequencing errors...

                  Comment

                  Latest Articles

                  Collapse

                  • seqadmin
                    Advancing Precision Medicine for Rare Diseases in Children
                    by seqadmin




                    Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
                    12-16-2024, 07:57 AM
                  • seqadmin
                    Recent Advances in Sequencing Technologies
                    by seqadmin



                    Innovations in next-generation sequencing technologies and techniques are driving more precise and comprehensive exploration of complex biological systems. Current advancements include improved accessibility for long-read sequencing and significant progress in single-cell and 3D genomics. This article explores some of the most impactful developments in the field over the past year.

                    Long-Read Sequencing
                    Long-read sequencing has seen remarkable advancements,...
                    12-02-2024, 01:49 PM

                  ad_right_rmr

                  Collapse

                  News

                  Collapse

                  Topics Statistics Last Post
                  Started by seqadmin, 12-17-2024, 10:28 AM
                  0 responses
                  39 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-13-2024, 08:24 AM
                  0 responses
                  52 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-12-2024, 07:41 AM
                  0 responses
                  38 views
                  0 likes
                  Last Post seqadmin  
                  Started by seqadmin, 12-11-2024, 07:45 AM
                  0 responses
                  46 views
                  0 likes
                  Last Post seqadmin  
                  Working...
                  X