Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • In silico PCR

    Does anyone know any good in silico PCR algorithm, preferably for offline use. Input would be a multifasta file and forward and reverse primer sequences. Output would preferably be substrings with intact headers.
    savetherhino.org

  • #2
    How about the tool by Jim Kent that's used by UCSC?

    In-Silico PCR was written by Jim Kent. Interactive use on this web server is free to all. Sources and executables to run batch jobs on your own server are available free for academic, personal, and non-profit purposes. Non-exclusive commercial licenses are also available. Contact Jim for details.

    Comment


    • #3
      Yeah, this was one of the algorithms a little bird called Google suggested. I just didn't feel like contacting anyone. I suppose this might end up being the best option though..
      savetherhino.org

      Comment


      • #4
        There's an hgPcr in the lastest copy of jksrc.zip. I've yet to to look into the code yet to determine if it's the same as the hgPcr in silico PCR on the website.

        Edit: Or you can go to

        Comment


        • #5
          Originally posted by winsettz View Post
          There's an hgPcr in the lastest copy of jksrc.zip. I've yet to to look into the code yet to determine if it's the same as the hgPcr in silico PCR on the website.

          Edit: Or you can go to

          http://genome-source.cse.ucsc.edu/gi...91e433;hb=HEAD
          Yeah, thanks. I was just going through the zip file. Problem is though that it has a bunch of dependencies and among other things requires a running mysql server. This is all a bit extensive for wanting to do just one relatively simple thing..
          savetherhino.org

          Comment


          • #6
            mothur has a function called pcr.seqs that supposedly does what you're looking for. http://www.mothur.org/wiki/Pcr.seqs

            Comment


            • #7
              Originally posted by rhinoceros View Post
              Yeah, thanks. I was just going through the zip file. Problem is though that it has a bunch of dependencies and among other things requires a running mysql server. This is all a bit extensive for wanting to do just one relatively simple thing..
              That was my experience as well. I ended up building a scraper for the UCSC PCR page. It took a few seconds per lookup, so doing hundreds of searches took a while, but not an unreasonable amount of time. Unfortunately, if you try to perform multiple queries from the same IP concurrently, the IP is automatically blocked for a day, so there is no way to parallelize that.

              Comment


              • #8
                Originally posted by mcnelson.phd View Post
                mothur has a function called pcr.seqs that supposedly does what you're looking for. http://www.mothur.org/wiki/Pcr.seqs
                Yeah, that ought to get the job done. Thanks
                savetherhino.org

                Comment


                • #9
                  You can also try ipcress



                  Cheers,

                  Scott.

                  Comment


                  • #10
                    Originally posted by id0 View Post
                    That was my experience as well. I ended up building a scraper for the UCSC PCR page. It took a few seconds per lookup, so doing hundreds of searches took a while, but not an unreasonable amount of time. Unfortunately, if you try to perform multiple queries from the same IP concurrently, the IP is automatically blocked for a day, so there is no way to parallelize that.
                    I contacted the author and turns out that source and binaries for many platforms are available via http://users.soe.ucsc.edu/~kent/

                    The provided macIntel binaries (faToTwoBit, gfServer, gfPcr) work almost out of box on recent Mac OS X. I only needed to symlink /opt/X11/lib/libpng14.14.dylib to /usr/local/lib
                    savetherhino.org

                    Comment


                    • #11
                      So, I ended up using pcr_seq from biopieces. The below code yields the results I was expecting:

                      Code:
                      read_fasta -i input.fasta | pcr_seq -f CCTACGGGRSGCAGCAG[1,0,0] -R ACCGCGGCKGCTGGC[1,0,0] | grab -e 'REC_TYPE eq PCR' | write_fasta -x > PCR_products.fasta
                      -f and -R of course depend on what primers you want to use. There are also -F and -r parameters for other primer orientations. Within closed brackets, the number of allowed mismatches/deletions/insertions are specified. I found this to be the least tedious way to get the wanted output, in this case from 50k full bacterial 16S sequences I got 47.6k 'correct' PCR products. With gfPcr there's a huge number of options which makes it rather difficult to obtain desired output.

                      On related note, I hadn't used biopieces before. I have to say, I'm positively surprised. There seems to be a ton of very useful scripts included. E.g. plot_lendist appears to be a very easy (little work) way to plot length distribution with high quality output, i.e. to check the output of previous code (except on bacterial subset of Silva SSURef_NR99_115):

                      Code:
                      read_fasta -i PCR_products.fasta | plot_lendist -xk SEQ_LEN -t post -o plot.ps -L
                      Gives:


                      ..works pretty well
                      Last edited by rhinoceros; 10-27-2013, 04:41 AM.
                      savetherhino.org

                      Comment

                      Latest Articles

                      Collapse

                      • seqadmin
                        Current Approaches to Protein Sequencing
                        by seqadmin


                        Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
                        04-04-2024, 04:25 PM
                      • seqadmin
                        Strategies for Sequencing Challenging Samples
                        by seqadmin


                        Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
                        03-22-2024, 06:39 AM

                      ad_right_rmr

                      Collapse

                      News

                      Collapse

                      Topics Statistics Last Post
                      Started by seqadmin, 04-11-2024, 12:08 PM
                      0 responses
                      25 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 10:19 PM
                      0 responses
                      29 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-10-2024, 09:21 AM
                      0 responses
                      25 views
                      0 likes
                      Last Post seqadmin  
                      Started by seqadmin, 04-04-2024, 09:00 AM
                      0 responses
                      52 views
                      0 likes
                      Last Post seqadmin  
                      Working...
                      X