Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Linux based enzyme restriction tool

    Hi all,
    I guess that the answer should be simple, but honestly i googled it and didnt find the answe...
    What im looking for is linux based tool to recognize enzyme restriction sites:
    1) over the genome (reference genome)
    2) over SAM/BAM file

    Right now what im doing is simply using Bowtie2 for multiple perfect hits to solve the first problem (reference genome), but it cant solve the SAM/BAM file.
    Moreover, i want to go over all the the rebase database (http://rebase.neb.com/rebase/rebase.ftp.html), and im sure there is already a tool that doing it...

    Any ideas?
    Thanks,
    Pap

  • #2
    I wrote a genomic digester , see https://www.biostars.org/p/86480/#86535

    Code:
    curl -s "http://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/chr3.fa.gz" |\
    gunzip -c  |\
    java -jar dist/biostar86480.jar -E AarI -E EcoRI 
    
    chr3    60645    60651    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    60953    60959    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    68165    68172    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    70263    70269    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    70945    70952    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    71140    71146    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    72264    72270    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    74150    74156    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    75063    75069    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    78438    78444    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    81052    81059    CACCTGC    1000    +    AarI    CACCTGC(4/8)
    chr3    84498    84504    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    84546    84552    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    84780    84787    CACCTGC    1000    +    AarI    CACCTGC(4/8)
    chr3    87771    87777    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    95344    95351    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    96358    96364    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    96734    96740    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    105956    105962    GAATTC    1000    +    EcoRI    G^AATTC

    Comment


    • #3
      Thanks lindenb!
      I saw your post before,but i have there a compliation error and our system admin is not in the area.. so we cant install anything right now..
      The error is about the "Requires apache ANT."

      Anyway, can it solve also the SAM/BAM problem?

      Thanks,
      Pap

      Originally posted by lindenb View Post
      I wrote a genomic digester , see https://www.biostars.org/p/86480/#86535

      Code:
      curl -s "http://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/chr3.fa.gz" |\
      gunzip -c  |\
      java -jar dist/biostar86480.jar -E AarI -E EcoRI 
      
      chr3    60645    60651    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    60953    60959    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    68165    68172    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    70263    70269    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    70945    70952    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    71140    71146    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    72264    72270    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    74150    74156    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    75063    75069    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    78438    78444    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    81052    81059    CACCTGC    1000    +    AarI    CACCTGC(4/8)
      chr3    84498    84504    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    84546    84552    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    84780    84787    CACCTGC    1000    +    AarI    CACCTGC(4/8)
      chr3    87771    87777    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    95344    95351    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    96358    96364    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    96734    96740    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    105956    105962    GAATTC    1000    +    EcoRI    G^AATTC

      Comment


      • #4
        I don't think you need a sysadmin ,just download ant from http://ant.apache.org/bindownload.cgi and add the executable to your PATH.

        Running this for a SAM file would be easy but I just wonder why you would need this ... :-)

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Strategies for Sequencing Challenging Samples
          by seqadmin


          Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
          03-22-2024, 06:39 AM
        • seqadmin
          Techniques and Challenges in Conservation Genomics
          by seqadmin



          The field of conservation genomics centers on applying genomics technologies in support of conservation efforts and the preservation of biodiversity. This article features interviews with two researchers who showcase their innovative work and highlight the current state and future of conservation genomics.

          Avian Conservation
          Matthew DeSaix, a recent doctoral graduate from Kristen Ruegg’s lab at The University of Colorado, shared that most of his research...
          03-08-2024, 10:41 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, Yesterday, 06:37 PM
        0 responses
        8 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, Yesterday, 06:07 PM
        0 responses
        8 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-22-2024, 10:03 AM
        0 responses
        49 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 03-21-2024, 07:32 AM
        0 responses
        66 views
        0 likes
        Last Post seqadmin  
        Working...
        X