Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • Linux based enzyme restriction tool

    Hi all,
    I guess that the answer should be simple, but honestly i googled it and didnt find the answe...
    What im looking for is linux based tool to recognize enzyme restriction sites:
    1) over the genome (reference genome)
    2) over SAM/BAM file

    Right now what im doing is simply using Bowtie2 for multiple perfect hits to solve the first problem (reference genome), but it cant solve the SAM/BAM file.
    Moreover, i want to go over all the the rebase database (http://rebase.neb.com/rebase/rebase.ftp.html), and im sure there is already a tool that doing it...

    Any ideas?
    Thanks,
    Pap

  • #2
    I wrote a genomic digester , see https://www.biostars.org/p/86480/#86535

    Code:
    curl -s "http://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/chr3.fa.gz" |\
    gunzip -c  |\
    java -jar dist/biostar86480.jar -E AarI -E EcoRI 
    
    chr3    60645    60651    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    60953    60959    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    68165    68172    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    70263    70269    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    70945    70952    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    71140    71146    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    72264    72270    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    74150    74156    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    75063    75069    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    78438    78444    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    81052    81059    CACCTGC    1000    +    AarI    CACCTGC(4/8)
    chr3    84498    84504    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    84546    84552    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    84780    84787    CACCTGC    1000    +    AarI    CACCTGC(4/8)
    chr3    87771    87777    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    95344    95351    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
    chr3    96358    96364    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    96734    96740    GAATTC    1000    +    EcoRI    G^AATTC
    chr3    105956    105962    GAATTC    1000    +    EcoRI    G^AATTC

    Comment


    • #3
      Thanks lindenb!
      I saw your post before,but i have there a compliation error and our system admin is not in the area.. so we cant install anything right now..
      The error is about the "Requires apache ANT."

      Anyway, can it solve also the SAM/BAM problem?

      Thanks,
      Pap

      Originally posted by lindenb View Post
      I wrote a genomic digester , see https://www.biostars.org/p/86480/#86535

      Code:
      curl -s "http://hgdownload.cse.ucsc.edu/goldenPath/hg19/chromosomes/chr3.fa.gz" |\
      gunzip -c  |\
      java -jar dist/biostar86480.jar -E AarI -E EcoRI 
      
      chr3    60645    60651    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    60953    60959    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    68165    68172    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    70263    70269    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    70945    70952    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    71140    71146    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    72264    72270    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    74150    74156    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    75063    75069    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    78438    78444    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    81052    81059    CACCTGC    1000    +    AarI    CACCTGC(4/8)
      chr3    84498    84504    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    84546    84552    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    84780    84787    CACCTGC    1000    +    AarI    CACCTGC(4/8)
      chr3    87771    87777    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    95344    95351    GCAGGTG    1000    -    AarI    CACCTGC(4/8)
      chr3    96358    96364    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    96734    96740    GAATTC    1000    +    EcoRI    G^AATTC
      chr3    105956    105962    GAATTC    1000    +    EcoRI    G^AATTC

      Comment


      • #4
        I don't think you need a sysadmin ,just download ant from http://ant.apache.org/bindownload.cgi and add the executable to your PATH.

        Running this for a SAM file would be easy but I just wonder why you would need this ... :-)

        Comment

        Latest Articles

        Collapse

        • seqadmin
          Recent Innovations in Spatial Biology
          by seqadmin


          Spatial biology is an exciting field that encompasses a wide range of techniques and technologies aimed at mapping the organization and interactions of various biomolecules in their native environments. As this area of research progresses, new tools and methodologies are being introduced, accompanied by efforts to establish benchmarking standards and drive technological innovation.

          3D Genomics
          While spatial biology often involves studying proteins and RNAs in their...
          01-01-2025, 07:30 PM
        • seqadmin
          Advancing Precision Medicine for Rare Diseases in Children
          by seqadmin




          Many organizations study rare diseases, but few have a mission as impactful as Rady Children’s Institute for Genomic Medicine (RCIGM). “We are all about changing outcomes for children,” explained Dr. Stephen Kingsmore, President and CEO of the group. The institute’s initial goal was to provide rapid diagnoses for critically ill children and shorten their diagnostic odyssey, a term used to describe the long and arduous process it takes patients to obtain an accurate...
          12-16-2024, 07:57 AM

        ad_right_rmr

        Collapse

        News

        Collapse

        Topics Statistics Last Post
        Started by seqadmin, 01-09-2025, 04:04 PM
        0 responses
        439 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 01-09-2025, 09:42 AM
        0 responses
        444 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 01-08-2025, 03:17 PM
        0 responses
        459 views
        0 likes
        Last Post seqadmin  
        Started by seqadmin, 01-03-2025, 11:18 AM
        1 response
        50 views
        1 like
        Last Post Tonia
        by Tonia
         
        Working...
        X