Seqanswers Leaderboard Ad

Collapse

Announcement

Collapse
No announcement yet.
X
 
  • Filter
  • Time
  • Show
Clear All
new posts

  • update on bisulfite mapping program: BSMAP-2.4

    This is an announcement of the release of BSMAP v2.4, a powerful bisulfite mapping program. This version improved a lot on the run time performance, while maintaining high accuracy and flexibility of previous versions.

    Using 8 threads, BSMAP-2.4 can map 28M 76nt pair-end WGBS reads to the human genome in about 2 hours (allow up to 6 mismatches), including the genome indexing time. The memory usage is ~9GB. (CPU: Intel Xeon X5690)

    We tested the mapping accuracy using simulated bisulfite reads. BSMAP have significantly higher mapping accuracy than most other bisulfite mapping programs, especially for reads with more than 3 mismatches.

    BSMAP is freely available at http://code.google.com/p/bsmap/

    Main features of BSMAP-2.4:

    1. Reads are directly mapped to original reference genome sequence, no need to preprocess the reads and reference genome to convert C to T.

    2. Support both whole genome bisulfite sequencing (WGBS) mode and reduced representation bisulfite sequencing (RRBS) mode. In RRBS mode, reads are guaranteed to be mapped to digestion sites to increase accuracy. The digestion site information can be also changed to support different digestion enzymes.

    3. Support both "Lister protocol" (sequence 2 forward strands only) and "Cokus protocol" (sequence all 4 bisulfite converted strands)

    4. Support trimming adapter sequences and low quality nucleotides from 3'end of reads

    5. Allow trade off between speed/memory usage/mapping sensitivity. For human genome, the RRBS mode uses ~3GB. In WGBS mode, the typical memory usage is ~9GB, but can be as low as 5GB.

    6. Allow alignment for other nucleotide transitions, for example, can be set to detect the A=>I(G) transition in RNA editing.

    7. Include down stream script to extract methylation ratios from mapping results.

    8. Fasta/Fastq/SAM format input, text/SAM output. single/pair-end mapping. read length up to 144nt, max 15 mismatches allowed.




    We encourage you try this new version for short bisulfite reads mapping.
    Any comments/suggestions/bug reports will be appreciated.

    Thank you,

    Yuanxin Xi

  • #2
    Dear Xi,
    here I have a problem in installation of BSMAP, my system is opensuse 12.3 x86_64 GNU/Linux. and when I make the software, the errors:
    xiao@localhost:~/Software/bsmap-2.74> make
    g++ -DMAXHITS=1000 -DTHREAD -funroll-loops -Lsamtools -Isamtools -Lgzstream -Igzstream -O3 -m64 -c param.cpp -o param.o
    param.cpp: In constructor ‘Param::Param()’:
    param.cpp:8:20: error: ‘_SC_NPROCESSORS_ONLN’ was not declared in this scope
    param.cpp:8:40: error: ‘sysconf’ was not declared in this scope
    make: *** [param.o] Error 1

    could you tell me what I should do to resolve such a error.

    Thank you.

    Xiao

    Comment


    • #3
      adding "#include<unistd.h>" in the beginning of param.cpp would solve this problem.

      Comment


      • #4
        OKay, thank you!

        Comment


        • #5
          Dear Xi

          i used BSmap for mapping and it gave better mapping efficiency as compared to other tools. But there is problem to extract methylation information from .txt file.
          plz suggest to solve it

          Comment

          Latest Articles

          Collapse

          • seqadmin
            Current Approaches to Protein Sequencing
            by seqadmin


            Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
            04-04-2024, 04:25 PM
          • seqadmin
            Strategies for Sequencing Challenging Samples
            by seqadmin


            Despite advancements in sequencing platforms and related sample preparation technologies, certain sample types continue to present significant challenges that can compromise sequencing results. Pedro Echave, Senior Manager of the Global Business Segment at Revvity, explained that the success of a sequencing experiment ultimately depends on the amount and integrity of the nucleic acid template (RNA or DNA) obtained from a sample. “The better the quality of the nucleic acid isolated...
            03-22-2024, 06:39 AM

          ad_right_rmr

          Collapse

          News

          Collapse

          Topics Statistics Last Post
          Started by seqadmin, 04-11-2024, 12:08 PM
          0 responses
          18 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 10:19 PM
          0 responses
          22 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-10-2024, 09:21 AM
          0 responses
          16 views
          0 likes
          Last Post seqadmin  
          Started by seqadmin, 04-04-2024, 09:00 AM
          0 responses
          47 views
          0 likes
          Last Post seqadmin  
          Working...
          X