Seqanswers Leaderboard Ad

**AlexReynolds** · 06-05-2013, 04:39 AM

If you can get your reads and your genes into UCSC BED format, then you can use the BEDOPS bedmap tool to map reads to genes.

http://code.google.com/p/bedops/wiki/bedmap

The bedmap tool comes with a --count operator, which you can use here to count the number of reads that map to a gene.

http://code.google.com/p/bedops/wiki...lap_statistics

If your inputs are not in BED format, BEDOPS offers conversion scripts for converting common genomic formats to sorted BED files, which you can use as inputs to bedmap.

http://code.google.com/p/bedops/wiki/conversion

To briefly demo how this might work for you, let's say your genes are in GFF format and your reads are in BAM format. We can convert like so:

$ gff2bed < genes.gff > genes.bed
$ bam2bed < reads.bam > reads.bed

Now we can map the reads to the genes and count them:

$ bedmap --echo --count genes.bed reads.bed > answer.bed

The file answer.bed is a BED-formatted list of genes. Each line contains a gene and the number of reads that overlap that gene:

$ more answer.bed
chr1 1000 2000 gene-1 ... | 8
chr1 4000 5000 gene-2 ... | 5
...

(In other words, eight reads overlap gene-1, five reads overlap gene-2 — and so on.)

If you want more information than just read counts, there are several operators that bedmap offers. For example, you might add the --echo-map-id operand if you want the IDs of all overlapping reads. The bedmap documentation describes various statistical and element operators in more detail.

The default overlap criterion between read and gene is one or more bases. You can set this to be more stringent with appropriate overlap settings:

http://code.google.com/p/bedops/wiki...erlap_criteria

**Jeremy** · 06-06-2013, 12:29 AM

I think HTSeq will do what you want.

http://www-huber.embl.de/users/anders/HTSeq/doc/count.html

**cumulonimbus** · 06-06-2013, 01:10 AM

Dear Alex,

thank you very much, this is exactly what I was looking for (I tried already and it works great), highly appreciated, very nice program!

Jeremy: Thanks, this looks also good, I will have a look, too!

Thank you for this fast help

cumulonimbus

**cumulonimbus** · 06-12-2013, 07:08 AM

Dear Alex,

I am now working through my datasets with bedmap for counting the reads and for some of my files I get this error when I use

bedmap --echo --count genes.bed reads.bed > hits.tab

Code:

dyld: lazy symbol binding failed: Symbol not found: __ZNSt8__detail15_List_node_base7_M_hookEPS0_
  Referenced from: /usr/local/bin/bedmap
  Expected in: /usr/lib/libstdc++.6.dylib

dyld: Symbol not found: __ZNSt8__detail15_List_node_base7_M_hookEPS0_
  Referenced from: /usr/local/bin/bedmap
  Expected in: /usr/lib/libstdc++.6.dylib

I checked the data and it looks normal. I also split up the files into smaller parts to see whether there is something wrong with the format in the file, but when I find the line where the error disappears, I can see no difference in the format.
For some data files (up to 120 MB in size) it works fine, for some not, do you have a solution why?

I am using Mac OS X 10.8.3

Thank you for your help!

**AlexReynolds** · 06-12-2013, 08:27 AM

Can you indicate what version of bedmap you are running? This is an error usually seen with an older version of BEDOPS for OS X, and upgrading to a current version may help resolve this.

**cumulonimbus** · 06-12-2013, 08:49 AM

Oh, I forgot to mention this, it is: version: 2.2.0
which is the current version, right?

**AlexReynolds** · 06-12-2013, 09:20 AM

That's the current version. Can you post the results from the following command?

$ otool -L /usr/local/bin/bedmap

You may need to install otool via Xcode or Apple's command-line developer tools installer.

Also, do you have MacPorts and GCC installed?

**cumulonimbus** · 06-12-2013, 10:11 AM

$ otool -L /usr/local/bin/bedmap:

Code:

/usr/local/bin/bedmap:
	/Library/Application Support/libstdc++.6.dylib (compatibility version 7.0.0, current version 7.17.0)
	/usr/lib/libSystem.B.dylib (compatibility version 1.0.0, current version 169.3.0)
	/Library/Application Support/libgcc_s.1.dylib (compatibility version 1.0.0, current version 1.0.0)

$ gcc --version:

Code:

i686-apple-darwin11-llvm-gcc-4.2 (GCC) 4.2.1 (Based on Apple Inc. build 5658) (LLVM build 2336.11.00)
Copyright (C) 2007 Free Software Foundation, Inc.
This is free software; see the source for copying conditions.  There is NO
warranty; not even for MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.

XCode Command Line tools are installed, GCC as well and I just installed MacPorts (MacPorts-2.1.3-10.8-MountainLion). XCode version 4.6.2.

So far the error is still occurring.

**AlexReynolds** · 06-12-2013, 10:29 AM

Thanks, the paths that otool are reporting are wrong. I'm researching why that is.

In the meantime, to resolve this issue please run the following commands:

$ sudo install_name_tool -change /Library/Application\ Support/libstdc++.6.dylib /Library/Application\ Support/BEDOPS/libstdc++.6.dylib /usr/local/bin/bedmap

$ sudo install_name_tool -change /Library/Application\ Support/libgcc_s.1.dylib /Library/Application\ Support/BEDOPS/libgcc_s.1.dylib /usr/local/bin/bedmap

These two commands tell these binaries where to find the required library files.

Assuming this is a problem with the 2.2 installer, you will likely want to repeat these commands for the following binaries, replacing /usr/local/bin/bedmap with:

/usr/local/bin/bedops
/usr/local/bin/bedextract
/usr/local/bin/closest-features
/usr/local/bin/sort-bed
/usr/local/bin/starch
/usr/local/bin/unstarch
/usr/local/bin/starchcat

Alternatively, you could wait until I put out a 2.2.1 installer, once I find the cause of this issue.

Thanks for the report!

**cumulonimbus** · 06-12-2013, 10:47 AM

Thanks a lot for your fast support, this solved the problem!

**AlexReynolds** · 06-12-2013, 10:53 AM

For others, I posted a new OS X installer ("2.2.0b") that fixes this issue:

http://code.google.com/p/bedops/down....2.0b.mpkg.zip

This patches some scripts that do the equivalent of the aforementioned fix.

**AlexReynolds** · 10-02-2013, 08:07 AM

We have posted new builds of BEDOPS v2.3:

Release BEDOPS v2.3.0 · bedops/bedops

https://github.com/bedops/bedops/releases/tag/v2.3.0

Downloads are available at the bottom of this page. Please read the BEDOPS v2.3.0 revision history, which summarizes new features and fixes in this release. Linux bedops_linux_x86_64-v2.3.0.tar.bz...

A more complete revision history is available here:

3. Revision history — BEDOPS v2.4.41

https://bedops.readthedocs.org/en/latest/content/revision-history.html#v2-3-0

Feel free to send us feedback at: [email protected]

Topics	Statistics	Last Post
Expanding the Horizons of Cellular Research with the Single Cell Atlas by seqadmin Started by seqadmin, Yesterday, 11:49 AM	0 responses 15 views 0 likes	Last Post by seqadmin Yesterday, 11:49 AM
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, 04-24-2024, 08:47 AM	0 responses 16 views 0 likes	Last Post by seqadmin 04-24-2024, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 61 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM

Seqanswers Leaderboard Ad

Announcement

Mapping reads to reference genome + count reads of genes

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News