Does anyone know a program which can split BED file according to the chromosome? I have generate a BED file which contains the data for all chromosome, but it is not sorted. When I did sorting using BedSort, the output was not ordered according the numeric order, it always give chr10 on the top and then followed chr11, up to chr19. It seems I have to do the sorting for each chr respectively, I wonder whether there is a program which can split BED file according to the chromosome. Thanks
Seqanswers Leaderboard Ad
Collapse
Announcement
Collapse
No announcement yet.
X
-
You could try the following with your bed file:
Code:sort -k 1V,1 -k 2n,2 file.bed -o file.sorted.bed
Code:mkdir -p split_results for chr in `cut -f 1 file.bed | sort | uniq`; do grep -w $chr file.bed > split_results/$chr.output.bed done
-
Similar to adamdeluca's suggestion, here is another simple awk solution. Note that the ">>" creates and appends to files named CHROM.bed, where CHROM is column 1 of the bed input bed file (in this case, example.bed).
So, in plain English, the awk command prints each entire line ($0) from example.bed to distinct files that are each named by the chrom field ($1).
This strategy is useful in many other cases where you want to do a context-based "grep", and route the results to distinct files.
Code:$ awk '{print $0 >> $1".bed"}' example.bed $ ls -1 *.bed chr1.bed chr2.bed ... (snip) chrY.bed example.bed
Comment
Latest Articles
Collapse
-
by seqadmin
The sequencing world is rapidly changing due to declining costs, enhanced accuracies, and the advent of newer, cutting-edge instruments. Equally important to these developments are improvements in sequencing analysis, a process that converts vast amounts of raw data into a comprehensible and meaningful form. This complex task requires expertise and the right analysis tools. In this article, we highlight the progress and innovation in sequencing analysis by reviewing several of the...-
Channel: Articles
Today, 07:48 AM -
-
by seqadmin
The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...-
Channel: Articles
04-22-2024, 07:01 AM -
ad_right_rmr
Collapse
News
Collapse
Topics | Statistics | Last Post | ||
---|---|---|---|---|
Started by seqadmin, Today, 07:17 AM
|
0 responses
9 views
0 likes
|
Last Post
by seqadmin
Today, 07:17 AM
|
||
Started by seqadmin, 05-02-2024, 08:06 AM
|
0 responses
19 views
0 likes
|
Last Post
by seqadmin
05-02-2024, 08:06 AM
|
||
Started by seqadmin, 04-30-2024, 12:17 PM
|
0 responses
20 views
0 likes
|
Last Post
by seqadmin
04-30-2024, 12:17 PM
|
||
Started by seqadmin, 04-29-2024, 10:49 AM
|
0 responses
28 views
0 likes
|
Last Post
by seqadmin
04-29-2024, 10:49 AM
|
Comment