Seqanswers Leaderboard Ad

**ETHANol** · 05-09-2012, 10:50 AM

Script for extracting random sub-set of sequences - SEQanswers

http://seqanswers.com/forums/showthread.php?t=16845

Discussion of next-gen sequencing related bioinformatics: resources, algorithms, open source efforts, etc

If you have no headers or you convert to BED the attached perl script should work as well. Warning: I'm not really sure what the script does, but it seems to work.

Attached Files

randomLines.pl (768 Bytes, 79 views)

**pbluescript** · 05-09-2012, 01:35 PM

You could use bamtools random for this as well.
What errors is Picard giving you?

**KathrineBL** · 05-11-2012, 05:34 AM

Thank you so much for your input!!

I've tried the code, but i doesn't seem to work - I'm not that much up for converting it to a bed file as I need the BAM format later on.

I'e tried bamtools random, but keep on getting the same error message

bamtools random ERROR: could not load index data for all input BAM file(s)... Aborting.

My code line is as follows:

bamtools random -in Input_file.bam -out output_reduced.bam -n 1000000

The input bam file originates from the SAM file produced when mapping with bowtie - it is converted to BAM with "samtools view", and sorted with "samtools sort" - then I extract all mapped reads with "samtools view -b -F 4"

As you might can imagine I'm a newbie in this field and all help is very much appreciated!

**KathrineBL** · 05-11-2012, 05:35 AM

Regarding the Picard errors - I think it relates to the (mac) version of my java (which otherwise is up to date):

Exception in thread "main" java.lang.NoClassDefFoundError: jvm-argsCaused by: java.lang.ClassNotFoundException: jvm-args
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:247)

**pbluescript** · 05-11-2012, 11:10 AM

You need to index the bam file first. You can do this with bamtools:

bamtools index -in Input_file.bam

**dkolbe** · 05-31-2012, 11:16 AM

Originally posted by KathrineBL View Post

Regarding the Picard errors - I think it relates to the (mac) version of my java (which otherwise is up to date):

Exception in thread "main" java.lang.NoClassDefFoundError: jvm-args
[...]

You're using this as a template, I'm guessing:
java jvm-args -jar PicardCommand.jar OPTION1=value1 OPTION2=value2...

Like the command and the options, jvm-args needs to be substituted for actual java arguments. A typical example is -Xmx2g (specifying 2G of memory allocated for the run)

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Downsampling a BAM file

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News