Reducing amount of sequences

graham90978

Junior Member

Join Date: Oct 2010

Posts: 6
- Share
- Tweet
#1

Reducing amount of sequences

10-09-2010, 04:23 PM

Hi folks, recently started looking at de novo sequence assembly. I have a file of illumina reads in raw format but have converted them to fastq. Im afraid I have no idea whether this sequencing was achieved using paired or mate end sequencing, its more of a training data set to get to grips with assembly. I have 17092779 reads of 85bp length. Running velvet, this creates a huge graph. I ran it using k=51 and got a really low n50 value. I think there is a huge level of redundancy in my files and would like to remove such sequences. Does velvet do this itself as I ran velvet all night and it was eating up 95% of the server I am running it on. Can anyone point me towards some scripts for filtering out the useless reads from my file? I tried searching the forums but havent turned up much?
Thank you
Tags: None

Previous template Next

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Yesterday, 08:47 AM	0 responses 13 views 0 likes	Last Post by seqadmin Yesterday, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad