Seqanswers Leaderboard Ad

**allo** · 07-17-2009, 12:58 PM

Run the program "fasta2bfa" and on another terminal check the memory usage every 15 - 20 seconds with the command: free -m (in megabytes) or free -g (in gigabytes) if the memory usage approaches the total Ram that your machine has just before the segmentation fault, That's your problem...

**luisczul** · 07-17-2009, 01:03 PM

According to my network administrator, I am using a node in the cluster that has 32 gigs of ram.

So I find that the cluster is not the problem, it is more like a C++ limitation in the memory pointing.

What other tests do you think i can run?

I will try your post.

I did though a little experiment splitting the file and 7 gigs is also too big. :S

**luisczul** · 07-21-2009, 06:54 AM

Hello,

apparently is not a memory problem, since I checked for the memory resources just before the program crashes and there is a lot free.

The program crashes just after line 166 in the fasta2bfa.c source file.

Can somebody help me out?

**simonandrews** · 07-21-2009, 07:50 AM

Originally posted by luisczul View Post

I am trying to convert my reference fasta file to bfa using maq with the command fasta2bfa.

My script has worked before with other old references but I just downloaded the new one from Ensmble and this one it doesn't work.

In another forum there was a thread about someone having problems indexing the latest human assembly with blat. The problem was that the length of the new haplotype chromosomes pushed the overall genome length above what could be handled by a 32 bit pointer. There may be a similar issue with maq.

Since the extra haplotype sequences are mostly poly-N (to keep the positions the same as the originals) you could either delete them altogether, or remove the Ns and see if things start working again.

**luisczul** · 07-21-2009, 08:23 AM

Solved

static void ma_fasta2csfa_core(FILE *fpout, FILE *fpin)
{
seq_t seq;
int i, c1, c2, c;
char name[256], comment[4096];
INIT_SEQ(seq);

So finally the problem got solved.

The reference file that I was downloading had a very big header file information.

As you can see in the code i pasted from maq, there was a limitation of char name[256]. When the header went over that then a segmentation fault was created.

Cheers hope this helps somebody.

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 55 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 52 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 45 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

is my reference too big for maq?

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News