Seqanswers Leaderboard Ad

**danwiththeplan** · 11-19-2014, 01:45 PM

MIRA4:

Sequence assembly and mapping with MIRA 5

http://mira-assembler.sourceforge.net/docs/DefinitiveGuideToMIRA.html#chap_denovo

Can handle hybrid assemblies, and also can handle paired-end reads in the orientation you describe (there's an example of this in the link above)..

It's a bit of a memory hog though. You may require a high-memory system.

**ssully** · 11-19-2014, 02:38 PM

That helps a lot (for MIRA) , thanks!

So as long as I convert the 454 Paired End sff to Fastq with sff_extract, MIRA will process and assemble them correctly *as paired ends*? It still understands that those reads are 'paired end' and not single end? I'm curious as to how it does that -- does it create an interleaved Fastq or two separate -1 and -2 fastq files?

**danwiththeplan** · 11-19-2014, 02:48 PM

It still understands that those reads are 'paired end' and not single end?

My understanding is yes

I'm curious as to how it does that -- does it create an interleaved Fastq or two separate -1 and -2 fastq files?

Actually don't know.

MIRA4 works by using a manifest file that defines the data to go into the program.

look at section 5.3.3. Manifest for data sets with paired reads (in the link above).

There's a parameter called segment_placement that defines how the pairs are arranged (ie >> or <> or >< or << or whatever) and (I think) the expected separation.

As for separate FASTQ files for left and right reads, I think MIRA expects Illumina data to be this way, but I don't know how 454 data works.In the example in the link above the data is defined as 454 data and only one file is given, so maybe you don't have to split the pairs. not sure about this one.

**ssully** · 11-19-2014, 03:31 PM

The example in the mulitple platform manifest *seems* to indicate the 454 PEs can be left as one fastq file. The insert size and SD need to be provided (which I can do) or 'autorefine' to let MIRA figure it out. Using 'autopairing' would mean not even having to tell MIRA the direction of the reads in a pair.

Looks good, but I'll write the author and see if I can get a clearer view.

**wanghao** · 11-20-2014, 03:06 AM

Originally posted by ssully View Post

I'll write the author and see if I can get a clearer view.

Can you also post the reply here if you get it from the author.

**flxlex** · 11-20-2014, 06:24 AM

Originally posted by ssully View Post

So, can Newbler (I have v3.0) 'understand' Illumina paired-end reads (i.e. know that they represent of span of X bp)? Can it do trimming of adapters and low-quality bases?

Yes, Newbler will figure out the pairing from the fastq files, provided the read IDs conform to the 'standards' (see the fastq entry on wikipedia). No, you cannot tell Newbler the span, as it figures this out for itself, regardless of where the data came from. It maps pairs and determines the mode and stdev of the distribution based on that.

Newbler will trim low-quality bases. It can do adaptor trimming through the -vt flag (you have to provide it with a fasta file of adapter sequences, probably both in forward and reverse complement orientation).

**ssully** · 11-21-2014, 10:20 AM

Originally posted by wanghao View Post

Can you also post the reply here if you get it from the author.

That was the wrong way to do it -- instead I've joined the MIRA user forum, which is what the author recommends.

**bastianwur** · 12-04-2014, 04:45 AM

With that data, I'd use GapFiller with both the reads, and throw the filled up 454 contigs together with the Illumina data into an assembler which can take both, like e.g. Ray.

Topics	Statistics	Last Post
The Adaptation of the Cell Cycle in Multiciliated Cells by seqadmin Started by seqadmin, Today, 06:58 AM	0 responses 1 view 0 likes	Last Post by seqadmin Today, 06:58 AM
New Method for DNA Sequence Amplification by seqadmin Started by seqadmin, Yesterday, 08:18 AM	0 responses 14 views 0 likes	Last Post by seqadmin Yesterday, 08:18 AM
New Tools Enhance Single-Molecule DNA Analysis with Minimal Samples by seqadmin Started by seqadmin, Yesterday, 08:04 AM	0 responses 12 views 0 likes	Last Post by seqadmin Yesterday, 08:04 AM
SIX2 Protein Identified as a Key Player in Prostate Cancer Treatment Resistance by seqadmin Started by seqadmin, 06-03-2024, 06:55 AM	0 responses 13 views 0 likes	Last Post by seqadmin 06-03-2024, 06:55 AM

Seqanswers Leaderboard Ad

Announcement

de novo assembly including Illumina and 454 paired-end reads

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News