Seqanswers Leaderboard Ad

**Brian Bushnell** · 04-18-2014, 12:09 AM

To download all of them, in Linux, you can do something like this:

wget "http://www.site.com/stuff/*.fa"

Then:

cat mito*.fasta > dir/mitoCombined.fasta
cat chloro*.fasta > dir/chloroCombined.fasta
(you have to be careful that cat does not try to combine your output file with the input files).

Then:

I suggest using BBSplit for this kind of problem.

bbsplit.sh ref=dir/mitoCombined.fasta,dir/chloroCombined.fasta in1=reads1.fq in2=reads2.fq basename=out_%.fq outu=out_unmapped.fq -Xmx4g

(-Xmx4g should be adequate in this case but you may need to adjust it up or down based on available memory)

This will write reads to 3 files:
mitochondrial reads go to out_mitoCombined.fq
chloroplast reads go to out_chloroCombined.fq
unmapped reads go to out_unmapped.fq

These will all be interleaved, for paired reads. You can de-interleave them like this:
reformat.sh in=out_mitoCombined.fq out1=mito1.fq out2=mito2.fq

**maubp** · 04-18-2014, 08:47 AM

I think you can do this with NCBI Entrez which offers some quite advanced filtering, e.g. properties like completeness - see for example: http://blastedbio.blogspot.co.uk/201...-chimeras.html

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 30 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 32 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 28 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Mitochondria & plastid genomes

Comment

Comment

Latest Articles

ad_right_rmr

News