Seqanswers Leaderboard Ad

**adamdeluca** · 07-09-2010, 06:33 AM

Originally posted by R diggity View Post

My reference consists of multiple candidate sequences varying in size and location across the genome.

Careful, if you are only aligning to your regions of interest you will often end up with false mappings. Generally the best approach is to map to the entire genome, and filter the results to your regions of interest.

10% mapping is not surprising for a hybridization based capture of a small region (I am assuming this is what you are doing). I did an Agilent capture / GA2 sequencing in human and got 16% mapping to the 0.3Mbase of target regions.

**R diggity** · 07-10-2010, 11:40 AM

Thanks for the advice. I suppose I will have to construct my reference genome from quite a few separate linkage groups. Given that my reads are 75bp in length, will I have to manually manipulate the reference sequence such that it has gaps greater that 75bp between chromosomes?

Edit: I found a FASTA file containing the entire genome with the linkage groups treated as separate sequences. Does Maq understand this?

Edit2: I used easyrun to map paired ends to the genome, and only mapped 18.24%. I'm fairly certain I'm doing something incorrectly.

**Nomijill** · 07-10-2010, 12:26 PM

multiple reference sequences

I do not know if you have tried the CLC bio software at all, but it should be able to handle your data in a variety of ways. First, you can easily map your Illumina reads to multiple reference sequences. If these reference sequences are a subset of a larger genome, you can also use our targeted resequencing tool to get a report of the mapping of your reads to the targeted area vs the non targeted area. The tools are pretty flexible, so there are a lot of different ways that you can apply them to your data. The software is commercial, but you can use the trial for two weeks to see if it is able to solve any of your problems. The download is available from the CLC web site: http://clcbio.com/index.php?id=1240 I hope you'll try it.

Note: I work for CLC.

Topics	Statistics	Last Post
AI Tool Creates High-Resolution 3D Maps of the Mouse Brain by seqadmin Started by seqadmin, 03-20-2025, 05:03 AM	0 responses 49 views 0 reactions	Last Post by seqadmin 03-20-2025, 05:03 AM
Studying Microbial Gene Transfer with RNA Barcoding by seqadmin Started by seqadmin, 03-19-2025, 07:27 AM	0 responses 57 views 0 reactions	Last Post by seqadmin 03-19-2025, 07:27 AM
Mapping the snoRNAome in Zebrafish to Advance Disease Research by seqadmin Started by seqadmin, 03-18-2025, 12:50 PM	0 responses 50 views 0 reactions	Last Post by seqadmin 03-18-2025, 12:50 PM
TIGR Systems Offer a Compact Alternative to CRISPR for Gene Editing by seqadmin Started by seqadmin, 03-03-2025, 01:15 PM	0 responses 201 views 0 reactions	Last Post by seqadmin 03-03-2025, 01:15 PM

Seqanswers Leaderboard Ad

Aligning numerous reads to several small references

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News