Seqanswers Leaderboard Ad

**HESmith** · 06-17-2011, 10:25 AM

Transposon mapping with paired-end reads is straightforward.

1) Create a reference file that contains the sequence of each transposon.
2) Align read one and read two separately to the transposon reference.
3) Align read one and read two separately to the genome reference, using repeat masking (so you won't align to transposons).
4) Filter the read one genome alignments with the read two transposon alignments, using the unique read identifier.
5) Repeat with read two genome and read one transposon alignments.

There are more sophisticated strategies, but this works relatively well given adequate read depth.

-Harold

**giror** · 06-17-2011, 10:34 AM

thanks Harold

This is generally the strategy I imagined. Unfortunately I am on an 8gig ram mac with a terabyte HD and I am not sure I could efficiently read through the entire BAM files which are 51 and 71 GB. The reads have already been mapped back to the genome, but I'm not sure of the parameters that were used. Do you know of a way I could get this information from the BAM?

If not, could you recommend an alignment program given the hardware constraints that I am under?

**HESmith** · 06-17-2011, 10:55 AM

The approach I suggested would almost certainly require repeating the alignments. I don't know which aligner was used to generate your existing dataset, but the repeats were either masked (yielding no matches) or not (multiple matches). Most aligners return the unique matches so, either way, the transposon reads would be missing.

Our aligners run on a server cluster, so I can't offer any software recommendations for your system. A cloud solution might be your best option.

Topics	Statistics	Last Post
Genetic Variants and Diabetes Risk in Childhood Cancer Survivors by seqadmin Started by seqadmin, Today, 08:47 AM	0 responses 10 views 0 likes	Last Post by seqadmin Today, 08:47 AM
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 60 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 53 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM

Seqanswers Leaderboard Ad

Announcement

Help picking up an abandoned sequencing project

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News