SEQanswers

Go Back   SEQanswers > Applications Forums > RNA Sequencing



Similar Threads
Thread Thread Starter Forum Replies Last Post
VCF to disease-causing variants Liam_Gallagher Bioinformatics 6 10-22-2013 01:40 PM
masking vcf for brain expressed variants willMD Bioinformatics 1 07-09-2013 10:57 AM
multiple vcf files to one multisampled vcf file Jetse Bioinformatics 2 06-27-2013 05:34 AM
Filtering VCF variants based on sequencing coverage elfuser Bioinformatics 0 02-19-2013 07:59 PM
Conservation scores for variants in VCF format Rubal7 Bioinformatics 0 05-16-2012 12:33 PM

Reply
 
Thread Tools
Old 02-07-2014, 06:47 AM   #1
atelford
Junior Member
 
Location: UK

Join Date: May 2013
Posts: 3
Default vcf files contain variants from a narrow range of comps from assembly

I have been playing around with vcf-tools, and have noticed something strange. When I used vcf-isec to create a complement vcf file containing variants only found in ALL my input files, the result is a list of variants found in components that are within a very narrow window (comps 22,450 to 69,079) whereas the full assembly contains 3,113,715 comps.

The vcf files from which I created the complement do contain a greater range of comps, but the vast majority also come from this window as explained above. Obviously when reducing the number of variants to those common in all files loses the comps at the edges.

I can't understand why all the common variants would come from such a narrow range of comps from the reference assembly. Is the assembly (which I did de novo in Trinity) arranged so that similar contigs are closer to one another in the assembly? Is there another explanation for why my variants seem to be restricted to a narrow range within the assembly?

Any advice or suggestions are very gratefully received!!
atelford is offline   Reply With Quote
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off




All times are GMT -8. The time now is 06:03 PM.


Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO