Getting a full annotation onto a consensus sequence in CLC Genomics Workbench

Dapip33

Junior Member

Join Date: Feb 2010

Posts: 5
- Share
- Tweet
#1

Getting a full annotation onto a consensus sequence in CLC Genomics Workbench

10-15-2010, 06:45 AM

Hi all,

I'm having a little problem with my analysis using CLC tools. I am trying to assemble illumina reads from clinical isolates of P. aeruginosa and do SNP analysis between strains that have evolved overtime. My problem is that I can do the assemblies just fine against a reference genome or de novo, but I cannot get good annotation files to do SNP analysis. Here's my procedure:

1. Assemble the early strain against annotated reference genome. Copy the annotations onto the consensus sequence and save that as my Early genome.
2. Assemble the late strain against the Early genome, and then do SNP report.
3. Get SNP report and realize that the most useful comparator data in the SNP table is not there, specifically the PAxxx gene numbers used in updated Pseudomonas annotations (www.pseudomonas.com).

When I look at the consensus sequence copied over from step #1, it looks like the PAxxx numbers are there, but the SNP report and the assembly in step 2 does not include them. Anybody know how to get the full annotation from a reference genome onto a consensus in CLC Genomics Work bench?

Last edited by ECO; 10-15-2010, 06:46 AM. Reason: Fixed typo in link
Tags: annotation, clc
cement_head

Senior Member

Join Date: Mar 2012

Posts: 261
- Share
- Tweet
#2

09-19-2013, 07:02 AM

Originally posted by Dapip33 View Post

Hi all,

I'm having a little problem with my analysis using CLC tools. I am trying to assemble illumina reads from clinical isolates of P. aeruginosa and do SNP analysis between strains that have evolved overtime. My problem is that I can do the assemblies just fine against a reference genome or de novo, but I cannot get good annotation files to do SNP analysis. Here's my procedure:

1. Assemble the early strain against annotated reference genome. Copy the annotations onto the consensus sequence and save that as my Early genome.
2. Assemble the late strain against the Early genome, and then do SNP report.
3. Get SNP report and realize that the most useful comparator data in the SNP table is not there, specifically the PAxxx gene numbers used in updated Pseudomonas annotations (www.pseudomonas.com).

When I look at the consensus sequence copied over from step #1, it looks like the PAxxx numbers are there, but the SNP report and the assembly in step 2 does not include them. Anybody know how to get the full annotation from a reference genome onto a consensus in CLC Genomics Work bench?

I'd ask the company (CLC) directly - sometimes export options are obvious in the CLC products.
Comment

Previous template Next

Essential Discoveries and Tools in Epitranscriptomics

by seqadmin

The field of epigenetics has traditionally concentrated more on DNA and how changes like methylation and phosphorylation of histones impact gene expression and regulation. However, our increased understanding of RNA modifications and their importance in cellular processes has led to a rise in epitranscriptomics research. “Epitranscriptomics brings together the concepts of epigenetics and gene expression,” explained Adrien Leger, PhD, Principal Research Scientist...
- Channel: Articles
04-22-2024, 07:01 AM
Current Approaches to Protein Sequencing

by seqadmin

Proteins are often described as the workhorses of the cell, and identifying their sequences is key to understanding their role in biological processes and disease. Currently, the most common technique used to determine protein sequences is mass spectrometry. While still a valuable tool, mass spectrometry faces several limitations and requires a highly experienced scientist familiar with the equipment to operate it. Additionally, other proteomic methods, like affinity assays, are constrained...
- Channel: Articles
04-04-2024, 04:25 PM

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 59 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 57 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 51 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 56 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

Getting a full annotation onto a consensus sequence in CLC Genomics Workbench

Comment

Latest Articles

ad_right_rmr

News