Seqanswers Leaderboard Ad

**Brian Bushnell** · 03-16-2014, 10:15 AM

'M' means 'A' or 'C'.

http://www.bioinformatics.org/sms/iupac.html

I was under the impression that degenerate nucleotides other than N were not allowed in NCBI submissions, but maybe human is different.

**yunhuang** · 03-16-2014, 03:48 PM

Thanks for your reply.
I also downloaded hg19 database from http://hgdownload.cse.ucsc.edu/golde...Zips/hg19.2bit
I converted it to fa format and it did not contain any other nucleotide codes except 'A,C,T,G,N'. Maybe the human_g1k is different.

**mbblack** · 04-07-2014, 11:55 AM

The TRACE repository is old and out of date - those files date from 2009/2010. Those files there are from what was to be used as the reference genome for the build out of the 1000genome project data.

What you pulled from the UCSC is derived from files in
ftp://ftp.ncbi.nlm.nih.gov/genomes/H_sapiens/ which is the up to date current build of Hg19 where the ambiguous calls have been edited by the curation staff.

**yunhuang** · 04-07-2014, 05:59 PM

Got it, thanks for your guide

Topics	Statistics	Last Post
Cancer Metastasis: A Deep Dive into Cellular Plasticity by seqadmin Started by seqadmin, 04-11-2024, 12:08 PM	0 responses 58 views 0 likes	Last Post by seqadmin 04-11-2024, 12:08 PM
Proteogenomic Profiles Offer New Clues in Prostate Cancer by seqadmin Started by seqadmin, 04-10-2024, 10:19 PM	0 responses 54 views 0 likes	Last Post by seqadmin 04-10-2024, 10:19 PM
Novel Diagnostic Assay Enhances Ovarian Cancer Detection by seqadmin Started by seqadmin, 04-10-2024, 09:21 AM	0 responses 45 views 0 likes	Last Post by seqadmin 04-10-2024, 09:21 AM
Evolutionary Dynamics of Centromeres: A Comparative Genomic Analysis by seqadmin Started by seqadmin, 04-04-2024, 09:00 AM	0 responses 55 views 0 likes	Last Post by seqadmin 04-04-2024, 09:00 AM

Seqanswers Leaderboard Ad

Announcement

'M' in human_g1k_v37.fasta, is that normal

Comment

Comment

Comment

Comment

Latest Articles

ad_right_rmr

News