Go Back   SEQanswers > Bioinformatics > Bioinformatics

Similar Threads
Thread Thread Starter Forum Replies Last Post
Bacterial species definition / OTUs rhinoceros Metagenomics 1 10-24-2013 05:31 AM
Definition of promoter gene_x Bioinformatics 4 03-01-2013 12:31 PM
assembly and strand definition litali Bioinformatics 1 08-31-2012 10:36 PM
Definition/Origin of XLOC clsppb Bioinformatics 0 10-19-2011 12:47 PM
About PF definition in qseq file lincy100 Illumina/Solexa 4 01-25-2011 09:40 PM

Thread Tools
Old 08-06-2014, 08:19 AM   #1
Carrot Scientist
Location: Madison WI USA

Join Date: Nov 2009
Posts: 42
Default Definition of Scaftig

There appears to be a lack of a clear definition of the term "scaftig".
How do you use this term? Is there a good definition somewhere?

I think it should have a very definite distinction from the term contig, and I would define it as
"All portions of a final assembly consisting of contiguous sequence, with sequences split at every occurrence of gaps of unknown bases (Ns)."

For example, if my final assembly is

My scaftigs are

Additionally, you could use the term "scaffold scaftigs" if you wanted to make clear that the left-over contigs are not to be included in the set of scaftigs.

I have found only a few somewhat inconsistent definitions available, in order of publication:
  1. "A scaftig refers to a continuous sequence formed by multiple initial contigs lined up in a scaffold with putative sequence overlaps."
    State of the art de novo assembly of human genomes from massively parallel sequencing data
    Hum Genomics. 2010; 4(4): 271–277. (April 1, 2010)
    Can sequence not in a scaffold be considered scaftigs? This definition would imply no.
  2. "New word of the day: #scaftigs RT @assemblathon 'scaftigs' intra-scaffold gaps between contigs. #gaw"
    Twitter (March 15, 2011)
    This strangely seems to define the gaps rather than the sequence. I include this because there are few definitions to be found.
  3. "scaftigs can be constructed by extracting the contiguous sequences that lack unknown bases (Ns)."
    Mende DR, Waller AS, Sunagawa S, Järvelin AI, Chan MM, et al. (2012) Assessment of Metagenomic Assembly Using Simulated Next Generation Sequencing Data. PLoS ONE 7(2): e31386. doi:10.1371/journal.pone.0031386 (February 23, 2012)
    This is closest to my definition
  4. "The resulting high-quality reads were assembled into scaftigs using SOAPdenovo 1.05 and genes predicted on scaftigs longer than 500 nt using MetaGeneMark v1.0"
    Country-specific antibiotic use practices impact the human gut resistome
    Genome Res. 2013. 23: 1163-1169 (April 8, 2013)
    Supplementary Materials and Methods
    I would call these contigs, they are the primary product of an assembly.
  5. ABySS version 1.5.0 introduced a command (May 1, 2014)
    "New command, `scaftigs`. Breaks scaffold sequences at 'N's and produce a scaftigs.fa file."

Other references that I found do not define the term.
dsenalik is offline   Reply With Quote
Old 08-19-2014, 07:17 AM   #2
Junior Member
Location: Berlin

Join Date: Aug 2012
Posts: 2

If I understand well, scaftigs are the contigs (contiguous sequences) from scaffolds, right? (Can be interpreted as the same set of the original contigs without the ones that are not included in any scaffold).
vitorpiro is offline   Reply With Quote

contigs, scaftig

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

All times are GMT -8. The time now is 01:18 AM.

Powered by vBulletin® Version 3.8.9
Copyright ©2000 - 2020, vBulletin Solutions, Inc.
Single Sign On provided by vBSSO