SlackBuilds Repository

15.0 > Academic > clark-ugene (git_6917bf9)

This is Ugene's ( fork of the CLARK tool
(, with supports building DB directly from
gzip & 7z packed RefSeq files

CLARK: CLAssifier based on Reduced K-mers

The problem of DNA sequence classification is central to several
application domains in molecular biology, genomics, metagenomics and
genetics. The problem is computationally challenging due to the size of
datasets generated by modern sequencing instruments and the growing size
of reference sequence databases.

CLARK is a novel method for supervised sequence classification based on
discriminative k-mers. Somewhat unique among other metagenomic and
genomic classification methods, CLARK provides a confidence score for
its assignments which can be used in downstream analysis. The utility of
CLARK is demonstrated on two distinct specific classification problems:

1) the assignment of metagenomic reads to known bacterial genomes
2) the assignment of BAC clones and transcript to chromosome arms (in
the absence of a finished assembly for the reference genome).

Three classifiers or variants in the CLARK framework are provided :
CLARK (default): created for powerful workstation, it may require a
significant amount of RAM to run with large database (e.g., all
bacterial genomes from NCBI/RefSeq). This classifier queries k-mers
with exact matching.

CLARK-l (light): created for workstations with limited memory, this
software tool provides precise classification on small metagenomes.
Indeed, for metagenomics analysis, CLARK-l works with a sparse or
"light" database (up to 4 GB of RAM) that is built using distant and
non-overlapping k-mers. This classifier queries k-mers with exact

CLARK-S (spaced): created for powerful workstation exploiting spaced k-
mers, this classifier requires a higher RAM usage than CLARK or CLARK-l,
but it does offer a higher sensitivity. CLARK-S completes the CLARK
series of classifiers.

Maintained by: Petar Petrov
Keywords: sequence classification,k-mer
ChangeLog: clark-ugene


Source Downloads (64bit):
clark-6917bf93df9a8c2b80155db367f6672c1b143d9b.tar.gz (c777c1ec34816c145bcd0a40b72cb69b)

Download SlackBuild:
clark-ugene.tar.gz.asc (FAQ)

(the SlackBuild does not include the source)

Individual Files:

Validated for Slackware 15.0

See our HOWTO for instructions on how to use the contents of this repository.

Access to the repository is available via:
ftp git cgit http rsync

© 2006-2022 Project. All rights reserved.
Slackware® is a registered trademark of Patrick Volkerding
Linux® is a registered trademark of Linus Torvalds