summaryrefslogtreecommitdiffstats
path: root/academic/jellyfish_k-mer/README
blob: 7988cded9ce144bb50f6bfd4cd1cb4500755df6c (plain)
Jellyfish is a tool for fast, memory-efficient counting of k-mers in
DNA. A k-mer is a substring of length k, and counting the occurrences of
all such substrings is a central step in many analyses of DNA sequence.
Jellyfish can count k-mers quickly by using an efficient encoding of a
hash table and by exploiting the "compare-and-swap" CPU instruction to
increase parallelism.

Jellyfish is a command-line program that reads FASTA and multi-FASTA
files containing DNA sequences. It outputs its k-mer counts in an binary
format, which can be translated into a human-readable text format using
the "jellyfish dump" command. See the documentation below for more
details.