summaryrefslogtreecommitdiffstats
path: root/python/PyStemmer/README
diff options
context:
space:
mode:
author Nikos Giotis2017-03-05 05:02:08 +0100
committer Willy Sudiarto Raharjo2017-03-05 05:02:08 +0100
commit66f974475ff3c17432c32653626a225cade03294 (patch)
treec5a9b2e4dc70d7995d20dc3e6c921fee720c84cc /python/PyStemmer/README
parent847c66a1c5036590b10db27f936220a5f6cdcdf8 (diff)
downloadslackbuilds-66f974475ff3c17432c32653626a225cade03294.tar.gz
python/PyStemmer: Added (Snowball stemming algorithms).
Signed-off-by: Willy Sudiarto Raharjo <willysr@slackbuilds.org>
Diffstat (limited to 'python/PyStemmer/README')
-rw-r--r--python/PyStemmer/README18
1 files changed, 18 insertions, 0 deletions
diff --git a/python/PyStemmer/README b/python/PyStemmer/README
new file mode 100644
index 0000000000..161b13c630
--- /dev/null
+++ b/python/PyStemmer/README
@@ -0,0 +1,18 @@
+Snowball stemming algorithms, for information retrieval
+
+Stemming algorithms
+
+PyStemmer provides access to efficient algorithms for calculating a "stemmed"
+form of a word. This is a form with most of the common morphological endings
+removed; hopefully representing a common linguistic base form. This is most
+useful in building search engines and information retrieval software;
+for example, a search with stemming enabled should be able to find a document
+containing "cycling" given the query "cycles".
+
+PyStemmer provides algorithms for several (mainly european) languages, by
+wrapping the libstemmer library from the Snowball project in a Python module.
+
+It also provides access to the classic Porter stemming algorithm for english:
+although this has been superceded by an improved algorithm, the original
+algorithm may be of interest to information retrieval researchers wishing
+to reproduce results of earlier experiments.