summaryrefslogtreecommitdiffstats
path: root/python/w3lib/README
diff options
context:
space:
mode:
Diffstat (limited to 'python/w3lib/README')
-rw-r--r--python/w3lib/README10
1 files changed, 10 insertions, 0 deletions
diff --git a/python/w3lib/README b/python/w3lib/README
new file mode 100644
index 0000000000..b2e25c80c4
--- /dev/null
+++ b/python/w3lib/README
@@ -0,0 +1,10 @@
+This is a Python library of web-related functions, such as:
+
+remove comments, or tags from HTML snippets
+extract base url from HTML snippets
+translate entites on HTML strings
+convert raw HTTP headers to dicts and vice-versa
+construct HTTP auth header
+converting HTML pages to unicode
+sanitize urls (like browsers do)
+extract arguments from urls