HTML cleaner from lxml project
Project description
lxml_html_clean
Motivation
This project was initially a part of lxml. Because HTML cleaner is designed as blocklist-based, many reports about possible security vulnerabilities were filed for lxml and that make the project problematic for security-sensitive environments. Therefore we decided to extract the problematic part to a separate project.
Installation
You can install this project directly via pip install lxml_html_clean
or soon as an extra of lxml
via pip install lxml[html_clean]
. Both ways installs this project together with lxml itself.
Documentation
https://lxml-html-clean.readthedocs.io/
License
BSD-3-Clause
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
lxml_html_clean-0.1.0.tar.gz
(14.0 kB
view hashes)
Built Distribution
Close
Hashes for lxml_html_clean-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f21a33d279eb3bddd336ebfa0c73d3d5b359dbfa8113014f7d1f2d8738fdc305 |
|
MD5 | be40b85b2920c3302161090ec9d1a26c |
|
BLAKE2b-256 | f93612f319a5cb41b0d1ced556d175a3ae878193fd1c769038dfb66fae6d2e89 |