Index index by Group index by Distribution index by Vendor index by creation date index by Name Mirrors Help Search

python311-lxml_html_clean-0.4.3-1.1 RPM for noarch

From OpenSuSE Ports Tumbleweed for noarch

Name: python311-lxml_html_clean Distribution: openSUSE Tumbleweed
Version: 0.4.3 Vendor: openSUSE
Release: 1.1 Build date: Fri Oct 10 07:38:18 2025
Group: Unspecified Build host: reproducible
Size: 108996 Source RPM: python-lxml_html_clean-0.4.3-1.1.src.rpm
Packager: http://bugs.opensuse.org
Url: https://github.com/fedora-python/lxml_html_clean/
Summary: HTML cleaner from lxml project
Separate project for HTML cleaning functionalities copied from lxml.html.clean.

Provides

Requires

License

BSD-3-Clause

Changelog

* Fri Oct 10 2025 Steve Kowalik <steven.kowalik@suse.com>
  - Update to 0.4.3:
    * Tests updated to work correctly with new lxml and libxml2 releases.
    * Python 3.6 and 3.7 are no longer tested.
* Fri Apr 11 2025 Dirk Müller <dmueller@suse.com>
  - update to 0.4.2:
    * lxml_html_clean now correctly handles HTML input as bytes as
      it did before the 0.2.0 release.
* Thu Nov 21 2024 ecsos <ecsos@opensuse.org>
  - Update to 0.4.1
    * Bugs fixed
    - Removed superfluous debug prints.
  - Changes from 0.4.0
    * Bugs fixed
    - The Cleaner() now scans for hidden JavaScript code embedded
      within CSS comments. In certain contexts, such as within
      <svg> or <math> tags, <style> tags may lose their intended
      function, allowing comments like /* foo */ to potentially be
      executed by the browser. If a suspicious content is detected,
      only the comment is removed.
  - Changes from 0.3.1
    * Features added
    - Do not parse URL addresses when it is not necessary.
  - Changes from 0.3.0
    * Features added
    - Parsing of URL addresses has been enhanced and Cleaner
      removes ambiguous URLs.
  - Changes from 0.2.2
    * Bugs fixed
    - sdist now includes all test files and changelog.
  - Changes from 0.2.1
    * Bugs fixed
    - Memory efficiency is now much better for HTML pages where
      cleaner removes a lot of elements. (#14)
  - Changes from 0.2.0
    * Features added
    - ASCII control characters (except HT, VT, CR and LF) are now
      removed from string inputs before they're parsed by lxml/libxml2.
  - Fix boo#1233541
* Sun Jun 23 2024 ecsos <ecsos@opensuse.org>
  - Initial version 0.1.1

Files

/usr/lib/python3.11/site-packages/lxml_html_clean
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/INSTALLER
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/METADATA
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/RECORD
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/REQUESTED
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/WHEEL
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/licenses
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/licenses/LICENSE.txt
/usr/lib/python3.11/site-packages/lxml_html_clean-0.4.3.dist-info/top_level.txt
/usr/lib/python3.11/site-packages/lxml_html_clean/__init__.py
/usr/lib/python3.11/site-packages/lxml_html_clean/__init__.pyi
/usr/lib/python3.11/site-packages/lxml_html_clean/__pycache__
/usr/lib/python3.11/site-packages/lxml_html_clean/__pycache__/__init__.cpython-311.opt-1.pyc
/usr/lib/python3.11/site-packages/lxml_html_clean/__pycache__/__init__.cpython-311.pyc
/usr/lib/python3.11/site-packages/lxml_html_clean/__pycache__/clean.cpython-311.opt-1.pyc
/usr/lib/python3.11/site-packages/lxml_html_clean/__pycache__/clean.cpython-311.pyc
/usr/lib/python3.11/site-packages/lxml_html_clean/clean.py
/usr/lib/python3.11/site-packages/lxml_html_clean/clean.pyi
/usr/lib/python3.11/site-packages/lxml_html_clean/py.typed
/usr/share/doc/packages/python311-lxml_html_clean
/usr/share/doc/packages/python311-lxml_html_clean/README.md
/usr/share/licenses/python311-lxml_html_clean
/usr/share/licenses/python311-lxml_html_clean/LICENSE.txt


Generated by rpm2html 1.8.1

Fabrice Bellet, Thu Oct 23 22:37:43 2025