Skip to content

⬆ Bump unstructured from 0.16.23 to 0.21.5#8

Open
dependabot[bot] wants to merge 1 commit into
developmentfrom
dependabot/uv/unstructured-0.21.5
Open

⬆ Bump unstructured from 0.16.23 to 0.21.5#8
dependabot[bot] wants to merge 1 commit into
developmentfrom
dependabot/uv/unstructured-0.21.5

Conversation

@dependabot
Copy link
Copy Markdown

@dependabot dependabot Bot commented on behalf of github Mar 8, 2026

Bumps unstructured from 0.16.23 to 0.21.5.

Release notes

Sourced from unstructured's releases.

0.21.5

What's Changed

New Contributors

Full Changelog: Unstructured-IO/unstructured@0.21.2...0.21.5

0.21.2

No release notes provided.

0.21.1

What's Changed

Full Changelog: Unstructured-IO/unstructured@0.21.0...0.21.1

0.21.0

Fixes

  • Replace NLTK with spaCy to remediate CVE-2025-14009: NLTK's downloader uses zipfile.extractall() without path validation, enabling RCE via malicious packages (CVSS 10.0, no patch available). spaCy models install as pip packages, eliminating the vulnerable downloader entirely.

0.20.8

What's Changed

Full Changelog: Unstructured-IO/unstructured@0.20.6...0.20.8

0.20.6

What's Changed

New Contributors

Full Changelog: Unstructured-IO/unstructured@0.20.1...0.20.6

0.20.2

Release 0.20.2

... (truncated)

Changelog

Sourced from unstructured's changelog.

0.21.5

Fixes

  • Lower the requirement for pdfminer.six to >=20251230

0.21.4

Enhancements

  • Add a github action for testing time regressions

0.21.3

Enhancements

  • Custom fallback for language detection (fixes #4091): Add optional language_fallback callable for short ASCII text (e.g. when detection would default to English). Callable receives the text and may return a list of ISO 639-3 codes or None to leave language unspecified; return value is validated and invalid entries are filtered out. language_fallback is passed through partition(), PDF/image partitioners, and partition_html; partition_md now accepts languages (use [""] to disable detection). Language-related parameters (languages, detect_language_per_element, language_fallback) are documented as top-level options and exposed explicitly on partition_html.

0.21.2

Fixes

  • Self-install pinned spaCy model at runtime with SHA256 verification: Replace the en-core-web-sm direct URL dependency in pyproject.toml with the installer library. The spaCy model is now downloaded and installed on first use with hash verification, removing the need for [tool.uv.sources] and making the install more portable.

0.21.1

  • Bump version to create a new release

0.21.0

Fixes

  • Replace NLTK with spaCy to remediate CVE-2025-14009: NLTK's downloader uses zipfile.extractall() without path validation, enabling RCE via malicious packages (CVSS 10.0, no patch available). spaCy models install as pip packages, eliminating the vulnerable downloader entirely.

0.20.8

Fixes

  • downgrade wrapt so it is compatible with opentelemetry-instrumentation-httpx
  • resolve lock issue with windows and python 3.13

0.20.7

Fixes

  • Cap size when decompressing elements JSON file: Prevents situations where decompression can consume an arbitrarily large portion in memory and on the filesystem.

0.20.6

Fixes

  • fix: remap parent id after hashing to preserve right reference

0.20.5

Fixes

  • Gracefully handle invalid text_as_html during chunking: _TableChunker now catches parse errors (e.g. lxml.etree.ParserError when text_as_html contains a markdown code-fence like ```html\n) and returns None instead of raising, allowing chunking to continue using plain-text fallback. A WARNING log is emitted with a truncated preview of the offending value.

... (truncated)

Commits

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [unstructured](https://github.com/Unstructured-IO/unstructured) from 0.16.23 to 0.21.5.
- [Release notes](https://github.com/Unstructured-IO/unstructured/releases)
- [Changelog](https://github.com/Unstructured-IO/unstructured/blob/main/CHANGELOG.md)
- [Commits](Unstructured-IO/unstructured@0.16.23...0.21.5)

---
updated-dependencies:
- dependency-name: unstructured
  dependency-version: 0.21.5
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
@dependabot dependabot Bot added dependencies Pull requests that update a dependency file python:uv Pull requests that update python:uv code labels Mar 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file python:uv Pull requests that update python:uv code

Projects

None yet

Development

Successfully merging this pull request may close these issues.

0 participants