CVE-2025-6638: Hugging Face Transformers is vulnerable to ReDoS through its MarianTokenizer
(updated )
A Regular Expression Denial of Service (ReDoS) vulnerability was discovered in the Hugging Face Transformers library, specifically affecting the MarianTokenizer’s remove_language_code() method. This vulnerability is present in version 4.52.4 and has been fixed in version 4.53.0. The issue arises from inefficient regex processing, which can be exploited by crafted input strings containing malformed language code patterns, leading to excessive CPU consumption and potential denial of service.
References
- github.com/advisories/GHSA-59p9-h35m-wg4g
- github.com/huggingface/transformers
- github.com/huggingface/transformers/commit/47c34fba5c303576560cb29767efb452ff12b8be
- github.com/huggingface/transformers/commit/d37f7517972f67e3f2194c000ed0f87f064e5099
- huntr.com/bounties/6a6c933f-9ce8-4ded-8b3b-2c1444c61f36
- nvd.nist.gov/vuln/detail/CVE-2025-6638
Code Behaviors & Features
Detect and mitigate CVE-2025-6638 with GitLab Dependency Scanning
Secure your software supply chain by verifying that all open source dependencies used in your projects contain no disclosed vulnerabilities. Learn more about Dependency Scanning →