Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix identity comparison in toctree_tags.py
#1949 opened Feb 13, 2026 by llukito Loading…
Update to PyO3 0.28 to automatically disable GIL
#1948 opened Feb 11, 2026 by ngoldbaum Loading…
Add get_special_tokens and is_special_token methods
#1945 opened Feb 5, 2026 by ArthurZucker Loading…
2 tasks done
Add post_process_tokens and post_process_ids methods
#1944 opened Feb 5, 2026 by ArthurZucker Loading…
3 tasks done
feat: add unk_token property to Unigram model
#1943 opened Feb 5, 2026 by ArthurZucker Loading…
4 tasks done
feat: add role_to_token field for special token metadata
#1942 opened Feb 5, 2026 by ArthurZucker Loading…
5 tasks done
Fix broken source links in documentation
#1934 opened Jan 21, 2026 by Shivam-Bhardwaj Loading…
fix: added type hints in .py files
#1932 opened Jan 20, 2026 by ashmi8 Loading…
Include license file into python wheels
#1931 opened Jan 20, 2026 by justeph Loading…
feat: add progress_format option for machine-readable JSON output
#1921 opened Dec 26, 2025 by podarok Loading…
6 tasks done
Upgrade GitHub Actions for Node 24 compatibility
#1916 opened Dec 20, 2025 by salmanmkc Loading…
Fix undefined names in docs/source/_ext/entities.py
#1895 opened Nov 28, 2025 by cclauss Loading…
Python: Add ruff rules for asyncio and performance
#1894 opened Nov 28, 2025 by cclauss Loading…
Implement Append normalizer
#1893 opened Nov 28, 2025 by ArthurZucker Loading…
Mark Python tests that need network access
#1872 opened Oct 2, 2025 by gordonmessmer Loading…
feat: whitespace optimize Feature Request
#1841 opened Aug 6, 2025 by b00f Loading…
Unused Unicode Character Filter
#1832 opened Jul 23, 2025 by sanderland Loading…
Add enforce_utf8_boundaries option to BpeTrainer
#1830 opened Jul 22, 2025 by sanderland Loading…
Faster Whitespace PreTokenizer (Drop-in Replacement)
#1822 opened Jul 7, 2025 by 8ria Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.