feat(fetchers): enhance ArXivFetcher with PDF binary indication by chaliy · Pull Request #89 · everruns/fetchkit

chaliy · 2026-04-03T02:58:06Z

What

Enhance ArXivFetcher with binary content indication for PDF URLs.

Why

Closes #57 — When agents request /pdf/ URLs, the fetcher should indicate that the original content is binary (PDF) and only metadata is returned, consistent with the core binary handling behavior.

How

Added is_pdf_url() helper to detect /pdf/ vs /abs/ URLs
Added binary content note in metadata section for PDF URLs
Added tests for PDF detection, DOI/journal ref extraction

Risk

Low
Only adds informational note to output for PDF URLs

Checklist

Unit tests are passed
Smoke tests are passed
Specs are up to date and not in conflict

- Add is_pdf_url() to detect /pdf/ URLs - Show binary content note for PDF URLs (metadata-only response) - Add tests for PDF URL detection, DOI/journal extraction - Verify ar5iv HTML link is included in output Closes #57

feat(fetchers): enhance ArXivFetcher with PDF binary indication

b0eed12

- Add is_pdf_url() to detect /pdf/ URLs - Show binary content note for PDF URLs (metadata-only response) - Add tests for PDF URL detection, DOI/journal extraction - Verify ar5iv HTML link is included in output Closes #57

chaliy merged commit 9ce3234 into main Apr 3, 2026
11 checks passed

chaliy deleted the fix/issue-57-arxiv-fetcher branch April 3, 2026 03:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(fetchers): enhance ArXivFetcher with PDF binary indication#89

feat(fetchers): enhance ArXivFetcher with PDF binary indication#89
chaliy merged 1 commit intomainfrom
fix/issue-57-arxiv-fetcher

chaliy commented Apr 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

chaliy commented Apr 3, 2026

What

Why

How

Risk

Checklist

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant