feat(filesystem): add PageIndex FileSystem and PIFS CLI#302
Open
BukeLy wants to merge 50 commits into
Open
Conversation
274af6c to
d7d3cb8
Compare
Remove the synchronous=OFF pragma from PIFS catalog inserts so SQLite remains the durable source of truth.
Route default semantic search to the summary projection when summary is the only populated semantic channel.
Only use the fresh event loop fallback for missing running-loop detection, so RuntimeError from a threaded agent run is not retried.
Raise on summary projection dimension mismatch instead of resetting an existing index.
Do not emit source-file grep fallback candidates unless an actual source line matches the query.
Avoid eager optional dependency imports when importing PageIndexFileSystem or filesystem semantic exports.
Resolve root virtual file paths correctly and raise a clear error for ambiguous file targets.
Escape wildcard characters in recursive folder LIKE filters and metadata contains queries.
Persist PageIndex tree build failure details in metadata_status and surface them through stat and structural reads.
Write projection and raw side effects only after a successful catalog insert, and clean owned artifacts when registration fails.
d7d3cb8 to
346eb0a
Compare
76a3b62 to
c13cb20
Compare
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
This PR adds the first PageIndex FileSystem (PIFS) MVP: a PageIndex-backed virtual filesystem, a shell-like
pifsCLI, an agent chat/ask loop over a workspace, and an example demo that registers documents into a PIFS workspace and retrieves long-document evidence through PageIndex structure/page/node reads.Why
PIFS gives agents a concrete filesystem-style interface for browsing, filtering, searching, and reading PageIndex documents. It keeps workspace catalog metadata, PageIndex extraction artifacts, semantic recall, and folder projections as product concepts instead of benchmark-only scripts.
What Changed
pageindex.filesystemcore model, SQLite workspace store, metadata policy/status handling, and summary projection indexing.pifsCLI withls,tree,find,grep,stat,cat,search-summary,chat, andaskworkflows.metadata_statusfor provenance/state.examples/pifs_demo.pyand a reusableexamples/pifs_workspacedemo flow for agent retrieval overexamples/documents.Verification
examples/pifs_workspace.tests/test_pageindex_filesystem_scope.py,tests/test_pifs_cli.py,tests/test_pageindex_structural_read.py, andtests/test_pifs_find_maxdepth.py.