Custom prompt configuration feature and Pyright fixes by nuwangeek · Pull Request #297 · buerokratt/LLM-Module

nuwangeek · 2026-02-09T08:49:04Z

No description provided.

Pulling changes from Burokratt WIP to rootcodelabs/RAG-Module wip

Get update from RAG-201-Fix into encrypt-llm-keys

update cron manager vault script

Streaming response formatting

Encrypt llm keys

…G-Module into encrypt-llm-keys

Sync rootcodelabs/RAG-Module wip with buerokratt/RAG-Module wip

BUG fixes buerokratt#192 buerokratt#189 (buerokratt#211)

… encrypt-llm-keys

…G-Module into encrypt-llm-keys

… settings

Refactor docker-compose-ec2.ym l file with new vault agent containers

Wip

Copilot

Pull request overview

Adds support for centrally managed “custom prompt instructions” loaded from Ruuter/Resql and applied during response generation, alongside several Pyright/type-safety fixes across the codebase.

Changes:

Introduces a PromptConfigurationLoader with TTL caching + retry logic, and wires it into LLMOrchestrationService + ResponseGeneratorAgent.
Adds an API + DSL flow to force-refresh the prompt cache after admin updates (POST /prompt-config/refresh and Ruuter save hook).
Fixes/adjusts typing and runtime contracts (Pyright) in metrics, guardrails streaming/provider registration, and contextual retrieval.

Reviewed changes

Copilot reviewed 14 out of 14 changed files in this pull request and generated 12 comments.

Show a summary per file

File	Description
src/utils/prompt_config_loader.py	New HTTP loader for custom prompt config with caching/retry and monitoring stats.
src/llm_orchestrator_config/llm_ochestrator_constants.py	Adds constants for prompt-config endpoint and cache TTL.
src/llm_orchestration_service.py	Initializes loader, warms cache, and injects custom prefix into response generator.
src/response_generator/response_generate.py	Adds `custom_instructions_prefix` and prepends it in streaming + non-streaming generation.
src/llm_orchestration_service_api.py	Adds `/prompt-config/refresh` endpoint to force refresh of the cache.
DSL/Ruuter.public/.../get-prompt.yml	Adds Ruuter public endpoint to fetch prompt config from Resql.
DSL/Ruuter.private/.../prompt-configuration/save.yml	Triggers refresh endpoint after saving prompt configuration.
constants.ini	Adds `RAG_SEARCH_PROMPT_REFRESH` used by the DSL save flow.
docs/CUSTOM_PROMPT_CONFIGURATION.md	Documents the new prompt configuration flow end-to-end.
src/optimization/metrics/generator_metrics.py	Pyright-friendly import/use of `SemanticF1` and ensures float typing.
src/llm_orchestrator_config/exceptions.py	Uses `Optional[str]` for error_id args for typing correctness.
src/guardrails/nemo_rails_adapter.py	Adjusts provider registration to satisfy typing (but has a runtime risk).
src/guardrails/dspy_nemo_adapter.py	Updates streaming return types to `GenerationChunk` for LangChain compatibility.
src/contextual_retrieval/contextual_retriever.py	Improves exception type narrowing for `asyncio.gather(..., return_exceptions=True)`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/utils/prompt_config_loader.py

src/llm_orchestration_service.py

src/utils/prompt_config_loader.py

src/guardrails/nemo_rails_adapter.py

src/utils/prompt_config_loader.py

src/response_generator/response_generate.py

Copilot · 2026-02-09T08:59:54Z

src/llm_orchestration_service_api.py

+@app.post("/prompt-config/refresh")
+def refresh_prompt_config(http_request: Request) -> Dict[str, Any]:
+    """


New endpoint /prompt-config/refresh is not covered by the existing integration tests (there are tests for /health and the inference flows). Add an integration test that saves a prompt via the Ruuter save route and asserts refresh succeeds and the updated instructions are reflected (or at least that cached prompt length changes).

src/utils/prompt_config_loader.py

docs/CUSTOM_PROMPT_CONFIGURATION.md

Thirunayan22 and others added 30 commits December 16, 2025 12:33

Merge pull request #97 from buerokratt/wip

f635768

Pulling changes from Burokratt WIP to rootcodelabs/RAG-Module wip

updated docker compose ec2

b674b5e

integrate streaming endpoint with test prodction connection page

9fec475

formatted response with markdown

9caa51d

fe logic for the encryption

49a78eb

vault secret update after fixing issues

e8af3fa

fixed formatting issue

dd3fa8b

Merge pull request #100 from rootcodelabs/RAG-201-Fix

023d53a

Get update from RAG-201-Fix into encrypt-llm-keys

integration with be

6e7e45f

update cron manager vault script

620af8c

Merge pull request #101 from rootcodelabs/RAG-201-Fix

509d0f0

update cron manager vault script

tested integration of vault security update

c6351eb

fix security issues

30f05bb

Merge pull request #102 from rootcodelabs/streaming-response-formatting

8b54764

Streaming response formatting

Merge branch 'RAG-206' into encrypt-llm-keys

7b1c830

Merge pull request #103 from rootcodelabs/encrypt-llm-keys

4fe08d1

Encrypt llm keys

creation success model changes

9af1a1e

Merge branch 'encrypt-llm-keys' of https://github.com/rootcodelabs/RA…

6e5234c

…G-Module into encrypt-llm-keys

clean vite config generated files

6830670

fixed issue references are not sending with streming tokens

a416995

complete buerokratt#192 and buerokratt#206 bug fixes

0352184

production inference display logic change

925af1c

change production inference display logic

b584e44

fixed requested issue

ce0916d

Merge pull request #105 from buerokratt/wip

6f95769

Sync rootcodelabs/RAG-Module wip with buerokratt/RAG-Module wip

Merge branch 'RAG-192' into wip

5cc3963

Merge pull request #108 from buerokratt/wip

f6ea894

BUG fixes buerokratt#192 buerokratt#189 (buerokratt#211)

Merge branch 'wip' of https://github.com/rootcodelabs/RAG-Module into…

08ce120

… encrypt-llm-keys

Merge branch 'encrypt-llm-keys' of https://github.com/rootcodelabs/RA…

c854c94

…G-Module into encrypt-llm-keys

Refactor Docker Compose configuration for vault agents and update CSP…

9f8074e

… settings

erangi-ar and others added 5 commits January 19, 2026 13:01

Remove obsolete Vite configuration files and associated plugins

fb08609

Merge pull request #109 from rootcodelabs/encrypt-llm-keys

355ca1c

Refactor docker-compose-ec2.ym l file with new vault agent containers

Merge pull request #112 from buerokratt/wip

fadd2f3

Wip

prompt coniguration backend to be testing

25c2a41

custom prompt configuration update and fixed Pyright issues

25c1b23

nuwangeek requested a review from Copilot February 9, 2026 08:49

Copilot started reviewing on behalf of nuwangeek February 9, 2026 08:49 View session

Copilot AI reviewed Feb 9, 2026

View reviewed changes

fixed copilot reviews

5f31f45

nuwangeek marked this pull request as ready for review February 9, 2026 10:13

nuwangeek requested a review from Thirunayan22 February 9, 2026 10:14

fixed review comments

f77a06e

Thirunayan22 approved these changes Feb 10, 2026

View reviewed changes

Thirunayan22 merged commit fa4e1a4 into buerokratt:wip Feb 10, 2026
5 of 9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom prompt configuration feature and Pyright fixes#297

Custom prompt configuration feature and Pyright fixes#297
Thirunayan22 merged 37 commits intobuerokratt:wipfrom
rootcodelabs:llm-289

nuwangeek commented Feb 9, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

nuwangeek commented Feb 9, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants