Streaming response formatting by erangi-ar · Pull Request #280 · buerokratt/LLM-Module

erangi-ar · 2026-02-05T03:12:32Z

No description provided.

Pulling changes from Burokratt WIP to rootcodelabs/RAG-Module wip

Get update from RAG-201-Fix into encrypt-llm-keys

update cron manager vault script

Streaming response formatting

Encrypt llm keys

…G-Module into encrypt-llm-keys

Sync rootcodelabs/RAG-Module wip with buerokratt/RAG-Module wip

BUG fixes buerokratt#192 buerokratt#189 (buerokratt#211)

… encrypt-llm-keys

…G-Module into encrypt-llm-keys

… settings

Refactor docker-compose-ec2.ym l file with new vault agent containers

… streaming-response-formatting

Wip

… streaming-response-formatting

Copilot

Pull request overview

This PR refactors the streaming response handling for the production LLM testing feature by replacing custom message parsing with a standardized markdown rendering approach and improving the streaming architecture.

Changes:

Replaces custom message content parsing logic with react-markdown library for better markdown rendering support
Refactors TestProductionLLM component to use useStreamingResponse hook for real-time message streaming
Updates notification server URL from port 3005 to 4040 in development environment
Adds new translation keys for the test production LLM interface in English and Estonian

Reviewed changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
GUI/translations/et/common.json	Adds Estonian translations for test production LLM interface
GUI/translations/en/common.json	Adds English translations for test production LLM interface
GUI/src/pages/TestProductionLLM/index.tsx	Refactors to use streaming hook, removes retry logic, implements token-by-token message building
GUI/src/hooks/useStreamingResponse.tsx	Updates to use environment variable for notification node URL
GUI/src/components/MessageContent/index.tsx	Replaces custom parsing with ReactMarkdown and remark-gfm for GFM support
GUI/src/components/MessageContent/MessageContent.scss	Adds comprehensive markdown styling including code blocks, lists, and links
GUI/package.json	Adds react-markdown ^10.1.0 and remark-gfm ^4.0.1 dependencies
GUI/package-lock.json	Updates lock file with new markdown-related dependencies
GUI/.env.development	Changes notification node URL from port 3005 to 4040

Files not reviewed (1)

GUI/package-lock.json: Language not supported

Comments suppressed due to low confidence (1)

GUI/src/hooks/useStreamingResponse.tsx:113

There's a potential race condition in the streaming setup. A 500ms delay is used to allow the SSE connection to establish before triggering the stream, but this is an arbitrary timeout that may not be sufficient in all network conditions. If the POST request (line 110-113) completes before the SSE connection is fully established, the server might start sending events that the client isn't ready to receive. Consider implementing a more robust handshake mechanism, such as waiting for a 'connected' event from the SSE connection before triggering the stream, or having the server return a success response only after verifying the client is connected.

        await new Promise(resolve => setTimeout(resolve, 500));

        // Step 3: POST to trigger streaming
        const postUrl = `${notificationNodeUrl}/channels/${channelId}/orchestrate/stream`;
        console.log('[API] Triggering stream:', postUrl);

        await axios.post(postUrl, {
          message,
          options,
        });

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-05T08:14:56Z

GUI/src/components/MessageContent/index.tsx

+          a: ({ node, ...props }) => (
+            <a {...props} target="_blank" rel="noopener noreferrer" />
+          ),
+          // Style strong/bold text
+          strong: ({ node, ...props }) => (
+            <strong {...props} className="markdown-bold" />
+          ),
+          // Style ordered lists
+          ol: ({ node, ...props }) => (
+            <ol {...props} className="markdown-list" />
+          ),
+          // Style list items
+          li: ({ node, ...props }) => (


The node prop is destructured but never used in the custom component renderers. While this doesn't cause runtime issues, it creates unused variable warnings. Consider using rest parameter syntax without explicitly mentioning node, or prefix it with an underscore to indicate it's intentionally unused:

Instead of:

a: ({ node, ...props }) => (...)

Use:

a: ({ ...props }) => (...)

Or if you need to keep the node parameter for future use:

a: ({ node: _node, ...props }) => (...)

Suggested change

a: ({ node, ...props }) => (

<a {...props} target="_blank" rel="noopener noreferrer" />

),

// Style strong/bold text

strong: ({ node, ...props }) => (

<strong {...props} className="markdown-bold" />

),

// Style ordered lists

ol: ({ node, ...props }) => (

<ol {...props} className="markdown-list" />

),

// Style list items

li: ({ node, ...props }) => (

a: ({ ...props }) => (

<a {...props} target="_blank" rel="noopener noreferrer" />

),

// Style strong/bold text

strong: ({ ...props }) => (

<strong {...props} className="markdown-bold" />

),

// Style ordered lists

ol: ({ ...props }) => (

<ol {...props} className="markdown-list" />

),

// Style list items

li: ({ ...props }) => (

Copilot · 2026-02-05T08:14:57Z

GUI/.env.development

 REACT_APP_SERVICE_ID=conversations,settings,monitoring
-REACT_APP_NOTIFICATION_NODE_URL=http://localhost:3005
+REACT_APP_NOTIFICATION_NODE_URL=http://localhost:4040
 REACT_APP_CSP=upgrade-insecure-requests; default-src 'self'; font-src 'self' data:; img-src 'self' data:; script-src 'self' 'unsafe-eval' 'unsafe-inline'; style-src 'self' 'unsafe-inline'; object-src 'none'; connect-src 'self' http://localhost:8086 http://localhost:8088 http://localhost:3004 http://localhost:3005 ws://localhost;


The Content Security Policy (CSP) configuration still references the old notification node URL port (http://localhost:3005) in the connect-src directive, but REACT_APP_NOTIFICATION_NODE_URL has been changed to http://localhost:4040. This mismatch will cause CSP violations and prevent the application from connecting to the notification server. Update the CSP to include http://localhost:4040 in the connect-src directive.

Copilot · 2026-02-05T08:14:57Z

GUI/src/pages/TestProductionLLM/index.tsx

-    messagesEndRef.current?.scrollIntoView({ behavior: 'smooth' });
-  };
+  // Generate a unique channel ID for this session
+  const channelId = useMemo(() => `channel-${Math.random().toString(36).substring(2, 15)}`, []);


The channel ID is generated once per component mount using useMemo with an empty dependency array. This means the same channel ID will be reused across multiple conversations within the same session. If a user sends multiple messages or clears the chat, they'll continue using the same channel ID. Consider whether this is the intended behavior, or if a new channel ID should be generated for each conversation/session. If channel IDs should be unique per conversation, consider generating a new ID when the chat is cleared or creating a function to regenerate the channel ID.

Copilot · 2026-02-05T08:14:57Z

GUI/src/pages/TestProductionLLM/index.tsx

+    const onError = (error: string) => {
+      console.error('[Component] Stream error:', error);
+      setIsLoading(false);

-      const errorMessage: Message = {
-        id: `error-${Date.now()}`,
-        content: 'Failed to send message. Please check your connection and try again.',
-        isUser: false,
-        timestamp: new Date().toISOString(),
-      };
-
-      setMessages(prev => [...prev, errorMessage]);
-
      toast.open({
        type: 'error',
-        title: 'Connection Error',
-        message: 'Unable to connect to the production LLM service.',
+        title: t('testProductionLLM.streamingErrorTitle'),
+        message: error,
      });
-    } finally {
+    };


When an error occurs during streaming or the component unmounts while streaming is active, the bot message that was partially populated may remain in an incomplete state. The onError callback sets isLoading to false but doesn't clean up the partial bot message from the messages array. Consider adding logic to remove incomplete bot messages when streaming fails, or add an error state to the message to indicate it's incomplete.

Copilot · 2026-02-05T08:14:57Z

GUI/src/pages/TestProductionLLM/index.tsx

+    const onToken = (token: string) => {
+      console.log('[Component] Received token:', token);
+
+      setMessages(prev => {
+        // Find the bot message
+        const botMsgIndex = prev.findIndex(msg => msg.id === botMessageId);

-        if (response.questionOutOfLlmScope) {
-          botContent += ' (Note: This question appears to be outside the LLM scope)';
+        if (botMsgIndex === -1) {
+          // First token - add the bot message
+          console.log('[Component] Adding bot message with first token');
+          setIsLoading(false);
+          return [
+            ...prev,
+            {
+              id: botMessageId,
+              content: token,
+              isUser: false,
+              timestamp: new Date().toISOString(),
+            }
+          ];
+        } else {
+          // Append token to existing message
+          console.log('[Component] Appending token to existing message');
+          const updated = [...prev];
+          updated[botMsgIndex] = {
+            ...updated[botMsgIndex],
+            content: updated[botMsgIndex].content + token,
+          };
+          return updated;
        }
-      }
-
-      const botMessage: Message = {
-        id: `bot-${Date.now()}`,
-        content: botContent,
-        isUser: false,
-        timestamp: new Date().toISOString(),
-      };
-
-      setMessages(prev => [...prev, botMessage]);
+      });
+    };

-      // Show toast notification
-      // toast.open({
-      //   type: botMessageType,
-      //   title: t('errorOccurred'),
-      //   message: t('errorMessage'),
-      // });
+    const onComplete = () => {
+      console.log('[Component] Stream completed');
+      setIsLoading(false);
+    };


The setIsLoading(false) is called inside the onToken callback when the first token arrives (line 83). However, if no tokens are received before stream completion or an error occurs, isLoading may not be properly reset. The loading state should be managed more consistently - either always set to false in onComplete and onError callbacks, or let the parent component handle it more explicitly. Currently, the loading state management is split between the token callback and error handling, which could lead to inconsistent UI states.

Copilot · 2026-02-05T08:14:57Z

GUI/src/pages/TestProductionLLM/index.tsx

+    // Prepare conversation history (exclude the current user message)
+    const conversationHistory = messages.map(msg => ({
+      authorRole: msg.isUser ? 'user' : 'bot',
+      message: msg.content,
+      timestamp: msg.timestamp,
+    }));


The conversation history is prepared at line 60-64 by mapping over the existing messages array, but this happens before the user message is added to the messages state (which occurs at line 52). This means the current user message is not included in the conversation history sent to the backend. The comment on line 59 states "exclude the current user message", but this appears to be intentional. However, verify that the backend expects the current message to be sent separately (via the message parameter) and doesn't need it in the conversation history.

Copilot · 2026-02-05T08:14:58Z

GUI/src/hooks/useStreamingResponse.tsx

+const notificationNodeUrl = import.meta.env.REACT_APP_NOTIFICATION_NODE_URL;
+


The notificationNodeUrl environment variable is read but there's no validation to ensure it's defined. If the environment variable is missing, the URLs will be constructed with undefined, leading to malformed URLs like "undefined/sse/stream/...". Add a validation check to ensure the environment variable is set before using it, or provide a fallback value with a helpful error message.

Suggested change

const notificationNodeUrl = import.meta.env.REACT_APP_NOTIFICATION_NODE_URL;

const getNotificationNodeUrl = (): string => {

const value = import.meta.env.REACT_APP_NOTIFICATION_NODE_URL;

if (!value) {

throw new Error(

'Environment variable REACT_APP_NOTIFICATION_NODE_URL is not defined. ' +

'Please set it to the base URL of the notification service to enable streaming responses.'

);

}

return value;

};

const notificationNodeUrl = getNotificationNodeUrl();

Copilot · 2026-02-05T08:14:58Z

GUI/src/pages/TestProductionLLM/index.tsx

+              onChange={(e) => setInputMessage(e.target.value)}
              onKeyDown={handleKeyPress}
-              placeholder="Type your message here... (Press Enter to send, Shift+Enter for new line)"
+              placeholder={t('testProductionLLM.messagePlaceholder')??""}


The nullish coalescing operator (??"") is used as a fallback for the placeholder translation, but this is unnecessary. The t() function from react-i18next returns the key itself as a string if the translation is missing, so it will never return null or undefined. This fallback can be removed to simplify the code.

Suggested change

placeholder={t('testProductionLLM.messagePlaceholder')??""}

placeholder={t('testProductionLLM.messagePlaceholder')}

Thirunayan22 and others added 30 commits December 16, 2025 12:33

Merge pull request #97 from buerokratt/wip

f635768

Pulling changes from Burokratt WIP to rootcodelabs/RAG-Module wip

updated docker compose ec2

b674b5e

integrate streaming endpoint with test prodction connection page

9fec475

formatted response with markdown

9caa51d

fe logic for the encryption

49a78eb

vault secret update after fixing issues

e8af3fa

fixed formatting issue

dd3fa8b

Merge pull request #100 from rootcodelabs/RAG-201-Fix

023d53a

Get update from RAG-201-Fix into encrypt-llm-keys

integration with be

6e7e45f

update cron manager vault script

620af8c

Merge pull request #101 from rootcodelabs/RAG-201-Fix

509d0f0

update cron manager vault script

tested integration of vault security update

c6351eb

fix security issues

30f05bb

Merge pull request #102 from rootcodelabs/streaming-response-formatting

8b54764

Streaming response formatting

Merge branch 'RAG-206' into encrypt-llm-keys

7b1c830

Merge pull request #103 from rootcodelabs/encrypt-llm-keys

4fe08d1

Encrypt llm keys

creation success model changes

9af1a1e

Merge branch 'encrypt-llm-keys' of https://github.com/rootcodelabs/RA…

6e5234c

…G-Module into encrypt-llm-keys

clean vite config generated files

6830670

fixed issue references are not sending with streming tokens

a416995

complete buerokratt#192 and buerokratt#206 bug fixes

0352184

production inference display logic change

925af1c

change production inference display logic

b584e44

fixed requested issue

ce0916d

Merge pull request #105 from buerokratt/wip

6f95769

Sync rootcodelabs/RAG-Module wip with buerokratt/RAG-Module wip

Merge branch 'RAG-192' into wip

5cc3963

refactor: update SSE connection URL to use environment variable

e8acb62

foramt markdown of the llm response

e1d9681

feat: add markdown support to MessageContent component

f54a325

Merge pull request #108 from buerokratt/wip

f6ea894

BUG fixes buerokratt#192 buerokratt#189 (buerokratt#211)

erangi-ar and others added 9 commits January 19, 2026 12:50

Merge branch 'wip' of https://github.com/rootcodelabs/RAG-Module into…

08ce120

… encrypt-llm-keys

Merge branch 'encrypt-llm-keys' of https://github.com/rootcodelabs/RA…

c854c94

…G-Module into encrypt-llm-keys

Refactor Docker Compose configuration for vault agents and update CSP…

9f8074e

… settings

Remove obsolete Vite configuration files and associated plugins

fb08609

Merge pull request #109 from rootcodelabs/encrypt-llm-keys

355ca1c

Refactor docker-compose-ec2.ym l file with new vault agent containers

Merge branch 'wip' of https://github.com/rootcodelabs/RAG-Module into…

fb6d57b

… streaming-response-formatting

title fix

8131217

Merge pull request #112 from buerokratt/wip

fadd2f3

Wip

Merge branch 'wip' of https://github.com/rootcodelabs/RAG-Module into…

15934e8

… streaming-response-formatting

erangi-ar requested a review from Copilot February 5, 2026 08:10

Copilot started reviewing on behalf of erangi-ar February 5, 2026 08:10 View session

Copilot AI reviewed Feb 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming response formatting#280

Streaming response formatting#280
erangi-ar wants to merge 39 commits intobuerokratt:wipfrom
rootcodelabs:streaming-response-formatting

erangi-ar commented Feb 5, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Copilot AI Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		const notificationNodeUrl = import.meta.env.REACT_APP_NOTIFICATION_NODE_URL;

-const notificationNodeUrl = import.meta.env.REACT_APP_NOTIFICATION_NODE_URL;
+const getNotificationNodeUrl = (): string => {
+  const value = import.meta.env.REACT_APP_NOTIFICATION_NODE_URL;
+  if (!value) {
+    throw new Error(
+      'Environment variable REACT_APP_NOTIFICATION_NODE_URL is not defined. ' +
+        'Please set it to the base URL of the notification service to enable streaming responses.'
+    );
+  }
+  return value;
+};
+const notificationNodeUrl = getNotificationNodeUrl();

	placeholder={t('testProductionLLM.messagePlaceholder')??""}
	placeholder={t('testProductionLLM.messagePlaceholder')}

Conversation

erangi-ar commented Feb 5, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants