You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Refactor codebase to to package utility modules for distribution
* Remove unnecessary getenv assertion from langfuse client setup
* Fix formatting issues in prompts and update README paths for consistency
* Upgrade packages flagged by pip-audit and ignore ones that can't be upgraded
* Rename `src` dir to `implementations`
* Update dependencies
* Restructure reference implementations to fix gradio hot-reload issue
* Fix extra trailing quote
* Rename `aieng-agents-utils` to `aieng-agents`
Copy file name to clipboardExpand all lines: README.md
+24-24Lines changed: 24 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,31 +11,31 @@ This repository includes several modules, each showcasing a different aspect of
11
11
**2. Frameworks: OpenAI Agents SDK**
12
12
Showcases the use of the OpenAI agents SDK to reduce boilerplate and improve readability.
13
13
14
-
-**[2.1 ReAct Agent for RAG - OpenAI SDK](src/2_frameworks/1_react_rag/README.md)**
14
+
-**[2.1 ReAct Agent for RAG - OpenAI SDK](implementations/2_frameworks/1_react_rag/README.md)**
15
15
Implements the same Reason-and-Act agent using the high-level abstractions provided by the OpenAI Agents SDK. This approach reduces boilerplate and improves readability.
16
16
The use of langfuse for making the agent less of a black-box is also introduced in this module.
17
17
18
-
-**[2.2 Multi-agent Setup for Deep Research](src/2_frameworks/2_multi_agent/README.md)**
18
+
-**[2.2 Multi-agent Setup for Deep Research](implementations/2_frameworks/2_multi_agent/README.md)**
19
19
Demo of a multi-agent architecture to improve efficiency on long-context inputs, reduce latency, and reduce LLM costs. Two versions are available- "efficient" and "verbose". For the build days, you should start from the "efficient" version as that provides greater flexibility and is easier to follow.
20
20
21
21
**3. Evals: Automated Evaluation Pipelines**
22
22
Contains scripts and utilities for evaluating agent performance using LLM-as-a-judge and synthetic data generation. Includes tools for uploading datasets, running evaluations, and integrating with [Langfuse](https://langfuse.com/) for traceability.
Automated evaluation pipelines using LLM-as-a-judge with Langfuse integration.
26
26
27
-
-**[3.2 Evaluation on Synthetic Dataset](src/3_evals/2_synthetic_data/README.md)**
27
+
-**[3.2 Evaluation on Synthetic Dataset](implementations/3_evals/2_synthetic_data/README.md)**
28
28
Showcases the generation of synthetic evaluation data for testing agents.
29
29
30
30
We also provide "basic" no-framework implementations. These are meant to showcase how agents work behind the scene and are excessively verbose in the implementation. You should not use these as the basis for real projects.
31
31
32
32
**1. Basics: Reason-and-Act RAG**
33
33
A minimal Reason-and-Act (ReAct) agent for knowledge retrieval, implemented without any agent framework.
A simple demo showing the capabilities (and limitations) of a knowledgebase search.
37
37
38
-
-**[1.1 ReAct Agent for RAG](src/1_basics/1_react_rag/README.md)**
38
+
-**[1.1 ReAct Agent for RAG](implementations/1_basics/1_react_rag/README.md)**
39
39
Basic ReAct agent for step-by-step retrieval and answer generation.
40
40
41
41
## Getting Started
@@ -48,7 +48,7 @@ In that case you can verify that the API keys work by running integration tests
48
48
uv run --env-file .env pytest -sv tests/tool_tests/test_integration.py
49
49
```
50
50
51
-
## Reference Implementations
51
+
## Running the Reference Implementations
52
52
53
53
For "Gradio App" reference implementations, running the script would print out a "public URL" ending in `gradio.live` (might take a few seconds to appear.) To access the gradio app with the full streaming capabilities, copy and paste this `gradio.live` URL into a new browser tab.
54
54
@@ -74,48 +74,48 @@ These warnings can be safely ignored, as they are the result of a bug in the ups
74
74
Interactive knowledge base demo. Access the gradio interface in your browser to see if your knowledge base meets your expectations.
75
75
76
76
```bash
77
-
uv run --env-file .env gradio src/1_basics/0_search_demo/app.py
77
+
uv run --env-file .env gradio implementations/1_basics/0_search_demo/app.py
78
78
```
79
79
80
80
Basic Reason-and-Act Agent- for demo purposes only.
81
81
82
82
As noted above, these are unnecessarily verbose for real applications.
83
83
84
84
```bash
85
-
# uv run --env-file .env src/1_basics/1_react_rag/cli.py
86
-
# uv run --env-file .env gradio src/1_basics/1_react_rag/app.py
85
+
# uv run --env-file .env implementations/1_basics/1_react_rag/cli.py
86
+
# uv run --env-file .env gradio implementations/1_basics/1_react_rag/app.py
87
87
```
88
88
89
89
### 2. Frameworks
90
90
91
91
Reason-and-Act Agent without the boilerplate- using the OpenAI Agent SDK.
92
92
93
93
```bash
94
-
uv run --env-file .env src/2_frameworks/1_react_rag/cli.py
95
-
uv run --env-file .env gradio src/2_frameworks/1_react_rag/langfuse_gradio.py
94
+
uv run --env-file .env implementations/2_frameworks/1_react_rag/cli.py
95
+
uv run --env-file .env gradio implementations/2_frameworks/1_react_rag/langfuse_gradio.py
96
96
```
97
97
98
98
Multi-agent examples, also via the OpenAI Agent SDK.
99
99
100
100
```bash
101
-
uv run --env-file .env gradio src/2_frameworks/2_multi_agent/efficient.py
101
+
uv run --env-file .env gradio implementations/2_frameworks/2_multi_agent/efficient.py
102
102
# Verbose option - greater control over the agent flow, but less flexible.
103
-
# uv run --env-file .env gradio src/2_frameworks/2_multi_agent/verbose.py
103
+
# uv run --env-file .env gradio implementations/2_frameworks/2_multi_agent/verbose.py
104
104
```
105
105
106
-
Python Code Interpreter demo- using the OpenAI Agent SDK, E2B for secure code sandbox, and LangFuse for observability. Refer to [src/2_frameworks/3_code_interpreter/README.md](src/2_frameworks/3_code_interpreter/README.md) for details.
106
+
Python Code Interpreter demo- using the OpenAI Agent SDK, E2B for secure code sandbox, and LangFuse for observability. Refer to [implementations/2_frameworks/3_code_interpreter/README.md](implementations/2_frameworks/3_code_interpreter/README.md) for details.
107
107
108
-
MCP server integration example also via OpenAI Agents SDK with Gradio and Langfuse tracing. Refer to [src/2_frameworks/4_mcp/README.md](src/2_frameworks/4_mcp/README.md) for more details.
108
+
MCP server integration example also via OpenAI Agents SDK with Gradio and Langfuse tracing. Refer to [implementations/2_frameworks/4_mcp/README.md](implementations/2_frameworks/4_mcp/README.md) for more details.
0 commit comments