docs: update introduction (#10255)

keydunov · web-flow · commit 109fb78ccace · 2025-12-15T17:29:33.000-08:00
diff --git a/docs/pages/product/introduction.mdx b/docs/pages/product/introduction.mdx
@@ -1,8 +1,6 @@
 # Introduction
 
-Cube is the [agentic analytics](#agentic-analytics) platform built on top of the [semantic layer](#semantic-layer).
-
-## Agentic analytics
+Cube is the [agentic analytics](#agentic-analytics) platform built on top of the [open-source semantic layer](#semantic-layer).
 
 Cube enables AI agents and users to query, explore, and manipulate data models — transforming the semantic layer into a dynamic, governed workspace for generating insights, automating workflows, and building data products.
 
@@ -20,127 +18,73 @@ Cube is a new generation of a BI platform built to be used by both humans and AI
 
 With Cube, you can power copilots, automate data workflows, and create interactive analytics experiences—all grounded in a consistent and governed data model.
 
-<InfoBox>
-
-End users can access Cube's agentic analytics capabilities in [Workbooks][ref-workbooks].
-
-You can also bring agentic analytics to your own applications by using the [embedding][ref-embedding]
-capabilities, the [Chat API][ref-chat-api], and the [MCP server][ref-mcp-server].
-
-To familiarize yourself with the core concepts behind agentic analytics, take a look at our guides on
-[spaces, agents, models][ref-spaces-agents-models], [agent rules][ref-agent-rules], and [agent memories][ref-agent-memories].
-
-</InfoBox>
-
 ## Semantic layer
 
-Cube is a universal semantic layer that represents the next evolution of OLAP technology for the cloud data platform era. Born in the cloud, Cube bridges the gap left when traditional OLAP capabilities from legacy specialized servers were not fully translated to modern cloud data platforms.
-
-As data infrastructure evolved from traditional relational databases to cloud data warehouses, the need for multidimensional analysis, consistent metrics, and performance optimization remained. Cube addresses these challenges by making it easy to connect data silos, create consistent metrics, and make them accessible to any data experience your business or your customers needs.
+At the foundation of Cube's agentic analytics platform is an [open-source semantic layer](https://github.com/cube-js/cube)—the critical infrastructure that enables both AI agents and humans to work with trusted, consistent data.
 
-Data engineers and application developers use Cube's developer-friendly platform to organize data from your cloud data warehouses into centralized, consistent definitions, and deliver it to every downstream tool via its APIs.
+The semantic layer provides the governed data foundation that makes agentic analytics possible. It organizes data from your cloud data warehouses into centralized, consistent definitions that AI agents can reliably query, explore, and reason about. Without a semantic layer, AI agents would struggle with inconsistent metrics, scattered business logic, and ungoverned data access—making their outputs unreliable and potentially dangerous.
 
-Your business data becomes consistent, accurate, easy to access, and, most importantly, trusted.
-Once trusted, the use of data accelerates throughout your organization, delivering better experiences
-to your customers and driving intelligence back into the business.
+By establishing a single source of truth for metrics, relationships, and business logic, the semantic layer ensures that AI agents and users work with the same trusted definitions. This consistency is essential for agentic analytics: when an AI agent generates insights or automates workflows, it relies on the semantic layer's data model to understand what metrics mean, how entities relate, and what data users are authorized to access.
 
-<Diagram src="https://ucarecdn.com/8d945f29-e9eb-4e7f-9e9e-29ae7074e195/" />
+The semantic layer also provides the performance and governance infrastructure needed for agentic workflows. Through caching and pre-aggregations, it ensures AI agents can respond quickly without overwhelming your data warehouse. Through access controls, it guarantees that agents respect the same data security policies as human users.
 
-With Cube, you can build a data model, manage access control and caching, and expose your data to every application
-via REST, GraphQL, and SQL APIs. With these APIs, you can use any charting library to build custom UI, connect existing dashboarding and reporting tools, and build AI-powered data applications.
+Data engineers use Cube's semantic layer to build and maintain data models, manage access control and caching, and expose data through REST, GraphQL, and SQL APIs—creating the governed foundation that powers agentic analytics experiences, traditional BI tools, and custom data applications.
 
 ### Code-first
 
-Throughout the evolution of software engineering, numerous tools and methodologies have been developed to effectively handle codebases of all sizes.
-These include [version control systems](https://git-scm.com/) for seamless collaboration and code reviews,
-infrastructure for testing and documentation, as well as [established patterns](https://en.wikipedia.org/wiki/Design_Patterns) and
-best practices to structure codebases for reusability and maintainability.
+A code-first approach is essential for both traditional data engineering and agentic analytics. Managing data models, configurations, and policies as code enables the same proven practices that power modern software development: version control for collaboration and code reviews, automated testing and documentation, and established patterns for reusability and maintainability.
 
-At Cube, we firmly believe that the future of data engineering lies in the application of these proven practices and tools to data management.
-By doing so, we can facilitate collaboration at scale and create high-quality data products that are easily maintainable.
+For agentic analytics specifically, a code-first semantic layer creates new possibilities. AI agents can help curate and maintain data models themselves, accelerating development while maintaining quality through git workflows. The structured, version-controlled nature of code makes it easier for agents to understand changes, suggest improvements, and even implement modifications autonomously.
 
-The foundation of this approach lies in adopting a code-first workflow.
-That's why everything within Cube, from configurations to data models, is meticulously managed through code.
+Everything within Cube—from configurations to data models to access control policies—is managed through code. This foundation enables both human data engineers and AI agents to collaborate on building and maintaining the semantic layer that powers agentic analytics.
 
 ### Four pillars of semantic layer
 
-We believe that a complete, universal semantic layer should have the following four pillars: data model, caching, access controls, and APIs. These pillars address the core challenges that OLAP technology was originally designed to solve, but in a modern, cloud-native way.
+The semantic layer that powers Cube's agentic analytics platform is built on four essential pillars: data modeling, access control, caching, and APIs. Each pillar plays a critical role in enabling AI agents and users to work with data reliably, securely, and efficiently.
 
 #### Data Modeling
 
-**Data modeling framework is a foundational piece of the universal semantic layer.** It helps data teams to centralize data models upstream from
-data consumption tools, such as BIs, embedded analytics applications, or AI agents. It makes your data architecture DRY
-([Don't Repeat Yourself](https://en.wikipedia.org/wiki/Don%27t_repeat_yourself)) by reducing the repetition of data modeling across multiple presentation layers.
+**The data model provides the knowledge graph that AI agents use to understand your business.** It centralizes metric definitions, entity relationships, and business logic upstream from all consumption tools—whether those are AI agents, BI tools, or custom applications. This centralization is critical for agentic analytics: AI agents need a structured understanding of what metrics mean, how entities relate, and what calculations are valid.
 
-While modern cloud data platforms excel at processing large volumes of data, they lack native support for multidimensional analysis and modeling that traditional OLAP servers provided. Cube brings OLAP-style analytics to these platforms, enabling consistent metric definitions and multidimensional analysis.
+When an AI agent analyzes sales performance or answers questions about customer behavior, it relies on the semantic layer's data model to understand that "revenue" is calculated consistently, that customers have orders, and that orders contain line items. This structured knowledge enables agents to generate reliable insights and navigate complex data relationships autonomously.
 
-**Cube data model is code-first.** Data teams define data models with YAML or JavaScript code.
-The codebase is commonly managed with a version control system. Cube enables git flow for
-changes to data model and managing multiple isolated environments per project.
+**Cube's data model is code-first.** Data teams define data models with YAML or JavaScript code, managed through version control systems. This enables AI-assisted development where agents can help curate and maintain the semantic layer itself, accelerating model development while maintaining quality through git workflows and multiple isolated environments.
 
-**Cube data model is dataset-centric.** It is inspired by and expands upon dimensional modeling.
-Cube provides a practical framework for implementing dataset-centric data modeling.
+**Cube's data model is dataset-centric**, inspired by and expanding upon dimensional modeling. You work with two types of objects:
 
-When building a data model in Cube, you work with two dataset-centric objects: **cubes** and **views**.
-**Cubes** represent business entities such as customers, line items, and orders. In cubes,
-you define all the calculations within the measures and dimensions of these entities.
-Additionally, you define relationships between cubes, such as "an order has many line items" or "a user may place multiple orders."
+**Cubes** represent business entities such as customers, line items, and orders. They define all calculations within measures and dimensions, as well as relationships between entities. These relationships form the knowledge graph that AI agents traverse when exploring data and generating insights.
 
-**Views** sit on top of a data graph of cubes and create a facade of your entire data model,
-with which data consumers can interact. You can think of views as the final data products for your
-data consumers - BI users, data apps, AI agents, etc. When building views, you select measures and dimensions
-from different connected cubes and present them as a single dataset to BI or data apps.
+**Views** sit on top of the data graph of cubes, creating facades that data consumers interact with. Think of views as the final data products for AI agents, BI users, and applications. Views select measures and dimensions from connected cubes and present them as unified datasets, providing AI agents with the right context and scope for specific analytical tasks.
 
 #### Access Control
 
-**One of the benefits of semantic layer is the active security layer.**
-Semantic layer provides a comprehensive real-time understanding and governance of your data.
-When all your data consumption tools access data through the semantic layer, it becomes an ideal place to enforce access control policies.
+**Access control ensures that AI agents respect the same data security policies as human users.** This is critical for agentic analytics: when AI agents autonomously query and analyze data, they must enforce the same governance rules that apply to human users—whether that's row-level security, column-level restrictions, or data masking.
+
+By centralizing access control in the semantic layer, you ensure that all data consumption—whether by AI agents, BI tools, or custom applications—goes through a single governed checkpoint. This provides comprehensive oversight and prevents agents from inadvertently exposing sensitive data or violating security policies.
 
-Cube provides infrastructure to define different access control policies and patterns,
-including row-level and column-level security, data masking and more. Being a code-first,
-Cube enables data teams to **define access control policies with Python or JavaScript.**
-They can range from simple row-level access rules to completely custom data models per tenants backed by different data sources.
+Cube's code-first approach enables data teams to **define access control policies with Python or JavaScript**, ranging from simple row-level access rules to completely custom data models per tenant backed by different data sources. These policies apply uniformly to all consumers of the semantic layer, ensuring AI agents operate within the same security boundaries as human users.
 
 #### Caching
 
-The semantic layer can serve as a buffer to the data sources, protecting the cloud data warehouses from unnecessary and redundant load.
-Caching optimizes performance and can reduce the cloud data warehouse cost.
+**Caching enables AI agents to deliver fast, interactive experiences without overwhelming your data infrastructure.** For agentic analytics to be effective, AI agents must respond quickly to user questions, iteratively explore data, and generate insights in real-time. Without caching, every agent query would hit your data warehouse directly, creating latency issues and potentially significant costs.
 
-While cloud data warehouses have improved query performance through column-oriented storage and distributed processing, they still struggle with complex analytical workloads. This is where Cube's caching layer addresses the performance challenge that traditional OLAP servers were designed to solve.
+The semantic layer acts as a performance buffer between AI agents and your data sources. Through intelligent caching, it ensures agents can work interactively while protecting your cloud data warehouse from unnecessary and redundant load.
 
-Cube implements caching through the **aggregate awareness framework called pre-aggregations.**
-Data teams can define pre-aggregates in the data model as rollup tables, including measures and dimensions.
-Cube builds and refreshes these pre-aggregates in the background by executing queries in your cloud data warehouse
-and storing results in Cube Store, Cube's purpose-built caching engine backed by distributed file storage, such as S3.
-Pre-aggregations can be refreshed on schedule or as a part of the workflow orchestration DAG.
+Cube implements caching through an **aggregate awareness framework called pre-aggregations.** Data teams define pre-aggregates in the data model as rollup tables, including measures and dimensions. Cube builds and refreshes these pre-aggregates in the background by querying your cloud data warehouse and storing results in Cube Store, Cube's purpose-built caching engine backed by distributed file storage such as S3. Pre-aggregations can be refreshed on schedule or as part of workflow orchestration.
 
-When you send a query to Cube, it will use aggregate awareness to see if an existing and fresh pre-aggregate is
-available to serve that query. It can significantly speed up queries and reduce the load and cost of cloud data warehouses.
+When an AI agent sends a query to Cube, the aggregate awareness engine determines if an existing and fresh pre-aggregate can serve that query. This significantly accelerates agent responses and reduces both latency and data warehouse costs—essential for enabling the iterative, exploratory workflows that characterize agentic analytics.
 
 #### APIs
 
-One of the key requirements of the semantic layer is **interoperability with data consumption tools**: BIs, embedded analytics, and AI agents.
-The universal semantic layer cannot require one-off integration with every tool, framework, or library.
-It is not feasible to support the ever-growing number of data consumption tools in a one-to-one model.
-
-Legacy OLAP tools were limited in how they exposed data. Cube provides both modern APIs and support for traditional OLAP interfaces, making it a truly universal semantic layer.
-
-Rather than inventing its own communication language or protocol, **the semantic layer must adhere to existing protocols and
-API standards** to ensure universal interoperability.
+**APIs enable AI agents, applications, and tools to interact with the semantic layer through standard protocols.** For agentic analytics to work across diverse use cases—from AI-powered workbooks to embedded analytics to traditional BI—the semantic layer must provide universal interoperability. AI agents need to query data, introspect the data model, and integrate with other systems without requiring custom integrations for every tool or framework.
 
-Cube embraces and implements the three most commonly used protocols and API standards: **REST, GraphQL, and SQL.**
+Rather than inventing proprietary protocols, Cube implements widely adopted standards: **REST, GraphQL, and SQL.**
 
-**REST and GraphQL** are commonly used in software development as a communication layer between the backend server and the frontend visualization layer.
+**REST and GraphQL** provide modern API interfaces for building custom applications and enabling programmatic access. These APIs power agentic workflows, allowing AI agents to query data, retrieve results, and build interactive experiences.
 
-**SQL** is universally adopted across all the tools in the data stack. Every BI and visualization tool can query a SQL data source.
-That makes SQL an obvious choice for a communication layer to ensure interoperability. Cube implements Postgres SQL and extends
-it to support data modeling in the semantic layer. Cube adds the notion of **measure** to SQL spec, a special type that knows how to
-evaluate itself based on the definition in the data model. Every BI and visualization tool that can connect to Postgres or Redshift can connect to Cube.
+**SQL** is universally adopted across the data stack. Every BI tool, visualization platform, and data application can query a SQL data source. Cube implements Postgres-compatible SQL and extends it to support semantic layer concepts like measures—special types that know how to evaluate themselves based on data model definitions. Any tool that can connect to Postgres or Redshift can connect to Cube, making the semantic layer accessible to both AI agents and traditional analytics tools.
 
-Finally, Cube exposes **robust meta API for data model introspection.** It is vital to achieve interoperability because
-it enables other tools to inspect the data model definitions and take actions, e.g. provide context to the AI agents querying the semantic
-layer or create the necessary mappings in a BI tool to data model objects.
+**Data model introspection through the meta API** is essential for agentic analytics. It enables AI agents to discover available metrics, understand entity relationships, and determine valid queries—providing the context agents need to navigate the semantic layer autonomously. This same introspection capability allows BI tools to automatically map to data model objects and helps applications build dynamic interfaces.
 
 
 [ref-workbooks]: /product/workspace/workbooks