From 960d25f7f3632c3bdecf0565b30081d96fc7e00b Mon Sep 17 00:00:00 2001 From: Ahmed <76680009+AhmedRadwan02@users.noreply.github.com> Date: Wed, 4 Feb 2026 13:30:00 -0500 Subject: [PATCH 1/2] Updating AIXpert website content (#54) * Updating AIXpert website content * Removing Git Icon top right and updating papers --- docs/index.md | 83 ++++++---------- docs/papers.md | 63 +++++++++++++ docs/stylesheets/extra.css | 68 ++++++++++---- docs/team.md | 95 ++++++------------- docs/updates.md | 21 +++++ docs/user_guide.md | 187 ++++++------------------------------- mkdocs.yml | 9 +- 7 files changed, 224 insertions(+), 302 deletions(-) create mode 100644 docs/papers.md create mode 100644 docs/updates.md diff --git a/docs/index.md b/docs/index.md index 24a1460..2a825e7 100644 --- a/docs/index.md +++ b/docs/index.md @@ -1,21 +1,21 @@ -# AI Fairness Data Generation and Question Answering System +# AIXpert at Vector Institute -_Transparent tools and standardized benchmarks for **fair**, **explainable**, and **accountable** generative AI._ +_**[Vector Institute's](https://vectorinstitute.ai)** contribution to the [AIXpert Project](https://aixpert-project.eu/): tools, benchmarks, and research for **explainable**, **accountable**, and **fair** AI._ -> The rapid growth of generative AI brings powerful capabilities—but it also magnifies long-standing concerns around **bias, fairness, and representation**. Many models reproduce stereotypes embedded in training data, especially around demographic attributes (e.g., gender, ethnicity, age). -> This project enables **systematic, controlled experimentation** so researchers and practitioners can pinpoint _when_ and _why_ bias occurs—and what actually mitigates it. +> The AIXpert project aims to transform how AI is developed, deployed, and trusted by society. Vector’s work within AIXpert focuses on **responsible AI**: fairness-aware data generation and evaluation, multimodal benchmarks (audio-video, vision-language), factuality and transparency in agentic systems, and open tools for reproducible, governance-ready research. +--- -## 🌍 What is the project about? +## What we do -The **AI Fairness Data Generation and Question Answering System** is part of **[Vector Institute's](https://vectorinstitute.ai)** contribution to the broader [AIXPERT Project](https://aixpert-project.eu/), a multi-institutional initiative, to develop tools and benchmarks for **fairness-aware data generation and evaluation** in generative AI. +Vector’s contribution to AIXpert aligns with the project’s **vision and objectives**: -It provides: +- **Build an adaptable, explainable AI-agentic platform** — Develop interoperable tools and modules that connect explainability, accountability, and fairness. +- **Define and assess AI trustworthiness** — Establish measurable criteria and indicators for evaluating the reliability and ethical alignment of AI systems. +- **Advance explainable multimodal foundation models** — Drive research in interpretable vision–language–reasoning and multimodal understanding. +- **Demonstrate real-world impact** — Validate the framework across sectors including healthcare, employment, and education. -- **Controlled synthetic datasets** to isolate bias-inducing factors safely and reproducibly. -- **Agentic automation** (CrewAI + custom LLM agents) for prompt generation, content creation, metadata, and QC. -- **Fairness metrics & explainers** to visualize model behavior and surface disparities. -- **Open, configurable pipelines** aligned with responsible AI practices and emerging governance needs. +For the full **AIXpert vision, consortium, and funding**, see [About](about.md). --- @@ -23,61 +23,38 @@ It provides:
-- **Develop a Controlled Data Pipeline** - Create a reproducible, configurable pipeline for generating **text, image, and video** with precise control over **demographic** and **contextual** variables. +- **Explainable, accountable AI** + Develop tools and benchmarks for **interpretability**, **fairness**, and **transparency** in generative and multimodal AI, aligned with AIXpert’s vision. -- **Enable Fairness-Aware Benchmarking** - Provide tools to build matched **baseline vs. fairness-aware** datasets for bias diagnosis and mitigation experiments. +- **Trust, risk, and security in agentic AI** + Advance **TRiSM** (Trust, Risk, and Security Management) and transparency frameworks for safe, explainable agentic and multi-agent systems. -- **Support Multi-Domain Risk Analysis** - Generate multimodal data for **hiring, healthcare, legal, education**, and more, covering risks like **bias, toxicity, misinformation**. +- **Multimodal and real-world evaluation** + Create benchmarks and datasets for **audio-video understanding**, **vision-language** assessment, and **fairness** across domains and demographics. -- **Integrate Agentic AI for Automation** - Orchestrate generation and QC with **CrewAI** and **custom LLM agents** (prompts, assets, annotations, validation). +- **Define and assess AI trustworthiness** + Establish **measurable criteria** and evaluation suites for reliability, factuality, and ethical alignment of AI systems. -- **Advance Interpretability & Explainability** - Combine **zero-shot LLM explainers** and fairness metrics to produce **interpretable** assessments and visualizations. +- **Real-world impact** + Validate approaches across **healthcare, employment, education**, and high-stakes domains through pilots, benchmarks, and open releases. -- **Foster Open Research & Collaboration** - Share configs, tools, and docs openly to enable **reproducible research** and **transparent governance**. +- **Open, reproducible research** + Share **code**, **datasets**, and **documentation** openly to support reproducible research, benchmarking, and transparent governance.
--- -## Pipeline - -![Project Pipeline](assets/fairness_pipeline.jpg) - ---- - ## Recent updates -- :material-rocket-launch: **Released data generation pipeline** (multimodal, configurable, agent-orchestrated). -- :material-robot: **Single-agent pipeline** prototype for rapid dataset bootstrapping. -- :material-file-document: NeurIPS 2025 LLM-eval Workshop paper: [_Bias in the Picture: Benchmarking VLMs with Social-Cue News Images and LLM-as-Judge Assessment_](https://arxiv.org/abs/2509.19659) -- :material-file-document: Preprint: [_TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems_](https://arxiv.org/abs/2506.04133) -- :material-file-document-edit-outline: TechRxiv article: [_Responsible Agentic Reasoning and AI Agents—A Critical Survey_](https://www.techrxiv.org/users/574774/articles/1329333-responsible-agentic-reasoning-and-ai-agents-a-critical-survey?mode=edit) -- :material-post-outline: Poster: **Single-Agent TRiSM** (NeurIPS LAW) - ---- +- :material-newspaper: **AIXpert news** — Our work was highlighted on the [AIXpert project website](https://aixpert-project.eu/2026/01/28/advancing-trustworthy-explainable-and-responsible-ai-at-neurips-2025/): *Advancing Trustworthy, Explainable, and Responsible AI at NeurIPS 2025* (Bias in the Picture, HumaniBench, Carbon Literacy, and more). +- :material-play-circle: **SONIC-O1** — Paper: [_A Real-World Benchmark for Evaluating MLLMs on Audio-Video Understanding_](https://arxiv.org/abs/2601.21666) (arXiv). +- :material-database: **SONIC-O1** — Dataset on [Hugging Face](https://huggingface.co/datasets/vector-institute/sonic-o1) (231 videos, ~60h, 4,958 QAs, 13 domains, demographic metadata). +- :material-github: **SONIC-O1** — [Code](https://github.com/VectorInstitute/sonic-o1) and evaluation pipeline (summarization, MCQ, temporal localization). +- :material-medal: **SONIC-O1** — [Leaderboard](https://huggingface.co/spaces/vector-institute/sonic-o1-leaderboard) for model comparisons and fairness analysis. - - -> Have feedback or want to contribute? See the [:material-account-group: Team](team.md) page and open an issue or pull request. +[:material-arrow-right: **View full list**](updates.md){ .md-button .md-button--primary } --- -## License - -This code in this repo is released under the **MIT License**. +> Have feedback or want to contribute? See the [:material-account-group: Team](team.md) page and open an issue or pull request. diff --git a/docs/papers.md b/docs/papers.md new file mode 100644 index 0000000..3061943 --- /dev/null +++ b/docs/papers.md @@ -0,0 +1,63 @@ +# Papers + +Selected publications and preprints from the AIXpert project. Each entry links to arXiv (or equivalent) where available. + +--- + +## AIXpert project papers + +### SONIC-O1: A Real-World Benchmark for Evaluating Multimodal Large Language Models on Audio-Video Understanding + +**Paper** · arXiv **Code** · GitHub **Dataset** · Hugging Face **Leaderboard** · Leaderboard + +**Authors:** Ahmed Y. Radwan, Christos Emmanouilidis, Hina Tabassum, Deval Pandya, Shaina Raza. + +SONIC-O1, a fully human-verified real-world audio-video benchmark with 4,958 annotations across 13 conversational domains. We evaluate multimodal models on video summarization, evidence-grounded QA, and temporal event localization, and release an extensible evaluation suite to support reproducible benchmarking and robustness analysis. + +### Reducing Hallucinations in LLMs via Factuality-Aware Preference Learning + +**Paper** · arXiv **Code** · GitHub **Dataset** · Hugging Face + +**Authors:** Sindhuja Chaduvula, Ahmed Y. Radwan, Azib Farooq, Yani Ioannou, Shaina Raza. + +Preference-learning method (F-DPO) that targets factuality directly, improving factuality scores while reducing hallucination rates across multiple open-weight LLMs. + +### Bias in the Picture: Benchmarking VLMs with Social-Cue News Images and LLM-as-Judge Assessment + +**Paper** (NeurIPS 2025 LLM-eval Workshop) · arXiv **Code** · GitHub + +**Authors:** Aravind Narayanan, Vahid Reza Khazaie, Shaina Raza. + +Benchmarking vision-language models with social-cue news images and LLM-as-judge assessment. + +### TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems + +**Paper** · arXiv + +**Authors:** Shaina Raza, Ranjan Sapkota, Manoj Karkee, Christos Emmanouilidis. + +A review of trust, risk, and security management (TRiSM) in LLM-based agentic and multi-agent systems. + +### Responsible Agentic Reasoning and AI Agents—A Critical Survey + +**Paper** (TechRxiv) · Paper + +**Authors:** Shaina Raza (Vector Institute), Ranjan Sapkota, Manoj Karkee (Cornell University), Christos Emmanouilidis (University of Groningen). + +Critical survey of responsible agentic reasoning and AI agents. + +### Evaluating and Regulating Agentic AI: A Study of Benchmarks, Metrics and Regulation + +**Paper** (TechRxiv) · Paper **Code** · GitHub + +**Authors:** Azib Farooq, Shaina Raza, Nazmul Karim, Hasan Iqbal, Athanasios V. Vasilakos, Christos Emmanouilidis. + +Reviews recent progress in developing and assessing agentic AI along three dimensions: benchmarks, metrics, and governance. Analyzes how evaluation frameworks capture reasoning, planning, collaboration, and ethical alignment in single- and multi-agent systems. Aims to establish a unified foundation for trustworthy, auditable, and human-aligned AI agents. + +--- + + diff --git a/docs/stylesheets/extra.css b/docs/stylesheets/extra.css index 5c79bfd..3a56adb 100644 --- a/docs/stylesheets/extra.css +++ b/docs/stylesheets/extra.css @@ -1,3 +1,26 @@ +/* Inline badge icons (shields.io) for arXiv, GitHub, Hugging Face */ +.md-typeset img[src*="img.shields.io"] { + height: 1.15em; + vertical-align: middle; + margin-right: 0.15em; +} + +/* User guide: floating card for each project section */ +.md-content .md-typeset div.user-guide-card { + background-color: #f5f5f5 !important; + border-radius: 12px !important; + box-shadow: 0 2px 12px rgba(0, 0, 0, 0.08) !important; + padding: 1.25rem 1.5rem !important; + margin: 0.5rem 0 1rem 0 !important; + border: none !important; + display: block !important; +} + +[data-md-color-scheme="slate"] .md-content .md-typeset div.user-guide-card { + background-color: rgba(255, 255, 255, 0.06) !important; + box-shadow: 0 2px 12px rgba(0, 0, 0, 0.3) !important; +} + [data-md-color-primary="vector"] { --md-primary-fg-color: #eb088a; --md-primary-fg-color--light: #f252a5; @@ -91,47 +114,56 @@ } -/* Reduce space between team members */ +/* Team cards: compact, name + LinkedIn only, 3 per line */ .team-grid { display: grid; - grid-template-columns: repeat(auto-fill, minmax(280px, 1fr)); - gap: 1.5rem; + grid-template-columns: repeat(3, 1fr); + gap: 0.75rem; margin-top: 1rem; } .team-card { background-color: var(--md-surface); - border-radius: 12px; - box-shadow: 0 1px 4px rgba(0, 0, 0, 0.1); - padding: 1rem 1.25rem; + border-radius: 8px; + box-shadow: 0 1px 3px rgba(0, 0, 0, 0.08); + padding: 0.5rem 0.75rem; text-align: left; transition: transform 0.15s ease; + min-width: 0; } .team-card:hover { - transform: translateY(-3px); + transform: translateY(-2px); } .team-card h3 { - margin-top: 0; - margin-bottom: 0.25rem; + margin: 0 0 0.25rem 0; + font-size: 0.95em; + font-weight: 600; + white-space: nowrap; + overflow: hidden; + text-overflow: ellipsis; } .team-card p { - margin: 0.25rem 0; - line-height: 1.4; -} - -.team-links .twemoji, -.team-links svg { - width: 18px; - height: 18px; - vertical-align: text-bottom; + margin: 0; + line-height: 1.3; } .team-links a { text-decoration: none; color: var(--md-primary-fg-color); + font-size: 0.85em; +} + +.team-links .fab { + margin-right: 0.25em; +} + +@media (max-width: 600px) { + .team-grid { + grid-template-columns: 1fr; + } } /* Center logo */ diff --git a/docs/team.md b/docs/team.md index 870700b..ff582c5 100644 --- a/docs/team.md +++ b/docs/team.md @@ -2,100 +2,65 @@ The team at the Vector Institute behind the development of this project focuses on ethical AI practices, promoting fairness, accountability, and sustainability. -For inquiries or support, contact: - ---
-
-

Shaina Raza, PhD

-

Applied ML Scientist – Responsible AI

-

shaina.raza@vectorinstitute.ai

- +
+

Shaina Raza

+
-

Aravind Narayanan

-

Associate Applied ML Specialist

-

aravind.narayanan@vectorinstitute.ai

- +

Ahmed Radwan

+

Ananya Raval

-

Applied ML Specialist

-

ananya.raval@vectorinstitute.ai

- +
-

Shweta Khushu

-

Manager, AI Engineering

-

shweta.khushu@vectorinstitute.ai

- +

Aravind Narayanan

+

Edward Chang

-

Senior Technical Program Manager

-

edward.chang@vectorinstitute.ai

- + +
+ +
+

Karanpal Sekhon

+ +
+ +
+

Mahshid Alinoori

+ +
+ +
+

Shweta Khushu

+

Sindhuja Chaduvula

-

Associate Applied ML Specialist

-

sindhuja.chaduvula@vectorinstitute.ai

- +
-

Ahmed Radwan

-

Applied Machine Learning Intern

-

ahmed.radwan@vectorinstitute.ai

- +

Vahid Khazaie

+
-

Karanpal Sekhon

-

Applied Machine Learning Intern

-

karanpal.sekhon@vectorinstitute.ai

- +

Deval Pandya

+
-
+
--- diff --git a/docs/updates.md b/docs/updates.md new file mode 100644 index 0000000..3cb1818 --- /dev/null +++ b/docs/updates.md @@ -0,0 +1,21 @@ +# Updates + +Full list of recent papers, releases, and news. + +--- + +- :material-newspaper: **AIXpert news** — Our work was highlighted on the [AIXpert project website](https://aixpert-project.eu/2026/01/28/advancing-trustworthy-explainable-and-responsible-ai-at-neurips-2025/): *Advancing Trustworthy, Explainable, and Responsible AI at NeurIPS 2025* (Bias in the Picture, HumaniBench, Carbon Literacy, and more). +- :material-play-circle: **SONIC-O1** — Paper: [_A Real-World Benchmark for Evaluating MLLMs on Audio-Video Understanding_](https://arxiv.org/abs/2601.21666) (arXiv). +- :material-database: **SONIC-O1** — Dataset on [Hugging Face](https://huggingface.co/datasets/vector-institute/sonic-o1) (231 videos, ~60h, 4,958 QAs, 13 domains, demographic metadata). +- :material-github: **SONIC-O1** — [Code](https://github.com/VectorInstitute/sonic-o1) and evaluation pipeline (summarization, MCQ, temporal localization). +- :material-medal: **SONIC-O1** — [Leaderboard](https://huggingface.co/spaces/vector-institute/sonic-o1-leaderboard) for model comparisons and fairness analysis. +- :material-file-document: **F-DPO** — Paper: [_Reducing Hallucinations in LLMs via Factuality-Aware Preference Learning_](https://arxiv.org/abs/2601.03027) (arXiv). +- :material-database: **F-DPO** — Factuality-aware preference [dataset](https://huggingface.co/datasets/vector-institute/Factuality_Alignment) on Hugging Face. +- :material-github: **F-DPO** — [Code](https://github.com/VectorInstitute/Factual-Preference-Alignment) and [project page](https://vectorinstitute.github.io/Factual-Preference-Alignment/) (factuality-aware DPO, no reward model). +- :material-book-open-variant: **Survey (forthcoming)** — _Transparency in Agentic AI: A Survey of Interpretability, Explainability, and Governance_. Surveys traditional methods, aligns them to agent layers, and proposes a five-axis framework for organizing agentic transparency. +- :material-rocket-launch: **Released data generation pipeline** (multimodal, configurable, agent-orchestrated). +- :material-robot: **Single-agent pipeline** prototype for rapid dataset bootstrapping. +- :material-file-document: **Paper:** [_Bias in the Picture: Benchmarking VLMs with Social-Cue News Images and LLM-as-Judge Assessment_](https://arxiv.org/abs/2509.19659) (NeurIPS 2025 LLM-eval Workshop) +- :material-file-document: **Paper:** [_TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems_](https://arxiv.org/abs/2506.04133) +- :material-file-document-edit-outline: **Paper:** [_Responsible Agentic Reasoning and AI Agents—A Critical Survey_](https://www.techrxiv.org/users/574774/articles/1329333-responsible-agentic-reasoning-and-ai-agents-a-critical-survey?mode=edit) (TechRxiv) +- :material-post-outline: **Paper:** **Single-Agent TRiSM** (Poster, NeurIPS LAW) diff --git a/docs/user_guide.md b/docs/user_guide.md index 84202cc..643e7a4 100644 --- a/docs/user_guide.md +++ b/docs/user_guide.md @@ -1,203 +1,70 @@ # User Guide -Transparent tools and standardized benchmarks for **fair**, **explainable**, and **accountable** generative AI. This guide introduces modules, setup, and usage patterns for fairness-aware data generation and analysis. - ---- - -## Table of Contents - -- [Getting Started](#getting-started) -- [Core Concepts](#core-concepts) -- [Contributing and Documentation](#contributing-and-documentation) +This guide points you to **each project’s repo and docs**. Installation and quick start are in the linked READMEs and project pages. --- ## Getting Started -**Four Essential Questions** +- **What is this?** — A research framework and toolkit for generating, evaluating, and mitigating bias in multimodal AI systems. +- **Why fairness-aware data?** — Controlled datasets isolate bias-inducing factors for targeted experiments on fairness and representation. +- **Why agentic AI?** — We use LLM agents (e.g. CrewAI) to scale prompt, image, and metadata generation. +- **Who is it for?** — Researchers and practitioners studying or benchmarking bias in generative models. -| Question | Answer | -|-----------|---------| -| **What is the project about?** | A research framework and toolkit for generating, evaluating, and mitigating bias in multimodal AI systems. | -| **Why Fairness-Aware Synthetic Data?** | Controlled datasets isolate bias-inducing factors, allowing targeted experiments on fairness, explainability, and representation. | -| **Why Agentic AI?** | We use autonomous LLM agents (via **CrewAI**) to scale prompt, image, and metadata generation. | -| **Who is project for?** | Researchers, data scientists, and fairness practitioners studying or benchmarking bias in generative models. | +Each project has its **own repository or module**; use the sections below for summaries and links. --- -### Installation - -#### From source (recommended) - -```bash -git clone https://github.com/VectorInstitute/vector-aixpert.git -cd vector-aixpert -uv sync -``` - -#### Optional groups +## SONIC-O1 -```bash -# Development extras -uv sync --dev -# Documentation build -uv sync --no-group docs -``` +Real-world benchmark for evaluating multimodal LLMs on **audio-video understanding**: short to long-form videos across 13 conversational domains (job interviews, medical, legal, etc.), with three tasks—summarization, multiple-choice QA, and temporal localization—and demographic metadata for fairness analysis. -#### Verify installation - -```bash -pytest -q -mkdocs serve -``` - ---- - -### Quick Starts - -Minimal commands to explore modules. - -```bash -# 1. Set up environment -uv sync -source .venv/bin/activate - -# 2. Run controlled image generation -cd src/aixpert/controlled_images/ -uv run python src/main.py \ - --config configs/img_gen_config.yaml - -# 3. Generate synthetic data (text) -cd src/aixpert/data_generation/synthetic_data_generation/nlp/ -uv run main.py \ - --config config.yaml \ - --stage all - -# 4. Generate synthetic data (images) -cd src/aixpert/data_generation/synthetic_data_generation/images/ -uv run main.py all_stages \ - --config_file ../../config.yaml \ - --prompt_yaml prompt_paths.yaml \ - --domain hiring \ - --risk security_risks - -# 5. Compute fairness metrics -cd src/aixpert/toxicity_fairness_analysis/ - -uv run python scripts/download_data.py \ - --dataset jigsaw \ - --out data/jigsaw.parquet \ - --sample 50000 - -uv run python scripts/llm_zero_shot_explain.py \ - --in data/jigsaw.parquet \ - --text_col comment_text \ - --task toxicity \ - --out outputs/zs_preds.parquet \ - --model distilgpt2 \ - --max_rows 1000 \ - --ig_rows 25 \ - --ig_steps 32 \ - --save_heatmaps \ - --force_float32 \ - --label_col target - --id_cols male female black white muslim jewish -``` - -Each module provides a focused README with configuration details and output examples. +**Links:** [GitHub](https://github.com/VectorInstitute/sonic-o1) · [Project page](https://vectorinstitute.github.io/sonic-o1/) · [Dataset](https://huggingface.co/datasets/vector-institute/sonic-o1) · [Leaderboard](https://huggingface.co/spaces/vector-institute/sonic-o1-leaderboard) --- -### Standard Usage +## F-DPO (Factual Preference Alignment) -Typical workflow for fairness-aware data generation: +Factuality-aware Direct Preference Optimization (F-DPO): extends DPO with binary factuality labels and a factuality-aware margin to reduce LLM hallucinations without an auxiliary reward model. Single-stage and compute-efficient. -1. **Generate controlled data** - Create matched datasets (e.g., gender, occupation, or ethnicity pairs). -2. **Run agentic generation pipeline** - Use CrewAI agents for multimodal prompt, image, and metadata generation. -3. **Perform fairness analysis** - Compute bias metrics such as Statistical Parity or Equal Opportunity. -4. **Visualize or export results** - Generate structured outputs or Hugging Face datasets for benchmarking. +**Links:** [GitHub](https://github.com/VectorInstitute/Factual-Preference-Alignment) · [Project page](https://vectorinstitute.github.io/Factual-Preference-Alignment/) · [Dataset](https://huggingface.co/datasets/vector-institute/Factuality_Alignment) --- -## Core Concepts +## Modules in AIXpert + +These modules live in the main [AIXpert](https://github.com/VectorInstitute/AIXpert) repository. Clone once, run `uv sync`, then use the READMEs below for setup and commands. ### Controlled Images -Generates baseline vs fairness-aware image sets for occupations or social groups. -Supports configurable attributes, matched prompts, and consistent random seeds for reproducibility. +Baseline vs fairness-aware image sets for occupations or social groups; configurable attributes, matched prompts, and reproducible seeds. -### Synthetic Data Generation +**Links:** [Module README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/controlled_images/README.md) -Multi-modal data synthesis modules under: +### Synthetic Data Generation -* `synthetic_data_generation/images` — image + VQA pairs -* `synthetic_data_generation/nlp` — textual scenes and MCQs -* `synthetic_data_generation/videos` — Veo/Gemini video generation +Multi-modal synthesis: image + VQA pairs, textual scenes and MCQs, and video generation (Veo/Gemini). Driven by LLM-designed prompts and metadata templates. -Each generator is driven by LLM-designed prompts and metadata templates. +**Links:** [Images README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/images/README.md) · [NLP README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/nlp/README.md) ### Agent Pipeline (CrewAI) -Implements single-agent orchestration to chain prompt → image → metadata generation. -Enables autonomous large-scale data creation using structured JSON task definitions. +Single-agent orchestration for prompt → image → metadata generation and large-scale data creation with structured JSON task definitions. +**Links:** [Module README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/data_generation/agent_pipeline/README.md) ### Fairness & Explainability -Evaluates generated data and model outputs via: - -* **Statistical metrics** — Statistical Parity, Equal Opportunity -* **Zero-shot explainers** — integrated gradients, concept attributions -* **Visualization tools** — disparity plots, attribution maps - -### Module Quick Start (one-liners + deep links) - -Each core module has its own README with commands, configurations, and sample outputs. - -* **Controlled Images** — Generate matched baseline vs fairness-aware images across professions. - ➜ [`View Module README`](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/controlled_images/README.md) - -* **Agent Pipeline (CrewAI)** — Single-agent orchestration for prompt, image, and metadata generation. - ➜ [`View Module README`](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/data_generation/agent_pipeline/README.md) - -* **Synthetic Data · Images** — Domain- and risk-specific image prompts and VQA pairs. - ➜ [`View Module README`](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/images/README.md) - -* **Synthetic Data · NLP** — Scene descriptions and MCQ generation for text pipelines. - ➜ [`View Module README`](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/nlp/README.md) +Statistical metrics (e.g. Statistical Parity, Equal Opportunity), zero-shot explainers (integrated gradients, concept attributions), and visualization (disparity plots, attribution maps). Lives under `toxicity_fairness_analysis` in the AIXpert repo. +**Links:** [AIXpert](https://github.com/VectorInstitute/AIXpert) --- ## Contributing and Documentation -See [CONTRIBUTING.md](https://github.com/VectorInstitute/vector-aixpert/blob/main/CONTRIBUTING.md) for: +- [CONTRIBUTING.md](https://github.com/VectorInstitute/AIXpert/blob/main/CONTRIBUTING.md): coding standards (PEP8, Google docstrings), pre-commit (`ruff`, `mypy`, `typos`, `nbQA`), branching, and tests. -* Coding standards and style guide (PEP8 + Google docstrings) -* Pre-commit setup (`ruff`, `mypy`, `typos`, `nbQA`) -* Branching and PR workflow -* Test coverage requirements +**Docs locally:** `uv sync --no-group docs` then `mkdocs serve` → [http://127.0.0.1:8000](http://127.0.0.1:8000). -### Docs build - -```bash -uv sync --no-group docs -mkdocs serve -``` - -The site will be live at [http://127.0.0.1:8000](http://127.0.0.1:8000). - -### Testing and Standards - -```bash -pytest -v -pre-commit run --all-files -``` - -Continuous integration runs these via GitHub Actions (`code_checks.yml`, `unit_tests.yml`, `integration_tests.yml`). - ---- +**CI:** GitHub Actions (`code_checks.yml`, `unit_tests.yml`, `integration_tests.yml`). diff --git a/mkdocs.yml b/mkdocs.yml index 30b8485..ecdf7f0 100644 --- a/mkdocs.yml +++ b/mkdocs.yml @@ -5,8 +5,6 @@ extra: social: - icon: fontawesome/brands/discord link: 404.html - - icon: fontawesome/brands/github - link: https://github.com/VectorInstitute/vector-aixpert markdown_extensions: - attr_list - admonition @@ -39,8 +37,9 @@ markdown_extensions: nav: - Home: index.md - User Guide: user_guide.md + - Updates: updates.md # - API Reference: api.md - - Technical Report: assets/aixpert-technical-report.pdf + - Papers: papers.md - Team: team.md - About: about.md plugins: @@ -59,9 +58,7 @@ plugins: show_root_toc_entry: false show_symbol_type_heading: true show_symbol_type_toc: true -repo_url: https://github.com/VectorInstitute/vector-aixpert -repo_name: VectorInstitute/vector-aixpert -site_name: vector-aixpert +site_name: AIXpert theme: custom_dir: docs/overrides favicon: assets/favicon-48x48.svg From 00c4e3a84902d4f54aaea9950dd40d8e3a2e25a0 Mon Sep 17 00:00:00 2001 From: aravind-3105 Date: Wed, 4 Feb 2026 14:16:41 -0500 Subject: [PATCH 2/2] Update repository links in user guide to point to vector-aixpert --- docs/user_guide.md | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/docs/user_guide.md b/docs/user_guide.md index 643e7a4..3db65f8 100644 --- a/docs/user_guide.md +++ b/docs/user_guide.md @@ -33,37 +33,37 @@ Factuality-aware Direct Preference Optimization (F-DPO): extends DPO with binary ## Modules in AIXpert -These modules live in the main [AIXpert](https://github.com/VectorInstitute/AIXpert) repository. Clone once, run `uv sync`, then use the READMEs below for setup and commands. +These modules live in the main [AIXpert](https://github.com/VectorInstitute/vector-aixpert) repository. Clone once, run `uv sync`, then use the READMEs below for setup and commands. ### Controlled Images Baseline vs fairness-aware image sets for occupations or social groups; configurable attributes, matched prompts, and reproducible seeds. -**Links:** [Module README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/controlled_images/README.md) +**Links:** [Module README](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/controlled_images/README.md) ### Synthetic Data Generation Multi-modal synthesis: image + VQA pairs, textual scenes and MCQs, and video generation (Veo/Gemini). Driven by LLM-designed prompts and metadata templates. -**Links:** [Images README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/images/README.md) · [NLP README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/nlp/README.md) +**Links:** [Images README](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/images/README.md) · [NLP README](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/data_generation/synthetic_data_generation/nlp/README.md) ### Agent Pipeline (CrewAI) Single-agent orchestration for prompt → image → metadata generation and large-scale data creation with structured JSON task definitions. -**Links:** [Module README](https://github.com/VectorInstitute/AIXpert/blob/main/src/aixpert/data_generation/agent_pipeline/README.md) +**Links:** [Module README](https://github.com/VectorInstitute/vector-aixpert/blob/main/src/aixpert/data_generation/agent_pipeline/README.md) ### Fairness & Explainability Statistical metrics (e.g. Statistical Parity, Equal Opportunity), zero-shot explainers (integrated gradients, concept attributions), and visualization (disparity plots, attribution maps). Lives under `toxicity_fairness_analysis` in the AIXpert repo. -**Links:** [AIXpert](https://github.com/VectorInstitute/AIXpert) +**Links:** [AIXpert](https://github.com/VectorInstitute/vector-aixpert) --- ## Contributing and Documentation -- [CONTRIBUTING.md](https://github.com/VectorInstitute/AIXpert/blob/main/CONTRIBUTING.md): coding standards (PEP8, Google docstrings), pre-commit (`ruff`, `mypy`, `typos`, `nbQA`), branching, and tests. +- [CONTRIBUTING.md](https://github.com/VectorInstitute/vector-aixpert/blob/main/CONTRIBUTING.md): coding standards (PEP8, Google docstrings), pre-commit (`ruff`, `mypy`, `typos`, `nbQA`), branching, and tests. **Docs locally:** `uv sync --no-group docs` then `mkdocs serve` → [http://127.0.0.1:8000](http://127.0.0.1:8000).