trust-and-safety

Here are 67 public repositories matching this topic...

roostorg / osprey

Automate the obvious and investigate the ambiguous. High-performance safety rules engine for real-time event processing at scale.

trust-and-safety roost roost-tools roost-osprey

Updated May 4, 2026
Python

roostorg / awesome-safety-tools

Star

Directory of open source tools for online safety

safety trust-and-safety roost online-safety roost-tools

Updated Apr 3, 2026

roostorg / model-community

Star

Making open safety AI models accessible and beneficial to the safety community

trust-and-safety roost roost-tools

Updated Apr 26, 2026
Jupyter Notebook

tattle-made / Uli

Sponsor

Star

Software and Resources for Mitigating Online Gender Based Violence in India

nlp machine-learning ml browser-extension india social-impact sdg indic-languages indic indian-languages trust-and-safety gender-based-violence extension-chrome content-moderation ogbv sdg-10 sdg-5

Updated Apr 23, 2026
JavaScript

swicg / activitypub-trust-and-safety

Star

ActivityPub Trust and Safety Taskforce

activitypub fediverse trust-and-safety

Updated Apr 1, 2026
HTML

roostorg / coop

Star

Review and moderation, your way. Online safety dashboard, queues, routing and automatic enforcement rules, and integrations.

trust-and-safety roost child-safety content-safety roost-tools roost-coop

Updated May 2, 2026
TypeScript

haileyok / phoebe

Star

A trust and safety agent that interacts with Osprey for investigation, real-time analysis, and prevention implementations

agent trust-and-safety atproto

Updated Feb 7, 2026
Python

disciplinedware / swiftward

Star

Self-hosted Trust & Safety policy engine with A/B testing, replay, and full audit trails

golang yaml self-hosted deterministic ai-safety fraud-detection policy-engine audit-trail trust-and-safety content-moderation

Updated Feb 9, 2026

roostorg / community

Star

Documentation and policies for the ROOST organization and open source community. File non-technical or ROOST-wide issues here.

trust-and-safety roost roost-tools

Updated Apr 27, 2026
CSS

aceteam-ai / safeclaw

Star

The safe version of OpenClaw. Same agent, with safety and accountability built in. Powered by AEP.

proxy safety ai-safety accountability ai-agents aep trust-and-safety llm enterprise-ai openclaw

Updated Apr 30, 2026
TypeScript

mpessis / supply-ops-agent

Star

Supply-side ad ops diagnostic agent built on AAMP standards. MCP server with bid rate analysis, IVT detection, ads.txt compliance, and demand diagnostics.

python mcp supply-chain adtech openrtb ai-agents trust-and-safety aamp programmatic-advertising

Updated Apr 20, 2026
Python

gian-gg / sabot

Star

Your third-party safety layer for verified, transparent, and scam-free online transactions.

security typescript ai nextjs smart-contracts blockchain p2p transactions web3 fraud-prevention escrow trust-and-safety

Updated Apr 10, 2026
TypeScript

haileyok / gopdq

Star

A Go implementation of Facebook's PDQ

trust-and-safety pdq

Updated Jan 15, 2026
Go

crispthinking / PdqHash

Star

A .NET implementation of the PDQ hashing algorithm to make integrating Trust and Safety tools for digital service providers easier.

hashing security dotnet trust-and-safety pdq

Updated Apr 29, 2026
C#

MOSTRE / discord-exploit-research

Star

⚠️ Active 0-day exploit allowing any Discord server to be permanently terminated via automated pipeline abuse. Research, attack analysis, and protection guide for server owners. Unpatched as of April 2026.

discord cybersecurity vulnerability infosec 0day responsible-disclosure security-research trust-and-safety server-security discord-security-bot discord-exploit discord-bug

Updated Apr 13, 2026

prysaic-labs / OpenSiteTrust

Star

OpenSiteTrust is an open, explainable, and reusable website scoring ecosystem

open-data browser-extension crowdsourcing open-api trust-and-safety explainable-ai privacy-by-design security-headers phishing-detection reputation-system risk-scoring trust-score url-analysis brand-impersonation community-moderation scam-detection misinformation-detection domain-intelligence website-trust

Updated Aug 20, 2025
Python

jordanstarrk / mcp-preflight

Star

ls -la for MCP servers. See tools, resources, and risky capabilities before you connect or trust a server.

mcp developer-tools inspection ai-agents trust-and-safety model-context-protocol mcp-server agent-infrastructure

Updated Feb 15, 2026
Python

PRADUMAN-KR / Multimodal-Lip-Sync-Deepfake-Detection-System

Star

Production-ready Multimodal Lip Sync Detection & Deepfake Detection System. Detects audio-video synchronization mismatches using deep learning (PyTorch) with a scalable FastAPI-based inference pipeline. Optimized for real-time processing,low false positives, and robust performance on noisy speech segments. Built for video forensics,synthetic media

computer-vision neural-network detection pytorch resnet multimodal-learning trust-and-safety ai-security deepfakes content-moderation temporal-modeling cross-modal-learning forensic-tools media-forensics real-time-inference lip-sync-detection audio-video-sync

Updated Mar 25, 2026
Python

aloth / origin-lens

Star

Combat fake news with cryptographic image verification. Origin Lens analyzes C2PA Content Credentials and EXIF metadata to detect AI-generated content, verify digital signatures, and reveal complete edit history. Privacy-first open source iOS app with on-device verification. (arXiv:2602.03423)

Updated Mar 7, 2026
Dart

w-henderson / ProjectPositiveVibes

Star

🤝 Using large language models to seamlessly help content moderators make better decisions, faster.

trust-and-safety content-moderation gpt-3

Updated Mar 29, 2023
TypeScript

Improve this page

Add a description, image, and links to the trust-and-safety topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trust-and-safety topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trust-and-safety

Here are 67 public repositories matching this topic...

roostorg / osprey

roostorg / awesome-safety-tools

roostorg / model-community

tattle-made / Uli

swicg / activitypub-trust-and-safety

roostorg / coop

haileyok / phoebe

disciplinedware / swiftward

roostorg / community

aceteam-ai / safeclaw

mpessis / supply-ops-agent

gian-gg / sabot

haileyok / gopdq

crispthinking / PdqHash

MOSTRE / discord-exploit-research

prysaic-labs / OpenSiteTrust

jordanstarrk / mcp-preflight

PRADUMAN-KR / Multimodal-Lip-Sync-Deepfake-Detection-System

aloth / origin-lens

w-henderson / ProjectPositiveVibes

Improve this page

Add this topic to your repo