#

content-safety

Here are 19 public repositories matching this topic...

xiangxinai / xiangxin-guardrails

Xiangxin Guardrails is an open-source, context-aware AI guardrails platform that provides protection against prompt injection attacks, content safety risks, and data leakage. It can be deployed as a security gateway or integrated via API, offering enterprise-grade, fully private deployment options.

security ai dlp content-filter security-gateway guardrails prompt-injection data-leakage-prevention content-safety ai-guardrails llm-firewalls ai-application-firewall

Updated Oct 24, 2025
Python

openguardrails / openguardrails

OpenGuardrails: A Configurable, Unified, and Scalable Guardrails Platform for Large Language Models

jailbreaking guardrails prompt-injection llm-security data-leakage-prevention llm-safety content-safety

Updated Oct 30, 2025
Python

darkwaves-ofc / nude-detect

NudeDetect is a Python-based tool for detecting nudity and adult content in images. This project combines the capabilities of the NudeNet library, EasyOCR for text detection, and the Better Profanity library for identifying offensive language in text.

Updated Jan 1, 2025
Python

Azure-Samples / rai-content-safety-workshop

Step-by-Step tutorial that teaches you how to use Azure Safety Content - the prebuilt AI service that helps ensure that content sent to user is filtered to safeguard them from risky or undesirable outcomes

azure aiml workshop-materials responsible-ai content-safety

Updated Jul 1, 2024
Jupyter Notebook

vibheksoni / jailbench

Benchmark LLM jailbreak resilience across providers with standardized tests, adversarial mode, rich analytics, and a clean Web UI.

Updated Aug 12, 2025
Python

cristofima / TaskAgent-AgenticAI

An intelligent task management assistant built with .NET, Microsoft Agentic AI Framework and Azure OpenAI, demonstrating Clean Architecture and autonomous AI agent capabilities

sql-server mvc dotnet clean-architecture ai-agent azure-open-ai content-safety agentic-framework

Updated Oct 29, 2025
C#

OrenGrinker / contentSafetyFilter

A Chrome extension that uses Claude AI to protect users under 18 from inappropriate content by analyzing webpage content in real-time.

nodejs chrome-extension typescript chrome-extensions claude-ai claude-api content-safety claude-3-sonnet

Updated Nov 30, 2024
TypeScript

joemathew2004 / Study-Buddy

Study Buddy is a user-friendly AI-powered web app that helps students generate safe, factual study notes and Q&A on any topic. It features user accounts, study history, and strong content safety filters—making learning interactive and secure.

python learning education flask ai study chatbot project webapp qna groq content-safety

Updated Jul 5, 2025
HTML

cristofima / Demo-AzureAIContentSafety

Content moderation (text and image) in a social network demo

angular dot-net azure-storage content-moderation content-safety

Updated Jan 5, 2025
C#

RafaelParonis / jailbench

🔍 Benchmark jailbreak resilience in LLMs with JailBench for clear insights and improved model defenses against jailbreak attempts.

python flask analytics openai alignment model-evaluation ai-safety security-testing red-teaming model-robustness anthropic litellm content-safety llm-jailbreaks tool-calling llm-benchmark ai-evals textual-tui

Updated Oct 31, 2025
Python

khanovico / sentinelshield-ai-guard

SentinelShield: Advanced AI content moderation combining Llama Prompt Guard 2, rule-based filtering, and real-time analysis. Protect your applications from harmful content, prompt injection attacks, and inappropriate material with sub-second response times.

python nlp security machine-learning ai moderation content-filtering ai-safety content-moderation fastapi hate-speech-detection prompt-injection content-safety llama-guard prompt-security

Updated Aug 7, 2025
Python

sammydeprez / presentations

Technical presentations with hands-on demos

python machine-learning ai jupyter-notebook presentations educational ai-agents azure-ai responsible-ai azure-openai llm prompt-engineering langchain content-safety langgraph

Updated Oct 26, 2025
Jupyter Notebook

amafjarkasi / hsx-context-hygiene-engine

Context hygiene & risk adjudication for LLM pipelines: secrets, PII, prompt-injection, policy redaction & tokenization.

nodejs cli redaction security typescript compliance tokenization policy-engine secret-scanning data-sanitization pii-redaction llm prompt-injection llm-security content-safety context-hygiene

Updated Sep 4, 2025
TypeScript

hdprajwal / Guardscribe

Real-time speech-to-text system with toxic content detection and filtering. Transcribes live audio using multiple ASR options while automatically detecting and masking harmful language.

natural-language-processing speech-to-text content-filtering content-safety toxic-content-moderation

Updated Aug 21, 2025
Python

jonathan-vella / SAIF

A 3-tier diagnostic application designed for hands-on learning about securing AI systems across identity, network, application, and content safety domains.

security ai azure foundations zero-trust responsible-ai content-safety

Updated Sep 5, 2025
PowerShell

Impact-Analyzer

Tailwind-Stocker / Impact-Analyzer

Impact Analyzer is a web app that helps you detect toxicity and analyze nuance in your writing before publishing, ensuring your content is respectful, clear, and aligned with your intent.

sentiment-analysis web-app toxicity-detection content-safety nuance-analysis content-review

Updated May 29, 2024
HTML

CharlyProgrammer / Text-content-analyzer

Azure safety content example using python for text analysis

python ai azure azure-sdk-for-python content-safety

Updated Sep 22, 2024
Python

ashleysally00 / google-cloud-natural-language-text-safety-explained

This tutorial demonstrates how to use the Google Cloud Natural Language API for text moderation. It provides a step-by-step guide to detecting and managing harmful content while promoting responsible AI practices.

ai-education responsible-ai text-sentiment-analysis text-moderation content-safety google-cloud-nlp natural-language-processing-education text-moderation-demo text-analysis-guide

Updated Jan 6, 2025
JavaScript

miozilla / promptshield

promptshield 🛡️ : Tech & Social Media #Content-Safety

java csharp py content-safety

Updated May 19, 2025
Python

Improve this page

Add a description, image, and links to the content-safety topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the content-safety topic, visit your repo's landing page and select "manage topics."