Skip to content
Change the repository type filter

All

    Repositories list

    • PRISM

      Public
      PRISM: Robust VLM Alignment with Principled Reasoning for Integrated Safety in Multimodality
      Python
      0200Updated Aug 27, 2025Aug 27, 2025
    • MetaAgent

      Public
      Offical Repository of MetaAgent Program
      Python
      01730Updated Aug 26, 2025Aug 26, 2025
    • [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".
      Python
      12500Updated Aug 4, 2025Aug 4, 2025
    • llm-armor

      Public
      JavaScript
      0000Updated Jul 23, 2025Jul 23, 2025
    • armor

      Public
      Python
      0300Updated Jul 23, 2025Jul 23, 2025
    • [COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and further assess the robustness and safety of MLLMs against a variety of jailbreak attacks.
      Python
      67320Updated May 9, 2025May 9, 2025
    • OET

      Public
      Python
      1800Updated May 5, 2025May 5, 2025
    • [ICLR 2025 Spotlight] The official implementation of our ICLR2025 paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs".
      Python
      4529840Updated Apr 15, 2025Apr 15, 2025
    • FIUBench

      Public
      A Task of Fictitious Unlearning for VLMs
      Jupyter Notebook
      12160Updated Apr 6, 2025Apr 6, 2025
    • Dolphins

      Public
      [ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“
      Python
      128050Updated Feb 10, 2025Feb 10, 2025
    • The homepage of SaFo Lab
      HTML
      0200Updated Nov 25, 2024Nov 25, 2024
    • List of T2I safety papers, updated daily, welcome to discuss using Discussions
      16300Updated Aug 12, 2024Aug 12, 2024
    • AdaShield

      Public
      [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."
      Python
      26243Updated Jul 11, 2024Jul 11, 2024
    • .github

      Public
      Open codes from SaFoLab at University of Wisconsin–Madison
      0100Updated Jul 3, 2024Jul 3, 2024