ComfyUI-Sa2VAWrapper

A custom ComfyUI node that implements the ByteDance Sa2VA model, enabling video captioning and segmentation capabilities within ComfyUI.

Description

This extension integrates Sa2VA into ComfyUI, allowing you to generate detailed descriptions of video frames. Sa2VA-8B is a multimodal model developed by ByteDance that can understand video content and generate natural language descriptions.

WIP:

Add a node that can take a gif
Add node that implements segmentation

Features

Process sequences of images to generate detailed captions
Customizable prompting to guide the model's description
Seamless integration with ComfyUI workflow
GPU-accelerated inference with Flash Attention support

Installation

Prerequisites

ComfyUI installation

Method 1: Via ComfyUI Manager

Open ComfyUI Manager
Search for "Sa2VAWrapper"
Click Install

Method 2: Manual Installation

WORKDIR /comfyui/custom_nodes
RUN git clone https://github.com/pablerdo/ComfyUI-Sa2VAWrapper.git --recursive
WORKDIR /comfyui/custom_nodes/ComfyUI-Sa2VAWrapper
RUN git reset --hard (commit hash)
RUN if [ -f requirements.txt ]; then python -m pip install -r requirements.txt; fi
RUN if [ -f install.py ]; then python install.py || echo "install script failed"; fi

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
nodes		nodes
utility		utility
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ComfyUI-Sa2VAWrapper

Description

Features

Installation

Prerequisites

Method 1: Via ComfyUI Manager

Method 2: Manual Installation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

Pablerdo/ComfyUI-Sa2VAWrapper

Folders and files

Latest commit

History

Repository files navigation

ComfyUI-Sa2VAWrapper

Description

Features

Installation

Prerequisites

Method 1: Via ComfyUI Manager

Method 2: Manual Installation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages