Skip to content

Pablerdo/ComfyUI-Sa2VAWrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ComfyUI-Sa2VAWrapper

A custom ComfyUI node that implements the ByteDance Sa2VA model, enabling video captioning and segmentation capabilities within ComfyUI.

Description

This extension integrates Sa2VA into ComfyUI, allowing you to generate detailed descriptions of video frames. Sa2VA-8B is a multimodal model developed by ByteDance that can understand video content and generate natural language descriptions.

WIP:

  • Add a node that can take a gif
  • Add node that implements segmentation

Features

  • Process sequences of images to generate detailed captions
  • Customizable prompting to guide the model's description
  • Seamless integration with ComfyUI workflow
  • GPU-accelerated inference with Flash Attention support

Installation

Prerequisites

  • ComfyUI installation

Method 1: Via ComfyUI Manager

  1. Open ComfyUI Manager
  2. Search for "Sa2VAWrapper"
  3. Click Install

Method 2: Manual Installation

WORKDIR /comfyui/custom_nodes
RUN git clone https://github.com/pablerdo/ComfyUI-Sa2VAWrapper.git --recursive
WORKDIR /comfyui/custom_nodes/ComfyUI-Sa2VAWrapper
RUN git reset --hard (commit hash)
RUN if [ -f requirements.txt ]; then python -m pip install -r requirements.txt; fi
RUN if [ -f install.py ]; then python install.py || echo "install script failed"; fi

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages