Significant Discrepancies Between Batch Inference and Single Inference in DeepSeekVL2

**Description:**
I've noticed that when using DeepSeekVL2-tiny, predictions for the same input differ significantly between single inference (processing one input at a time) and batch inference (processing multiple inputs together). The outputs for a single item in a batch are notably different from its output when run alone. I suspect this might relate to how batch normalization or dropout is handled, or a difference in how the data is preprocessed. Any guidance on why this discrepancy occurs, or how to align the results, would be greatly appreciated.








Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Significant Discrepancies Between Batch Inference and Single Inference in DeepSeekVL2 #129

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Significant Discrepancies Between Batch Inference and Single Inference in DeepSeekVL2 #129

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions