-
-
Notifications
You must be signed in to change notification settings - Fork 408
Closed
Labels
enhancementNew feature or requestNew feature or request
Description
Description
With the advent of multimodal modals like Phi-4-multimodal (5.6b parameters), it might be interesting to enable interacting with these models in the app.
Use Case
Would be great to upload an image and talk about it. As Phi-4-multimodal also support speech, that could also be included in the feature request. However, I think images might have a higher priority, for instance for visually impaired pocketpal users who could start relying on offline on-device image recognition and descriptions.
temsa, Yomghi29, LedyBacer, theprashant-one, Amusingdock25 and 4 morearthurcavalcant
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request