Expand description
Vision model captioning for images.
This module provides infrastructure for generating captions from images
using vision models. With the vision feature enabled, a BLIP-based
captioner is available. Otherwise, only a placeholder implementation exists.
Structs§
- Blip
Captioner - BLIP-based image captioner using Candle.
- Caption
Config - Configuration for vision captioning.
- Placeholder
Captioner - Placeholder vision captioner that returns no captions.
Enums§
- Caption
Error - Error type for vision captioning operations.
Constants§
- BLIP_
IMAGE_ 🔒SIZE - Image size for BLIP preprocessing.
- BLIP_
MODEL_ 🔒ID - BLIP model identifier on HuggingFace Hub.
Traits§
- Image
Captioner - Trait for vision-based image captioning.