etri-vilab/SafeLLaVA-7B
Image-Text-to-Text
ā¢
7B
ā¢
Updated
ā¢
66
ā¢
2
Visual Intelligence, Pretrained Vision-and-Language Model, Embodied AI, Collaborative Agents, Vision Task(Object Detection, Segmentation)