HomeGuard-8B
HomeGuard-8B is an 8B-parameter vision-language safeguard model for identifying contextual risk in household tasks. It is introduced in the paper HomeGuard: VLM-based Embodied Safeguard for Identifying Contextual Risk in Household Task and is designed to help embodied agents detect subtle, implicit hazards that arise from environmental context rather than explicit malicious intent.
This checkpoint corresponds to the 8B step-RFT model used in the HomeGuard framework. It is built on top of Qwen3-VL-8B-Thinking and further optimized for grounded household risk reasoning with reinforcement fine-tuning.
Model Summary
HomeGuard focuses on scenarios where a seemingly benign instruction becomes unsafe because of object attributes, spatial relations, or latent environmental conditions.
Compared with generic VLMs, HomeGuard is specialized for:
- contextual risk identification
- grounded multimodal safety reasoning
- safety-aware support for downstream planning and trajectory generation
Training Recipe
This model is derived from Qwen3-VL-8B-Thinking and trained within the HomeGuard pipeline.
Training setup summarized from the released training configuration:
- Base model:
Qwen/Qwen3-VL-8B-Thinking - Training stage: step-level RFT + GRPO-style optimization in the HomeGuard pipeline
- Training data: HomeSafe
Intended Use
HomeGuard-8B is intended for research and development on:
- safety assessment for embodied agents
- contextual risk identification in household tasks
- grounded VLM reasoning with visual context
- safe planning and downstream robotics pipelines
Usage
This repository contains the inference-ready model weights and tokenizer assets. A typical Transformers loading pattern is:
from transformers import AutoProcessor, Qwen3VLForConditionalGeneration
model_id = "Ursulalala/HomeGuard-8B"
processor = AutoProcessor.from_pretrained(model_id)
model = Qwen3VLForConditionalGeneration.from_pretrained(
model_id,
torch_dtype="auto",
device_map="auto",
)
For full prompting, evaluation, and application examples, please refer to the HomeGuard project repository.
Resources
- Paper: HomeGuard: VLM-based Embodied Safeguard for Identifying Contextual Risk in Household Task
- Code: AI45Lab/HomeGuard
- Dataset: Ursulalala/HomeSafe
- Base model: Qwen/Qwen3-VL-8B-Thinking
Citation
If you use this model, please cite the HomeGuard paper:
@article{lu2026homeguard,
title={HomeGuard: VLM-based Embodied Safeguard for Identifying Contextual Risk in Household Task},
author={Lu, Xiaoya and Zhou, Yijin and Chen, Zeren and Wang, Ruocheng and Sima, Bingrui and Zhou, Enshen and Sheng, Lu and Liu, Dongrui and Shao, Jing},
journal={arXiv preprint arXiv:2603.14367},
year={2026}
}
- Downloads last month
- 107