AMUSE: Adaptive Multi-Segment Encoding for Dataset Watermarking Paper • 2403.05628 • Published Mar 8, 2024
ArchBERT: Bi-Modal Understanding of Neural Architectures and Natural Languages Paper • 2310.17737 • Published Oct 26, 2023
From Segments to Scenes: Temporal Understanding in Autonomous Driving via Vision-Language Model Paper • 2512.05277 • Published 10 days ago • 4
LaWa: Using Latent Space for In-Generation Image Watermarking Paper • 2408.05868 • Published Aug 11, 2024 • 3
DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models Paper • 2503.02175 • Published Mar 4 • 3