Our Publications

Selected work

All types Conferences Journals
All years 2026 2025 2024 2023 2022 2021 2020 2019 2018 2017

WACV

2026 2 papers
Reverse Personalization

Reverse Personalization

Safe Vision-Language Models via Unsafe Weights Manipulation

Safe Vision-Language Models via Unsafe Weights Manipulation

Moreno D'Incà, Elia Peruzzo, Xingqian Xu, Humphrey Shi, Nicu Sebe, Massimiliano Mancini
AAAI 2026 3 papers

Masked Clustering Prediction for Unsupervised Point Cloud Pre-training

Bin Ren, Xiaoshui Huang, Mengyuan Liu, Hong Liu, Fabio Poiesi, Nicu Sebe, Guofeng Mei

Open-World Deepfake Attribution via Confidence-Aware Asymmetric Learning

Haiyang Zheng, Nan Pu, Wenjing Li, Teng Long, Nicu Sebe, Zhun Zhong

Wasserstein-Aligned Hyperbolic Multi-View Clustering

Rui Wang, Yuting Jiang, Xiaoqing Luo, Xiaojun Wu, Nicu Sebe, Ziheng Chen
NeurIPS 2025 6 papers
ConViS-Bench: Estimating Video Similarity Through Semantic Concepts

ConViS-Bench: Estimating Video Similarity Through Semantic Concepts

ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression
Oral

ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression

Tom Burgert, Oliver Stoll, Paolo Rota, Begüm Demir
Increasing the Utility of Synthetic Images through Chamfer Guidance

Increasing the Utility of Synthetic Images through Chamfer Guidance

Nicola Dall'Asen, Xiaofeng Zhang, Reyhane Askari Hemmat, Melissa Hall, Jakob Verbeek, Adriana Romero-Soriano, Michal Drozdzal

SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting

Mengjiao Ma, Qi Ma, Yue Li, Jiahuan Cheng, Runyi Yang, Bin Ren, Nikola Popovic, Mingqiang Wei, Nicu Sebe, Ender Konukoglu, Luc Van Gool, Theo Gevers, Martin R. Oswald, Danda Pani Paudel

Towards a General Attention Framework on Gyrovector Spaces for Matrix Manifolds

Rui Wang, Chen Hu, Xiaoning Song, Xiaojun Wu, Nicu Sebe, Ziheng Chen
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu, Hangui Lin, Yexin Liu, Yan Zhang, Gangyan Zeng, Yan Li, Yu Zhou, Ser-Nam Lim, Harry Yang, Nicu Sebe

ACM Multimedia

2025 4 papers
AlignCAT: Visual-Linguistic Alignment of Category and Attributefor Weakly Supervised Visual Grounding

AlignCAT: Visual-Linguistic Alignment of Category and Attributefor Weakly Supervised Visual Grounding

Yidan Wang, Chenyi Zhuang, Wutao Liu, Pan Gao, Nicu Sebe
Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection

Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection

FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors

Chenxi Li, Weijie Wang, Qiang Li, Nicu Sebe, Bruno Lepri, Weizhi Nie

Unveiling Open-set Noise: Theoretical Insights into Label Noise

Chen Feng, Nicu Sebe, Georgios Tzimiropoulos, Miguel R. D. Rodrigues, Ioannis Patras

ICCV

2025 8 papers
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

Mainak Singha, Subhankar Roy, Sarthak Mehrotra, Ankit Jha, Moloud Abdar, Biplap Banerjee, Elisa Ricci

Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery

Xiao Liu, Nan Pu, Haiyang Zheng, Wenjing Li, Nicu Sebe, Zhun Zhong

Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation

Jiahua Dong, Hui Yin, Wenqi Liang, Hanbin Zhao, Henghui Ding, Nicu Sebe, Salman Khan, Fahad Shahbaz Khan
On Large Multimodal Models as Open-World Image Classifiers

On Large Multimodal Models as Open-World Image Classifiers

Pseudo-SD: Pseudo Controlled Stable Diffusion for Semi-Supervised and Cross-Domain Semantic Segmentation

Dong Zhao, Qi Zang, Shuang Wang, Nicu Sebe, Zhun Zhong

SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Yue Li, Qi Ma, Runyi Yang, Huapeng Li, Mengjiao Ma, Bin Ren, Nikola Popovic, Nicu Sebe, Ender Konukoglu, Theo Gevers, Luc Van Gool, Martin R. Oswald, Danda Pani Paudel
Superpowering Open-Vocabulary Object Detectors for X-ray Vision

Superpowering Open-Vocabulary Object Detectors for X-ray Vision

What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models

Lorenzo Baraldi, Davide Bucciarelli, Federico Betti, Marcella Cornia, Lorenzo Baraldi, Nicu Sebe, Rita Cucchiara