Our Publications

Selected work

All types Conferences Journals
All years 2026 2025 2024 2023 2022 2021 2020 2019 2018 2017
NeurIPS 2025 6 papers
ConViS-Bench: Estimating Video Similarity Through Semantic Concepts

ConViS-Bench: Estimating Video Similarity Through Semantic Concepts

ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression
Oral

ImageNet-trained CNNs are not biased towards texture: Revisiting feature reliance through controlled suppression

Tom Burgert, Oliver Stoll, Paolo Rota, Begüm Demir
Increasing the Utility of Synthetic Images through Chamfer Guidance

Increasing the Utility of Synthetic Images through Chamfer Guidance

Nicola Dall'Asen, Xiaofeng Zhang, Reyhane Askari Hemmat, Melissa Hall, Jakob Verbeek, Adriana Romero-Soriano, Michal Drozdzal

SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting

Mengjiao Ma, Qi Ma, Yue Li, Jiahuan Cheng, Runyi Yang, Bin Ren, Nikola Popovic, Mingqiang Wei, Nicu Sebe, Ender Konukoglu, Luc Van Gool, Theo Gevers, Martin R. Oswald, Danda Pani Paudel

Towards a General Attention Framework on Gyrovector Spaces for Matrix Manifolds

Rui Wang, Chen Hu, Xiaoning Song, Xiaojun Wu, Nicu Sebe, Ziheng Chen
When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

When Semantics Mislead Vision: Mitigating Large Multimodal Models Hallucinations in Scene Text Spotting and Understanding

Yan Shu, Hangui Lin, Yexin Liu, Yan Zhang, Gangyan Zeng, Yan Li, Yu Zhou, Ser-Nam Lim, Harry Yang, Nicu Sebe

ACM Multimedia

2025 4 papers
AlignCAT: Visual-Linguistic Alignment of Category and Attributefor Weakly Supervised Visual Grounding

AlignCAT: Visual-Linguistic Alignment of Category and Attributefor Weakly Supervised Visual Grounding

Yidan Wang, Chenyi Zhuang, Wutao Liu, Pan Gao, Nicu Sebe
Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection

Dynamic Scoring with Enhanced Semantics for Training-Free Human-Object Interaction Detection

FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors

Chenxi Li, Weijie Wang, Qiang Li, Nicu Sebe, Bruno Lepri, Weizhi Nie

Unveiling Open-set Noise: Theoretical Insights into Label Noise

Chen Feng, Nicu Sebe, Georgios Tzimiropoulos, Miguel R. D. Rodrigues, Ioannis Patras

ICCV

2025 8 papers
FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

Mainak Singha, Subhankar Roy, Sarthak Mehrotra, Ankit Jha, Moloud Abdar, Biplap Banerjee, Elisa Ricci

Generate, Refine, and Encode: Leveraging Synthesized Novel Samples for On-the-Fly Fine-Grained Category Discovery

Xiao Liu, Nan Pu, Haiyang Zheng, Wenjing Li, Nicu Sebe, Zhun Zhong

Hierarchical Visual Prompt Learning for Continual Video Instance Segmentation

Jiahua Dong, Hui Yin, Wenqi Liang, Hanbin Zhao, Henghui Ding, Nicu Sebe, Salman Khan, Fahad Shahbaz Khan
On Large Multimodal Models as Open-World Image Classifiers

On Large Multimodal Models as Open-World Image Classifiers

Pseudo-SD: Pseudo Controlled Stable Diffusion for Semi-Supervised and Cross-Domain Semantic Segmentation

Dong Zhao, Qi Zang, Shuang Wang, Nicu Sebe, Zhun Zhong

SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Yue Li, Qi Ma, Runyi Yang, Huapeng Li, Mengjiao Ma, Bin Ren, Nikola Popovic, Nicu Sebe, Ender Konukoglu, Theo Gevers, Luc Van Gool, Martin R. Oswald, Danda Pani Paudel
Superpowering Open-Vocabulary Object Detectors for X-ray Vision

Superpowering Open-Vocabulary Object Detectors for X-ray Vision

What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models

Lorenzo Baraldi, Davide Bucciarelli, Federico Betti, Marcella Cornia, Lorenzo Baraldi, Nicu Sebe, Rita Cucchiara

ICIAP

2025 2 papers
Automatic benchmarking of large multimodal models via iterative experiment programming

Automatic benchmarking of large multimodal models via iterative experiment programming

Diversified in-domain synthesis with efficient fine-tuning for few-shot classification

Diversified in-domain synthesis with efficient fine-tuning for few-shot classification

Nicola Dall'Asen, Victor G Turrisi da Costa, Yiming Wang, Nicu Sebe, Elisa Ricci
CVPR 2025 4 papers
Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers

Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers

Compositional Caching for Training-free Open-vocabulary Attribute Detection
Highlight

Compositional Caching for Training-free Open-vocabulary Attribute Detection

Multi-focal Conditioned Latent Diffusion for Person Image Synthesis

Multi-focal Conditioned Latent Diffusion for Person Image Synthesis

Jiaqi Liu, Jichao Zhang, Paolo Rota, Nicu Sebe
Not Only Text: Exploring Compositionality of Visual Representations  in Vision-Language Models
Highlight

Not Only Text: Exploring Compositionality of Visual Representations in Vision-Language Models

Page 1 of 3 2 3