Papers published at top conferences and journals.
Democratizing Fine-grained Visual Recognition with Large Language Models
Conditioned Prompt-Optimization for Continual Deepfake Detection
Text-Enhanced Zero-Shot Action Recognition: A training-free approach
Frustratingly Easy Test-Time Adaptation of Vision-Language Models
Improving Fairness using Vision-Language Driven Image Augmentation
SpectralCLIP: Preventing Artifacts in Text-Guided Style Transfer from a Spectral Perspective
StyLIP: Multi-Scale Style-Conditioned Prompt Learning for CLIP-based Domain Generalization