- INP-Former, CVPR 25
- Dinomaly, cvpr 25 (https://arxiv.org/abs/2405.14325): The Less Is More Philosophy in Multi-Class Unsupervised Anomaly Detection.
- 在 GitHub 上,项目 README 中称其为 “the first multi-class UAD model that can compete with single-class SOTAs”
- minimalistic / simple design: only Transformer (Attention + MLP), no much dependency on complex modules or tricks.
- reconstruction-based
- 但它并不是典型的自编码器式重构,而是对重构机制做了“弱化”、控制重构能力、防止过拟合,以使其在多类统一模型的场景下更稳健。⇒
- loose reconstruction instead of tight reconstruction (traditional):
- do not force layer-to-layer and point-to-point reconstruction
- layer-to-layer ⇒ use high/coarse level features only
- point-to-point ⇒ cosine similarity instead of L2 loss
- Noisy Bottleneck
- Dropouts do all the noise injection tricks
- Selective masking / skipping of easy regions in training
- ShortcutBreaker: Low-Rank Noisy Bottleneck with Global Perturbation Attention for Multi-Class Unsupervised Anomaly Detection, 25
- CostFilter-AD, ICML 25 (https://arxiv.org/pdf/2505.01476): Enhancing Anomaly Detection through Matching Cost Filtering
- Unified Unsupervised Anomaly Detection via Matching Cost Filtering, 25
- https://arxiv.org/pdf/2510.03363
- builds upon CostFilter-AD [42] with four advances.
- Unsupervised Continual Anomaly Detection with Contrastively-Learned Prompt, AAAI24 (https://arxiv.org/abs/2401.01010)
- 新交叉方向:Continual Learning(持续学习) + Anomaly Detection(异常检测) + Prompt Learning(提示学习)
- Generalist Multi-Class Anomaly Detection via Distillation to Two Heterogeneous Student Networks, 25
- Patch-as-Decodable-Token: Towards Unified Multi-Modal Vision Tasks in MLLMs, 25
- 做LLM based anomaly应该看看这个