Repository logo

Domain-Adaptive YOLOv9 for Foggy-Weather Object Detection Using Partial Spatial Self-Attention

Loading...
Thumbnail Image

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Cross-domain object detection remains challenging because a detector trained on a labeled source domain often generalizes poorly to a target domain with different visual characteristics. This problem is especially evident under adverse weather conditions, where visibility degradation changes contrast, texture, and object boundaries while target-domain annotations are typically unavailable. This thesis develops a domain-adaptive YOLOv9 framework for foggy-weather object detection. The method combines image-level appearance adaptation with feature-level refinement. At the image level, Contrastive Unpaired Translation (CUT) is used to translate labeled source images into pseudo target-style samples while preserving the original annotations. At the feature level, a Partial Spatial Self-Attention (PSSA) module is introduced to refine deep feature representations through spatial contextual modeling over only part of the channel dimension. The proposed framework is evaluated on the Cityscapes$\rightarrow$Foggy Cityscapes benchmark. Experimental results show that both components improve target-domain performance, but their contributions are not identical. CUT produces the larger gain by reducing the appearance discrepancy between the source and target domains, while PSSA provides an additional improvement through deep feature refinement. When the two components are combined, the resulting detector achieves the best overall performance among the evaluated configurations. These results show that substantial improvement under foggy cross-domain conditions can be obtained without abandoning the inference structure of a one-stage detector. The thesis therefore provides a practical domain-adaptive detection framework that improves robustness under adverse visibility while remaining compatible with deployment-oriented YOLO-style detection.

Description

Keywords

Foggy Cityscapes, Cityscapes, CUT, Partial Spatial Self-Attention (PSSA), YOLOv9, unsupervised domain adaptation, Cross-domain detection

Citation