Explicit Invariant Feature Induced Cross-Domain Crowd Counting
DOI:
https://doi.org/10.1609/aaai.v37i1.25098Keywords:
CV: ApplicationsAbstract
Cross-domain crowd counting has shown progressively improved performance. However, most methods fail to explicitly consider the transferability of different features between source and target domains. In this paper, we propose an innovative explicit Invariant Feature induced Cross-domain Knowledge Transformation framework to address the inconsistent domain-invariant features of different domains. The main idea is to explicitly extract domain-invariant features from both source and target domains, which builds a bridge to transfer more rich knowledge between two domains. The framework consists of three parts, global feature decoupling (GFD), relation exploration and alignment (REA), and graph-guided knowledge enhancement (GKE). In the GFD module, domain-invariant features are efficiently decoupled from domain-specific ones in two domains, which allows the model to distinguish crowds features from backgrounds in the complex scenes. In the REA module both inter-domain relation graph (Inter-RG) and intra-domain relation graph (Intra-RG) are built. Specifically, Inter-RG aggregates multi-scale domain-invariant features between two domains and further aligns local-level invariant features. Intra-RG preserves taskrelated specific information to assist the domain alignment. Furthermore, GKE strategy models the confidence of pseudolabels to further enhance the adaptability of the target domain. Various experiments show our method achieves state-of-theart performance on the standard benchmarks. Code is available at https://github.com/caiyiqing/IF-CKT.Downloads
Published
2023-06-26
How to Cite
Cai, Y., Chen, L., Guan, H., Lin, S., Lu, C., Wang, C., & He, G. (2023). Explicit Invariant Feature Induced Cross-Domain Crowd Counting. Proceedings of the AAAI Conference on Artificial Intelligence, 37(1), 259-267. https://doi.org/10.1609/aaai.v37i1.25098
Issue
Section
AAAI Technical Track on Computer Vision I