Publications

You can also find my articles on my Google Scholar profile.

Journal Articles


TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text–Video Retrieval

Published in Information Fusion, 2025

In this paper, we propose a novel Text-Conditioned Multi-Grained Contrast (TC-MGC) framework to explore multi-grained contrasts between textual and semantic-relevant visual representations.

Recommended citation: Xiaolun Jing, Genke Yang, Jian Chu. TC-MGC: Text-conditioned multi-grained contrastive learning for text-video retrieval[J]. Information Fusion, 2025, 121:103151.
Download Paper

An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video–Text Retrieval

Published in Neurocomputing, 2024

In this work, we rethink the inherent limitation of widely-used mean pooling operation in the frame features aggregation and investigate the adaptions of excitation and aggregation design for discriminative video representation generation.

Recommended citation: Xiaolun Jing, Genke Yang, Jian Chu. An empirical study of excitation and aggregation design adaptions in CLIP4Clip for video-text retrieval[J]. Neurocomputing, 2024, 596:127905.
Download Paper

Conference Papers


Text-Video Retrieval With Global-Local Contrastive Consistency Learning

Published in China Automation Congress (CAC), 2025

In this paper, we propose a simple yet effective method called Global-Local Contrastive Consistency Learning (GLCCL) to achieve texts and videos semantics alignment.

Recommended citation: X. Jing, X. Yang and G. Yang, "Text-Video Retrieval With Global-Local Contrastive Consistency Learning," 2025 China Automation Congress (CAC), Harbin, China, 2025, pp. 1621-1626.
Download Paper