Algorithm R&D Intern @ ByteDance Douyin Search Group - 字节跳动抖音搜索
Develop searching strategies for Douyin.
Develop searching strategies for Douyin.
Short description of portfolio item number 1
Short description of portfolio item number 2
Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang, Dongyan Zhao
Published in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
A large-scale video dialogue corpus collected from TV series with scene and segment transistion annotation.
Download here
Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Zilong Zheng
Published in arXiv:2402.16050, 2024
Intergrating optical flow for relevant content selection to improve video-text LLMs’ abilities on videoqa.
Download here
Yueqian Wang, Chang Liu, Kai Chen, Xi Wang, Dongyan Zhao
Published in Findings of the Association for Computational Linguistics: EMNLP, 2025
Prompt-based learning and distillation for small transformer encoder-based language models.
Download here
Yueqian Wang, Yuxuan Wang, Dongyan Zhao
Published in Natural Language Processing and Chinese Computing, 2025
Hosted a shared task about video dialogue understanding and prediction at NLPCC 2023.
Download here
Yueqian Wang, Jianxin Liang, Yuxuan Wang, Huishuai Zhang, Dongyan Zhao
Published in arXiv, 2025
Proposed a parameter-free and training-free method for analyzing the quantity of information within image representations, and its applications in multimodal hallucination ascription.
Download here
Yueqian Wang, Yuxuan Wang, Kai Chen, Dongyan Zhao
Published in AAAI Conference on Artificial Intelligence, 2025
A neural module network (NMN) based method for videoqa with long videos and complicated questions.
Download here
Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Jiansheng Wei, Huishuai Zhang, Dongyan Zhao
Published in arXiv, 2025
Proposed MMDuet, a video-text MLLM for real-time interaction, which autonomously decides its response timing during video playback, and its training dataset MMDuetIT.
Download here
Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Qun Liu, Dongyan Zhao
Published in AAAI 2025, 2025
Defined tasks related to ``multimodal multi-party conversation understanding’, collected the Friends-MMC dataset from TV series, and introduced baseline models..
Download here
Yueqian Wang, Xiaojun Meng, Jianxin Liang, Yuxuan Wang, Qun Liu, Dongyan Zhao
Published in arXiv, 2025
One of the first video-text LLMs that can perform temporal video grounding in a fully text-to-text manner, and InternVid-G, a large-scale video-text dataset for video grounding training.
Download here
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Published:
Published:
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, Department of Information Management, Peking University
Data Analysis in Python is a programming course for undergraduates. The syllabus of this course includes commonly used statistical methods, machine learning algorithms, and data visualization methods, as well as their implementations in Python.
Undergraduate course, School of Electronics Engineering and Computer Science, Peking University
Introduction to Computation (计算概论) is a programming course for junior undergraduates. The syllabus of this course includes basic programming skills in Python.