Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2 
Published in ICML 2026 Position Track (Under Review), 2025
A position paper arguing that the focus of efficient AI research is shifting from model-centric compression to data-centric compression, with a systematic review of token compression methods.
@article{liu2026shifting,
title={Shifting AI Efficiency From Model-Centric to Data-Centric Compression},
author={Liu, Xuyang and Wen, Zichen and Wang, Shaobo and Chen, Junjie and Tao, Zhishan and Wang, Yubo and Jin, Xiangqi and Zou, Chang and Wang, Yiyu and Liao, Chenfei and Zheng, Xu and Chen, Honggang and Li, Weijia and Hu, Xuming and He, Conghui and Zhang, Linfeng},
journal={arXiv preprint arXiv:2505.19147},
year={2026}
}Published in EMNLP 2025 Main Conference, 2025
We propose VidCom2, a plug-and-play inference acceleration framework for VideoLLMs that adaptively adjusts compression intensity across frames, effectively preserving essential information while reducing redundancy in video sequences.
@inproceedings{liu2025vidcom2,
title={Video Compression Commander: Plug-and-Play Inference Acceleration for Video Large Language Models},
author={Liu, Xuyang and Wang, Yiyu and Ma, Junpeng and Zhang, Linfeng},
booktitle={Proceedings of the Conference on Empirical Methods in Natural Language Processing},
year={2025}
}Published in CVPR 2026, 2025
We propose V2Drop, a variation-aware method that identifies and progressively drops lazy tokens based on their intrinsic behavioral patterns, eliminating positional bias while maintaining compatibility with efficient operators.
@inproceedings{chen2026v2drop,
title={Variation-aware Vision Token Dropping for Faster Large Vision-Language Models},
author={Chen, Junjie and Liu, Xuyang and Wen, Zichen and Wang, Yiyu and Huang, Siteng and Chen, Honggang},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year={2026}
}Published in ACL 2026 (Under Review), 2025
We propose VTC-Bench, the first comprehensive evaluation framework for visual token compression methods across image and video understanding tasks, revealing critical insights about current benchmarks.
@inproceedings{liao2026vtc,
title={Are We Using the Right Benchmark: An Evaluation Framework for Visual Token Compression Methods},
author={Liao, Chenfei and Wang, Wensong and Wen, Zichen and Zheng, Xu and Wang, Yiyu and He, Haocong and Lyu, Yuanhuiyi and Jiang, Lutao and Zou, Xin and Fu, Yuqian and Ren, Bin and Zhang, Linfeng and Hu, Xuming},
booktitle={Proceedings of the Annual Meeting of the Association for Computational Linguistics},
year={2026}
}Published in arXiv Technical Report, 2025
We propose Alpha-Service, a proactive assistance system with AI glasses that enables real-time, context-aware service through multimodal perception and agentic decision making.
@article{wen2025aiforservice,
title={AI for Service: Proactive Assistance with AI Glasses},
author={Wen, Zichen and Wang, Yiyu and Liao, Chenfei and Yang, Boxue and Li, Junxian and Liu, Weifeng and He, Haocong and Feng, Bolong and Liu, Xuyang and Lyu, Yuanhuiyi and others},
journal={arXiv preprint arXiv:2510.14359},
year={2025}
}Published in CVPR 2026, 2025
We propose STC, the first plug-and-play hierarchical token compression framework for streaming VideoLLMs, optimizing both ViT encoding and LLM pre-filling stages to accelerate real-time video understanding.
@inproceedings{wang2026stc,
title={Accelerating Streaming Video Large Language Models via Hierarchical Token Compression},
author={Wang, Yiyu and Liu, Xuyang and Gui, Xiyan and Lin, Xinying and Yang, Boxue and Liao, Chenfei and Chen, Tailai and Zhang, Linfeng},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year={2026}
}Published in ECCV 2026 (Under Review), 2026
We propose KAWHI, a plug-and-play reward reweighting mechanism that explicitly incorporates structured visual information into uniform reward policy optimization methods for LVLMs.
@article{han2026kawhi,
title={Bridging Visual Representation and Reinforcement Learning from Verifiable Rewards in Large Vision-Language Models},
author={Han, Yuhang and Wu, Yuyang and Jiao, Zhengbo and Wang, Yiyu and Liu, Xuyang and Wang, Shaobo and Xu, Hanlin and Hu, Xuming and Zhang, Linfeng},
journal={arXiv preprint arXiv:2603.27375},
year={2026}
}Published in ECCV 2026 (Under Review), 2026
We propose V-CAST, a training-free plug-and-play pruning policy for long-context video inference that casts token compression as a trajectory approximation problem with curvature-guided temporal allocation.
@article{lin2026vcast,
title={V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models},
author={Lin, Xinying and Liu, Xuyang and Wang, Yiyu and Ma, Teng and Ren, Wenqi},
journal={arXiv preprint arXiv:2603.27650},
year={2026}
}Published:
This is a description of your talk, which is a markdown file that can be all markdown-ified like any other post. Yay markdown!
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
Undergraduate course, University 1, Department, 2014
This is a description of a teaching experience. You can use markdown like any other post.
Workshop, University 1, Department, 2015
This is a description of a teaching experience. You can use markdown like any other post.