副教授
您当前所在位置是: 首页 >> 师资队伍 >> 副教授 >> 正文
赵斌 副教授
 

个人简介

赵斌:副教授,博士生导师

学术主页:[Homepage]

研究方向:具身智能

联系地址:陕西省西安市碑林区友谊西路127号西北工业大学

邮政编码:710072

电子邮箱:bin@nwpu.edu.cn

个人简介:聚焦具身智能研究,致力于实现人形机器人、无人机、机器狗等异构智能体的自主协同,构建大模型驱动的机器人社区。在国际期刊和会议发表学术论文 50 余篇,包括TPAMI/CVPR/ICCV/ICML/NeurIPS/RSS/ICRA/CoRL等。获中国科协青年托举人才工程、陕西省高校优秀青年人才。相关成果应用于航空航天、安防巡检、应急救援任务中,公开技术被 Asia Times、South China Morning Post、The SUN、人民日报、新华网、科学网等国内外媒体广泛报道。长期欢迎对大模型、具身智能、机器人硬件(含无人机)等感兴趣的本科生和硕博士研究生前来实习和交流。

 

代表成果

 

代表性论文

  1. 李学龙* and 赵斌, “视频萃取,” 中国科学: 信息科学, 2021, 51(5): 695-734. [BibTeX] | [PDF]

  2. B. Zhao, P. Han, and X. Li, "Vehicle perception from satellite," IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 46, no. 4, pp. 2545-2554, 2023, IEEE. [BibTeX] | [PDF]

  3. B. Zhao, H. Li, X. Lu, and X. Li*, "Reconstructive Sequence-Graph Network for Video Summarization," IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 44, no. 5, pp. 2793-2801, 2022. [BibTeX] | [PDF]

  4. B. Zhao, X. Li, and X. Lu, "HSA-RNN: Hierarchical Structure-Adaptive RNN for Video Summarization," Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7405-7414, 2018. [BibTeX] | [PDF]

  5. K. Xu, C. Bai, X. Ma, D. Wang, B. Zhao, Z. Wang, X. Li, and W. Li, "Cross-domain policy adaptation via value-guided data filtering," Advances in Neural Information Processing Systems (NeurIPS), vol. 36, pp. 73395-73421, 2023. [BibTeX] | [PDF]

  6. C. Yan, D. Qu, D. Xu, B. Zhao, Z. Wang, D. Wang, and X. Li, "Gs-slam: Dense visual slam with 3d gaussian splatting," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 19595-19604, 2024. [BibTeX] | [PDF]

  7. D. Qu, C. Yan, D. Wang, J. Yin, Q. Chen, D. Xu, Y. Zhang, B. Zhao, and X. Li, "Implicit event-rgbd neural slam," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 19584-19594, 2024. [BibTeX] | [PDF]

  8. Y. Tang, R. Zhang, Z. Guo, X. Ma, B. Zhao, Z. Wang, D. Wang, and X. Li, "Point-peft: Parameter-efficient fine-tuning for 3d pre-trained models," Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 38, no. 6, pp. 5171-5179, 2024. [BibTeX] | [PDF]

  9. W. Xia, D. Wang, X. Pang, Z. Wang, B. Zhao, D. Hu, and X. Li, "Kinematic-aware prompting for generalizable articulated object manipulation with llms," 2024 IEEE International Conference on Robotics and Automation (ICRA), pp. 2073-2080, 2024, IEEE. [BibTeX] | [PDF]

  10. M. Cui, Z. Wang, D. Wang, B. Zhao, and X. Li, "Color event enhanced single-exposure HDR imaging," Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 38, no. 2, pp. 1399-1407, 2024. [BibTeX] | [PDF]

  11. Y. Tang, R. Zhang, J. Liu, Z. Guo, B. Zhao, Z. Wang, P. Gao, H. Li, D. Wang, and X. Li, "Any2point: Empowering any-modality large models for efficient 3d understanding," European Conference on Computer Vision (ECCV), pp. 456-473, 2024. [BibTeX] | [PDF]

  12. Z. Li, B. Zhao, and Y. Yuan, "Cyclic learning for binaural audio generation and localization," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 26669-26678, 2024. [BibTeX] | [PDF]

  13. X. Gao, P. Zhang, D. Qu, D. Wang, Z. Wang, Y. Ding, and B. Zhao, "Learning 2d invariant affordance knowledge for 3d affordance grounding," Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 39, no. 3, pp. 3095-3103, 2025. [BibTeX] | [PDF]

  14. K. Liu, Z. Tang, D. Wang, Z. Wang, X. Li, and B. Zhao, "Coherent: Collaboration of heterogeneous multi-robot system with large language models," 2025 IEEE International Conference on Robotics and Automation (ICRA), pp. 10208-10214, 2025, IEEE. [BibTeX] | [PDF]

  15. L. Jing, Y. Xue, X. Yan, C. Zheng, D. Wang, R. Zhang, Z. Wang, H. Fang, B. Zhao, and Z. Li, "X4d-sceneformer: Enhanced scene understanding on 4d point cloud videos through cross-modal knowledge transfer," Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 38, no. 3, pp. 2670-2678, 2024. [BibTeX] | [PDF]

  16. L. Jing, Y. Ding, Y. Gao, Z. Wang, X. Yan, D. Wang, G. Schaefer, H. Fang, B. Zhao, and X. Li, "HPL-ESS: hybrid pseudo-labeling for unsupervised event-based semantic segmentation," Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 23128-23137, 2024. [BibTeX] | [PDF]

  17. H. He, C. Bai, L. Pan, W. Zhang, B. Zhao, and X. Li, "Learning an actionable discrete diffusion policy via large-scale actionless video pre-training," Advances in Neural Information Processing Systems (NeurIPS), vol. 37, pp. 31124-31153, 2024. [BibTeX] | [PDF]

  18. D. Qu, Q. Chen, P. Zhang, X. Gao, B. Zhao, Z. Wang, D. Wang, and X. Li, "LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Control and Rendering," Advances in Neural Information Processing Systems (NeurIPS), vol. 37, pp. 12271-12292, 2024. [BibTeX] | [PDF]

  19. G. Li, B. Zhao, and X. Li, "Low-light image enhancement with sam-based structure priors and guidance," IEEE Transactions on Multimedia (TMM), vol. 26, pp. 10854-10866, 2024, IEEE. [BibTeX] | [PDF]

  20. G. Lan, Q. Ma, Y. Yang, Z. Wang, D. Wang, X. Li, and B. Zhao, "Efficient Diffusion as Low Light Enhancer," Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 21277-21286, 2025. [BibTeX] | [PDF]

  21. Y. Yuan, Z. Li, and B. Zhao, "A survey of multimodal learning: Methods, applications, and future," ACM Computing Surveys (ACM Comput. Surv.), vol. 57, no. 7, pp. 1-34, 2025, ACM New York, NY. [BibTeX] | [PDF]

  22. K. Liu, C.Guan, Z Jia, ..., B. Zhao, and X. Li, "FastUMI: A Scalable and Hardware-Independent Universal Manipulation Interface with Dataset," arXiv preprint arXiv:2409.19499, 2024. [BibTeX] | [PDF]

  23. Y. Yao, S. Liu, H. Song, D. Qu, Q. Chen, Y. Ding, B. Zhao, Z. Wang, X. Li, and D. Wang, "Think Small, Act Big: Primitive Prompt Learning for Lifelong Robot Manipulation," Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR), pp. 22573-22583, 2025. [BibTeX] | [PDF]

  24. H. Song, D. Qu, Y. Yao, Q. Chen, Q. Lv, Y. Tang, M. Shi, G. Ren, M. Yao, and B. Zhao, "Hume: Introducing System-2 Thinking in Visual-Language-Action Model," arXiv preprint arXiv:2505.21432, 2025. [BibTeX] | [PDF]

  25. P. Zhang, Y. Su, P. Wu, D. An, L. Zhang, Z. Wang, D. Wang, Y. Ding, B. Zhao, and X. Li, "Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation," arXiv preprint arXiv:2505.20897, 2025. [BibTeX] | [PDF]

  26. Z. Wang, Y. Su, C. Li, D. Wang, Y. Huang, X. Li, and B. Zhao, "Open-Vocabulary Octree-Graph for 3D Scene Understanding," Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pp. 7037-7047, 2025. [BibTeX] | [PDF]

  27. Y. Gao, C. Li, Z. You, ..., B. Zhao, and X. Li, "OpenFly: A Comprehensive Platform for Aerial Vision-Language Navigation," arXiv preprint arXiv:2502.18041, 2025. [BibTeX] | [PDF]

  28. J. Liu, Q. Chen, Z. Wang, Y. Tang, Y. Zhang, C. Yan, D. Wang, X. Li, and B. Zhao, "AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations," arXiv preprint arXiv:2504.07836, 2025. [BibTeX] | [PDF]

  29. D. Qu, H. Song, Q. Chen, ..., B. Zhao, and D. Wang, "Embodiedonevision: Interleaved vision-text-action pretraining for general robot control," arXiv e-prints, arXiv:2508.21112, 2025. [BibTeX] | [PDF]

  30. D. Qu, H. Song, Q. Chen, ..., B. Zhao, D. Wang, and X. Li, "SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model," arXiv preprint arXiv:2501.15830, 2025. [BibTeX] | [PDF]

 

科研项目

  • 军口国家级纵向项目,物理先验与统计学习联合的XXXXXXXXXXXXX技术,150万元,主持。

  • XXXX创新团队课题,复杂地面背景下XXXXXXXXXXXX,100万元,主持。

  • 陕西电信联合实验室基金项目,无人机远程激光供能技术,100万元,主持。

  • 国家自然科学基金委面上项目,极端环境下视觉感知的关键问题研究,57.5万元,主持。

  • 国家重点研发计划子课题,基于应急大模型的灾害现场影像全要素提取与维态势快速精准重构技术及装备,55万元,主持。

  • 国家自然科学基金委青年科学基金项目,认知驱动的视频多模态信息萃取研究,20万元,主持。

  • 国家重点研发计划项目,恶劣环境下视觉信息的主动探测与感知,参与。

  • 国家重点研发计划项目,定向到通用推理的泛化机制,参与。

 

荣誉获奖

  • 中国科协青年人才托举工程,中国科协,2023。

  • 陕西省高校优秀青年,陕西省教育厅,2024。

  • 试飞环境下多模态传感器故障分析关键技术与应用,排名9/15,中国航空学会,科技进步,省部一等奖,2022。

  • 全国光学工程学科优秀博士学位论文奖,中国光学工程学会,国家一级学会优博,2021。

  • 陕西省优秀博士学位论文奖,陕西省教育厅,省优博,2022。

 

学术活动

  • 中国模式识别与计算机视觉大会领域主席、中国具身智能大会(首届)-大模型与具身智能论坛主席。

  • 国际会议SPC/TPC/PC/Reviewer:CVPR/ICCV/ECCV/NeurIPS/AAAI/ICRA等30余个国际知名会议。

  • 期刊审稿人:TPAMI/TNNLS/TCYB/TIP/TMI等20余个国际高水平期刊。

  • Pattern Recognition编委,SCI一区期刊,2024.10-至今

  • Neural Networks编委,SCI二区期刊,2025.01-至今

  • Computer Systems Science and Engineering编委,SCI三区期刊,2024.07-至今

 

其他信息

  • 欢迎有兴趣开展大模型、具身智能和机器人硬件等方面研究的本科生同学提前进入实验室。对于自驱力强的同学,可以提供学术资源进行科研训练,请联系bin@nwpu.edu.cn。

  • 更新时间:2025年10月29日