VIPL-VSU’s Website

Our group focuses on comprehensive scene understanding to enable intelligent perception and understanding of natural visual environment in the open world. More specifically, we aim to propose a vision-based robot system that has the basic capability just like human visual processing system for real world visual scene understanding, mainly including perceptual tasks such as object detection, object recognition, semantic segmentation, scene classification, attribute learning, relationship extraction, and so on. To facilitate more advanced natural language based visual concept semantic description, the system can also incorporate language models and knowledge-based reasoning for cognitive tasks like image/video captioning (description) and visual question answering.

Highlights

Our Research

Research topics of our group mainly cover three aspects: 1) Object recognition, 2) Scene understanding, and 3) Language/knowledge-based cognition.

See our publications

Our Projects

The datasets, systems, and other resources developed by our research group.

Browse our projects

Our Team

The current members and alumni of our research group.

Meet our team