摘要: |
Multi-view video surveillanee is a highly valuable tool to ensure the safety of the crowd in large public space. By utilizing complementary information captured by multiple cameras, the issue of limited views and occlusion in single views can be addressed to gain better insight into the whole monitored space. However, multi-view surveillanee has been widely applied to microscopic crowd analysis, for example pedestrian detection and tracking, while macroscopic level analysis, which deals with the whole crowd, has received little attention. We propose a multi-view framework for the gen eratio n of level of service maps, which are the most comm only used measure of con gestion at macroscopic level, based on an ens em-ble of state-of-the-art Convolutional Neural Networks (CNNs). Several combination rules are compared and evaluated on two datasets, both in sparse and dense seenarios. Our results show that this fusio n framework improves the accuracy of level of service map gen eration, from 83.2% to 89.8%, and eliminates blind spots in single views. Our framework is imple-mented on a 3 D GIS platform, which provides a suitable interface for multi-view crowd con-gestion management. The results of a loading test show that a maximum of 48 cameras can be processed at a map refresh rate of 2 seconds. |