Abstract
Purpose
Human group behavior is the driving force behind many complex social and economic phenomena. Few studies have integrated multi-dimensional travel patterns and city interest points to construct urban security risk indicators. This paper combines traffic data and urban alarm data to analyze the safe travel characteristics of the urban population. The research results are helpful to explore the diversity of human group behavior, grasp the temporal and spatial laws and reveal regional security risks. It provides a reference for optimizing resource deployment and group intelligence analysis in emergency management.
Design/methodology/approach
Based on the dynamics index of group behavior, this paper mines the data of large shared bikes and ride-hailing in a big city of China. We integrate the urban interest points and travel dynamic characteristics, construct the urban traffic safety index based on alarm behavior and further calculate the urban safety index.
Findings
This study found significant differences in the travel power index among ride-sharing users. There is a positive correlation between user shared bike trips and the power-law bimodal phenomenon in the logarithmic coordinate system. It is closely related to the urban public security index.
Originality/value
Based on group-shared dynamic index integrated alarm, we innovatively constructed an urban public safety index and analyzed the correlation of travel alarm behavior. The research results fully reveal the internal mechanism of the group behavior safety index and provide a valuable supplement for the police intelligence analysis.
Keywords
Citation
Deng, X., Li, X., Chen, Z., Zhu, M., Xiong, N. and Shen, L. (2024), "Construction of public security indicators based on characteristics of shared group behavior patterns", Data Technologies and Applications, Vol. 58 No. 3, pp. 345-362. https://doi.org/10.1108/DTA-12-2021-0389
Publisher
:Emerald Publishing Limited
Copyright © 2022, XiYue Deng, Xiaoming Li, Zhenzhen Chen, Mengli Zhu, Naixue Xiong and Li Shen
License
Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at http://creativecommons.org/licences/by/4.0/legalcode
1. Introduction
In recent years, the research on characteristics of group behavior patterns has attracted extensive attention in the academic circle of human behavior dynamics. Characteristics of group behavior patterns are highly complex and diverse, making it full of great challenges to study them (the elements of group behavior patterns mentioned here mainly focus on the behavior patterns of groups in daily life). The purpose of the study on the parts of group behavior patterns is to establish the corresponding feature model by analyzing and mining the data related to group daily behavior. In terms of research methods, the model of group behavior research is to quantitatively analyze the episodic, memorizing and power-law distribution of group behavior through real and objective data. It follows the cycle of “data collection, feature analysis, mathematical model reappearance” data rules “without human intervention and objectively reflects the law revealed by the data.
So far, studies on model group behavior have been based on general cases, such as (Katzenbach and Smith, 2015) explaining Poisson's characteristics based on people's timely processing of E-mail. Wuchty et al. (2007) found that European mobile phone users (based on a population level) follow the long-back exponential distribution. Another example is literature (Gowers and Nielsen, 2009), based on the analysis of Chinese mobile phone users through the memory of call interval and parody research. It is found that the distribution of call time intervals obeys power-law distribution and that most human behaviors obey characteristics of strong retention and weak memory. Subsequently, a large number of practical models were put forward for statistical analysis of group characteristics of human behavior, including priority queue (Naimi and Westreich, 2014), Poisson process (Lazer et al., 2009), adaptive model (Shmueli et al., 2014), etc. In order to reveal characteristics of different traffic behavior patterns of groups, Jing et al. (2021) holds that according to the human travel distance and residence time calculated at the group level, it can be inferred that the travel distance and residence time of each individual in the group also obeys the power-law distribution.
However, some scholars have begun to study the behavior pattern based on mobile phone calls in the process of abnormal events due to characteristics of group behavior patterns in unconventional cases (Gao et al., 2014). They found that emergencies can lead to a significant increase in eyewitness social activities (making phone calls, sending text messages, etc.).
Therefore, based on the research method of human behavior dynamics, this paper analyzes the data of large-scale shared bicycles and ride-hailing sensors in Chinese metropolises. With the integration of urban points of interest, the multi-dimensional travel characteristics of group traffic passengers are mined and a multi-spatial resolution urban regional association network based on alarm behavior is constructed. Behavioral dynamics characteristics of different travel tools are discussed and the urban regional safety risk index is further analyzed.
The rest of this paper is organized as follows: in the second section, we review characteristics of human behavior, urban points of interest and their applications in traffic behavior. Section 3 gives some preliminary ideas, including related concepts and terminology. In Section 4, we will show and analyze the experimental results based on the data set of shared bikes and online car-hailing and the fusion of related alarm data sets. The first part analyzes the community attributes of a large city in China and further constructs the urban community of street safety index. In Section 5, we present our conclusions and future work.
2. Related work
2.1 Group behavior analysis
Group behavior refers to the sum of the behavior of multiple individuals under one spatio-temporal environment. These individuals may achieve a particular goal, or they may just be limited by time or space. Their actions may influence each other, or they may be independent. At first, this concept was put forward and studied by anthropologists, sociologists, biologists and psychologists. In the following decades, researchers primarily analyzed group behavior from sociology and psychology and their research results were applied to explain behavior characteristics and patterns of humans and other groups of animals (Gowers and Nielsen, 2009). With the big data era (Naimi and Westreich, 2014), more and more computing scientists have joined group behavior research. For example, in 2009, some computing scientists headed by Professor Alex Pentland of the Massachusetts Institute of Technology put forward the concept of Computational Social Sciences (Computational Social Science) (Lazer et al., 2009), that is, to study social behavior by collecting and analyzing large amounts of data. Group behavior analysis is the focus of this discipline.
In recent years, communication technology has made relationships between people, people and other organisms closer. In contrast, data acquisition and storage technology progress makes the group target behavior data obtained by researchers more and more abundant. This stimulates the social and commercial needs of group behavior research and provides a technical guarantee and data basis for the development of group behavior research. From the perspective of business and social value, group behavior modeling and analysis technology have extensive applications, so it is highly valued in the information age. In fact, at present, there is an increasing demand for group behavior analysis technology in many fields, such as network analysis (Scott, 1988), urban planning (Burgess et al., 2017), financial management (Charles and Sguotti, 2021), biological conservation (Laughlin et al., 2014) and so on. The results of group behavior analysis are more and more widely used. In the research of virtual media information networks such as Weibo and Wechat, individual users form groups through information exchange such as the posting and forwarding of Weibo. Through the network structure (that is, user influence network) and network events (that is, user posting and other behavior) in the information network, the group behavior is modeled (Du et al., 2014). It will solve practical problems such as identifying and mining opinion leaders, discovering critical paths of information dissemination and estimating the influence of the information dissemination window (Du et al., 2013). It directly applies to network marketing (Zhang et al., 2020), public opinion analysis (Chen et al., 2020), advertising push, etc.
Group behavior analysis technology often needs to deal with massive heterogeneous data and design corresponding models and algorithms for different applications. To determine the group behavior more accurately, further reveal the diversity and correlation characteristics of regional security risks and provide a reference for the application of emergency management such as optimizing resource deployment.
2.2 Temporal characteristics of human behavior dynamics
A series of human behaviors, such as communication, mobile, shopping, social and so on, are the direct or indirect driving forces of a large number of typical complex systems, such as economic system, financial system, transportation system, social system and so on. The dynamics of human behavior, initiated by statistical physicists, is a new interdisciplinary research direction, mainly from human behavior through the quantitative statistics of many individual human behaviors and group behavior. Study its complexity in time and space, reveal hidden statistical rules and explore the mechanism of these rules. And the effects on all kinds of external systems, such as society, economy, technology and ecology.
In 2005, Barabasi's study of non-Poisson characteristics of the temporal distribution of human activities based on task priority mechanism (Brabási, 2005) and the quantitative research of spatial scale law of human movement by Brockmann et al. (2006) was published in Nature in 2006. These studies have become a historical turning point in analyzing human behavior's temporal and spatial complexity. The time characteristic of human behavior is the statistical law of time when people often engage in a specific event. Researchers tend to Poisson distribution in studying the time characteristics of human behavior dynamics in the early years. Poisson distribution was initially being proposed and named by Poisson. Then it is widely used in the quantitative model of all kinds of human activities to describe the statistical regularity of the number of specific events within a specified range or per unit of time.
However, at present, the academic circles believe that the interval time distribution corresponding to these behaviors of different individuals has a noticeable fat tail (Wang et al., 2015), which a power function can better fit, that is p(τ)∝τ –α. The research on the time characteristics of human communication behavior has also been carried out in many human empirical data. The interval time between the occurrence of communication behaviors, such as e-mail (Wang et al., 2014), mobile phone text messages (Jeon and Kim, 2018), mobile phone calls (Brockmann et al., 2006), financial activities (Ghosh et al., 2021) and so on, all obeys the power-law distribution. This paper mainly refers to the time characteristics of behavior patterns, the power index of an interval probability distribution, etc.
2.3 Spatial characteristics of human behavior dynamics
In 2010, the Barabási research team published an article on the predictability of human behavior in Science (Song et al., 2010). They studied the behavior patterns of anonymous mobile phone users. They found that human daily travel activities are highly regular and independent of distance and everyone's regular travel (distances of more than 10 kilometers) indicates that 93% of activities are predictable. Kang et al. (2012) analyzed the correlation between human mobility patterns and city shape using mobile phone data from 8 cities in Northeast China for nine consecutive days (5 working days, two weekends). Becker et al. (2013) analyzed human mobility patterns and participation in group events using mobile phone data from Los Angeles, San Francisco and New York from April 1 to June 30, 2011. Csáji et al. (2012) analyzed human mobility patterns using mobile phone data of 100000 anonymous users in Portugal for 15 months and proposed a variety of evaluation indicators. The main statistical indicators of spatial pattern analysis include: single-step displacement (Li et al., 2008), gyration radius (Ma et al., 2015), convex hull diameter (Csáji et al., 2012), path Euclidean distance and true distance (Ali and Shah, 2008), spatial location access frequency and ranking (Csáji et al., 2012), mobile pattern diversity (Song et al., 2010), predictability (Maciel et al., 2021) and so on. This paper mainly refers to the path length probability distribution power index and other indicators. To increase the comprehensibility different from previous studies, the summarizing relevant work is shown in Table 1.
3. Data
In recent years, many scholars at home and abroad have devoted themselves to group behavior analysis techniques. They have made many achievements in many fields, such as biology, medicine, finance, management, electronic information, etc. Many different branches have been derived from group behavior research for various applications. Group movement behavior analysis and trajectory inference, such as crowd activity, species migration and so on, are classical problems in group behavior analysis. The data source of the problem here is mainly based on the information network built by all kinds of sensors. The most typical ones include video surveillance networks (Ali and Shah, 2008), access control systems based on radio frequency identification technology (Wang et al., 2012), wireless sensor networks (Oh, 2012), radar detection systems (Lin et al., 2019) and so on. The main form of data is the detailed trajectory of each individual or the aggregate data of groups in different Spatio-temporal positions (how many individuals appear in what Spatio-temporal place). Here we use the sensor data of shared bikes and online ride-hailing in a large city in China as an example to explore the multidimensional travel characteristics of group traffic passengers.
Table 2 shows a detailed description of the data set in this paper. The data set includes online car Hailing data set, shared bicycle data set and alarm data set. As shown, the online car-hailing data set contains the ride-hailing data of December 20, 2019. The travel conditions of 158,718 ride-hailing users of different companies on that day were recorded and the total number of trips recorded was 18,287,688. Each trip record contains the departure time, arrival time and order number of the trip and the speed and starting position of each trip. Then, the shared bike data of 203,577 users from May 25 to June 1, 2017, are recorded and the total number of trips recorded is 455,411. Each travel record contains the order number of the journey, the bike number used, the departure time and the geographic location code at the time of departure. The dataset comprises shared bike data from two different companies. Last, the alarm data set is used in the 110 alarm data of a city in China. The data set records the alarm data of various cases in the city within a year.
In addition, Figure 1 shows the data set analysis of urban points of interest. With the rapid development of urban construction in China, a complex urban system has been gradually formed and the metropolitan center system has become an essential part of urban geography research. The urban center system is an organic whole that is interdependent and related to each function and scale of different service areas within the city. The traditional urban spatial analysis is through on-the-spot investigation and other methods, by a certain amount of sample law to represent the prevailing law, suitable for small-scale regional research. Nowadays, the urban spatial structure is becoming more and more complex and the shortcomings of traditional methods, such as heavy workload and intense subjectivity, are becoming increasingly prominent. With the rapid development of Internet communication technology, big data has become a new research hotspot in urban geography, which provides a new idea for urban planning problem discovery and essential data acquisition and integration. At present, there have been some research achievements in the field of big urban data, such as the use of bus card swiping data to study the occupation-residence relationship and commuting law of first-tier urban residents and the use of big data to study urban place name information and so on. However, the research is still in its infancy and it needs to be further developed to obtain a large amount of adequate data and apply it to the actual planning.
Urban Point of interest (POI) data is a method to represent the spatial and location attributes of geographical entities by querying and downloading the location big data through the API provided by the electronic network map written by computer technology. It mainly includes geographical entities that are closely related to people's lives in most cities, such as catering services, schools, housing, shopping malls, stations, government agencies, financial institutions, etc.
4. Characteristics of group behavior patterns
Firstly, this paper extracts the feature set, including group behavior pattern feature definition, feature extraction of urban interest points and definition of related features in the police field. Then analyzing correlation and constructing safety index. The last part demonstrates behavioral characteristics of alarm and urban safety index. The details of research methods have been shown in Figure 2.
4.1 Statistical analysis of group time interval
4.1.1 Travel rules of ride-hailing passengers
The calculation method of travel frequency of online ride-hailing passengers is as follows: first, the travel time interval of all online ride-hailing passengers is segmented according to hours and the time interval within one hour is treated as 1 h. Statistics of the travel frequency of passengers in each time interval. Taking the time interval as the abscissa and the passenger travel frequency as the ordinate, the frequency double logarithm coordinate diagram of the time interval of four types of passengers is drawn, as shown in Figure 3.
Figure 3 above shows the travel interval distribution of ride-hailing passengers of four different companies. In addition, there is a prominent peak in the morning working hours, during which users use online ride-hailing more frequently.
4.1.2 Travel rules of shared bike users
The calculation method of travel frequency of shared bike users is as follows: first, the travel time interval of all shared bike users is divided into hours and the time interval within 1 h is treated as 1 h. Statistics of the travel frequency of users in each time interval. Take the time interval as the Abscissa and the user travel frequency as the ordinate, draw the frequency double logarithm coordinate diagram of the time interval of two different groups of shared bicycle users, as shown in Figure 4.
The above picture is the time interval distribution map of the two types of shared bikes of type1, type2. From the chart, we can see that the travel rules of the two types of shared bike users are relatively similar. We can see that the image shows a bimodal distribution from the picture. From the picture, we can see that the first peak is about going to work and school, while the second peak is the time of off work and school, both of which are the peak periods of traffic. Shared bikes are used more frequently. This paper also makes a further classification of shared bike users and analyzes their user activity, as shown in Figure 5 below.
Figure 5 above divides the users who share bikes into four groups, reflecting the active distribution of users sharing bikes. Group 1 and group 2 are users of shared bike type1, group 1 is users who use shared bikes on weekdays, group 2 is users on rest days, group 3 and group 4 are users of shared bike type2, group 3 is working days and group 4 is rest days. We also can see that the graphics of shared bikes in each group are similar, but by comparison, group 4 is more active than group 3 and group 4 uses shared bikes more frequently at the same time interval. Shared bikes are used more regularly on weekdays than on rest days.
4.2 Burst and memory of time interval
This section uses the indexes of burst and memory in human behavior dynamics (Brockmann et al., 2006) to characterize the time interval distribution, which converts the frequency corresponding to the passenger time interval into a specific sequence. Thus, the statistical characteristics of passenger travel time intervals can be described by mathematical formulas.
4.2.1 Burst
Burst refers to a unique phenomenon in which human behavior occurs intensively in a short time and is silent for an extended period. Its definition is shown in Formula (1).
4.2.2 Memory
Memory can describe the correlation of time series of human behavior. The passenger travel time interval is arranged into a time interval sequence according to the line of time occurrence and the arrangement is set to have nτ time interval, then the series has nτ+1 travel behavior. The first nτ−1 time interval is taken as sequence 1 and the last nτ−1 time interval is taken as sequence 2. The correlation between the two-time series is defined as for Formula (2).
Memory and burst describe the characteristics of human behavior time series from two dimensions by counting the explosion and memory of the travel time interval of each passenger group.
Figure 6 above shows the burst and memory distribution of shared bike users. The main distribution range of user travel Burst is −0.6 to 0.4, mainly concentrated in −0.2 to 0.2. Similarly, the main distribution range of user travel Memory is between −0.8 and 0.8. The travel time interval of users will not continue to increase or decrease, so most individual users have weak memory or no memory. From the above analysis, it can be concluded that the user group travel behavior is also a common human behavior characteristic of “strong matrix method and weak memory”, but it is difficult to depict the travel mode of individual users.
4.3 Analysis of the correlation between urban points of interest and travel rules
POI represents the spatial and location attributes of geographical entities. Among them, the color from light to deep in the illustration in Figure 7a shows the change of POI density from sparse to dense. It can be seen from the map that the road network density of the central urban area of the city is the highest, followed by the Binhai New area, the density of the central urban area is the highest and concentrated in a relatively small range, the outward diffusion space is small and the density changes rapidly and there is little difference in the POI density of the central urban areas of each district and county. Figure 7b mainly shows the dense and sparse POI distribution in the city. As can be seen from the picture, the POI distribution in the central area of the city and the main urban area of Binhai New area is dense, while the distribution of POI in the north and south of the city is sparse. The overall distribution of POI is similar to that of the road network. Figure 7c is a road network distribution map, which is mainly a map of urban traffic structure, which is mainly composed of urban roads connected with each other and intertwined into a network, which can reflect the degree of road concentration in various areas of the city. Figure 7d mainly shows the distribution of the city's road network and the analysis can be carried out from the dense and sparse distribution of the road network. As can be seen from the picture, the road network in the central urban area of the city is dense, while that in the north and south is sparse. Because the road network density can be used to describe the road length and average distribution in the region, reflect the level of urban road supply and provide a reference basis for the analysis of urban traffic operation, it also provides basic data for the formulation of urban road management and control measures. Figure 7d the color from light to deep in the illustration shows the change of road network density from sparse to dense. It can be seen from the map that the road network density in the central urban area of the city is the highest, followed by the Binhai New area. There is a relatively dense road network connection between the central urban area and the Binhai New area and there is little difference in the road network density between the central urban areas of each district and county.
4.4 Construction of urban safety indicators
In this paper, community safety analysis is carried out by studying the correlation between urban points of interest and alarm data and combined with the travel rules of human groups.
The community safety index is defined as
In Formula (3), n represents the number of cases that call the police,
Finally, the security analysis suggestions of different communities are given. First of all, the security index
Finally, the normalized safety index of each community is classified to judge the security of the community. Then carry on the correlation analysis; Tables 3 and 4 give the corresponding code of the relevant attributes in this paper, convenient for the correlation analysis results.
From the basic attributes of the community and alarm calls related to attribute Table 5, we can see the following phenomena:
There is a high positive correlation between the total population (Y1) and the number of traffic cases reported (X4). The reason may be that the more the total population, the greater the probability of accidents, such as traffic accidents. Similarly, the larger the area, the greater the likelihood of accidents and the corresponding number of alarms will be more.
According to the statistical analysis, the probability distribution of the number of alarm calls obeys the power-law distribution of α = 3.84127 in the community and an extensive power index indicates that the probability decreases rapidly with the increase of the number of alarm calls. So that is to say, most of the alarm volume is concentrated in the case of a small number of alarm calls and the per capita alarm volume is stable between 2 and 4 in each community, regardless of the total population and population density—the area size, etc. Therefore, the basic attribute data of different communities can not be used as the prediction index of per capita alarm. Similarly, there is a similar conclusion for the average daily alarm, which also aligns with the power-law distribution. We can draw a similar conclusion with the per capita alarm according to Table 5.
As can be seen in Table 5, there is a high correlation between the number of extortion alarms (X9) and the number of shared bicycles (Y2). The reason may be that people gather in places with a large number of shared bicycles and the area is well developed and has a large flow of people. The possibility of extortion cases is relatively high.
It can be seen that there is a certain positive correlation between the number of hospitals, the number of bus stops and the total number of 110 alarms (X1). The reason may be that the more hospitals and bus stations there are, the higher the population density is and the type of alarm often causes the type of alarm. There is a strong correlation between the number of police stations (Y9) and the total number of 110 alarms. The reason for this situation can be considered on the basis of practical experience. According to the accumulated experience over the years, the relevant 110 police situation handling personnel will place a relatively large number of police stations in areas with many alarms. However, the correlation between the number of police stations and the number of police stations in Table 5 can be demonstrated by the correlation value of 0.530. The number of police stations here is also an important independent variable index for estimating alarm volume.
It can be found that there is a significant correlation between criminal cases (X2) and the number of police stations (Y10). This may indicate that many factors will be integrated with the process of selecting the location of the police station. The choice of the address of the police station will undoubtedly be arranged according to the number of criminal cases in history. It can also be seen that there is a significant correlation between the number of police stations and almost all cases, which further shows that the location of police stations will comprehensively consider the occurrence of various types of historical alarm cases.
It can also be found in Table 5 that there is a significant positive correlation between the number of traffic cases (X4) and the number of hospitals (Y8), bus stations (Y7), banks (Y11), restaurants (Y13) and supermarkets (Y14). It can be considered that there is more traffic in hospitals, bus stations, banks, restaurants and supermarkets and the possibility of traffic accidents is relatively greater.
There is no correlation between the number of cases (X8) and all community attributes except the number of police stations (Y9). The reason is that the number of cases is rare and what happens is random from a macro point of view, so all community attributes except the number of police stations have nothing to do with it.
Next, the human alarm time characteristics are described by the alarm interval time burst coefficient and memory coefficient of each street. It can be seen in Figure 8 that the mean point is Bamboo 0.08607 and memory 0.009544 and the paroxysmal memory is close to zero. It is found that the alarm behavior of street people has the characteristic of weak memory.
The reason for calculating the community safety index through the number of different police cases is that according to the correlation analysis, this paper evaluates the community safety index results of 60 community streets. It calculates each community safety index according to formula (3). Table 6 is obtained according to formula (4) normalization.
As can be seen from Table 6, it is found that about 60% of the community streets are in average safety condition. In contrast, only a tiny number of community streets are in good condition and the relevant street security personnel need to take further safety measures to ensure community safety.
5. Conclusions
According to characteristics of urban spatial structure, it is found that there is a close relationship between people's behavior patterns and urban public safety indicators. And through the analysis of the behavior model, the experimental results can provide a reference for urban public safety indicators. Group behavior modeling and analysis technology are widely used in various fields, especially in social security. Based on the sensor data of shared bikes and ride-hailing in a large city in China, this paper makes an in-depth study on the internal structure period and volatility of group behavior, time intervals (such as the relationship between individuals) and urban points of interest and travel rules through the observation and analysis of traffic behavior dynamics under various traffic patterns, such as urban spatial structure and human traffic behavior patterns. It is found that there is a difference in power index when users choose additional ride-hailing services. There is a positive correlation between the shared bike user activity and the power law bimodal phenomenon in the double logarithmic coordinate system. Finally, we analyzed the city.
For the future work in this paper, Although this paper analyzes the dynamic behavior of multiple traffic in two shared datasets and obtains more meaningful dynamic results, some problems still have to be solved. For example, we only analyzed two shared vehicles. Our focus will be to study multi-level traffic behavior based on more shared sub-tools to serve the city's public safety better.
Figures
Characteristics of group behavior patterns
Characteristics of group behavior patterns | Feature definition |
---|---|
Social Interactions Online (Chen et al., 2020) | A closer match between individuals' interests and those exhibited by the Web site should result in a stronger tie between a Web site and user |
Social network analysis model (Scott, 1988) | It is argued that the concept of social network provides a powerful model for social structure and that a number of important formal methods of social network analysis can be discerned |
Diffusion model (Scott, 1988) | A diffusion model is often associated with a directed graph G = (V, E) and a cascade from a model is just a set of influenced nodes according to the model given a set of source nodes S ⊆ V |
J-A Models (Shmueli et al., 2014) | It not only considers the assimilation effect in the social judgment theory but also considers the situations of repulsion and neutrality |
Interval time probability distribution power exponent (Wang et al., 2014) | The time interval between the occurrence of two adjacent times of the same individual obeys the power law distribution |
Power Exponent of Probability Distribution of Waiting Time (Wang et al., 2014) | The time interval of the waiting time of the same individual obeys the power law distribution |
Power Exponent of Duration Probability Distribution (Wang et al., 2014) | The duration of the same event obeys a power law distribution |
Intermittent coefficient (Jeon and Kim, 2018) | Quantified indicator of the time interval between events |
Memory factor (Jeon and Kim, 2018) | Memory characteristics of the time interval between events |
Time domain diversity index (entropy) (Jeon and Kim, 2018) | Time series predictability quantitative index |
Single step displacement probability distribution power exponent (Li et al., 2008) | The displacement between the coordinates of adjacent positions of an individual obeys a power law distribution |
Power Exponent of Probability Distribution of Radius of Gyration (Li et al., 2008) | A quantitative indicator of the distance between the individual position coordinates and the center position |
Power Exponent of Probability Distribution of Convex Hull Diameter (Li et al., 2008) | A quantitative indicator of the distance between the individual position coordinates and the center position |
Path length probability distribution power exponent (Wang et al., 2015) | The length between the individual start coordinate and the end coordinate position obeys a power law distribution |
Spatial diversity index (entropy) (Wang et al., 2015) | Quantitative Index of Predictability of Hotspot Spatial Sequence |
Detailed description of the data set
Data set entry | Data set value | |
---|---|---|
Detailed description of the ride-hailing data set | CompanyID | 9 |
OrderID | 158718 | |
DateTime | 20/12/2019 00:00:00–20/12/2019 23:59:49 | |
Detailed description of the shared bicycle data set | OrderID | 455411 |
UserID | 203577 | |
BikeID | 268922 | |
BikeType | 2 | |
StartTime | 2017–05–25 00:00:03–2017–06–01 00:00:00 | |
Detailed description of alarm data | Time Limit | 365 Days |
Location Distribution | A City in China | |
Number of Alarms | About one million | |
Alarm interval time memory coefficient | Memory coefficient of alarm interval time for different types of cases | |
Interval coefficient of alarm interval time | Interval coefficient of alarm interval time for different types of cases |
Alarm data code table
Alarm related attributes | Code | Alarm related attributes | Code |
---|---|---|---|
Total Number of Alarms | X1 | Total Number of Robbery Alarms | X7 |
Total Number of Criminal Alarms | X2 | Total Number of Homicide Alarms | X8 |
Total Number of Public Security Alarms | X3 | Total Number of Fraud Alarms | X9 |
Total Number of Traffic Alarms | X4 | Total Number of Extortion Alarms | X10 |
Total Number of Alarms by the Masses for Help | X5 | Total Number of Kidnapping Alarms | X11 |
Total Number of Theft Alarms | X6 | Total Number of Fight Alarms | X12 |
Community attribute code table
Community attribute name | Code | Community attribute name | Code |
---|---|---|---|
Population | Y1 | Number of banks | Y11 |
Number of shared bicycles | Y2 | Number of attractions | Y12 |
Area | Y3 | Number of restaurants | Y13 |
The population density | Y4 | Number of supermarkets | Y14 |
Area per capita | Y5 | Percentage of residential land | Y15 |
Housing Prices | Y6 | Percentage of road facilities | Y16 |
Number of bus stops | Y7 | Percentage of Greenland Plaza | Y17 |
Number of hospitals | Y8 | Percentage of hospital land | Y18 |
Number of schools | Y9 | Distance from city center | Y19 |
Number of police stations | Y10 |
Basic attributes of the community and properties related to alarm calls
X1 | X2 | X3 | X4 | X5 | X6 | X7 | X8 | X9 | X10 | X11 | X12 | |
---|---|---|---|---|---|---|---|---|---|---|---|---|
Y1 | 0.130 | 0.053 | 0.118 | 0.492 | 0.134 | 0.049 | 0.016 | 0.094 | 0.081 | 0.054 | −0.109 | 0.105 |
Y2 | 0.112 | 0.092 | 0.100 | 0.191 | 0.118 | 0.073 | 0.194 | 0.087 | 0.187 | 0.063 | 0.044 | 0.087 |
Y3 | −0.061 | −0.071 | −0.058 | −0.001 | −0.061 | −0.052 | −0.189 | 0.014 | −0.167 | −0.106 | −0.086 | −0.062 |
Y4 | −0.041 | 0.011 | −0.048 | −0.177 | −0.041 | 0.005 | 0.072 | −0.032 | 0.041 | −0.033 | 0.051 | −0.055 |
Y5 | −0.005 | 0.020 | −0.010 | −0.241 | 0.008 | 0.032 | −0.063 | −0.202 | −0.024 | −0.020 | −0.078 | −0.010 |
Y6 | 0.268 | 0.255 | 0.252 | 0.402 | 0.263 | 0.244 | 0.232 | 0.159 | 0.320 | 0.190 | 0.091 | 0.250 |
Y7 | 0.276 | 0.218 | 0.274 | 0.611 | 0.258 | 0.209 | 0.114 | 0.062 | 0.281 | 0.088 | −0.050 | 0.259 |
Y8 | 0.057 | 0.060 | 0.067 | 0.160 | 0.035 | 0.055 | 0.048 | −0.050 | 0.101 | 0.092 | −0.099 | 0.071 |
Y9 | 0.530 | 0.488 | 0.540 | 0.444 | 0.508 | 0.473 | 0.373 | 0.320 | 0.549 | 0.220 | 0.318 | 0.525 |
Y10 | 0.064 | 0.019 | 0.041 | 0.266 | 0.083 | 0.008 | −0.161 | −0.158 | 0.191 | −0.143 | 0.041 | 0.008 |
Y11 | 0.056 | 0.007 | 0.053 | −0.032 | 0.079 | −0.003 | −0.051 | −0.151 | 0.112 | 0.047 | 0.140 | 0.047 |
Y12 | 0.206 | 0.128 | 0.195 | 0.427 | 0.215 | 0.118 | −0.036 | −0.142 | 0.273 | 0.012 | −0.021 | 0.160 |
Y13 | 0.229 | 0.150 | 0.223 | 0.540 | 0.225 | 0.135 | 0.072 | −0.024 | 0.265 | 0.154 | −0.081 | 0.187 |
Y14 | 0.131 | 0.157 | 0.134 | 0.093 | 0.113 | 0.165 | 0.130 | 0.192 | 0.058 | 0.057 | 0.036 | 0.139 |
Y15 | −0.115 | −0.132 | −0.113 | −0.065 | −0.107 | −0.138 | −0.054 | 0.079 | −0.099 | 0.070 | 0.080 | −0.102 |
Y16 | 0.107 | 0.119 | 0.100 | 0.014 | 0.110 | 0.114 | 0.079 | 0.019 | 0.115 | −0.041 | 0.015 | 0.080 |
Y17 | −0.216 | −0.202 | −0.202 | 0.048 | −0.240 | −0.200 | −0.262 | −0.219 | −0.149 | −0.143 | −0.185 | −0.204 |
Y18 | −0.014 | 0.061 | −0.039 | 0.012 | −0.016 | 0.064 | 0.171 | 0.096 | 0.024 | 0.026 | −0.017 | −0.038 |
Y19 | 0.162 | 0.094 | 0.171 | 0.178 | 0.166 | 0.079 | 0.050 | −0.076 | 0.214 | 0.045 | 0.023 | 0.158 |
Evaluation form of calculation results of community safety index
Community normalized safety index range | Number of communities |
---|---|
0< | 36 |
0.3< | 18 |
0.6< | 6 |
References
Ali, S. and Shah, M. (2008), “Floor fields for tracking in high density crowd scenes”, in Forsyth, D., Torr, P. and Zisserman, A.F. (Eds), Computer Vision –European Conference on Computer Vision (ECCV) 2008, Springer, Marseille, France, pp. 1-14, doi: 10.1007/978-3-540-88688-4_1.
Becker, R., Cáceres, R., Hanson, K., Isaacman, S., Loh, J.M., Martonosi, M., Rowland, J., Urbanek, S., Varshavsky, A. and Volinsky, C. (2013), “Human mobility characterization from cellular network data”, Communications of the ACM, Vol. 56 No. 1, pp. 74-82, doi: 10.1145/2398356.2398375.
Brabási, A.L. (2005), “The origin of bursts and heavy tails in human dynamics”, Nature, Vol. 435 No. 7039, pp. 207-211, doi: 10.1038/nature03459.
Brockmann, D., Hufnagel, L. and Geisel, T. (2006), “The scaling laws of human travel”, Nature, Vol. 439 No. 7075, pp. 462-465, doi: 10.1038/nature04292.
Burgess, H.K., DeBey, L.B., Froehlich, H.E., Schmidt, N., Theobald, E.J., Ettinger, A.K., HilleRisLambers, J., Tewksbury, J. and Parrish, J.K. (2017), “The Science of citizen science: exploring barriers to use as a primary research tool”, Biological Conservation, Vol. 208, pp. 113-120, doi: 10.1016/j.biocon.2016.05.014.
Charles, A. and Sguotti, D. (2021), “Sustainable earnings: how can herd behavior in financial accumulation feed into a resilient economic system?”, Sustainability, Vol. 13 No. 11, p. 5776, doi: 10.3390/su13115776.
Chen, T., Shi, J., Yang, J., Cong, G. and Li, G. (2020), “Modeling public opinion polarization in group behavior by integrating SIRS-based information diffusion process”, Complexity. doi: 10.1155/2020/4791527.
Csáji, B.C., Browet, A., Traag, V.A., Delvenne, J.C., Huens, E., Van Dooren, P., Smoreda, Z. and Blondel, V.D. (2012), “Exploring the mobility of mobile phone users”, Physica A: Statistical Mechanics and Its Applications, Vol. 392 No. 6, pp. 1459-1473, doi: 10.1016/j.physa.2012.11.040.
Du, N., Liang, Y., Balcan, M. and Song, L. (2014), “Influence function learning in information diffusion networks”, in Proceedings of the 31st International Conference on Machine Learning, Beijing, China, pp. 2016-2024.
Du, N., Song, L., Rodriguez, M.G. and Zha, H. (2013), “Scalable influence estimation in continuous-time diffusion networks”, Advances in Neural Information Processing Systems, Vol. 26 No. 3, pp. 3147-3155, doi: 10.48550/arXiv.1612.02712.
Gao, L., Song, C., Gao, Z., Barabási, A.L., Bagrow, J.P. and Wang, D. (2014), “Quantifying information flow during emergencies”, Scientific Reports, Vol. 4 No. 1, pp. 1-6, doi: 10.1038/srep03997.
Ghosh, B., Kenourgios, D., Francis, A. and Bhattacharyya, S. (2021), “How well the log periodic power law works in an emerging stock market?”, Applied Economics Letters, Vol. 28 No. 14, pp. 1174-1180, doi: 10.1080/13504851.2020.1803484.
Gowers, T. and Nielsen, M. (2009), “Massively collaborative mathematics”, Nature, Vol. 461 No. 7266, pp. 879-881, doi: 10.1038/461879a.
Jeon, J.H. and Kim, P. (2018), “Recurrent synchronization of coupled oscillators with spontaneous phase reformation”, Chaos, Vol. 28 No. 10, 103113, doi: 10.1063/1.5029285.
Jing, H., Li, X., Xu, G., Zhu, M., Shen, L., Liu, F. and Peng, H. (2021), “Dynamic study of intelligent traffic behavior based on multiple traffic modes”, Scientific Programming. doi: 10.1155/2021/7254907.
Kang, C., Ma, X., Tong, D. and Liu, Y. (2012), “Intra-urban human mobility patterns: an urban morphology perspective”, Physica A: Statistical Mechanics and Its Applications, Vol. 391 No. 4, pp. 1702-1717, doi: 10.1016/j.physa.2011.11.005.
Katzenbach, J.R. and Smith, D.K. (2015), The Wisdom of Teams: Creating the High-Performance Organization, Harvard Business Review Press, Boston, MA.
Laughlin, A.J., Sheldon, D.R., Winkler, D.W. and Taylor, C.M. (2014), “Behavioural drivers of communal roosting in a songbird: a combined theoretical and empirical approach”, Behavioural Ecology, Vol. 25 No. 4, pp. 734-743, doi: 10.1093/beheco/aru044.
Lazer, D., Pentland, A.S., Adamic, L., Aral, S., Barabasi, A.L., Brewer, D., Christakis, N., Contractor, N., Fowler, J., Gutmann, M., Jebara, T., King, J., Macy, M., Roy, D. and Van Alstyne, M. (2009), “Life in the network: the coming age of computational social science”, Science, Vol. 323 No. 5915, pp. 721-723, doi: 10.1126/science.1167742.
Li, N.N., Zhang, N. and Zhou, T. (2008), “Empirical analysis on temporal statistics of human correspondence patterns”, Physica A: Statistical Mechanics and Its Applications, Vol. 387 No. 25, pp. 6391-6394, doi: 10.1016/j.physa.2008.07.021.
Lin, T.Y., Winner, K., Bernstein, G., Mittal, A., Dokter, A.M., Horton, K.G., Nilsson, C., Van Doren, B.M., Farnsworth, A., La Sorte, F.A., Maji, S. and Sheldon, D. (2019), “MistNet: measuring historical bird migration in the US using archived weather radar data and convolutional neural networks”, Methods in Ecology and Evolution, Vol. 10 No. 11, pp. 1908-1922, doi: 10.1111/2041-210X.13280.
Ma, K., Wang, W., Yuan, N. and Chi, X. (2015), “Study on temporal patterns of medical emergency call”, in 2015 10th International Conference on Intelligent Systems and Knowledge Engineering (ISKE), pp. 403-407, doi: 10.1109/ISKE.2015.96.
Maciel, R.F.C., Pinheiro, M.M.K. and Bayerl, P.S. (2021), “Social forecasting: a literature review of research promoted by the United States national security system to model human behavior”, Revista Brasileira de Ciências Policiais, Vol. 12 No. 4, pp. 23-52, doi: 10.31412/rbcp.v12i4.612.
Naimi, A.I. and Westreich, D.J. (2014), “Big data: a revolution that will transform how we live, work, and think”, American Journal of Epidemiology, Vol. 179 No. 9, pp. 1143-1144, doi: 10.1093/aje/kwu085.
Oh, S. (2012), “A scalable multi-target tracking algorithm for wireless sensor networks”, International Journal of Distributed Sensor Networks, Vol. 8 No. 9, 938521, doi: 10.1155/2012/938521.
Scott, J. (1988), “Social network analysis”, Sociology, Vol. 22 No. 1, pp. 109-127, doi: 10.1177/0038038588022001007.
Shmueli, E., Singh, V.K., Lepri, B. and Pentland, A. (2014), “Sensing, Understanding, and shaping social behavior”, IEEE Transactions on Computational Social Systems, Vol. 1 No. 1, pp. 22-34, doi: 10.1109/TCSS.2014.2307438.
Song, C., Qu, Z., Blumm, N. and Barabási, A.L. (2010), “Limits of predictability in human mobility”, Science, Vol. 327 No. 5968, pp. 1018-1021, doi: 10.1126/science.1177170.
Wang, P., Zhou, T., Han, X.P. and Wang, B.H. (2014), “Modeling correlated human dynamics with temporal preference”, Physica A: Statistical Mechanics and Its Applications, Vol. 398, pp. 145-151, doi: 10.1016/j.physa.2013.12.014.
Wang, W., Yuan, N., Pan, L., Jiao, P., Dai, W., Xue, G. and Liu, D. (2015), “Temporal patterns of emergency calls of a metropolitan city in China”, Physica A: Statistical Mechanics and Its Applications, Vol. 436, pp. 846-855, doi: 10.1016/j.physa.2015.05.028.
Wuchty, S., Jones, B.F. and Uzzi, B. (2007), “The increasing dominance of teams in production of knowledge”, Science, Vol. 316 No. 5827, pp. 1036-1039, doi: 10.1126/science.1136099.
Zhang, X., Gong, Y. and Peng, L. (2020), “The impact of interdependence on behavioral engagement in online communities”, Marketing Intelligence & Planning, doi: 10.1108/MIP-05-2019-0285.
Further reading
Jiang, Z.Q., Xie, W.J., Li, M.X., Podobnik, B., Zhou, W.X. and Stanley, H.E. (2013), “Calling patterns in human communication dynamics”, Proceedings of the National Academy of Sciences, Vol. 110 No. 5, pp. 1600-1605, doi: 10.1073/pnas.1220433110.
Kwon, O., Son, W.S. and Jung, W.S. (2016), “The double power law in human collaboration behavior: the case of Wikipedia”, Physica A: Statistical Mechanics and Its Applications, Vol. 461, pp. 85-91, doi: 10.1016/j.physa.2016.05.010.
Wang, C., Huo, X. and Song, W.Z. (2013), “Integer programming based approach for multiple-targets trajectory identification in WSNs”, IEEE International Conference on Networking IEEE, pp. 106-111, doi: 10.1109/ICNSC.2013.6548720.
Acknowledgements
This work was supported in part by the Key R&D program of Zhejiang Province (2022C01083), The National Science Foundation of China (62102262) and Major Science and Technology Project of the 12th Division of Xinjiang Production and Construction Corps (SR202103).