You are on page 1of 9

2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept.

21-24, 2016, Jaipur, India

Strategies to Handle Big Data for Traffic


Management in Smart Cities

Satyanarayana V Nandury1,2 Beneyaz A Begum1,2


1- Academy of Scientific & Innovative Research 1- Academy of Scientific & Innovative Research
2- CSIR-Indian Institute of Chemical Technology 2- CSIR-Indian Institute of Chemical Technology
Hyderabad, India Hyderabad, India
satyanarayana.v.nandury@ieee.org beneyaz.cet@gmail.com

AbstractThe myriad sensors deployed across a smart city, be closely monitored. To ensure their optimization, smart cities
serve as major source for Big data, which can potentially be used need to deploy thousands of wireless sensors and actuators across
for various applications like smart governance, smart energy, the city to monitor various phenomena and to control & audit the
smart traffic, smart environment, etc. However, issues related to usage of scarce resources.
the handling of such huge volumes of data originating from
thousands of heterogeneous sensor and IoT devices placed across The myriad sensors and actuators, however, keep generating
the length and breadth of the city, emerge as a major challenge. huge amounts of data every second. Sitting on mounds of
While it may be relatively easy to identify the IT devices necessary untapped data, smart cities are likely to encounter the herculean
for processing Big data, in the absence of clear strategies and challenge of mining credible information from data being
robust platform architecture for handling Big data, the deployment generated across the smart city. However, to handle such
of these resources attain limited success. The SWIFT architecture voluminous amounts data, the traditional data processing
introduced by us in an earlier work provides a ubiquitous platform approaches may not be adequate. To get over this issue, we
for seamless interaction of various smart objects, devices and introduce concepts of Big data analytics for processing Big data
systems, and hence may prove to be an ideal architecture to generated in smart cities for its potential application in various
capture, process and assimilate information from Big data. In this fields like smart infrastructure, smart environment, smart power
paper we discuss issues related to implementation of SWIFT & energy, smart traffic, smart health services, smart waste, etc.
architecture for handling Big data for traffic management in smart
To illustrate the use of these concepts, we propose few strategies
cities. Various strategies to provide Big data solutions for smart
to handle Big data for smart traffic management in smart cities.
traffic in terms of profiling traffic density, traffic signaling,
managing the parking lots, smart navigation and monitoring Traffic congestion and delays in travel is a serious problem in
vehicular pollution are discussed in the paper. urban locales. A congested route often leads other problems such
as longer waiting times, wastage of fuel, air and noise pollution,
KeywordsBig data; context; smart cites; SWIFT architecture; etc. Prerequisite for cities to have smart traffic management is a
WSN smart ICT infrastructure that is open, platform independent and
I. INTRODUCTION can handle real-time Big data. To facilitate seamless connectivity
of various heterogeneous sensors, actuators, and computing
Due to better lifestyle and employment opportunities, the platforms, a robust IT backbone infrastructure needs to be
cities have become an attractive place for living, and the world is created. To handle Big data and to meet challenges in providing
witnessing an increasing trend of rural folk moving over to cities. QoS to a host of services, the foremost requirement for smart
As per a recent study, about 60% of the global population is cities is the IT infrastructure that is pervasive to cover the entire
expected to live in cities by 2025 [1]. The cities world over, need city; ubiquitous to provide 24x7 services; reliable to tolerate
to emerge as smart cities to meet this massive influx of rural faults and provide QoS; flexible to platform different types of
masses into urban areas. To meet the infrastructure and socio- devices & systems; scalable to accommodate city expansion;
economic requirements of smart cities, the National Smart Cities adaptable to embrace newer technologies, regulations and
mission of India has earmarked a budget of $160 billion for the policies; and more importantly, versatile to handle the vagaries of
next five years to develop 100 smart cities across the country [2]. human kind and their never ending quest for better living
The ever increasing demand for providing better living conditions. To this end, we have proposed a platform architecture
conditions in smart cities, has forced the civil authorities to named Smart WSN-based Infrastructural Framework for smart
address issues related to the triple bottom line for sustainable Transactions (SWIFT) for smart cities in [3]. We discuss the
development in smart cities [2], [3], viz. economic prosperity, application of SWIFT architecture for handling Big data for
environment integrity and social equity. Since, smart and optimal smart traffic applications in smart cities.
utilization of resources being a key ingredient for any attempt The rest of the work is organized as follows: In Section II, we
toward achieving sustainable development, the resources need to present an overview of global market for Big Data, available

978-1-5090-2029-4/16/$31.00 @2016 IEEE 356


2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

platform & computing tools to handle Big data, their limitations; data changes constantly, while Value refers to the core
smart city and its challenges; and reviews related works on smart information that needs to be retrieved from big data.
traffic monitoring. In Section III, we present the requirements for
smart traffic management in smart cities. The challenges likely to Some key challenges that any government and city planners
be encountered while applying Big data for smart traffic might encounter while implementing smart city strategies are:
applications are discussed in Section IV. In Section V, we discuss integration/collaboration between different organizations and
the role of context-based Big data in smart traffic management application services, the openness of organizations for sharing
and propose few strategies to handle Big data for smart traffic information resources to ensure data accuracy, security, support
applications. The application of SWIFT architecture for utilizing for good ICT infrastructure, different smart city solutions,
context based Big data for smart traffic is presented in Section multiple technologies participation of people, etc. With advanced
VI. Big data solutions and ICT infrastructure such challenges could
be transformed into opportunities (better resource utilization,
II. BACKGROUND LITERATURE better quality of life, information transparency and usage).
A city may be called smart when investments in human & Data can be acquired from multiple data generating sources
social capital and modern information & communication like digitized city, its citizens through use of smartphones, smart
infrastructure, fuel sustainable economic growth for better quality devices, computers, internet, ubiquitous sensing, cameras, RFID
of life, through smart management of natural resources and devices, WSNs, GIS map, crowdsourcing, etc. However, the size,
participatory governance [4]. This ICT revolution which fueled complexity, quality and diversity of these collected data impose
the growth of smart cities, allows unprecedented generation of complexities in storage, retrieval and analysis of data, since the
data, while at the same time providing ubiquitous public access to existing standard statistical techniques, or mining tools have
information. The Global smart city market is poised to grow at limited capability to process Big data. Reports from the business
more than 14%, and is expected to reach over $1.3 trillion by world reveal that effective analysis and utilization of big data can
2019 [5]. help organizations out-perform their competitors [13], [14]. To
harness information from such large volumes of unstructured
As digitization has become an integral part of social life, it is data, Big data solutions have emerged that significantly lowered
estimated that about 90% of the worlds digitized data was the cost mining customized information from unstructured data.
captured over just the past two years [6]. This has driven many
governments and R&D organizations to utilize Big data The SWIFT architecture proposed by us in [3] provides an
technology to support the development and sustainability of architectural framework that supports integration of
smart cities [7]. heterogeneous devices, IoT components and technologies to
facilitate seamless flow of information and can readily be tuned
Big Data is not only large, but much unstructured, complex, for Big data applications. In this work, we discuss the application
heterogeneous, composed of various data types streaming data of SWIFT architecture for handling Big data for smart traffic
and may be even ambiguous, noisy or erroneous, which may management.
have negative impact on the statistical and data analysis
techniques and may degrade the accuracy of the mining results. SWIFT is a three-tiered architecture, with Smart Wireless
Conventional data analytic algorithms can be applied for analysis Sensor Network (S-WSN) layer forming the base. The second
of Big data based on the specific application domain (e.g., layer Smart Wireless-based Pervasive Edifice (SWIPE),
clustering algorithms such as BIRCH [8], classification resides on the S-WSN layer. The third is the apex layer, Smart
algorithms such as k-nearest neighbor, linear and quadratic Decision & Control Enabler (SDCE). A schematic of SWIFT
discriminant analysis, Naive Bayes, support vector machine and architecture is shown in Fig. 1.
artificial neural network).
The S-WSN layer acts as the sensory organ of the SWIFT
Big data is an essential component of smart cities applications architecture. The layer consists of several hundreds of physically
as massive data generated from smart sensors, Internet of Things dispersed wireless sensor nodes that sense a phenomenon of
(IoT), crowd sourced data; can be used by organizations and interest and report the data for further analysis. Smart Cluster
government to provide smart solutions to smart city dwellers. Big Heads (SCHs) are deployed at various locations in the city to
data can be characterized by three key features (3-Vs): volume, collect and aggregate data from nearby sensor nodes. Such
velocity and variety [9], [10]. Volume refers to the size of data deployment facilitates acquisition of real-time data related to
that has been generated from multiple sources. Velocity means various physical and social phenomena and paves a way to create
the speed at which data is generated, stored, analyzed and contextual information. In addition to sensed data, the nodes
processed. Variety refers to the different types of data being transmit their node ID and battery status to the SCH. The SCHs
generated. As most data generated is unstructured it cannot be are capable of taking low-end, but nevertheless important
easily categorized or tabulated. In addition features like decisions like raising an alarm, generating emergency actuation
variability and value are needed to present a complete definition: signals etc. After aggregating and processing the data, the SCHs
[11], [12]. Variability defines how the structure and meaning of transmit the information to nearby Smart Fusion Nodes (SFN) in
the SWIPE layer.

357
2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

B. Data Collection Agents


Collection of information is vital for implementing an
effective traffic management and control system. The sensors and
actuators necessary to sense and control traffic must be robust
enough to continue to perform well under widely varying
weather & climatic conditions all through the year. The
information collecting devices can be either static or mobile. Few
such collection devices are listed below:
a) Wireless Sensor Network (WSN): A WSN consists of
large number of sensors and/or actuators deployed in region of
interest (ROI) to collect data depending on the application
requirement. For the design of the network, the nature of the
environment where the nodes are deployed must take into
account the objectives of specific applications. Smart sensors
perceive ambient physical parameters, process information and
transmit the data signal to the nearby access points.
Heterogeneous sensors can be integrated in a single device or
Fig. 1. SWIFT Architecture. deployed on both sides of road for sensing road conditions and
traffic.
SWIPE is the heart of SWIFT architecture, which comprises
several Smart Fusion Nodes (SFN) that acts as the edifice for b) Interet of Things (IoT): Besides sensors, IoT can also
SWIFT architecture. SFNs act as data classifiers and perform be utilized to record the events. It is an information network that
data fusion to draw meaningful interpretation of the sensed data connects devices, smart devices, RFID readers, to the internet
for query processing and other related services. They collect via Wi-Fi, and other advanced wireless technologies. IoT allows
information from S-WSN layer to facilitate ubiquitous ubiquiotous access to information collected on different
computing. Smart Decision & Control Enabler (SDCE) is the technologies platforms (e.g. sensors and sensor networks, cloud,
apex layer that provides a host of services (cloud) to all smart data modeling, storing, processing, reasoning, ICT) [15].
objects in the city based on data provided by SWIPE. The c) Video Surveillance: It enables the capture of real-time
architecture is modeled to provide IT infrastructure that is images/videos of the incidents and passerby on the road.
pervasive, ubiquitous, reliable, flexible, scalable, adaptable and
d) Satellite system: It provides GIS map/aerial
versatile.
photographs of the road network in a city. Vehicles equipped
III. REQUIREMENTS FOR SMART TRAFFIC MANAGEMENT with GPS can be guided to less congested routes.
The most essential requirements for smart traffic management e) Meteorological sensors: Metereological information
system are: helps vehicle owners plan their visits to various places during
the day. They can avoid venturing out during rain or when it is
Effective traffic monitoring and control system likely to snow.
Data Collection agents f) Traffic control centres: The control centers collect
information of vehicular movement in different zones in the city
Robust ICT infrastructure and ubiquitous computing and broadcast alert messages through various media like FM
platforms radio, SMS, mails and socila media to its subscribers.
Big data Sources g) Mobile Infrastructure: It includes probe vehicles (GPS-
enabled vehicles) and portable devices (such as smartphones)
A. Traffic Monitoring & Control Devices and Systems
that use localization systems for sensing their position
Foremost amongst the requirements for smart traffic (timestamp, latitude, longitude) and speed, and wireless
management is the infrastructure necessary for effective and terminals to allow sharing of the sensed data. Smartphones and
robust traffic monitoring. Meaningful information obtained from GPS-enabled vehicles (crowdsourcing) can provide location and
various sensors deployed across the city for traffic monitoring velocity estimates using the GPS sensor.
help in devising systems that i) reduce travel time, ii) avoid
traffic congestion for smooth the traffic flow, iii) prevent C. ICT Infrastructure and Ubiquiotous Computing Platform
accidents, iv) guiding travelers to routes which are less To handle the Big data and the complexities involved in its
congested, v) identifying parking slots, vi) clearing routes for processing, a suitable computing platform is required. The
emergency vehicles, and vii) control traffic signal cycles. computing platform need to consider data heterogeneity, time to
produce the results, processing load and training and

358
2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

implementation of different models (traffic model, road model, information based on GIS map, and to overlap them to
weather model, etc.) The ICT infrastructure must ensure accurate, retrieve information related to traffic congestion.
timely and reliable delivery of data from road traffic monitoring
systems. To allow direct communication between the monitoring Non-experimental automated vehicle location (AVL):
infrastructure (sensors, etc.) and traffic-related services, radio AVL systems which use GPS to collect probe data tend to
communication can be used based on advanced wireless be sparse compared to other systems that typically record
technology (WiFi, 3G, 4G, WLAN, Bluetooth, EDGE, LTE, samples once every 20 or 30 seconds. Therefore,
WiMAX, etc.). techniques developed for higher sampling rates are not
directly applicable. Such sparse data can be modeled
D. Big data sources using Hidden Markov Model for analysis of travel time
One of the recent technologies that has huge potential to distributions.
enhance smart city services is big data analytics and context
TABLE I. DIFFERENT DATA SOURCES AND ITS PURPOSE
tagging. The primary role of big data in cities is to facilitate
assimilation of information from multiple flows. To understand Sensors/data Data Generated Application
the traffic flow trend/pattern, travel time and congestion, Big data Sources
appears to be a major solution to smart traffic monitoring system. Magnetic Loops Act as counters, counting vehicle Detect
passing over them. Some variants vehicle
Table I presents various sensors/data sources that collect Big
are used to classify vehicles.
data, the type of data that they generate and their applications. Inductive Loops Detect vehicle passage and Detect
Present day governments are adopting the smart city concept in presence. vehicle
their cities and implementing big data applications that support Laser-based radars Provide vehicle presence at traffic
smart city services to reach the required level of sustainability signals, volume, speed, length
and improve the living standards & mechanisms of learning, and assessment, queue measurement,
and classification.
coordination by heterogeneous individuals [14].
Infra-red detectors Detect vehicle passage or presence
Ultrasonic detectors Measuring the time taken for the
IV. BIG DATA APPLICATION CHALLENGES
sound echo to return the distance of
In the context of smart city traffic monitoring, we discuss an object can be found. Ultrasonic
below the challenges for real-time traffic monitoring and in Doppler detector that also measures
vehicle speed.
implementing data analytics.
Microwave radar Traffic volume, speed, and vehicle Calculation
Traffic behavior varies from location to location and the trend detectors length. of traffic
data
is non-linear, and is affected by several factors, such as, traffic
Anisotropic Count stationary or moving vehicle Count and
volume, weather, traffic signal, road occupancy, vehicle size, etc. Magneto-resistive in particular lane. Classify vehicle detect
Traffic flows in urban road networks are highly variable across (AMR)/Magnetic vehicles
different network links and different times of day. Therefore, Sensors
area-wise traffic data collection is necessary. Some of the Acoustic sensors measure vehicle passage, presence vehicle
challenges posed by existing methods are briefly discussed and speed by detecting sounds count
produced by vehicular traffic
below:
Satellite systems GIS map of road network.
Tracking of GPS Data: One of the popular methods to GPS-based systems GPS-derived location information is Localization
(smartphone, used to track movement of vehicles information.
track moving object is the linear interpolation method, vehicle) (speed) and report this information
which is effective when the velocity of the vehicle to a server.
between sampling points is constant. However, if the Camera-based Detect, count and classify vehicles. Vehicle
vehicles accelerate and decelerate frequently, which is systems passage,
quite normal, deciphering the Big data generated by the presence,
GPS at various sampling units is a major challenge. speed can be
extracted by
Dispersion method cannot model road geometry, road image
occupancy and complex traffic flows. Under such traffic processing
flows, neural nets and other learning algorithms that rely Temperature and Temperature and relative humidity Weather
on Big data could succeed in determining solutions for humidity sensors
complex non-linear problems. Accelerometers sensing structural damages of
bridges and pavement
Crowd sensing using GPS enable navigation systems: Sound level sound level (dB) in a traffic area
GPS-equipped mobile phone/vehicles data can be affected meters/microphones
by many factors, for example, the errors in transmitting Pollutant detectors CO, PM2.5, CO2, NO2 AQI
GPS signal, the electromagnetic interference, etc. One
way to deal with this problem, is to combine the
positional information of GPS data and the road

359
2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

TABLE II. CONTEXTUAL ENTITY AND THEIR ATTRIBUTES FOR SMART TRAFFIC MONITORING SYSTEM

Entity Features Classifier Inference/


Reasoning
Number of lanes, maximum speed limit,
road type, one/two-way, road segment
Road state S Possibility of congestion and propagation of congestion.
geometry, intersection points, type of road
surface.
speed, flow and vehicle counts
Traffic state T Traffic state (traffic density) is mapped to congestion level
(occupancy) in short time intervals.
Temp = C, RH; high( air pressure+ temp+ density)= Cloudy,
low( air pressure+ temp+ density)= clear sky; wind direction;
Temperature (Temp), relative humidity
Sunshine and UV intensity level; Rain: Tr = trace (< 0.1 mm.),
Weather Data (RH), air pressure and density, wind speed S
Sl = slight rain ( 0.1 10.0 mm), M = moderate rain (10.1 - 35.0
and direction, sunny, Rain.
mm), H = heavy rain (35.1 90.0 mm), VH = very heavy rain (
> 90.1 mm )
Daytime TW1, TW2, TW24 T 24 hour can be divided into Time window of 1 hour each.
Driving condition, free-flowing traffic causes less acceleration
stationary, moving: low-speed, normal
Vehicle mobility T and deceleration and consequently impacts the noise emitted by
speed
the engine.
Human Mobility Latitude, longitude, speed T Infer users location based on moving cell phone
Street Lights Foggy-F, Cloudy-C T Intelligent lighting in foggy or cloudy weather
As queue length is proportional to traffic volume, and road
Color of signal: green, orange, red and
width is proportionality constant, tuning traffic signals
Traffic Signal dirction of flow: left, right, forward, T
proportional to waiting traffic volumes, queue lengths can help
average waiting time
to over traffic flow.
Air Concentration of CO, PM2.5, CO2, NO2 S AQI of the area
Sound level at different hours: Daily
Noise equivalent sound levels(LeqD), LA10 and S 70dB busy traffic.
LA90, LAeq, Lden, Lday, Lnight,
Location information is used to estimate S and T
Establish correlation between traffic congestion and vehicles
Crowdsourcing speed of vehicle and helps in identifying speed
halting at a traffic signal.
anomalous traffic situations. distribution
S: Spatial Classifier and T: Temporal Classifier.

V. CHALLENGES TO HANDLE BIG DATA AND STRATEGIES TO Spatially-related data: location of interest, road
PROVIDE BIG DATA SOLUTION FOR SMART TRAFFIC network, weather (cloudy, rainy, sunny)
We can infer real-time and fine-grained traffic behavior Temporally-related data: traffic, human mobility, traffic
information of entire city using Big data generated from signal, day/night.
heterogeneous sources. Big data analytics can help understand
the traffic pattern and thereby identifying the root cause of Data sparseness: Many locations to infer; limited
congestion on roads. The analytics can also be used in number of traffic monitoring stations on road segments.
forecasting/predicting congestions and can give alerts on roads, To get over the challenges, context driven Big data
route selection based on congestion data. With an estimate of analytics need to be adopted for smart traffic. They are briefly
traffic density and travel time, real-time traffic information enumerated below. A semi-supervised learning model that
along with historical traffic data, can alleviate the congestion makes use of the analytics is also discussed in this Section.
problem. In this section, we briefly outline the difficulties in
handling heterogeneous data and propose strategies to handle A. Role of Context-based Analytics on Real-Time Traffic
the Big Data generated from traffic monitoring data generators Data
and use context-based machine learning and reasoning to Context is a critical component of analytic decision
perform real-time data analytics on the big data. Some of the process. Context is any information derived from data
challenges in handling Big data are listed below: observations about entities (e.g., person, place, or object)
depending on the interaction between an entity and an
Identification of features from each kind of data source.
application. Schilit et al. [20] define three aspects of context:
Integrating multiple heterogeneous data sources into a where, who, and what. The definition of context constantly
learning model (semi-supervised learning models such changes with the change in execution environment, i.e.,
as co-training and tri-training models [16], [17-18], computing environment (available processors, devices
[19]). accessible for user input and display, network capacity,
connectivity, and costs of computing); user environment
(location, collection of nearby people, and social situation); and

360
2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

physical environment (such as lighting and noise level). By


using context analytics with big data, high quality data models
can be developed and implemented to derive trends, patterns,
and relationships from unstructured data and related structured
road traffic data. Contextual data can be used to provide better
decisions. Integrating context with Big data analytics could
assist in detecting relationship between related entities across
large, sparse, and unrelated collections of historic and current
data. Table II, provides an idea about the contextual entity and
theirs attributes for smart traffic monitoring system. These
contextual entities can be used to develop different data models
for training and reasoning.
B. Strategies to handle Big data
Divide the city into disjoint road segments (grid).
Extract features of each road segment: The road
segment features include road map geometry, number Fig. 2. Training process of classifier using Big data.
of lanes, one way or two way, entry and exit points on
road segments, traffic signal points, length of route on Possibility of congestion propagation to other routes
either side of road segment, number of intersection
points that affect the location, vehicle Extract temporal features (traffic flow, weather, day,
density/occupancy, average speed of the traffic on the human mobility, vehicle mobility) and spatially related
lane, etc. features (road networks, location of interest).

Extract Traffic features: The traffic features include, One of the main steps involved tuning the learning
traffic flow, normal speed, low-speed; direction of behavior of the model is Co-training. Co-training is a machine
movement of traffic (left, right, forward), distribution of learning algorithm used when there are only small amounts of
vehicle speed with time, expected speed to cover the labeled data and large amounts of unlabeled data
lane, standard deviation from the expected speed, traffic (audio/acoustic, videos) available [17],[18]. It is a semi-
signaling time, average waiting time, number of signal supervised learning technique that requires two views of the
cycles to allow a vehicle to pass. data, i.e., two different conditionally independent feature sets
that provide different, complementary information about the
Extract vehicle features: Type of vehicle; average instance and each view is sufficient to accurately predict the
velocity of vehicle of same type instance. Co-training provides a better inference if one of the
classifiers correctly labels a piece of data that the other
Extract meteorological features: Weather- rainy, classifier previously misclassified. The choice of an algorithm
cloudy, sunny, foggy. is based on the specific application domain.
Extract human mobility features: ID of cellphone tower a) Temporal classifier (T)- It models traffic pattern of a
where the calls are being handled (cellular phone as location/route using temporally related features based on
mobile sensors), rough estimate of users location Chain Conditional Random Field (CRF) [17].
(latitude, longitude) and traffic information.
b) Spatial classifier (S)- It models spatial correlation of
Extract time features: Daytime- morning, afternoon, traffic pattern between two congested routes of different
evening, night; peak time and off-time; divide time into locations using Back Propagation network or RBF (Radial
24 hours window. Basis Function).
Apply co-training semi-supervised learning model [17], First, the two classifiers (S and T) are trained with two
[18], for each factor affecting the traffic behavior: separated sets of features independently on labeled data sets
Predict traffic congestion with sparse data sets, use of and then train each other on unlabeled data. The trained sets are
time and spatial related features. then used to infer unlabeled road segments iteratively. The
trained data defines the inference rules that can serve as a
C. Supervised Learning Model knowledge base for performing data fusion operations and
The supervised learning model comprises the following: drawing inference from the output as shown in Fig. 2.
State of traffic pattern- temporal correlation in a route, During inference stage, both the trained sets are applied to
geo-correlation between two routes the corresponding features separately, to determine the traffic
pattern of a route/road segment by taking into account the
Predict traffic congestion of a route probability scores of both classifiers. We can use the route
selection function (decision-making) to assist drivers for
smooth travel services. Similarly, based on the number of
vehicles and pedestrian, traffic signaling can be controlled to
organize traffic flow.

361
2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

VI. APPLICATION OF SWIFT ARCHITECTURE FOR HANDLING BIG segments, which report data at regular intervals. The road
DATA FOR SMART TRAFFIC segments have one/more SCH(s), mounted on overhead
collecting platform or point of access and traffic monitoring
This section discusses the implementation of SWIFT
sensors. The sensors are deployed on both sides of the road
architecture in providing Big data solution for smart traffic
positions such as traffic posts or lampposts, depending on the
management in smart cities. The entire process of big data
length of the road and whether the road is one or two-way. The
solution for smart traffic can be divided into following steps:
location and application-specific SCHs aggregates data from
Deployment of event monitoring/sensing devices all traffic monitoring devices and send the traffic state
information to the nearby SFN.
Data collection from different data sources/generators
as illustrated in Fig. 3. The important function of the SCH is to perform context
tagging on raw data (data transformation) collected from the
Preprocessing: data interpretation and context analysis sensors and conduct statistical analysis to determine their status
Processing of data stream and execution of data and lifetime, and generate alerts to take action against faulty
analytics (online) nodes. The SCH stores the incoming spatio-temporal traffic
data in a time window (time duration) generated at regular
Data storage intervals of time and performs the following operations.
Diagrammatic representation of SCH and basic operations
Data analytics (offline) performed on tagged contextual data is illustrated in Fig. 4 (a)
We briefly describe the data flow and data handling and Fig. 4(b) respectively.
strategies in each layer of SWIFT. Interpreter: Every received location-tagged data signal
A. Smart WSN Layer is passed through the Contextual Interpreter to retrieve
For freeway environments, static infrastructure is suitable the node component status and its corresponding
but for urban road networks where the traffic is highly variable observed readings.
across different network segments and different times of day an Node Context Analyzer: It evaluates node component
area-wise traffic data collection is required. For collecting status data to know the working status and lifetime of
location-specific, temporal traffic data from different sources, the data generator. The Node Context Analyzer uses
the entire road network of a city can be divided into disjoint strength of the signal (RSSI), sensor age, battery state
segments. Each segment of the road is supposed to have relative to the minimum operating power, voltage
structured lane with lane discipline for the vehicles. The S- output from the transducer present in the calibration
WSN is the sensory layer formed by SCHs acting as data field and delay in response (sensor response hysteresis
affecting data quality) to check fitness of the node and
generates alerts through web services for node
replacement and reconfiguration of the network if there
is a random unwanted variation.
If there is no unwanted variation, the contextual
readings of a place (traffic situation sensed by sensors)
is geo-tagged with the SCH ID and timestamp (WSN
ID, its location, time window during which data are
collected), and then forwarded to the SFNs.
Contextual Data Generator: The modular structure of
the SCH maintains spatial and temporal consistency of
data via context information. As soon as the time
window reaches its maximum, the data stream is
forwarded via ICT infrastructure to SFNs without any
delay.
The SFNs deployed at traffic intersection points need to
ensure the reception of data from nearby predesignated
SCHs. The SFN conditions the data received from
SFNs to filter out inaccuracies in collected data by
removing outliers, and validate the context of the data
generated from hardware and communciation faults.

Fig. 3. Big data sources for road traffic monitoring.

362
2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

(a)
(a)

(b)
Fig. 4. Diagrammatic representation of (a) SCH module, and (b) basic (b)
operations performed on tagged contextual data. Fig. 5. Diagrammatic representation of (a) SFN module and (b) operations
performed by the components to estimate traffic situation.
B. Smart Fusion Nodes
The SFNs keep receiving Big data from various Primary Context Data Classifier and Intelligent Router:
heterogeneous sources including SCH, crowd-sourced data via Based on the frequency of data collection, classifier
participatory sensing (GPS equipped mobile phones, cars), extract features/attributes along with its corresponding
video surveillance data, etc. The volume of data becomes large measurement for each road segment (context) from the
based on the frequency of data being collected. To visualize the data received in a given time window. It categorizes the
effect of traffic state on a road and manage the traffic flow, features into spatial and temporal data sets for training.
It uses the Knowledge Base to select the
SFNs provide an open platform that can implement training/learning methods to be applied. The Intelligent
machine learning with parallel computing for executing real- Router dynamically routes the incoming traffic data
time data analytics. The flexible nature of the SFN enables it to across multiple PEs, to provide load balance (distribute
intelligently fuse & analyze the complex data stream, for the load in terms of data volume) and achieve better
creating meaningful information from data generated from fault tolerance.
physical and computing environment. The operations handled
by various components of SFNs as depicted in Fig. 5 (a) and Data Fusion Engine: Fuses the data from different
Fig. 5(b), viz. data classifier, intelligent router etc, are sources and executes the spatial and temporal classifier
discussed below. based on inference rules in Knowledge Base to generate
temporal traffic pattern of an area to correlate the traffic
flow at two connecting routes. The final output is high
level context information that can be used to perform

363
2016 Intl. Conference on Advances in Computing, Communications and Informatics (ICACCI), Sept. 21-24, 2016, Jaipur, India

other calculations, such as based on the traffic pattern [4] A. Caragliu, B. Del, and C. Nijkamp, "Smart cities in Europe, Serie
(congestion level) a travel plan can be made using Research Memoranda 0048, VU University Amsterdam, Faculty of
Economics, Business Administration and Econometrics, 2009.
vehicle speed and distance to reach a destination and
[5] Global smart cities market- Industry analysis, size, share, growth,
control traffic signal timings to smooth the traffic flow. trends and forecast, 2013-2019, by Transparency Market Research,
May 2014.
Decision Logic: Whenever a threshold condition
defined by decision logic (the rule engine applies [6] SINTEF, Big Data, for better or worse: 90% of world's data generated
over last two years, ScienceDaily, May 2013. Available: https://www.
probabilistic logic or fuzzy logic) is reached, an alert sciencedaily.com/releases/2013/05/13052085217.htm.
message is sent to the nearby traffic control points on [7] E. A. Nuaimi, H. A. Neyadi, N. Mohamed, and J. Al-Jaroodi,
the route to divert the incoming traffic to some other Applications of big data to smart cities, J. of Internet Services and
routes. Similarly, when the traffic density is below the Appl., vol. 6, no. 25, pp. 1-15, 2015.
threshold, the traffic can be allowed again to move [8] T. Zhang, R. Ramakrishnan, and M. Livny, BIRCH: an efficient data
through the same route. clustering method for very large databases, in Proc. ACM SIGMOD Int.
Conf. on Manage. of Data, 1996. pp 103114.
Smart Decision & Control Enabler: The generated data [9] D. Laney, 3d data management: Controlling data volume, velocity and
stream is then stored in the cloud that acts as a platform variety, Gartner, Tech. Rep., February 2001.
for everything-as-a-service. The distributed archived [10] F. Ribeiro, F. Ferraz, Maria Silva, and G. Alexandre, Big data solutions
data can be analyzed statistically to understand the trend for urban environments: a systematic review, ALLDATA 2015, 1st Int.
of traffic under a range of conditions (normal, extreme) Conf. on Big Data, Small Data, Linked Data and Open Data, pp.22-28.
at different times of day. Based on this data, the [11] W. Fan, and A. Bifet, Mining big data:current status, and forecast to the
future, ACM SIGKDD Explor Newslett, 2013, vol. 14, no. 2, pp.15.
behavior of traffic during periods of extreme congestion
[12] M. Khan, M. F. Uddin, and N. Gupta, Seven Vs of Big Data
can be determined. understanding Big Data to extract value, in ASEE Zone 1, Conf. of the
IEEE, 2014. pp. 15.
VII. CONCLUSION
[13] F. Provost and T. Fawcett, Data Science and its Relationship to Big
With the deployment of several hundred of sensors across Data and Data-Driven Decision Making, Big Data, vol. 1, no. 1, pp.
smart cities, huge amount of data is generated every second. 5159, Mar. 2013.
The Big data so generated can be effectively used for providing [14] L. M. A. Bettencourt, The Uses of Big Data in Cities, Big Data,
a host of solutions to smart city dwellers. The paper discusses vol.2, no. 1, pp. 12-22, Mar. 2013.
few strategies to handle Big data for smart traffic in smart [15] C. Perera, A. Zaslavsky, P. Christen, and D. Georgakopoulos, Context
aware computing for The Internet of Things: A Survey, IEEE Commun.
cities. However, for effective implementation of strategies, a Surveys Tuts., vol. 16, no. 1, pp. 414-454, Jan. 1, 2014.
ubiquitous computational platform that facilitates seamless [16] G. S. Mann, and A. McCallum, Generalized expectation criteria for
flow of information is necessary. The SWIFT architecture semi-supervised learning with weakly labeled data, J. of Mach.
developed by us in one of our earlier works, provides necessary Learning Res., vol. 11, pp. 955-984, Feb. 2010.
ICT infrastructure to handle Big data. In this paper, we discuss [17] A. Blum, and T. Mitchell, Combining labeled and unlabeled data with
the application of SWIFT architecture to facilitate the co-training, in Proc. 11th Ann. Conf. on Comput.Learning Theory,
collection and process of Big data related to all kinds of real- pp.92100, 1998.
time traffic information through sensors, IoT devices, [18] K. Nigam, and R. Ghani, Analyzing the effectiveness and applicability
of co-training, in Proc. of CIKM, 2000.
crowdsourced data, etc., to provide an adaptive solution for
real-time traffic monitoring and control, traffic trend prediction [19] Z. H. Zhou, and M. Li, Tri-training: exploiting unlabeled data using
three classifiers, Trans. Knowl. and Data Eng., vol. 17, no. 11, pp.
etc. In addition, the framework supports the decision-makers to 1529-1541, 2005.
control the traffic signaling cycle in order to reduce the waiting [20] B. Schilit, N. Adams, and R. Want, Context-Aware Computing
time at traffic signals and provide better alternative route Applications, 1st Int. Workshop on Mobile Comput. Syst. and Appl.,
selection to travelers. Further, based on trends of traffic flow at 1994, pp. 85-90.
different periods of day, the platform can be utilized to
facilitate the mining of Big data to estimate total emissions and
noise level, and initiate actions to divert the incoming traffic to
less congested roads.
REFERENCES
[1] R. Dobbs, S. Smit, J. Remes, J. Manyika, C. Roxburgh, and A.
Restrepo, Urban world: Mapping the economic power of cities,
McKinsey Global Institite Rep., Mar., 2011.
[2] N. Abbu, L. Bhagavatula, A. R. Ghorpade, N. Kolsepatil, E. Kumar, R.
Parvathapuram, V. Saini, F. Schraffl, J. Dash, P. D. Dey, S. Kumar, U.
P. Raghupathi, J. Shah, S. Balasubramania, A. Canan,and S. Sharma,
Urban green growth strategies for indian cities, vol. 1, published by
Local Governments for Sustainability, South Asia, 2015.
[3] Satyanarayana V. Nandury, and Beneyaz A. Begum, Smart WSN-
based ubiquitous architecture for smart Cities, in Proc. of IEEE
Int. Conf. on Advances in Comp., Commun. & Informatics (ICACCI),
Kochi, Kerela 2015, pp. 2356-2363. doi:
[10.1109/ICACCI.2015.7275972]

364

You might also like