Computing Functions Over Wireless Networks

June 26, 2009 , P. R.
Kumar
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 Unported License. Based on a work at decision.csl.illinois.edu See last page and http://creativecommons.org/licenses/by-nc-nd/3.0/
Computing functions over wireless networks
P. R. Kumar
Dept. of Electrical and Computer Engineering, and Coordinated Science Lab University of Illinois, Urbana-Champaign Email: prkumar@illinois.edu Web: http://decision.csl.illinois.edu/~prkumar
1/37
June 26, 2009 , P. R. Kumar
How to process information in the network in sensor networks? (Or how to do data fusion over a sensor network? Or how to compute a function of data over a sensor network? Or how to perform in-network information processing in a sensor network?)
2/37
Outline

Difference between sensor networks and data networks Model of problem: Protocol model, Non-information theoretic model Sample of results: Average vs. Max More details of results Some information theoretic results for sensor networks References
4 5 9 12 27 34
3/37
Sensor networks
Examples of Tasks
Environment monitoring
Determine the Average temperature: (x1 + x2 + + xn)/n
Alarm networks
Determine the Max temperature: Max xi
Sensor networks are not just data networks with sensor measurements replacing les
They are application specic Nodes need not just relay packets
They can discard, combine, process packets Combination of computing and communication
More generally: Consider a symmetric function F(x1, x2, , xn)

E.g., Average, Mode, Median, Percentile, Max Determined by Histogram or Types
How should information be processed in the network to compute such functions?

This can also be regarded as network coding for sensor networks
4/37
Model of problem: Protocol model, Non-information theoretic (Giridhar & K 05)
5/37
Model of problem
Multi-hop model for wireless communication (Giridhar & K 05)

Collocated network
All nodes within range of all
Multi-hop random network (Penrose 1997, Gupta & K 98)

Critical common range for connectivity of random graph
2 log n ) n At time t, sensor i takes a measurement xi(t) {1, 2,,D}

where
cn +. (Take r(n) =
Fusion node needs to calculate F(x1,, xn) exactly Non-information theoretic formulation
6/37
Model of problem
Protocol Model for wireless communication

Receiver should be outside other transmitters interference footprints
r1 (1+) r1 r2 (1+)r2
Two types of networks Collocated network

Large range so all nodes can hear each other Random network
Communicate at rate W bits/sec (Take
W =1 bit/sec wlog)
At time t, sensor i takes a measurement xi(t) {1, 2,,D}

No probability distribution on xi(t) s
n nodes randomly distributed Need range at no less than
Fusion node needs to calculate F(x1,, xn) exactly

for network to be connected
Non-information theoretic formulation
7/37
Denition of Computational Rate Rmax(n)
Block coding allowed

N measurements of node 1: x1 N measurements of node 2 : x2
N measurements of node n : xn
If all N functions computed in time T Then Computational Rate Best Rate over all Strategies S and block lengths N:
(Giridhar & K 05)
8/37
Sample of results: Average vs. Max
9/37
The Average versus Max
Theorem (Giridhar & K 05): The rate at which the Average can be harvested is Strategy
Tessellate Fuse locally Compute along a rooted tree of cells
Theorem (Giridhar & K 05): The rate at which the Max can be harvested is
1 log log n Strategy: Take advantage of Block Coding
First node announces times of max values: ( Second node announces times of additional max values: ( 1 Third node announces of yet more max values: (
1 1 1
) ) )
10/37
Summary: Order of difculty of computations

(1/n)
Collocated network: Average, Mode, Type Data downloading
(1/log n)
Random Multi-hop network: Average, Mode, Type
Collocated network: Max
(1/loglog n)
Random Multi-hop network Max
(Giridhar & K 05)
11/37
More details of results (Giridhar & K 05)
12/37
Results: A classication of functions
Divisible functions
Amenable to divide and conquer if deg(Gn) = O(log |(Fn)|)
Symmetric functions
Data centric paradigm: Identity of node is not important, only its value
Type-sensitive functions
Hard to compute in collocated case, and random case
Type-threshold functions
collocated case, and random case
(Giridhar & K 05) 13/37
Examples (Giridhar & K 05)
Data download problem: Fn(x1,, xn) = (x1,, xn):

In collocated or random networks:
Histogram of frequencies or Type: Fn(x1,, xn) = (z1, z2,, zD)

Collocated case: Random networks:
Special case: Any symmetric function
Mean, Mode, Median, Majority:

Collocated case: Random case:
Max, Min, Range, Occurrence of a value:

Collocated case: Random case:
14/37
Denition of Rate Rmax(n)
Block coding allowed

N measurements of node 1: x1 N measurements of node 2 : x2
N measurements of node n : xn
Compute all N functions If computed in time T Then Rate Best Rate over all Strategies S and block lengths N: Bound on Rmax:
(Giridhar & K 05) 15/37
Divisible functions
Divisible functions:
There exists FS(xi: i S) for every subset S {1,2,,n} With |(FS)| |(Fn)|
FS2 FS1
FS3 FS5 FS4
FS
FS6
for partition {S1,, Sm} of S
Theorem: Special cases
if deg(Gn) = O(log |(Fn)|)
(Giridhar & K 05)
Data Downloading: deg(Gn) O(log |(Fn)|) = O(log Dn) = O(n) So Histogram: So Hence for Random networks for Random networks (Giridhar & K 05) 16/37
Proof of Divisible Functions

for
Tessellate into square cells of area r2/2 Neighboring occupied cells can communicate with each other Form a tree rooted at fusion center out of occupied connected cells Choose a relay node in each occupied cell and a parent in the next cell towards the root Locally compute and pass on along tree to root
Collect data from deg(Gn) nodes within cell Collect functional value of log|(Fn)| bits from bounded number of child cells Pass on functional value of log|(Fn)| bits to parent cell
All operations can be performed in So Constructive strategy
time
17/37
Symmetric functions
Symmetric functions depend only on type

where
Type-sensitive functions
There is a 0 < c < 1 such that a fraction c of values is never enough to pin down the value of the function Fn Examples: Mean, Median, Mode, Majority
Only want to know whether each zi exceeds a threshold zi* There is a threshold vector such that Examples: Max, Min, Range, Occurrence of value
(Giridhar & K 05) 18/37
Collision-free strategies in collocated case

Every node knows when to transmit based on what it hears on channel The content of the packet it transmits depends on what it heard, as well as its own information
Node g1 transmits packet P1(xg1) Node g2(P1(xg1)) transmits packet P2(P1(xg1), xg2) Node g3(P1(xg1), P2(P1(xg1), xg2)) transmits packet P3(P1(xg1), P2 (xg2), xg3) Note: We are not allowing information transmission to occur through collisions
(Giridhar & K 05) 19/37
for Type-sensitive functions in collocated case

Wlog suppose D=2 Initially, xg1 is in the set S0g1 with cardinality |S0g1| = 2N After rst transmission, xg1 can be in one of two sets depending on whether it transmits 0 or 1
Let the transmission correspond to the larger set, call it be S1g1 |S1g1| 1/2 |S0g1|
After t-th transmission of node k, let xk lie in Stk with | Stk | 1/2 | St-1k | So at the end, uncertainty set is: Thus at least nN-T places in the nN values (x1, x2 ,, xn) are undetermined However to compute Fn (x(1),x(2),,x(N)), at least cnN values are needed So nN-T (1-c)nN So T cnN Hence Thus for collocated case
(Giridhar & K 05) 20/37
for Type-threshold functions in collocated case

Consider Max function (argument can be generalized) Threshold vector = (1,1,,1) Lower Bound
Take block length Node 1 transmits its locations of the N1 1s in Node 2 transmits the N2 new 1s in its list Node 3 transmits the N3 new 1s in its list
To describe Ni takes log N bits To describe the locations of Ni 1s requires So Maximized when Ni = N/n. Use Thus:
(Giridhar & K 05) 21/37
Upper bound in collocated case .
Take N > 2n. Consider

Exactly N/2n 1s in x1 Exactly N/2n 1s in x2
Exactly N/2n 1s in xn
At most one 1 ... most one 1 At
Claim: Each such x produces a unique set of transmissions P1,P2,,PT

Suppose not. Then there are two: x and y which produce same transmissions They differ in some xk yk Then also produces same transmissions since node k hears the same under xk or yk and so reacts the same But this has different Max values from x Thus Max functions are not determined from transmissions
(Giridhar & K 05) 22/37
Finishing the proof for the Max function
Number of such vectors x =
So 2T >
So T > N log(n-1)
So
Hence This proves
(Giridhar & K 05) 23/37
Generalizing to any Type-threshold function for collocated case
Feasibility of
Node i sends only list of 1s for values which threshold has not been attained
Upper bound of
There exist a threshold vector such that Now consider an x which has
z1-1 vectors of the form (1,1,,1) z2 vectors of the form (2,2,,2) z3 vectors of the form (2,2,,2) (Giridhar & K 05) 24/37
Remaining have 1s or 2s only Now problem is reduced to a Max
Random networks: Type sensitive networks
Theorem (Giridhar & K 05)

Type-sensitive functions:
Proof
Tessellate unit area domain into squares of area A = (r)2/2 Transmissions are local within square Assume Genie communicates all messages instantaneously to all nodes We know at least cnN transmissions are needed At least one square has greater than cnNA receptions However only one node can receive at a time in a square So T cnNA = cnN(r)2/2 But So T cN log n for connectivity
25/37

Theorem (Giridhar & K 05)

Type-threshold functions:
Proof
Consider Max for simplicity Tessellate unit area domain into squares of area A = (r)2/2 Some square has greater than nA nodes Suppose all nodes outside this square have value 0 Then we need to compute Max of nA nodes We need cN log (nA) transmissions Only one node can receive at any given time So T cN log (nA) But is needed for connectivity So So T N (log log n) So Achievability can be proved by using tree gathering
26/37
Some information theoretic results for sensor networks
27/37
Complexities of function computation over wireless networks
Slepian-Wolf Theorem (73)

Total information fusion over wires from correlated sources
R1=H(X)
Several other complexities in sensor networks Wireless nodes

There are no independent links: Sources share channel Multiple access problem
R2=H(Y|X)
Also, sensors can communicate with each other and thus cooperate Source-channel separation does not hold Only a function needs to be computed, not all information
Little is known in a pure information theoretic setting

28/37
Multiple relay network (Xie and K 05)
Consider a network p(y2 (t), y3 (t),..., yn (t) | x1 (t), x2 (t),..., xn 1 (t))
A feasible rate
Generalization of Cover and El Gamal 79
29/37
Multiple access channel (Ahlswede 71, Liao 72)
Consider the multiple access channel
The capacity region
for any p(x1)p(x2)

30/37
A multiple source multiple relay network (Xie and K 07)
Consider the network
A feasible rate vector is
31/37
A feasible rate for a sensor network with a sink

Consider the network Feasible rate region for acyclic choice of routes
Maximize over Based on combination of backward decoding for relay channel (Xie and K 07) 32/37 and multiple access channel
Exact capacity region for some Gaussian sensor networks with phase fading
Kramer, Gastpar and Gupta 03 have determined exact capacity region for relay channel with phase fading for some geometries
The capacity region is
Theorem (Xie and Kumar 07)

Phase fading unknown to transmitter Node 5 far away from other nodes
33/37
References-1

M.D. Penrose, The longest edge of the random minimal spanning tree, The Annals of Probability, vol. 7, no. 2, pp. 340-361, 1997. Piyush Gupta and P. R. Kumar, Critical Power for Asymptotic Connectivity in Wireless Networks, in Stochastic Analysis, Control, Optimization and Applications: A Volume in Honor of W. H. Fleming. Edited by W. M. McEneany, G. Yin, and Q. Zhang, Birkhauser, Boston, MA, pp. 547566, 1998. ISBN 0-8176-4078-9 Arvind Giridhar and P. R. Kumar, Computing and Communicating Functions Over Sensor Networks, IEEE Journal on Selected Areas in Communications, pp. 755764, vol. 23, no. 4, April 2005. Arvind Giridhar and P. R. Kumar, Towards a Theory of In-Network Computation in Wireless Sensor Networks IEEE Communications Magazine, vol. 44, no. 4, pp. 98107, April 2006. Arvind Giridhar and P. R. Kumar, In-Network Information Processing in Wireless Sensor Networks. In Wireless Sensor Networks: Signal Processing and Communications Perspectives, A. Swami, Q. Zhao, Y.W. Hong and L. Tong, Editors. pp. 4367, John Wiley, Chichester, England, 2007.
34/37
References-2
Slepian D and Wolf J 1973 Noiseless coding of correlated information sources. IEEE Transactions on Information Theory, 19(4), 471480. T. Cover and A. El Gamal, Capacity theorems for the relay channel, IEEE Trans. Inform. Theory, vol. 25, pp. 572-584, 1979. R. Ahlswede, Multi-way communication channels, in Proceedings of the 2nd Int. Symp. Inform. Theory (Tsahkadsor, Armenian S.S.R.), (Prague), pp. 23-52, Publishing House of the Hungarian Academy of Sciences, 1971. H. Liao, Multiple access channels. PhD thesis, Department of Electrical Engineering, University of Hawaii, Honolulu, HA, 1972. G. Kramer, M. Gastpar, and P. Gupta, Cooperative strategies and capacity theorems for relay networks, IEEE Trans. Inform. Theory, vol. 51, pp. 3037-3063, September 2005.
35/37
References-3
Liang-Liang Xie and P. R. Kumar, An Achievable Rate for the MultipleLevel Relay Channel, IEEE Transactions on Information Theory, vol. 51, no. 4, pp. 13481358, April 2005. Liang-Liang Xie and P. R. Kumar, Multisource, multidestination, multirelay wireless networks, IEEE Transactions on Information Theory, Special issue on Models, Theory and Codes for Relaying and Cooperation in Communication Networks, vol. 53, no. 10, pp. 3586 3595, October 2007. Liang-Liang Xie and P. R. Kumar, Information-Theoretic Studies of Wireless Sensor Networks. In Handbook on Array Processing and Sensor Networks, Simon Haykin and K. J. Ray Liu, Editors. IEEEWiley, 2009.
36/37
http://decision.csl.illinois.edu/~prkumar/html_les/talks.html
37/37

Computing Functions Over Wireless Networks

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Computing Functions Over Wireless Networks

Uploaded by

Copyright:

Available Formats

June 26, 2009 , P. R.

Computing functions over wireless networks

June 26, 2009 , P. R. Kumar

June 26, 2009 , P. R. Kumar

June 26, 2009 , P. R. Kumar

More generally: Consider a symmetric function F(x1, x2, , xn)

How should information be processed in the network to compute such functions?

June 26, 2009 , P. R. Kumar

Model of problem: Protocol model, Non-information theoretic (Giridhar & K 05)

June 26, 2009 , P. R. Kumar

Multi-hop model for wireless communication (Giridhar & K 05)

Multi-hop random network (Penrose 1997, Gupta & K 98)

2 log n ) n At time t, sensor i takes a measurement xi(t) {1, 2,,D}

June 26, 2009 , P. R. Kumar

Protocol Model for wireless communication

Two types of networks Collocated network

Communicate at rate W bits/sec (Take

At time t, sensor i takes a measurement xi(t) {1, 2,,D}

n nodes randomly distributed Need range at no less than

Fusion node needs to calculate F(x1,, xn) exactly

Non-information theoretic formulation

June 26, 2009 , P. R. Kumar

Denition of Computational Rate Rmax(n)

Block coding allowed

June 26, 2009 , P. R. Kumar

Sample of results: Average vs. Max

June 26, 2009 , P. R. Kumar

The Average versus Max

June 26, 2009 , P. R. Kumar

Summary: Order of difculty of computations

Random Multi-hop network: Average, Mode, Type

Collocated network: Max

Random Multi-hop network Max

(Giridhar & K 05)

June 26, 2009 , P. R. Kumar

More details of results (Giridhar & K 05)

June 26, 2009 , P. R. Kumar

Results: A classication of functions

June 26, 2009 , P. R. Kumar

Examples (Giridhar & K 05)

Data download problem: Fn(x1,, xn) = (x1,, xn):

Histogram of frequencies or Type: Fn(x1,, xn) = (z1, z2,, zD)

Special case: Any symmetric function

Mean, Mode, Median, Majority:

Max, Min, Range, Occurrence of a value:

June 26, 2009 , P. R. Kumar

Denition of Rate Rmax(n)

Block coding allowed

June 26, 2009 , P. R. Kumar

FS3 FS5 FS4

for partition {S1,, Sm} of S

Theorem: Special cases

if deg(Gn) = O(log |(Fn)|)

(Giridhar & K 05)

June 26, 2009 , P. R. Kumar

Proof of Divisible Functions

All operations can be performed in So Constructive strategy

June 26, 2009 , P. R. Kumar

Symmetric functions depend only on type

June 26, 2009 , P. R. Kumar

Collision-free strategies in collocated case

(Giridhar & K 05) 19/37

June 26, 2009 , P. R. Kumar