Data fusion, communication complexity, and decentralized detection


Decentralized detection

This is similar to the classical detection and hypothesis testing problem, except that each of several sensors receives a different set of measurements and they are allowed to communicate only a few bits. These bits have to be chosen so that they are "most informative" summaries of the measurements of each sensor.

A 1993 survey with a unified treatment of the subject can be found in:

If the measurements at each sensor are conditionally independent (conditional on any hypothesis), there is a lot to say:

But without conditional independence, the problem becomes NP-complete:


Communication complexity

Agent 1 knows x, agent 2 knows y. They want to compute a function f(x,y). How much do they need to communicate? This formulation captures much of the essence of the data fusion problem:

Unfortunately, the problem is NP-complete, in general:

But for specific types of functions f, progress is possible. When x and y are continuous variables, calculating the communication complexity may involve tools from algebraic or differential geometry:

In another variant, agent 1 knows a convex function g(.), agent 2 knows a convex function h(.) and they want optimize f(.)+g(.) within some accuracy epsilon: