6.856 Lecture Notes

Good for fingerprinting “composable” data objects.

Small problem:

degree $n$ poly can generate huge values from small inputs.
Solution 1:
- If poly is over $Z_p$, can do all math mod $p$
- Need $p$ exceeding coefficients, degree
- doesn't change number of roots
- $p$ need not be random---pick once in advance
Solution 2:
- Work in $Z$, deduce nonzero evaluation because few roots
- e.g. evaluate at random $0 \le x \le n^2$ for high probability
- deduce nonzero mod random $q$ (as in string matching)
- so do all computation mod $q$
- $q$ value must exceed bits (i.e. log) of evaluated value
- if max coefficient is $a$ then value is at most $an(n^2)^n$
- so $q$ can be $O(\log a + n \log n)$.
Again, major benefit if polynomial implicitly specified.

String checksum:

Multivariate:

$n$ variables
degree of term: sum of vars' degrees
total degree $d$: max degree of term.
Schwartz-Zippel: fix $S \subseteq F$ and let each $r_i$ random in $S$ \[ \Pr[Q(r_i)=0 \mid Q \ne 0] \le d/|S| \] Note: no dependence on number of vars!

Proof:

induction. Base done.
$Q \ne 0$. So pick some (say) $x_1$ that affects $Q$
write $Q=\sum_{i \le k} x_1^iQ_i(x_2,\ldots,x_n)$ with $Q_k() \ne 0$ by choice of $k$
$Q_k$ has total degree at most $d-k$
By induction, prob $Q_k$ evals to 0 is at most $(d-k)/|S|$
suppose it didn't. Then $q(x)=\sum x_1^i Q(r_2,\ldots,r_n)$ is a nonzero univariate poly.
by base, prob. eval to 0 is $k/|S|$
add: get $d/|S|$
why can we add? $$ \begin{eqnarray*} \Pr[E_1] &= &\Pr[E_1 \cap \overline{E_2}]+\Pr[E_1 \cap E_2]\\ &\le & \Pr[E_1 \mid \overline{E_2}] + \Pr[E_2] \end{eqnarray*} $$

Define
by max-flow techniques, get $m\sqrt{n} \le n^{2.5}$
Edmonds matrix: variable $x_{ij}$ if edge $(u_i,v_j)$
determinant nonzero if PM
poly nonzero symbolically.
- so apply Schwartz-Zippel
- Degree is $n$
- So number $r\in(1,\ldots,n^2)$ yields 0 with prob. $1/n$
Determinant can be computed with $n^{2.376...}$ ops.

Wait, det may be huge!

We now have a way to decide if a matching exists.

Self reducibility

For perfect matching

Network Coding

Motivation

In traditional network protocols a node receives a message and forwards it to a “next hop”
can model this with a graph where each edge can carry on message per time step
disjoint paths can carry distinct messages at the same time
achievable $s$-$t$ throuput is just number of disjoint $s$-$t$ paths, i.e. $s$-$t$ max-flow
which we know equals $s$-$t$ min-cut
different length paths might have different latency
but can start a new message each time step, so doesn't affect throughput

Problems

Multicast

what if multiple recipients want same messages?
e.g. streaming video (netflix)
do we need to use twice as many paths to send to two recipients?
seems wasteful. could perhaps send message out on a “multicast tree” which uses same path for most of the way?
the dream: maybe if each recipient has enough throughput separately, can transmit to all of tham at the same time?
for broadcast (every node a recipient), Edmonds proved this is possible (disjoint trees)
for multicast (only some nodes want it), counterexample: the butterfly

Network coding

Special case to consider:

Coding solution

think of messages as integers mod $p$
$s$ sends one message to each LHS
each LHS sends its message to all RHS neighbors
each RHS neighber sends $t$ a random linear combination of what it got, mod $p$
can $t$ “decode” the message?
yes (assuming it knows the random coefficients).
What is the linear operator (called “transfer matrix”) mapping $n$ inputs (from $s$) to $n$ outputs (at $t$)?
it's the Edmonds matrix of the bipartite graph, with random coefficients in it
we proved this has nonzero determinant whp
which means it is invertible
so $t$ can invert the linear function to reconstruct input.

How can source know transfer matrix?

for first $n$ rounds, have $s$ send $n$ distinct unit vectors
lets $t$ read off columns of transfer matrix
then use it for all future rounds
more sophisticated schemes can adapt over time
e.g., imagine that each “super-round” message contains many $>>n$ rounds of numbers
then you can make the first $n$ rounds of each message encode the $n$ unit vectors without wasting a lot of space

Multiple receivers

same bipartite core
$s$ has $k< n$ messages
multiple $t_i$ each connected to some $k$ RHS vertices
can send to any one $t_i$ over the perfect matching to that $t_i$
but coding argument above says RLC also sends to $t_i$ ($k\times k$ transfer matrix)
but this is true for all receivers simultaneously
since each can receive, all can receive.

This is the key idea of random linear network coding. It generalizes to arbitrary communication graphs.

Delay

we focused on one “round” of messages for simplicity
if nodes want to transmit new messages in each time step, and paths have different lengths, then messages at different times can get combined with each other
this can be handled
intuitively, you learn about early rounds and subtract them from later rounds