TFOCS: proximity operator of negative entropy

nv1 · June 7, 2013, 6:24pm

In TFOCS I need to compute the proximity operator of the negative entropy

$$ \sum_i [ x_i \log x_i + (1-x_i) \log (1-x_i) ]$$

An analytic solution seems impossible, so I guess one approach would be to linearly approximate the function (e.g., by Taylor expansion about x=1/2). Is there a better way?

mcg · June 7, 2013, 6:45pm

Why proximity? That’s smooth, though its gradient is not Lipschitz continuous.

nv1 · June 8, 2013, 4:31pm

Because I’d like to use it as first argument in tfocs_SCD.

mcg · June 8, 2013, 9:50pm

Then you will have to construct a projector an your own, which is going to be a lot more difficult than just computing the derivative.

Stephen_Becker · June 12, 2013, 11:54am

Interesting question. Hopefully I’ll get back to you with a good solution.

Stephen_Becker · June 12, 2013, 1:48pm

A partial solution. First, we are separable, thankfully, so wlog let x be a scalar. The prox of x \log x is known, and it can be done using the Lambert W function (e.g. see Pesquet and Combettes’ review article which has a nice table of prox functions). I tried to use the Lambert W function for x \log(x) + (1-x)\log(1-x) using but without success. However, I think we can evaluate the function implicitly following the techniques used for Lambert (e.g., see Lambert W fcn evaluation ). This is probably much better than a Taylor series.

If this works well, we can incorporate it into a standard TFOCS atom.