Left to right: standard gradient descent, accelerated descent, accelerated descent with steering. Energy functional contains a quadratic term and entropy.
Here's our implementation on KL divergence (with Cornell logo) and heart
This algorithm can also easily be extended to 3D graphics