Make money doing the work you believe in

ImageNet has 1000 categories. A typical network maps each image to a 512-dimensional feature vector.

Each category is not a single point in that space. It is a low-dimensional subspace. Golden retrievers vary along perhaps 5 directions: angle, lighting, fur shade. Sports cars vary along 8: color, angle, model, background. Each class occupies a thin slab.

The rate reduction objective must arrange 1000 subspaces of different shapes and dimensions in 512-dimensional space so each is internally compact and all are mutually spread apart.

This is not a problem intuition can solve. It is a problem an objective function was built for. Two clusters is straightforward. A thousand requires principled optimization.

Intelligence Is Compression, Part 4: The Information Game
Apr 30
at
11:17 AM
Relevant people

Log in or sign up

Join the most interesting and insightful discussions.