MathInstitutes.org

Understanding Neural Network Expressivity via Polyhedral Geometry

Presenter

Christoph Hertrich

April 26, 2023

ICERM

Event: Trends in Computational Discrete Optimization

Play Video

Abstract

Neural networks with rectified linear unit (ReLU) activations are one of the standard models in modern machine learning. Despite their practical importance, fundamental theoretical questions concerning ReLU networks remain open until today. For instance, what is the precise set of (piecewise linear) functions representable by ReLU networks with a given depth? Even the special case asking for the number of layers to compute a function as simple as max{0, x1, x2, x3, x4} has not been solved yet. In this talk we will explore the relevant background to understand this question and report about recent progress using polyhedral geometry as well as a computer-aided approach based on mixed-integer programming.

Supplementary Materials

Slides

Videos

Understanding Neural Network Expressivity via Polyhedral Geometry

Presenter

Abstract

Supplementary Materials