Parallel Software for Million-scale Exact Kernel Regression

Black line drawing of a stack of papers on a white circle inside a dark green square.

Citation

Chen, Yu, Lucca Skon, James R. McCombs, Zhenming Liu, and Andreas Stathopoulos. "Parallel Software for Million-scale Exact Kernel Regression." ICS '23: Proceedings of the 37th International Conference on Supercomputing June 2023: 313–323.

Description

We present the design and the implementation of a kernel principal component regression software that handles training datasets with a million or more observations. Kernel regressions are nonlinear and interpretable models that have wide downstream applications, and are shown to have a close connection to deep learning. Nevertheless, the exact regression of large-scale kernel models using currently available software has been notoriously difficult because it is both compute and memory intensive and it requires extensive tuning of hyperparameters.

URL

Date

Jun 2023

Type

Journal Article