Views

CrossRef citations to date

Altmetric

Machine Learning

Revisiting Convolutional Neural Networks from the Viewpoint of Kernel-Based Methods

Corinne Jonesa Swiss Data Science Center, École polytechnique fédérale de Lausanne, Lausanne, SwitzerlandCorrespondence[email protected]

https://orcid.org/0000-0001-7363-0431 View further author information

Vincent Rouletb Department of Statistics, University of Washington, Seattle, WA

https://orcid.org/0000-0001-6526-5235 View further author information

Zaid Harchaouib Department of Statistics, University of Washington, Seattle, WA

https://orcid.org/0000-0003-1186-1343 View further author information

Abstract

Convolutional neural networks, as most artificial neural networks, are frequently viewed as methods different in essence from kernel-based methods. In this work we translate several classical convolutional neural networks into kernel-based counterparts. Each kernel-based counterpart is a statistical model called a convolutional kernel network with parameters that can be learned from data. We provide an alternating minimization algorithm with mini-batch sampling and implicit partial differentiation to learn from data the parameters of each convolutional kernel network. We also show how to obtain inexact derivatives with respect to the parameters using an algorithm based on two inter-twined Newton iterations. The models and the algorithms are illustrated on benchmark datasets in image classification. We find that the convolutional neural networks and their kernel counterparts often perform similarly. Supplemental appendices and code for the article are available online.

KEYWORDS:

Supplementary Materials

The supplementary materials are contained within a single zip archive, supplement.zip. This zip archive contains:

Technical appendices: Appendices containing mathematical descriptions of the ConvNets and CKNs used in the article, the derivation of the gradient of a CKN with respect to the weight matrices, additional details related to the training methods, and additional results. (appendix.pdf, PDF file)

Python code: Python code that can be used to reproduce the results in the article. (code.zip, zip archive)

Acknowledgments

The authors would like to thank the referees and the associate editor for their valuable comments.

Disclosure Statement

The authors report there are no competing interests to declare.

Notes

1 A dot product kernel is a kernel of the form $k (x, y) = f (〈 x, y 〉)$ for a function $f : R \times R \to R$ . For notational convenience for a dot product kernel k we will write $k (t)$ rather than $k (x, y)$ where $t = 〈 x, y 〉$ . For a matrix A the element-wise application of k to $A \in R^{m \times n}$ results in $k (A) : = {[k (A_{i, j})]}_{i, j = 1}^{m, n}$ .

Additional information

Funding

This work was mainly performed while C. Jones was at the University of Washington. The authors gratefully acknowledge funding from NSF CCF-1740551, NSF CCF-2019844, NSF DMS-2134012, CIFAR-LMB, and faculty research awards.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Revisiting Convolutional Neural Networks from the Viewpoint of Kernel-Based Methods

Information for

Open access

Opportunities

Help and information

Revisiting Convolutional Neural Networks from the Viewpoint of Kernel-Based Methods

Abstract

Supplementary Materials

Acknowledgments

Disclosure Statement

Notes

Additional information

Funding

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature