47
Views
0
CrossRef citations to date
0
Altmetric
Research Articles

Domain Generalization by Functional Regression

, &
Pages 259-281 | Received 17 May 2023, Accepted 14 Feb 2024, Published online: 04 Mar 2024
 

Abstract

The problem of domain generalization is to learn, given data from different source distributions, a model that can be expected to generalize well on new target distributions which are only seen through unlabeled samples. In this paper, we study domain generalization as a problem of functional regression. Our concept leads to a new algorithm for learning a linear operator from marginal distributions of inputs to the corresponding conditional distributions of outputs given inputs. Our algorithm allows a source distribution-dependent construction of reproducing kernel Hilbert spaces for prediction, and, satisfies finite sample error bounds for the idealized risk. Numerical implementations and source code are availableFootnote1.

Acknowledgments

We acknowledge the ELLIS Unit Linz, the LIT AI Lab, and the Institute for Machine Learning at the University of Linz. In addition, the research reported in this paper has been partly funded by the Federal Ministry for Climate Action, Environment, Energy, Mobility, Innovation and Technology (BMK), the Federal Ministry for Digital and Economic Affairs (BMDW), and the Province of Upper Austria in the frame of the COMET–Competence Centers for Excellent Technologies Programme and the COMET Module S3AI managed by the Austrian Research Promotion Agency FFG.

Notes

2 If we equip M1+(X×Y) with τw(X×Y), the weakest topology on M1+(X×Y) such that the mapping Lh:(M1+(X×Y),τw(X×Y))R with Lh(P)=X×Yh(x,y)dP(x,y) is continuous for all bounded and continuous functions h:X×YR and denote by B(τw(X×Y)) the associated Borel σ- algebra, then (M1+(X×Y),B(τw(X×Y))) becomes a itself measurable space, cf. [5, 10].

3 It holds that mPXHkL2(X), the space of square-integrable functions on X. The mapping m:PXmPX is well-defined if the kernel k is bounded and it is injective if k is universal [11, 12].

4 The regression function fP is well-defined since X and Y are Polish spaces (as compact subsets of Rd1,Rd2) and, therefore, every PM1+(X×Y) can be factorized P(x,y)=P(y|x)PX(x) in a conditional probability measure P(y|x) and a marginal (w.r.t. X) probability measure PX(x), see [13, Theorem 10.2.1].

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.