Search in:

Optimization

A Journal of Mathematical Programming and Operations Research

Volume 73, 2024 - Issue 4

Submit an article Journal homepage

Open access

184

Views

CrossRef citations to date

Altmetric

Research Article

Generalized left-localized Cayley parametrization for optimization with orthogonality constraints

Keita KumeDepartment of Information and Communications Engineering, Tokyo Institute of Technology, Tokyo, JapanCorrespondence[email protected]
View further author information

Isao YamadaDepartment of Information and Communications Engineering, Tokyo Institute of Technology, Tokyo, JapanCorrespondence[email protected]
View further author information

Pages 1113-1159 | Received 14 Oct 2021, Accepted 18 Oct 2022, Published online: 15 Nov 2022

Cite this article
https://doi.org/10.1080/02331934.2022.2142471
CrossMark

Sample our Engineering & Technology journals, sign in here to start your access, latest two full volumes FREE to you for 14 days

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Abstract

We present a reformulation of optimization problems over the Stiefel manifold by using a Cayley-type transform, named the generalized left-localized Cayley transform, for the Stiefel manifold. The reformulated optimization problem is defined over a vector space, whereby we can apply directly powerful computational arts designed for optimization over a vector space. The proposed Cayley-type transform enjoys several key properties which are useful to (i) study relations between the original problem and the proposed problem; (ii) check the conditions to guarantee the global convergence of optimization algorithms. Numerical experiments demonstrate that the proposed algorithm outperforms the standard algorithms designed with a retraction on the Stiefel manifold.

Keywords:

Stiefel manifold
Cayley transform
Cayley parametrization
orthogonality constraint
non-convex optimization

Disclosure statement

No potential conflict of interest was reported by the author(s).

Notes

1 $φ^{- 1}$ is well-defined over $Q_{N, N}$ because all eigenvalues of $V \in Q_{N, N}$ are pure imaginary. For the second expression in (Equation4(4) $φ^{- 1} : Q_{N, N} \to SO (N) ∖ E_{N, N} : V \mapsto (I - V) (I + V)^{- 1} = 2 (I + V)^{- 1} - I$ (4) ), see the beginning of Appendix 3.

2 The closure of $SO (N) ∖ E_{N, N}$ is equal to $SO (N)$ . For every $U \in SO (N)$ , we can approximate it by some sequence $(U_{n})_{n = 1}^{\infty}$ of $SO (N) ∖ E_{N, N}$ with any accuracy, i.e. $lim_{n \to \infty} U_{n} = U$ .

3 The domain of $φ_{S}$ with $S \in SO (N)$ is a subset $O (N) ∖ E_{N, N} (S) = SO (N) ∖ E_{N, N} (S)$ of $SO (N)$ .

4 As in (Equation9(9) $Q_{N, p} (S) := Q_{N, p} := {[\begin{matrix} A & - B^{T} \\ B & 0 \end{matrix}] | \begin{matrix} - A^{T} = A \in R^{p \times p}, \\ B \in R^{(N - p) \times p} \end{matrix}} \subset Q_{N, N} .$ (9) ), $Q_{N, p} (S)$ is the common set $Q_{N, p}$ for every $S \in O (N)$ . However, we distinguish $Q_{N, p} (S)$ for each $S \in O (N)$ as a parametrization of the particular subset $St (p, N) ∖ E_{N, p} (S)$ of $St (p, N)$ (see also Remark 1.3(b)).

5 Algorithm 1 can serve as a central building block in our further advanced Cayley parametrization strategies, reported partially in [Citation38–40].

6 The local diffeomorphism of $R_{U}$ around $0 \in T_{U} St (p, N)$ can be verified with the inverse function theorem and the condition (ii) in Definition B.1.

7 Let $I_{p} + [[V]]_{21}^{T} [[V]]_{21} = Q (I_{p} + Σ) Q^{T}$ be the eigenvalue decomposition with $Q \in O (N)$ and a nonnegative-valued diagonal matrix $Σ \in R^{p \times p}$ . From (I2) in Appendix 9, we have $‖ M^{- 1} ‖_{2} \leq ‖ (I_{p} + Σ)^{- 1} ‖_{F} = (1 + σ_{min}^{2} ([[V]]_{21}))^{- 1} \leq 1$ . Thus, we have $κ (M) \leq ‖ M ‖_{2} \leq 1 + ‖ [[V]]_{11} ‖_{2} + ‖ [[V]]_{21} ‖_{2}^{2}$ .

8 From the relation $min_{U \in St (p, N)} f (U) = inf_{V \in Q_{N, p} (S)} f \circ Φ_{S}^{- 1} (V)$ in Lemma 2.6, $Φ_{S}^{- 1} (V^{⋆}) \in St (p, N)$ is also a global minimizer of f over $St (p, N)$ .

9 We note that this early stopping of GDM+CP-retraction can be caused by the instability [Citation22] of the Sherman-Morrison-Woodbury formula used in $R_{U_{0}}^{Cay}$ and $\nabla (f \circ R_{U_{0}}^{Cay})$ .

10 The subspace $W_{1} := {U Ω \in R^{N \times p} ∣ Ω^{T} = - Ω \in R^{p \times p}} \subset R^{N \times p}$ is an orthogonal complement to the subspace $W_{2} := {U_{⊥} K \in R^{N \times p} ∣ K \in R^{(N - p) \times p}} \subset R^{N \times p}$ with the inner product $⟨ X, Y ⟩ = Tr (X^{T} Y) (X, Y \in R^{N \times p})$ . The tangent space $T_{U} St (p, N)$ can be decomposed as $W_{1} \oplus W_{2}$ with the direct sum ⊕. In view of the orthogonal decomposition, the first term and the second term in the right-hand side of (EquationA1(A1) $\begin{aligned} (X \in R^{N \times p}) P_{T_{U} St (p, N)} (X) & := \underset{Z \in T_{U} St (p, N)}{argmin} ‖ X - Z ‖_{F} \\ = \frac{1}{2} U (U^{T} X - X^{T} U) + (I - U U^{T}) X . \end{aligned}$ (A1) ) can be regarded respectively as the orthogonal projection of $X$ onto $W_{1}$ and $W_{2}$ .

11 The exponential mapping ${Exp}_{U} : T_{U} St (p, N) \to St (p, N)$ at $U \in St (p, N)$ is defined as a mapping that assigns a given direction $D \in T_{U} St (p, N)$ to a point on the geodesic of $St (p, N)$ with the initial velocity $D$ . The exponential mapping is also a special instance of retractions of $St (p, N)$ . However, due to its high computational complexity, computationally simpler retractions have been used extensively for Problem 1.1 [Citation1].

Wen Z, Yin W. A feasible method for optimization with orthogonality constraints. Math Program. 2013;142(1–2):397–434.

Web of Science ®Google Scholar

Absil PA, Mahony R, Sepulchre R. Optimization algorithms on matrix manifolds. Princeton (NJ): Princeton University Press; 2008.

Google Scholar

Additional information

Funding

This work was supported by JSPS [grants-in-aid19H04134] partially, by JSPS [grants-in-aid 21J21353] and by JST SICORP [grant number JPMJSC20C6].

Download PDF

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

Generalized left-localized Cayley parametrization for optimization with orthogonality constraints

Information for

Open access

Opportunities

Help and information

Generalized left-localized Cayley parametrization for optimization with orthogonality constraints

Abstract

Disclosure statement

Notes

Additional information

Funding

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date

Your download is now in progress and you may close this window

Login or register to access this feature