55
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Nonparametric High-Dimensional Multi-Sample Tests based on Graph Theory

ORCID Icon
Received 06 Jul 2022, Accepted 15 May 2024, Published online: 17 Jun 2024
 

Abstract

High-dimensional data pose unique challenges for data processing in an era of ever-increasing amounts of data availability. Graph theory can provide a structure of high-dimensional data. We introduce two key properties desirable for graphs in testing homogeneity. Roughly speaking, these properties may be described as: unboundedness of edge counts under the same distribution and boundedness of edge counts under different distributions. It turns out that the minimum spanning tree violates these properties but the shortest Hamiltonian path posses them. Based on the shortest Hamiltonian path, we propose two combinations of edge counts in multiple samples to test for homogeneity. We give the permutation null distributions of proposed statistics when sample sizes go to infinity. The power is analyzed by assuming both sample sizes and dimensionality tend to infinity. Simulations show that our new tests behave very well overall in comparison with various competitors. Real data analysis of tumors and images further convince the value of our proposed tests. Software implementing the test is available in the R package GRelevance. Supplemental materials for this article are available online.

Supplementary Materials

Supplementary materials include: Appendix A for proof of Theorem 1, Appendix B for proof of Theorem 2, Appendix C for verification of A4, Appendix D for verification of A5, Appendix E for presenting Lemma 1 and its proof, Appendix F for proof of Theorem 3, and a zip file containing R code for simulation and data analysis.

Acknowledgments

We thank Dr. Won Beom Jung for sharing fMRI data. We also thank Dr. Nancy Reid, Dr. Augustine Wong, Octavia Wong, and Jessica Collins for their helpful comments on the first draft. We would like to thank the two referees for suggesting some notable improvements in the presentation of the article.

Disclosure statement

The author reports there are no competing interests to declare.

Additional information

Funding

This work was supported by the Natural Sciences and Engineering Research Council of Canada under Grant RGPIN-2022-03264, the Interior Universities Research Coalition and the BC Ministry of Health, the NSERC Alliance International Catalyst Grant ALLRP 590341-23, and the University of British Columbia Okanagan (UBC-O) Vice Principal Research in collaboration with UBC-O Irving K. Barber Faculty of Science.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 180.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.