Skip to main content

Tropical principal component analysis on the space of phylogenetic trees.

Author
Abstract
:

Due to new technology for efficiently generating genome data, machine learning methods are urgently needed to analyze large sets of gene trees over the space of phylogenetic trees. However, the space of phylogenetic trees is not Euclidean, so ordinary machine learning methods cannot be directly applied. In 2019, Yoshida et al. introduced the notion of tropical principal component analysis (PCA), a statistical method for visualization and dimensionality reduction using a tropical polytope with a fixed number of vertices that minimizes the sum of tropical distances between each data point and its tropical projection. However, their work focused on the tropical projective space rather than the space of phylogenetic trees. We focus here on tropical PCA for dimension reduction and visualization over the space of phylogenetic trees.

Year of Publication
:
2020
Journal
:
Bioinformatics (Oxford, England)
Volume
:
36
Issue
:
17
Number of Pages
:
4590-4598
Date Published
:
2020
ISSN Number
:
1367-4803
URL
:
https://academic.oup.com/bioinformatics/article-lookup/doi/10.1093/bioinformatics/btaa564
DOI
:
10.1093/bioinformatics/btaa564
Short Title
:
Bioinformatics
Download citation