Dimensionality reduction for algorithms learning Mahalanobis matrix M #167

wdevazelhes · 2019-02-04T12:56:58Z

For all Mahalanobis metric learners, we should be able to reduce the dimension. For those that optimize the transformation matrix L, this can be done explicitely by setting the matrix at init to have shape (num_dims, n_features). For the others (that optimize the metric M), we could provide the user with num_dims which could be set to:

a number k > n_features: in this case we would do the eigendecomposition of M and would only keep the k components with highest eigenvalues
similar to scikit-learn's PCA, it could also be a value between 0 and 1 for say some threshold on the eigenvalues, or even a string, for some custom strategy (for instance the elbow rule)

This is the current state in the package:

All metric learners that use transformer_from_metric (Covariance, LSML, MMC, and SDML) do not have a num_dims argument
All others optimize explicitely L, and have a num_dims argument (LFDA, MLKR, NCA, RCA) except LMNN, that could have one

Also, should we replace num_dims by n_components, like this is the case in scikit learn linear transformers ? This is also what we did for this PR on NCA in scikit-learn scikit-learn/scikit-learn#10058

This is also related to #124, since we should check that in the case of a custom matrix for initializing the explicit transformer it is consistent with the desired dimension

The text was updated successfully, but these errors were encountered:

bellet · 2019-02-14T12:50:42Z

We should definitely do this for all algorithms learning the transformation matrix L

For M I am not sure because it would not change the learning algorithm, it is only post-processing the solution and the impact on the quality can be very large, and hard to understand for the user. A better way is to add trace regularization to encourage the learned M to be low rank (in which case one can safely ignore the eigenvectors corresponding to eigenvalues equal 0). In this case one cannot choose num_dims explicitly but indirectly by varying the strength of the regularization

wdevazelhes · 2019-03-26T13:17:02Z

Let's just add this for LMNN and add trace regularization in a next release

bellet · 2019-06-07T10:16:44Z

Done for LMNN in #193, so I renamed this and flagged it for 0.6.0

wdevazelhes added this to the v0.5.0 milestone Feb 4, 2019

bellet closed this as completed Feb 14, 2019

bellet reopened this Feb 14, 2019

This was referenced Apr 18, 2019

[MRG] Uniformize num_dims and add it for LMNN #193

Merged

[MRG] Uniformize initialization for all algorithms #195

Merged

bellet changed the title ~~Dimensionality reduction for all algorithms~~ Dimensionality reduction for algorithms learning Mahalanobis matrix M Jun 7, 2019

bellet modified the milestones: v0.5.0, v0.6.0 Jun 7, 2019

bellet modified the milestones: v0.6.0, v0.7.0 Jul 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dimensionality reduction for algorithms learning Mahalanobis matrix M #167

Dimensionality reduction for algorithms learning Mahalanobis matrix M #167

wdevazelhes commented Feb 4, 2019

bellet commented Feb 14, 2019 •

edited

Loading

wdevazelhes commented Mar 26, 2019

bellet commented Jun 7, 2019

Dimensionality reduction for algorithms learning Mahalanobis matrix M #167

Dimensionality reduction for algorithms learning Mahalanobis matrix M #167

Comments

wdevazelhes commented Feb 4, 2019

bellet commented Feb 14, 2019 • edited Loading

wdevazelhes commented Mar 26, 2019

bellet commented Jun 7, 2019

bellet commented Feb 14, 2019 •

edited

Loading