Utils & Metrics¶

_1d¶

pyldl.algorithms.utils._1d(func)[source]¶

_clip¶

pyldl.algorithms.utils._clip(func)[source]¶

_reduction¶

pyldl.algorithms.utils._reduction(func)[source]¶

binaryzation¶

pyldl.algorithms.utils.binaryzation(D: ndarray, method='threshold', param: any = None) → ndarray[source]¶

Transform label distribution matrix to logical label matrix.

Parameters:

D (np.ndarray) – Label distribution matrix (shape: \([n,\, l]\)).
method ({'threshold', 'topk'}, optional) –
Type of binaryzation method, defaults to ‘threshold’. The options are ‘threshold’ and ‘topk’, which can refer to:

[BIN-KWT+24]
Zhiqiang Kou, Jing Wang, Jiawei Tang, Yuheng Jia, Boyu Shi, and Xin Geng. Exploiting multi-label correlation in label distribution learning. In Proceedings of the International Joint Conference on Artificial Intelligence, 4326–4334. 2024. URL: https://doi.org/10.24963/ijcai.2024/478.
param (any, optional) – Parameter of binaryzation method, defaults to None. If None, the default value is .5 for ‘threshold’ and \(\lfloor l / 2 \rfloor\) for ‘topk’.

Returns:

Logical label matrix (shape: \([n,\, l]\)).

Return type:

np.ndarray

csr2sparse¶

pyldl.algorithms.utils.csr2sparse(A: csr_matrix) → SparseTensor[source]¶

kernel¶

pyldl.algorithms.utils.kernel(X: Tensor, Y: Tensor | None = None, gamma: float | None = None) → Tensor[source]¶

kl_divergence\(\downarrow\)¶

pyldl.algorithms.utils.kl_divergence(D, D_pred)[source]¶: Kullback-Leibler divergence. It is defined as:

\[\text{KLD}(\boldsymbol{u}, \, \boldsymbol{v}) = \sum^l_{j=1}u_j \ln \frac{u_j}{v_j}\text{.}\]

pairwise_cosine¶

pyldl.algorithms.utils.pairwise_cosine(X: ndarray | Tensor, Y: ndarray | Tensor | None = None, mode: str = 'similarity') → ndarray | Tensor[source]¶

Pairwise cosine distance/similarity.

Parameters:

X (tf.Tensor) – Matrix \(\boldsymbol{X}\) (shape: \([m_{\boldsymbol{X}},\, n]\)).
Y (tf.Tensor) – Matrix \(\boldsymbol{Y}\) (shape: \([m_{\boldsymbol{Y}},\, n]\)).
mode (str) – Defaults to ‘similarity’. The options are ‘similarity’ and ‘distance’.

Returns:

Pairwise cosine similarity (shape: \([m_{\boldsymbol{X}},\, m_{\boldsymbol{Y}}]\)).

Return type:

tf.Tensor

pairwise_euclidean¶

pyldl.algorithms.utils.pairwise_euclidean(X: ndarray | Tensor, Y: ndarray | Tensor | None = None) → ndarray | Tensor[source]¶

Pairwise Euclidean distance.

Parameters:

X (Union[np.ndarray, tf.Tensor]) – Matrix \(\boldsymbol{X}\) (shape: \([m_{\boldsymbol{X}},\, n]\)).
Y (Union[np.ndarray, tf.Tensor], optional) – Matrix \(\boldsymbol{Y}\) (shape: \([m_{\boldsymbol{Y}},\, n]\)), if None, \(\boldsymbol{Y} = \boldsymbol{X}\), defaults to None.

Returns:

Pairwise Euclidean distance (shape: \([m_{\boldsymbol{X}},\, m_Y]\)).

Return type:

Union[np.ndarray, tf.Tensor]

pairwise_pearsonr¶

pyldl.algorithms.utils.pairwise_pearsonr(X: ndarray | Tensor, Y: ndarray | Tensor | None = None) → ndarray | Tensor[source]¶

proj¶

pyldl.algorithms.utils.proj(D: ndarray) → ndarray[source]¶

This approach is proposed in paper [Con16].

Parameters:: D (np.ndarray) – Matrix \(\boldsymbol{D}\).
Returns:: The projection onto the probability simplex.
Return type:: np.ndarray

soft_thresholding¶

pyldl.algorithms.utils.soft_thresholding(A: ndarray, tau: float) → ndarray[source]¶

Soft thresholding operation. It is defined as \(\text{soft}(\boldsymbol{A}, \, \tau) = \text{sgn}(\boldsymbol{A}) \odot \max\lbrace \lvert \boldsymbol{A} \rvert - \tau, 0 \rbrace\), where \(\odot\) denotes element-wise multiplication.

Parameters:

A (np.ndarray) – Matrix \(\boldsymbol{A}\).
tau (float) – \(\tau\).

Returns:

The result of soft thresholding operation.

Return type:

np.ndarray

solvel21¶

pyldl.algorithms.utils.solvel21(A: ndarray, tau: float) → ndarray[source]¶

This approach is proposed in paper [CY14].

The solution to the optimization problem \(\mathop{\arg\min}_{\boldsymbol{X}} \Vert \boldsymbol{X} - \boldsymbol{A} \Vert_\text{F}^2 + \tau \Vert \boldsymbol{X} \Vert_{2,\,1}\) is given by the following formula:

\[\begin{split}\vec{x}_{\bullet j}^{\ast} = \left\{ \begin{aligned} & \frac{\Vert \vec{a}_{\bullet j} \Vert - \tau}{\Vert \vec{a}_{\bullet j} \Vert} \vec{a}_{\bullet j}, & \tau \le \Vert \vec{a}_{\bullet j} \Vert \\ & 0, & \text{otherwise} \end{aligned} \right.\text{.}\end{split}\]

where \(\vec{x}_{\bullet j}\) is the \(j\)-th column of matrix \(\boldsymbol{X}\), and \(\vec{a}_{\bullet j}\) is the \(j\)-th column of matrix \(\boldsymbol{A}\).

Parameters:

A (np.ndarray) – Matrix \(\boldsymbol{A}\).
tau (float) – \(\tau\).

Returns:

The solution to the optimization problem.

Return type:

np.ndarray

svt¶

pyldl.algorithms.utils.svt(A: ndarray, tau: float) → ndarray[source]¶

Singular value thresholding (SVT) is proposed in paper [CCS10].

The solution to the optimization problem \(\mathop{\arg\min}_{\boldsymbol{X}} \Vert \boldsymbol{X} - \boldsymbol{A} \Vert_\text{F}^2 + \tau \Vert \boldsymbol{X} \Vert_{\ast}\) is given by \(\boldsymbol{U} \max \lbrace \boldsymbol{\Sigma} - \tau, 0 \rbrace \boldsymbol{V}^\top\), where \(\boldsymbol{A} = \boldsymbol{U} \boldsymbol{\Sigma} \boldsymbol{V}^\top\) is the singular value decomposition of matrix \(\boldsymbol{A}\).

Parameters:

A (np.ndarray) – Matrix \(\boldsymbol{A}\).
tau (float) – \(\tau\).

Returns:

The solution to the optimization problem.

Return type:

np.ndarray

artificial¶

pyldl.utils.artificial(X, a=1.0, b=0.5, c=0.2, d=1.0, w1=array([[4., 2., 1.]]), w2=array([[1., 2., 4.]]), w3=array([[1., 4., 2.]]), lambda1=0.01, lambda2=0.01)[source]¶

download_dataset¶

pyldl.utils.download_dataset(name, dataset_path)[source]¶

emphasize¶

pyldl.utils.emphasize(D, rate=0.5, **kwargs)[source]¶

gaussian_noise¶

pyldl.utils.gaussian_noise(D: ndarray, mean: float = 0.0, std: float = 0.1)[source]¶

load_dataset¶

pyldl.utils.load_dataset(name, dir='dataset')[source]¶

make_ldl¶

pyldl.utils.make_ldl(n_samples=200, **kwargs)[source]¶

plot_artificial¶

pyldl.utils.plot_artificial(n_samples=50, model=None, file_name=None, *, noise=False, noise_func_args=None, **kwargs)[source]¶

random_missing¶

pyldl.utils.random_missing(D, missing_rate=0.9, weighted=False, return_mask=True)[source]¶

_D2L¶

pyldl.metrics._D2L(f=None, *, keep_pred=False)[source]¶

accuracy\(\uparrow\)¶

pyldl.metrics.accuracy(y, y_pred)[source]¶

canberra\(\downarrow\)¶

pyldl.metrics.canberra(D, D_pred)[source]¶: Canberra distance. It is defined as:

\[\text{Can.}(\boldsymbol{u}, \, \boldsymbol{v}) = \sum^l_{j=1}\frac{\left\vert u_j - v_j \right\vert}{u_j + v_j}\text{.}\]

chebyshev\(\downarrow\)¶

pyldl.metrics.chebyshev(D, D_pred)[source]¶: Chebyshev distance. It is defined as:

\[\text{Cheby.}(\boldsymbol{u}, \, \boldsymbol{v}) = \max_j \left\vert u_j - v_j \right\vert\text{.}\]

chi2\(\downarrow\)¶

pyldl.metrics.chi2(D, D_pred)[source]¶: Chi-squared distance. It is defined as:

\[\chi^2(\boldsymbol{u}, \, \boldsymbol{v}) = \sum^l_{j=1}\frac{\left( u_j - v_j \right)^2}{u_j + v_j}\text{.}\]

clark\(\downarrow\)¶

pyldl.metrics.clark(D, D_pred)[source]¶: Clark distance. It is defined as:

\[\text{Clark}(\boldsymbol{u}, \, \boldsymbol{v}) = \sqrt{\sum^l_{j=1}\frac{\left( u_j - v_j \right)^2}{\left( u_j + v_j \right)^2}}\text{.}\]

cosine\(\uparrow\)¶

pyldl.metrics.cosine(D, D_pred)[source]¶: Cosine similarity. It is defined as:

\[\text{Cosine}(\boldsymbol{u}, \, \boldsymbol{v}) = \frac{\sum^l_{j=1}u_j v_j}{\sqrt{\sum^l_{j=1}u_j^2}\sqrt{\sum^l_{j=1}v_j^2}}\text{.}\]

dpa\(\uparrow\)¶

pyldl.metrics.dpa(D, D_pred)[source]¶

Degree percentile average (DPA) is proposed in paper [LDL-JQLL24]. It is defined as:

\[\text{DPA}(\boldsymbol{u}, \, \boldsymbol{v}) = \frac{1}{l} \sum_{j=1}^{l} u_j \rho(v_j)\text{,}\]

where \(\rho(\cdot)\) is the rank of the element in the vector.

error_probability\(\downarrow\)¶

pyldl.metrics.error_probability(D, D_pred)[source]¶: Error probability. It is defined as:

\[\text{Err. prob.}(\boldsymbol{u}, \, \boldsymbol{v}) = 1 - u_{\arg\max(\boldsymbol{v})}\text{.}\]

euclidean\(\downarrow\)¶

pyldl.metrics.euclidean(D, D_pred)[source]¶: Euclidean distance. It is defined as:

\[\text{Eucl.}(\boldsymbol{u}, \, \boldsymbol{v}) = \sqrt{\sum^l_{j=1}\left( u_j - v_j \right)^2}\text{.}\]

fidelity\(\uparrow\)¶

pyldl.metrics.fidelity(D, D_pred)[source]¶: Fidelity similarity. It is defined as:

\[\text{Fid.}(\boldsymbol{u}, \, \boldsymbol{v}) = \sum^l_{j=1} \sqrt{u_j v_j}\text{.}\]

hamming\(\downarrow\)¶

pyldl.metrics.hamming(L, L_pred)[source]¶

intersection\(\uparrow\)¶

pyldl.metrics.intersection(D, D_pred)[source]¶: Intersection similarity. It is defined as:

\[\text{Int.}(\boldsymbol{u}, \, \boldsymbol{v}) = \sum^l_{j=1} \min\left(u_j, \, v_j\right)\text{.}\]

jaccard\(\uparrow\)¶

pyldl.metrics.jaccard(L, L_pred)[source]¶

kendall\(\uparrow\)¶

pyldl.metrics.kendall(D, D_pred)[source]¶: Kendall’s rank correlation coefficient. It is defined as:

\[\text{Ken.}(\boldsymbol{u}, \, \boldsymbol{v}) = \frac{2 \sum_{j < k} \text{sgn}(u_j - u_k) \text{sgn}(v_j - v_k) }{l (l-1)}\text{.}\]

match_m\(\uparrow\)¶

pyldl.metrics.match_m(D, D_pred, m=None)[source]¶

max_roc_auc\(\uparrow\)¶

pyldl.metrics.max_roc_auc(D, D_pred)[source]¶

mean_absolute_error\(\downarrow\)¶

pyldl.metrics.mean_absolute_error(D, D_pred, mode='macro')[source]¶

mean_squared_error\(\downarrow\)¶

pyldl.metrics.mean_squared_error(D, D_pred, mode='macro')[source]¶

mu\(\uparrow\)¶

pyldl.metrics.mu(D, D_pred, metrics=<function kl_divergence>)[source]¶

one_error\(\downarrow\)¶

pyldl.metrics.one_error(L, D_pred)[source]¶

precision\(\uparrow\)¶

pyldl.metrics.precision(y, y_pred)[source]¶

score¶

pyldl.metrics.score(target: ndarray, pred: ndarray | None = None, metrics: list | None = None, return_dict: bool = False)[source]¶

sensitivity\(\uparrow\)¶

pyldl.metrics.sensitivity(y, y_pred)[source]¶

sorensen\(\downarrow\)¶

pyldl.metrics.sorensen(D, D_pred)[source]¶: Sørensen's distance. It is defined as:
\[\text{S}\phi\text{ren.}(\boldsymbol{u}, \, \boldsymbol{v}) = \frac{\sum^l_{j=1}\left\vert u_j - v_j \right\vert}{\sum^l_{j=1}\left( u_j + v_j \right)}\text{.}\]

spearman\(\uparrow\)¶

pyldl.metrics.spearman(D, D_pred)[source]¶

Spearman’s rank correlation coefficient. It is defined as:

\[\text{Spear.}(\boldsymbol{u}, \, \boldsymbol{v}) = 1 - \frac{6 \sum_{j=1}^{l} (\rho(u_j) - \rho(v_j))^2 }{l(l^2 - 1)}\text{,}\]

where \(\rho(\cdot)\) is the rank of the element in the vector.

specificity\(\uparrow\)¶

pyldl.metrics.specificity(y, y_pred)[source]¶

top_k\(\uparrow\)¶

pyldl.metrics.top_k(D, D_pred, k=None, mode='f1_score')[source]¶

wave_hedges\(\downarrow\)¶

pyldl.metrics.wave_hedges(D, D_pred)[source]¶: Wave-Hedges distance. It is defined as:

\[\text{WHD}(\boldsymbol{u}, \, \boldsymbol{v}) = \sum^l_{j=1}\frac{\left| u_j - v_j \right|}{\max (u_j, \, v_j)}\text{.}\]

worst_kl_divergence¶

pyldl.metrics.worst_kl_divergence(D: ndarray)[source]¶

youden_index\(\uparrow\)¶

pyldl.metrics.youden_index(y, y_pred)[source]¶

zero_one_loss\(\downarrow\)¶

pyldl.metrics.zero_one_loss(D, D_pred)[source]¶

0/1 loss. It is defined as:

\[\text{0/1 loss}(\boldsymbol{u}, \, \boldsymbol{v}) = \delta(\arg\max(\boldsymbol{u}), \, \arg\max(\boldsymbol{v}))\text{,}\]

where \(\delta(\cdot, \, \cdot)\) is the Kronecker delta function.

References¶

[Con16]

Laurent Condat. Fast projection onto the simplex and the l1 ball. Mathematical Programming, 158(1):575–585, 2016. URL: https://doi.org/10.1007/s10107-015-0946-6.

[CY14]

Jinhui Chen and Jian Yang. Robust subspace segmentation via low-rank representation. IEEE Transactions on Cybernetics, 44(8):1432–1445, 2014. URL: https://doi.org/10.1109/TCYB.2013.2286106.

[CCS10]

Jian-Feng Cai, Emmanuel J Candès, and Zuowei Shen. A singular value thresholding algorithm for matrix completion. SIAM Journal on optimization, 20(4):1956–1982, 2010. URL: https://doi.org/10.1137/080738970.