-
Notifications
You must be signed in to change notification settings - Fork 11
/
NEWS
180 lines (107 loc) · 5.23 KB
/
NEWS
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
# Changelog
## 1.1.6 (2024-08-22)
* [PYTHON] The package now works with *numpy* 2.0.
## 1.1.5 (2023-10-18)
* [BACKWARD INCOMPATIBILITY] [Python and R] Inequality measures
are no longer referred to as inequity measures.
* [BACKWARD INCOMPATIBILITY] [Python and R]
Some external cluster validity measures were renamed:
`adjusted_asymmetric_accuracy` -> `normalized_clustering_accuracy`,
`normalized_accuracy` -> `normalized_pivoted_accuracy`.
* [BACKWARD INCOMPATIBILITY] [Python] `compare_partitions2` has been removed,
as `compare_partitions` and other partition similarity scores
now support both pairs of label vectors `(x, y)` and confusion matrices
`(x=C, y=None)`.
* [Python and R] New parameter to `pair_sets_index`: `clipped`.
* In `normalizing_permutation` and external cluster validity measures,
the input matrices can now be of the type `double`.
* [BUGFIX] [Python] #80: Fixed adjustment for `nmslib_n_neighbors`
in small samples.
* [BUGFIX] [Python] #82: `cluster_validity` submodule not imported.
* [BUGFIX] Some external cluster validity measures
now handle NaNs better and are slightly less prone to round-off errors.
## 1.1.4 (2023-03-31)
* [Python] The GIc algorithm is no longer marked as experimental;
its description is provided in
<https://doi.org/10.1007/s00357-024-09483-1>.
## 1.1.3 (2023-01-17)
* [R] `mst.default` now throws an error if any element in the input matrix
is missing/infinite.
* [Python] The call to `mlpack.emst` that stopped working
with the new version of `mlpack` has been fixed.
## 1.1.2 (2022-09-17)
* [Python and R] `adjusted_asymmetric_accuracy`
now accepts confusion matrices with fewer columns than rows.
Such "missing" columns are now treated as if they were filled with 0s.
* [Python and R] `pair_sets_index`, and `normalized_accuracy` return
the same results for non-symmetric confusion matrices and transposes thereof.
## 1.1.1 (2022-09-15)
* [Python] #75: `nmslib` is now optional.
* [BUILD TIME]: The use of `ssize_t` was not portable.
## 1.1.0 (2022-09-05)
* [Python and R] New function: `adjusted_asymmetric_accuracy`.
* [Python and R] Implementations of the so-called internal cluster
validity measures discussed in
DOI: [10.1016/j.ins.2021.10.004](https://doi.org/10.1016/j.ins.2021.10.004);
see our (GitHub-only) [CVI](https://github.com/gagolews/optim_cvi) package
for R. In particular, the generalised Dunn indices are based on the code
originally authored by Maciej Bartoszuk. Thanks.
Functions added (`cluster_validity` module):
`calinski_harabasz_index`,
`dunnowa_index`,
`generalised_dunn_index`,
`negated_ball_hall_index`,
`negated_davies_bouldin_index`,
`negated_wcss_index`,
`silhouette_index`,
`silhouette_w_index`,
`wcnn_index`.
These cluster validity measures are discussed
in more detail at <https://clustering-benchmarks.gagolewski.com/>.
* [BACKWARD INCOMPATIBILITY] `normalized_confusion_matrix`
now solves the maximal assignment problem instead of applying
the somewhat primitive partial pivoting.
* [Python and R] New function: `normalizing_permutation`
* [R] New function: `normalized_confusion_matrix`.
* [Python and R] New parameter to `pair_sets_index`: `simplified`.
* [Python] New parameters to `plots.plot_scatter`:
`axis`, `title`, `xlabel`, `ylabel`, `xlim`, `ylim`.
## 1.0.1 (2022-08-08)
* [GENERAL] A paper on the `genieclust` package is now available:
M. Gagolewski, genieclust: Fast and robust hierarchical clustering,
SoftwareX 15, 100722, 2021, DOI:
[10.1016/j.softx.2021.100722](https://doi.org/10.1016/j.softx.2021.100722).
* [Python] `plots.plot_scatter` now uses a more accessible default palette
(from R 4.0.0).
* [Python and R] New function: `devergottini_index`.
## 1.0.0 (2021-04-22)
* [R] Use `mlpack` instead of `RcppMLPACK` (#72).
This package is merely suggested, not dependent upon.
## 0.9.8 (2021-01-08)
* [Python] Require Python >= 3.7 (implied by `numpy`).
* [Python] Require `nmslib`.
* [R] Use `RcppMLPACK` directly; remove dependency on `emstreeR`.
* [R] Use `tinytest` for unit testing instead of `testthat`.
## 0.9.4 (2020-07-31)
* [BUGFIX] [R] Fix build errors on Solaris.
## 0.9.3 (2020-07-25)
* [BUGFIX] [Python] Add code coverage CI. Fix some minor inconsistencies.
Automate the `bdist` build chain.
* [R] Update DESCRIPTION to meet the CRAN policies.
## 0.9.2 (2020-07-22)
* [BUGFIX] [Python] Fix broken build script for OS X with no OpenMP.
## 0.9.1 (2020-07-18)
* [GENERAL] The package has been completely rewritten.
The core functionality is now implemented in C++ (with OpenMP).
* [GENERAL] Clustering with respect to HDBSCAN*-like
mutual reachability distances is supported.
* [GENERAL] The parallelised Jarnik-Prim algorithm now supports on-the-fly
distance computations. Euclidean minimum spanning tree can be
determined with `mlpack`, which is much faster in low-dimensional spaces.
* [R] R version is now available.
* [Python] [Experimental] The GIc algorithm proposed by Anna Cena
in her 2018 PhD thesis is added.
* [Python] Approximate version based on nearest neighbour graphs produced
by `nmslib` is added.
## 0.1a2 (2018-05-23)
* [Python] Initial PyPI release.