Skip to content

Commit 1c999c2

Browse files
committed
v-measure
1 parent b433b41 commit 1c999c2

File tree

1 file changed

+39
-0
lines changed

1 file changed

+39
-0
lines changed

book/4-clustering.tex

Lines changed: 39 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -318,6 +318,45 @@ \subsection{Homogeneity Score}
318318
\section{V Measure}
319319
\subsection{V Measure}
320320

321+
V-Measure is an external clustering evaluation metric that combines the strengths of two complementary criteria: Homogeneity
322+
(how pure clusters are with respect to ground-truth classes) and Completeness (how well all members of a class are grouped into
323+
the same cluster). It is computed as the harmonic mean of these two scores, similar to how precision and recall are combined into
324+
the F-score.
325+
326+
\begin{center}
327+
FORMULA GOES HERE
328+
\end{center}
329+
330+
By design, V-Measure is bounded between 0.0 and 1.0, with higher scores indicating better clustering quality. A score of 1.0
331+
means that both homogeneity and completeness are perfectly satisfied.
332+
333+
\textbf{When to use the V-measure?}
334+
335+
V-Measure is ideal for evaluating clustering solutions when you need a balanced assessment of both purity within clusters and the
336+
completeness of class representation.
337+
338+
\coloredboxes{
339+
\item Combines homogeneity and completeness into a single score, providing a holistic evaluation of clustering quality.
340+
\item The $\beta$ parameter allows customization to prioritize either homogeneity or completeness based on specific use cases.
341+
\item Clustering algorithm agnostic.
342+
}
343+
{
344+
\item Requires ground-truth labels.
345+
\item V-Measure inherits the trade-off between homogeneity and completeness. Adjusting one often impacts the other, making it
346+
challenging to optimize both in some scenarios.
347+
\item Not normalized with regards to random labeling.
348+
}
349+
350+
\clearpage
351+
352+
\thispagestyle{customstyle}
353+
354+
\textbf{Other related metrics}
355+
356+
V-Measure excels as a comprehensive metric for clustering evaluation. However, for specific insights, it can be supplemented with
357+
individual Homogeneity and Completeness scores. Additionally, pair-based measures like Adjusted Rand Index (ARI) or
358+
information-theoretic measures like Variation of Information (VI) may provide complementary perspectives in specific use cases​.
359+
321360
% ---------- Davis Bouldin Score ----------
322361
\clearpage
323362
\thispagestyle{clusteringstyle}

0 commit comments

Comments
 (0)