Skip to content

Commit bccb39d

Browse files
committed
diversity, novelty, serendipity and coverage
1 parent c231fb5 commit bccb39d

File tree

1 file changed

+142
-3
lines changed

1 file changed

+142
-3
lines changed

book/5-ranking.tex

Lines changed: 142 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -301,10 +301,149 @@ \subsection{Fraction of Concordant Pairs}
301301
% great references: https://aman.ai/recsys/metrics/#fraction-of-concordant-pairs-fcp
302302
% https://www.ijcai.org/Proceedings/13/Papers/449.pdf
303303

304-
% ---------- Behavioral Metrics (Novelty, Serendipity, Diversity/Intra-List Diversity, Coverage) ----------
304+
% ---------- Diversity ----------
305305
\clearpage
306306
\thispagestyle{rankingstyle}
307-
\section{Behavioral Metrics}
308-
\subsection{Behavioral Metrics}
307+
\section{Diversity}
308+
\subsection{Diversity}
309309

310+
Diversity is a ranking metric used in recommender systems to measure how varied the recommended items are within a given list.
311+
A diverse recommendation set ensures that users are exposed to different categories, genres, or types of content, rather than
312+
receiving highly similar items. This helps prevent redundancy and enhances user discovery. Diversity is often calculated using
313+
pairwise dissimilarity between recommended items.
310314

315+
\begin{center}
316+
FORMULA GOES HERE
317+
\end{center}
318+
319+
A higher Diversity score indicates that the recommended items are more distinct from one another, whereas a lower score
320+
suggests redundancy.
321+
322+
\textbf{When to use Diversity?}
323+
324+
Diversity is particularly important when you want to improve user engagement by introducing varied recommendations. Or when
325+
you want to avoid excessive similarity in recommendations.
326+
327+
\coloredboxes{
328+
\item Encourages exploration. Users are exposed to a broader range of content, which can increase
329+
engagement and retention.
330+
\item Supports long-tail recommendations. Helps surface less popular items that may still be relevant,
331+
avoiding over-recommendation of mainstream content.
332+
}
333+
{
334+
\item Potential trade-off with relevance. Increasing diversity may sometimes lead to less relevant recommendations
335+
for the user.
336+
\item Hard to define optimal diversity. Too much diversity can lead to recommendations that feel random or disconnected.
337+
}
338+
339+
% ---------- Novelty ----------
340+
\clearpage
341+
\thispagestyle{rankingstyle}
342+
\section{Novelty}
343+
\subsection{Novelty}
344+
345+
Novelty is a ranking metric in recommender systems that measures how unfamiliar or unexpected the recommended items are to
346+
the user. A high Novelty score indicates that the system is suggesting items that the user has not encountered before,
347+
rather than repeating well-known or frequently recommended content. One way to measure Novelty is the following.
348+
349+
\begin{center}
350+
\[
351+
Novelty(i) = 1 - \frac{count(\text{users who got recommended} \: i)}{count(\text{users who have not interacted with} \: i)}
352+
\]
353+
\end{center}
354+
355+
In the literature we can find other ways to compute Novelty such as: $Novelty(i) = -log_2 \left( \frac{count(\text{users who got recommended } \: i)}{count(\text{users who have not interacted with} \: i)} \right)$
356+
or $Novelty = \frac{1}{|S|} \sum_{i \in S} -log P(i)$ where $P(i)$ represents the popularity of item \( i \)
357+
A higher Novelty score means the recommendations contain less mainstream content, encouraging discovery of new items rather
358+
than reinforcing existing preferences.
359+
360+
\textbf{When to use Novelty?}
361+
362+
Novelty is especially useful in scenarios where exploration and discovery are important. Such as content, e-commerce and
363+
retail platforms where recommending new or niche products rather than just trending or best-selling ones can make a difference.
364+
365+
\coloredboxes{
366+
\item Encourages exploration. Users are exposed to a broader range of content, which can increase
367+
engagement and retention.
368+
\item Supports discovery of niche content. Helps mitigate the popularity bias by promoting lesser-known items.
369+
}
370+
{
371+
\item Potential Trade-off with relevance. High novelty items might be less relevant if the user has no prior
372+
interest in them.
373+
\item Potentially overwhelming for users. If novelty is too high, recommendations may feel random or disconnected.
374+
}
375+
376+
377+
\clearpage
378+
% FOR SECOND PAGE
379+
\textbf{Novelty vs Diversity}
380+
381+
Novelty focuses on how new or unexpected the recommendations are for the user whereas diversity focuses on how different the
382+
recommended items are from each other. Both metrics contribute to exploration but in different ways — novelty ensures
383+
fresh discoveries, while diversity prevents redundancy.
384+
385+
% ---------- Serendipity ----------
386+
\clearpage
387+
\thispagestyle{rankingstyle}
388+
\section{Serendipity}
389+
\subsection{Serendipity}
390+
391+
Serendipity is a ranking and recommendation system metric that measures the extent to which a recommendation is both
392+
relevant and surprising to a user. Unlike conventional accuracy-based metrics, which focus on predicting user preferences
393+
based on past behavior, serendipity evaluates how well a system introduces users to novel and unexpected items they would
394+
not have easily discovered themselves.
395+
396+
\begin{center}
397+
% there are many formulas, we need to figure which one to show. potentially comment on the others.
398+
FORMULA GOES HERE
399+
\end{center}
400+
401+
A high serendipity score means the system provides relevant and pleasantly surprising recommendations, while a low score
402+
indicates predictable suggestions.
403+
404+
\textbf{When to use Serendipity?}
405+
406+
Use serendipity when designing recommender systems that aim to provide diverse and engaging recommendations beyond what
407+
users already know. It is particularly useful in domains like music streaming and movie recommendations, where user
408+
satisfaction improves when they encounter unexpected but enjoyable suggestions.
409+
410+
\coloredboxes{
411+
\item Encourages discovery: Helps users explore new items they wouldn't typically encounter.
412+
\item Improves long-term engagement. By avoiding repetitive recommendations, it keeps users engaged with the platform.
413+
}
414+
{
415+
\item Hard to quantify "unexpectedness".
416+
\item Trade-off with accuracy. Maximizing serendipity may reduce traditional accuracy metrics.
417+
}
418+
419+
420+
% ---------- Coverage ----------
421+
\clearpage
422+
\thispagestyle{rankingstyle}
423+
\section{Coverage}
424+
\subsection{Coverage}
425+
426+
Coverage measures how well a recommender system utilizes the full range of available items. It reflects the proportion of
427+
the item catalog that is being recommended to users, ensuring that recommendations are not overly concentrated on a small
428+
subset of popular items.
429+
430+
\begin{center}
431+
FORMULA GOES HERE
432+
\end{center}
433+
434+
Higher coverage indicates a broader, more diverse recommendation set, while lower coverage suggests a system that primarily
435+
focuses on frequently chosen or trending items.
436+
437+
\textbf{When to use Coverage?}
438+
439+
Use coverage when evaluating how well a recommender system distributes recommendations across the entire catalog.
440+
It is particularly useful for platforms where content discovery is a priority. If a system recommends only a small fraction
441+
of the available items, users may miss out on relevant but less popular choices.
442+
443+
\coloredboxes{
444+
\item Reduces popularity bias. Encourages recommendations beyond just the most popular or frequently interacted items.
445+
\item Enhances user discovery. Helps users explore content they may not have found otherwise.
446+
}
447+
{
448+
\item Can reduce recommendation accuracy. Expanding recommendations too broadly may lead to less relevant suggestions.
449+
}

0 commit comments

Comments
 (0)