v0.0.36
HNMFk graph post-processing & root node naming
-
Added the ability to post-process HNMFk graphs based on the number of documents in leaf nodes.
- New functions:
model.traverse_tiny_leaf_topics(threshold: int)
: Identifies outlier clusters where the number of documents is below the given threshold.model.get_tiny_leaf_topics()
: Retrieves tiny leaf nodes (processed separately).model.process_tiny_leaf_topics(threshold: int)
: Processes the graph to separate tiny nodes based on the given threshold.- Resetting the graph by setting
threshold=None
restores the tiny nodes.
- New functions:
-
Added option to specify a root node name in HNMFk using
root_node_name="Root"
.- Default is now
"Root"
instead of"*"
to resolve Windows compatibility issues.
- Default is now
Bug(s)
- Fixed a bug in Beaver where mismatched indexes caused incorrect highlighting.