Skip to content

Conversation

ghafek
Copy link

@ghafek ghafek commented Jun 29, 2025

No description provided.

@ghafek ghafek force-pushed the feature/ssb-benchmark branch from bcc4671 to e7ae6b3 Compare July 11, 2025 22:04
jessicapriebe and others added 4 commits July 14, 2025 14:07
This patch refines the current union operation to an internal LOP
operation. Currently, two subsequent operations -- rbind() and unique()
are used to perform the union operation. We rewrite the operation with
an internal LOP that uses a HashSet to compute the unique entries and
returns them in a matrix. This improves the efficiency of the
operation, as it avoids unique(). The order of the input entries is
preserved in the output.

Closes apache#2286.
This patch introduces a basic integration of the out-of-core backend.

For reading, we use a dedicated reblock instruction which creates
a queue of blocks, spawns a thread for reading and immediately returns.
In addition, we extended the acquireRead functionality to collect such
streams of blocks whenever an operations requires the full matrix.
Based on these foundations, we can now add other OCC operations that
directly work with the input stream of blocks and produce either results
or created modified output streams.
The test failure of not finding the mtd-file did not show up in local
tests, because locally we do not clean the test directories, and from
development there were both Xmtd and X.mtd existing.
@ghafek ghafek changed the title Feature/ssb benchmark [SYSTEMDS-3862] SSB Benchmark Implementation Jul 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

5 participants