Skip to content

Commit 262b918

Browse files
refactor: update instructions and configuration settings for webgraph
processing. Adapt to settings used to build cc-main-2025-jan-feb-mar on Ubuntu 24.04 ARM using Java 21
1 parent b3b8525 commit 262b918

File tree

2 files changed

+21
-27
lines changed

2 files changed

+21
-27
lines changed

src/script/webgraph_ranking/run_webgraph.sh

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -35,8 +35,7 @@ case "$1" in
3535
# Java options for HyperBall, recommended in
3636
# https://webgraph.di.unimi.it/docs/it/unimi/dsi/webgraph/algo/HyperBall.html
3737
JAVA_OPTS="$JAVA_OPTS -server -Xss256K -XX:PretenureSizeThreshold=512M -XX:MaxNewSize=$(($MEMMB/3))m \
38-
-XX:+UseNUMA -XX:+UseTLAB -XX:+ResizeTLAB \
39-
-XX:+UseConcMarkSweepGC -XX:CMSInitiatingOccupancyFraction=99 -XX:+UseCMSInitiatingOccupancyOnly"
38+
-XX:+UseNUMA -XX:+UseTLAB -XX:+ResizeTLAB"
4039
;;
4140
esac
4241

src/script/webgraph_ranking/webgraph_config.sh

Lines changed: 20 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -11,34 +11,29 @@ USE_WEBGRAPH_BIG=${USE_WEBGRAPH_BIG:-false}
1111
JOIN_RANKS_IN_MEMORY=${JOIN_RANKS_IN_MEMORY:-true}
1212

1313

14-
# number of threads and Hyperball registers
15-
# depend on the size of the machine (here EC2 instance)
16-
# ... and of the graph to be processed
17-
# => it's only an empirical value and possibly needs to be adjusted
18-
THREADS=${THREADS:-2}
19-
HYP_REG=4
20-
## on r4.8xlarge (244 GB)
21-
#THREADS=32
22-
#HYP_REG=4 # 4-5 for hostgraph, 10 for domain graph
23-
## on r4.16xlarge (488 GB)
24-
#THREADS=64
25-
#HYP_REG=5 # 4-6 for hostgraph, 10 for domain graph
26-
## on r5.12xlarge (384 GB)
27-
#THREADS=48
28-
#HYP_REG=5 # 4-6 for hostgraph, 10 for domain graph
29-
## on x1.16xlarge (976 GB)
30-
#THREADS=64
31-
#HYP_REG=9
32-
## on x1.32xlarge (1952 GB)
33-
#THREADS=128
34-
#HYP_REG=10
35-
36-
# determine automatically, using java.lang.Runtime.availableProcessors()
37-
# THREADS=0
38-
3914
# number of registers used for Hyperball / harmonic centrality calculation
15+
#
16+
# The number of Hyperball registers depend on
17+
# - the size of the machine (here EC2 instance)
18+
# - and of the graph to be processed
19+
# => it's an empirically determined value and
20+
# possibly needs to be adjusted
21+
# It can be overridden by the environment variable
22+
# HYPERBALL_REGISTERS, see below.
23+
HYP_REG=12
24+
## on r8.24.xlarge (768 GB, 96 CPUs)
25+
#HYP_REG=10 (host-level graph, 300M nodes)
26+
#HYP_REG=12 (domain-level graph, 130M nodes)
27+
4028
HYPERBALL_REGISTERS=${HYPERBALL_REGISTERS:-$HYP_REG}
4129

30+
# number of threads
31+
# THREAD=0 : let the webgraph tools decide how many threads,
32+
# given the available CPU cores, using
33+
# java.lang.Runtime.availableProcessors()
34+
THREADS=${THREADS:-0}
35+
36+
4237

4338
# number of fields in vertices file(s)
4439
# (default: 2)

0 commit comments

Comments
 (0)