Skip to content

Commit b3b8525

Browse files
chore: update config for upcoming Jan/Feb/Mar 2025 webgraph
1 parent 91d3f1b commit b3b8525

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

src/script/hostgraph/hostgraph_config.sh

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,7 @@
1212
### saved as tuples <from_host, to_host>
1313

1414
# crawls to be processed
15-
CRAWLS=("CC-MAIN-2024-51" "CC-MAIN-2025-05" "CC-MAIN-2025-08")
15+
CRAWLS=("CC-MAIN-2025-05" "CC-MAIN-2025-08" "CC-MAIN-2025-13")
1616

1717
INPUT_BASE_URL="s3://commoncrawl/"
1818

@@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
4242
################################################################################
4343
# construct a merged graph of multiple monthly crawls
4444

45-
MERGE_NAME=cc-main-2024-25-dec-jan-feb
45+
MERGE_NAME=cc-main-2025-jan-feb-mar
4646

4747
# Naming convention should be the three months' crawls that are
4848
# used to generate this graph release. In the event of multiple months

0 commit comments

Comments
 (0)