We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent aa81ab8 commit 939bdd4Copy full SHA for 939bdd4
src/script/hostgraph/hostgraph_config.sh
@@ -12,7 +12,7 @@
12
### saved as tuples <from_host, to_host>
13
14
# crawls to be processed
15
-CRAWLS=("CC-MAIN-2025-18" "CC-MAIN-2025-21" "CC-MAIN-2025-26")
+CRAWLS=("CC-MAIN-2025-21" "CC-MAIN-2025-26" "CC-MAIN-2025-30")
16
17
INPUT_BASE_URL="s3://commoncrawl/"
18
@@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
42
################################################################################
43
# construct a merged graph of multiple monthly crawls
44
45
-MERGE_NAME=cc-main-2025-apr-may-jun
+MERGE_NAME=cc-main-2025-may-jun-jul
46
47
# Naming convention should be the three months' crawls that are
48
# used to generate this graph release. In the event of multiple months
0 commit comments