We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 23a4aed commit 6f0bca5Copy full SHA for 6f0bca5
src/script/hostgraph/hostgraph_config.sh
@@ -12,7 +12,7 @@
12
### saved as tuples <from_host, to_host>
13
14
# crawls to be processed
15
-CRAWLS=("CC-MAIN-2025-08" "CC-MAIN-2025-13" "CC-MAIN-2025-18")
+CRAWLS=("CC-MAIN-2025-18" "CC-MAIN-2025-21" "CC-MAIN-2025-26")
16
17
INPUT_BASE_URL="s3://commoncrawl/"
18
@@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
42
################################################################################
43
# construct a merged graph of multiple monthly crawls
44
45
-MERGE_NAME=cc-main-2025-feb-mar-apr
+MERGE_NAME=cc-main-2025-apr-may-jun
46
47
# Naming convention should be the three months' crawls that are
48
# used to generate this graph release. In the event of multiple months
0 commit comments