We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 15d1daf commit 90b8f08Copy full SHA for 90b8f08
src/script/hostgraph/hostgraph_config.sh
@@ -12,7 +12,7 @@
12
### saved as tuples <from_host, to_host>
13
14
# crawls to be processed
15
-CRAWLS=("CC-MAIN-2024-10" "CC-MAIN-2024-18" "CC-MAIN-2024-22")
+CRAWLS=("CC-MAIN-2024-22" "CC-MAIN-2024-26" "CC-MAIN-2024-30")
16
17
INPUT_BASE_URL="s3://commoncrawl/"
18
@@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
42
################################################################################
43
# construct a merged graph of multiple monthly crawls
44
45
-MERGE_NAME=cc-main-2024-feb-apr-may
+MERGE_NAME=cc-main-2024-may-jun-jul
46
47
# Naming convention should be the three months' crawls that are
48
# used to generate this graph release. In the event of multiple months
0 commit comments