We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 7509a1c commit a7cd4e4Copy full SHA for a7cd4e4
src/script/hostgraph/hostgraph_config.sh
@@ -12,7 +12,7 @@
12
### saved as tuples <from_host, to_host>
13
14
# crawls to be processed
15
-CRAWLS=("CC-MAIN-2024-30" "CC-MAIN-2024-33" "CC-MAIN-2024-38")
+CRAWLS=("CC-MAIN-2024-33" "CC-MAIN-2024-38" "CC-MAIN-2024-42")
16
17
INPUT_BASE_URL="s3://commoncrawl/"
18
@@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
42
################################################################################
43
# construct a merged graph of multiple monthly crawls
44
45
-MERGE_NAME=cc-main-2024-jul-aug-sep
+MERGE_NAME=cc-main-2024-aug-sep-oct
46
47
# Naming convention should be the three months' crawls that are
48
# used to generate this graph release. In the event of multiple months
0 commit comments