Skip to content

Test crawler performance #9

@SebastianZimmeck

Description

@SebastianZimmeck

Before we start the crawl, we need to test the crawler's performance. So, we need to compare the manually observed groundtruth with the analysis results. We probably need a 100-site test set.

  • How do we select the test set sites given the different locations and states (issue Create Manually Curated List of Sites to Crawl #7) so that we have good test coverage?
  • One issue is that different loads of a site may lead to different trackers etc. detected. So, we need to look for the groundtruth and analysis results at exactly the same site load. So, maybe, just load one site, get both groundtruth and analysis results and check?
  • We need to document all of that

(@JoeChampeau and @jjeancharles feel free to participate here as well.)

Metadata

Metadata

Labels

omnibusAn issue that covers multiple connected (smaller) sub-issuestestingAn issue related to testing

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions