Skip to content

[Draft] Filter images before clustering #858

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 17 commits into
base: deployments/ood.antenna.insectai.org
Choose a base branch
from

Conversation

mihow
Copy link
Collaborator

@mihow mihow commented May 25, 2025

Work by @Yuyan-C

Copy link

netlify bot commented May 25, 2025

Deploy Preview for antenna-preview failed. Why did it fail? →

Name Link
🔨 Latest commit f1a52ef
🔍 Latest deploy log https://app.netlify.com/projects/antenna-preview/deploys/68a6b7334aa3900008012a35

Copy link

netlify bot commented May 25, 2025

Deploy Preview for antenna-ood failed. Why did it fail? →

Name Link
🔨 Latest commit f1a52ef
🔍 Latest deploy log https://app.netlify.com/projects/antenna-ood/deploys/68a6b731138e830008ace878

@mihow mihow changed the title [Draft] Incorporate size information into clustering [Draft] Filter images before clustering Jul 7, 2025
@mihow
Copy link
Collaborator Author

mihow commented Jul 7, 2025

I suggest we open a new branch from main and pull in several of these filters for the regular classification task (and keep clustering it's own experimental branch)

Filters:

  • Filter crops that are too small
  • Filter crops that are too blurry
  • Filter crops on the edge

These can be first implemented in a post-processing job, which can be run on existing predictions. It will add a prediction to existing occurrences and update their determination. If the occurrence does not pass the filters, a new prediction should be added "Not identifiable".

Then we can consider when & how it is run in the existing pipeline. I suggest this pattern for all post-processing steps (regional class masking, rank predictions, etc)

After the post-processing implementation, look into adding these filters to the prediction service itself (the AMI Data Companion). And adjust the confidence score or reject the prediction at the source, before sending to Antenna. Do users want to see blurry & small images with low confidence, or do they want them hidden all together? (no classification & not included in response). Or return the unidentifiable image with sore of 0 and taxon "Not identifiable".

@mihow mihow assigned vanessavmac and mihow and unassigned vanessavmac Jul 16, 2025
@mihow mihow added this to the Post Panama feature integration milestone Jul 16, 2025
@mihow mihow force-pushed the deployments/ood.antenna.insectai.org branch from a011a5f to 3528f27 Compare August 21, 2025 02:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants