-
Notifications
You must be signed in to change notification settings - Fork 89
Feat: add recent activity as ML feature #1173
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: staging
Are you sure you want to change the base?
Conversation
…or all properties for ML feature engineering
The latest updates on your projects. Learn more about Vercel for Git ↗︎
|
This PR has been marked as stale because it has been open for 7 days with no activity. |
|
||
|
||
def fetch_recent_activity(query: str) -> pd.DataFrame: | ||
response = requests.get("https://phl.carto.com/api/v2/sql", params={"q": query}) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We already should have a class that is capable of grabbing a carto query and converting the results to a df. Is there are reason it didn't work for this case?
I'd like to keep with using that class rather than duplicating the functionality.
Generally, this looks ready to go. I just had one thing I think we should change regarding the carto API call. |
This PR has been marked as stale because it has been open for 7 days with no activity. |
Add recent activity for ML features
Description
This PR adds permits, business licenses, and appeals data for each property in the dataset, tracking whether there has ever been such a record associated with a given
opa_id
, as well as the date of the most recent record and the days since that date (to the date of the pipeline run). My hunch is that some combination of these features will significantly improve our ability to identify vacant properties with ML.