Skip to content
#

cleansing

Here are 15 public repositories matching this topic...

Raw data of real analytical use cases in a number of industries and companies are frequently provided in an Excel-based form. These files usually cannot be processed directly in machine learning models, but must first be cleaned and preprocessed. In this process, many different types of pitfalls may occur. This makes data preprocessing an essent…

  • Updated Apr 20, 2020
  • Jupyter Notebook

A collection of data analysis projects using SQL and Python in Databricks and SQL server management studio. Each project include the automation SQL code and brief description of the data analysis that I performed and the value it created.

  • Updated Aug 12, 2025

Improve this page

Add a description, image, and links to the cleansing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cleansing topic, visit your repo's landing page and select "manage topics."

Learn more