A real-time data processing pipeline that ingests streaming sales data from Apache Kafka, parses and aggregates the data using Apache Spark Structured Streaming, and writes the results to a PostgreSQL database. The system supports scalable, fault-tolerant micro-batch processing and enables near real-time analytics on product sales.
-
Notifications
You must be signed in to change notification settings - Fork 0
A real-time data processing pipeline that ingests streaming sales data from Apache Kafka, parses and aggregates the data using Apache Spark Structured Streaming, and writes the results to a PostgreSQL database. The system supports scalable, fault-tolerant micro-batch processing and enables near real-time analytics on product sales.
License
tayostats/Real-Time-Sales-Data-Processing
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A real-time data processing pipeline that ingests streaming sales data from Apache Kafka, parses and aggregates the data using Apache Spark Structured Streaming, and writes the results to a PostgreSQL database. The system supports scalable, fault-tolerant micro-batch processing and enables near real-time analytics on product sales.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published