Skip to content

A real-time data processing pipeline that ingests streaming sales data from Apache Kafka, parses and aggregates the data using Apache Spark Structured Streaming, and writes the results to a PostgreSQL database. The system supports scalable, fault-tolerant micro-batch processing and enables near real-time analytics on product sales.

License

Notifications You must be signed in to change notification settings

tayostats/Real-Time-Sales-Data-Processing

Repository files navigation

Real-Time-Sales-Data-Processing

A real-time data processing pipeline that ingests streaming sales data from Apache Kafka, parses and aggregates the data using Apache Spark Structured Streaming, and writes the results to a PostgreSQL database. The system supports scalable, fault-tolerant micro-batch processing and enables near real-time analytics on product sales.

About

A real-time data processing pipeline that ingests streaming sales data from Apache Kafka, parses and aggregates the data using Apache Spark Structured Streaming, and writes the results to a PostgreSQL database. The system supports scalable, fault-tolerant micro-batch processing and enables near real-time analytics on product sales.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages