PySpark-Boilerplate
A boilerplate for writing PySpark Jobs
pythonboilerplateapache-sparkpyspark
FreeRepo
Overview
PySpark-Boilerplate is a template for building production-ready PySpark jobs with structured code organization and best practices. It provides a foundation for data processing workflows, including configuration management, logging and testing patterns, making it suitable for teams building scalable batch processing and ETL pipelines on Apache Spark.
Features
pyspark-jobsproduction-grade-setupbest-practices
Feature Flags
blogjobsQueue
Recommended Use Cases
data-processingbig-data-analyticsspark-jobs
Frontend
None
Backend
apache-sparkpyspark
Auth Providers
None
Deployment Targets
None
Payment Providers
None
Quick Facts
โญ Stars
394
๐ด Forks
154
๐ Active
Unknown
๐ Last Commit
2024-01-21T06:57:52.000Z
GitHub
Stack
Framework
apache-spark
Language
python
Data Layer
UI Stack
Developer Experience
Docker
No
Tests
No
Quickstart
No
env.example
No
Pricing
Classification
free
Selected
โ
Notes
No clear pricing signals