Skip to content

Latest commit

 

History

History
45 lines (38 loc) · 2.52 KB

batch-processing.md

File metadata and controls

45 lines (38 loc) · 2.52 KB

Bookmarks tagged [batch-processing]

https://beam.apache.org/

Apache Beam is an open source, unified model and set of language-specific SDKs for defining and executing data processing workflows, and also data ingestion and integration flows, supporting Enterpris...


https://pypi.python.org/pypi/pyspark/

Apache Spark Python API.


https://github.com/dask/dask

A flexible parallel computing library for analytic computing.


https://github.com/spotify/luigi

A module that helps you build complex pipelines of batch jobs.


https://github.com/Yelp/mrjob

Run MapReduce jobs on Hadoop or Amazon Web Services.


https://github.com/ray-project/ray/

A system for parallel and distributed Python that unifies the machine learning ecosystem.