State of the Art Natural Language Processing
A unified analytics engine for large-scale data processing
A Spark library for Amazon SageMaker
Apache Spark to Apache Cassandra connector
Deequ is a library built on top of Apache Spark
Apache Kyuubi is a distributed and multi-tenant gateway
Simple and distributed Machine Learning
A Scala API for Apache Beam and Google Cloud Dataflow
A Scala kernel for Jupyter
A scalable, unified data and AI engineering platform for enterprise
Memory optimized analytics database, based on Apache Spark
Spark Cool Play: Spark source code analysis, Spark class library, etc.
REST job server for Apache Spark
Reading OpenStreetMap Pbf files.
SZT‑bigdata is an open source project
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library
osDQ dedicated to create apache spark based data pipeline using JSON
Apache Spark Connector for Azure Cosmos DB
Machine learning server for building predictive applications
Machine learning server for developers and ML engineers