Alternatives to DataStax

Compare DataStax alternatives for your business or organization using the curated list below. SourceForge ranks the best alternatives to DataStax in 2025. Compare features, ratings, user reviews, pricing, and more from DataStax competitors and alternatives in order to make an informed decision for your business.

  • 1
    MongoDB Atlas
    The most innovative cloud database service on the market, with unmatched data distribution and mobility across AWS, Azure, and Google Cloud, built-in automation for resource and workload optimization, and so much more. MongoDB Atlas is the global cloud database service for modern applications. Deploy fully managed MongoDB across AWS, Google Cloud, and Azure with best-in-class automation and proven practices that guarantee availability, scalability, and compliance with the most demanding data security and privacy standards. The best way to deploy, run, and scale MongoDB in the cloud. MongoDB Atlas offers built-in security controls for all your data. Enable enterprise-grade features to integrate with your existing security protocols and compliance standards. With MongoDB Atlas, your data is protected with preconfigured security features for authentication, authorization, encryption, and more.
    Compare vs. DataStax View Software
    Visit Website
  • 2
    StarTree

    StarTree

    StarTree

    StarTree, powered by Apache Pinot™, is a fully managed real-time analytics platform built for customer-facing applications that demand instant insights on the freshest data. Unlike traditional data warehouses or OLTP databases—optimized for back-office reporting or transactions—StarTree is engineered for real-time OLAP at true scale, meaning: - Data Volume: query performance sustained at petabyte scale - Ingest Rates: millions of events per second, continuously indexed for freshness - Concurrency: thousands to millions of simultaneous users served with sub-second latency With StarTree, businesses deliver always-fresh insights at interactive speed, enabling applications that personalize, monitor, and act in real time.
    Starting Price: Free
  • 3
    ScaleGrid

    ScaleGrid

    ScaleGrid

    ScaleGrid is a fully managed Database-as-a-Service (DBaaS) platform that helps you automate your time-consuming database administration tasks both in the cloud and on-premises. Easily provision, monitor, backup and scale your open source databases with high availability, advanced security, full superuser and SSH access, query analysis, and troubleshooting support to improve the performance of your deployments. Supported databases include: - MySQL - PostgreSQL - Redis™ - MongoDB® database - Greenplum™ (coming soon) The ScaleGrid platform supports both public and private clouds, including AWS, Azure, Google Cloud Platform (GCP), DigitalOcean, Linode, Oracle Cloud Infrastructure (OCI), VMware and OpenStack. Used by thousands of developers, startups, and enterprise customers including Atlassian, Meteor, and Accenture, ScaleGrid handles all your database operations at any scale so you can focus on your application performance.
    Starting Price: $8 per month
  • 4
    Redis

    Redis

    Redis Labs

    Redis Labs: home of Redis. Redis Enterprise is the best version of Redis. Go beyond cache; try Redis Enterprise free in the cloud using NoSQL & data caching with the world’s fastest in-memory database. Run Redis at scale, enterprise grade resiliency, massive scalability, ease of management, and operational simplicity. DevOps love Redis in the Cloud. Developers can access enhanced data structures, a variety of modules, and rapid innovation with faster time to market. CIOs love the confidence of working with 99.999% uptime best in class security and expert support from the creators of Redis. Implement relational databases, active-active, geo-distribution, built in conflict distribution for simple and complex data types, & reads/writes in multiple geo regions to the same data set. Redis Enterprise offers flexible deployment options, cloud on-prem, & hybrid. Redis Labs: home of Redis. Redis JSON, Redis Java, Python Redis, Redis on Kubernetes & Redis gui best practices.
  • 5
    ScyllaDB

    ScyllaDB

    ScyllaDB

    ScyllaDB is the database for data-intensive apps that require high performance and low latency. It enables teams to harness the ever-increasing computing power of modern infrastructures – eliminating barriers to scale as data grows. Unlike any other database, ScyllaDB is a distributed NoSQL database fully compatible with Apache Cassandra and Amazon DynamoDB, yet is built with deep architectural advancements that enable exceptional end-user experiences at radically lower costs. Over 400 game-changing companies like Disney+ Hotstar, Expedia, FireEye, Discord, Zillow, Starbucks, Comcast, and Samsung use ScyllaDB for their toughest database challenges. ScyllaDB is available as free open source software, a fully-supported enterprise product, and a fully managed database-as-a-service (DBaaS) on multiple cloud providers.
  • 6
    Apache Cassandra

    Apache Cassandra

    Apache Software Foundation

    The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data. Cassandra's support for replicating across multiple datacenters is best-in-class, providing lower latency for your users and the peace of mind of knowing that you can survive regional outages.
  • 7
    Aerospike

    Aerospike

    Aerospike

    Aerospike is the global leader in next-generation, real-time NoSQL data solutions for any scale. Aerospike enterprises overcome seemingly impossible data bottlenecks to compete and win with a fraction of the infrastructure complexity and cost of legacy NoSQL databases. Aerospike’s patented Hybrid Memory Architecture™ delivers an unbreakable competitive advantage by unlocking the full potential of modern hardware, delivering previously unimaginable value from vast amounts of data at the edge, to the core and in the cloud. Aerospike empowers customers to instantly fight fraud; dramatically increase shopping cart size; deploy global digital payment networks; and deliver instant, one-to-one personalization for millions of customers. Aerospike customers include Airtel, Banca d’Italia, Nielsen, PayPal, Snap, Verizon Media and Wayfair. The company is headquartered in Mountain View, Calif., with additional locations in London; Bengaluru, India; and Tel Aviv, Israel.
  • 8
    Instaclustr

    Instaclustr

    Instaclustr

    Instaclustr is the Open Source-as-a-Service company, delivering reliability at scale. We operate an automated, proven, and trusted managed environment, providing database, analytics, search, and messaging. We enable companies to focus internal development and operational resources on building cutting edge customer-facing applications. Instaclustr works with cloud providers including AWS, Heroku, Azure, IBM Cloud, and Google Cloud Platform. The company has SOC 2 certification and provides 24/7 customer support.
    Starting Price: $20 per node per month
  • 9
    Fauna

    Fauna

    Fauna

    Fauna is a data API for modern applications that facilitates rich clients with serverless backends by providing a web-native interface with support for GraphQL and custom business logic, frictionless integration with the serverless ecosystem, a no compromise multi-cloud architecture you can trust and grow with and total freedom from database operations. Instantly create multiple databases in one account leveraging multi-tenancy for development or customer-facing use case. Create a distributed database across one geography or the globe in just three clicks and easily import existing data. Scale seamlessly without ever managing servers, clusters, data partitioning, or replication. Track usage and consumption-based billing in near real time via a dashboard.
    Starting Price: Free
  • 10
    Luna for Apache Cassandra
    Luna is a subscription to the Apache Cassandra support and expertise at DataStax. It allows you to enjoy all the benefits of open-source Cassandra, with the peace of mind knowing you have direct access to the team that authored the majority of the code and supported some of the largest deployments in the world. Best practices, advice, and SLA-based support to keep your Cassandra deployment in top shape. Scale without compromising on performance or latency to seamlessly manage the most demanding real-time workloads. Create real-time and highly-interactive customer experiences with blisteringly fast read and writes. Luna provides assistance with resolving issues and following best practices with Cassandra clusters. Services provide help through the full application life cycle, with a deeper integration in your team working together on implementation.
  • 11
    Astra DB

    Astra DB

    DataStax

    Astra DB from DataStax is vector database for developers that need to get accurate Generative AI applications into production, quickly and efficiently. Built on Apache Cassandra, Astra DB is the only vector database that can make vector updates immediately available to applications and scale to the largest real-time data and streaming workloads, securely on any cloud. Astra DB offers unprecedented serverless, pay as you go pricing and the flexibility of multi-cloud and open-source. You can store up to 80GB and/or perform 20 million operations per month. Securely connect to VPC peering and private links. Manage your encryption keys with your own key management and SAML SSO secure account accessibility. You can deploy on AWS, GCP, or Azure while still maintaining open-source Cassandra compatibility.
  • 12
    Astra Streaming
    Responsive applications keep users engaged and developers inspired. Rise to meet these ever-increasing expectations with the DataStax Astra Streaming service platform. DataStax Astra Streaming is a cloud-native messaging and event streaming platform powered by Apache Pulsar. Astra Streaming allows you to build streaming applications on top of an elastically scalable, multi-cloud messaging and event streaming platform. Astra Streaming is powered by Apache Pulsar, the next-generation event streaming platform which provides a unified solution for streaming, queuing, pub/sub, and stream processing. Astra Streaming is a natural complement to Astra DB. Using Astra Streaming, existing Astra DB users can easily build real-time data pipelines into and out of their Astra DB instances. With Astra Streaming, avoid vendor lock-in and deploy on any of the major public clouds (AWS, GCP, Azure) compatible with open-source Apache Pulsar.
  • 13
    Amazon Keyspaces
    Amazon Keyspaces (for Apache Cassandra) is a scalable, highly available, and managed Apache Cassandra–compatible database service. With Amazon Keyspaces, you can run your Cassandra workloads on AWS using the same Cassandra application code and developer tools that you use today. You don’t have to provision, patch, or manage servers, and you don’t have to install, maintain, or operate software. Amazon Keyspaces is serverless, so you pay for only the resources you use and the service can automatically scale tables up and down in response to application traffic. You can build applications that serve thousands of requests per second with virtually unlimited throughput and storage. Amazon Keyspaces gives you the performance, elasticity, and enterprise features you need to operate business-critical Cassandra workloads at scale. Process data at high speeds for applications that require single-digit-millisecond latency, such as industrial equipment maintenance, trade monitoring, etc.
  • 14
    Couchbase Capella
    Couchbase Capella is a cloud database platform that optimizes data storage and processing for both JSON-native operational and analytical data. A faster, better DBaaS for transaction, search, AI, and edge applications, with real-time analytics. Start easily with Capella by using SQL to query your JSON data, and get natural language coding assistance from Capella iQ. Capella is a multipurpose database supporting operational and analytic workloads and use cases, including JSON analytics, vector search, and mobile synchronization. It delivers millisecond response at scale, with the best price-performance of any document Database-as-a-Service. Try for free today and be up and running in minutes. Whether you're tackling development, architectural, or deployment challenges, there are many great reasons to future-proof with Capella.
  • 15
    Hawkular Metrics

    Hawkular Metrics

    Hawkular Metrics

    Hawkular Metrics is a scalable, asynchronous, multi tenant, long term metrics storage engine that uses Cassandra as the data store and REST as the primary interface. This section provides an overview of some of the key features of Hawkular Metrics. The following sections provide in-depth discussions on these as well as other features. Hawkular Metrics is all about scalability. You can run a single instance backed by a single Cassandra node. You can also scale out Cassandra to multiple nodes to handle increasing loads. The Hawkular Metrics server employs a stateless architecture, which makes it easy to scale out as well. This diagram illustrates the various deployment options made possible with Hawkular Metrics' scalable architecture. The upper left shows the simplest deployment with a single Cassandra node and single Hawkular Metrics node. The bottom right picture shows that it is possible to run more Hawkular Metrics nodes than Cassandra nodes.
  • 16
    HugeGraph

    HugeGraph

    HugeGraph

    HugeGraph is a fast-speed and highly-scalable graph database. Billions of vertices and edges can be easily stored into and queried from HugeGraph due to its excellent OLTP ability. As compliance to Apache TinkerPop 3 framework, various complicated graph queries can be accomplished through Gremlin (a powerful graph traversal language). Among its features, it provides compliance to Apache TinkerPop 3, supporting Gremlin. Schema Metadata Management, including VertexLabel, EdgeLabel, PropertyKey and IndexLabel. Multi-type Indexes, supporting exact query, range query and complex conditions combination query. Plug-in Backend Store Driver Framework, supporting RocksDB, Cassandra, ScyllaDB, HBase and MySQL now and easy to add other backend store driver if needed. Integration with Hadoop/Spark. HugeGraph relies on the TinkerPop framework, we refer to the storage structure of Titan and the schema definition of DataStax.
  • 17
    Azure Cosmos DB
    Azure Cosmos DB is a fully managed NoSQL database service for modern app development with guaranteed single-digit millisecond response times and 99.999-percent availability backed by SLAs, automatic and instant scalability, and open source APIs for MongoDB and Cassandra. Enjoy fast writes and reads anywhere in the world with turnkey multi-master global distribution. Reduce time to insight by running near-real time analytics and AI on the operational data within your Azure Cosmos DB NoSQL database. Azure Synapse Link for Azure Cosmos DB seamlessly integrates with Azure Synapse Analytics without data movement or diminishing the performance of your operational data store.
  • 18
    CrateDB

    CrateDB

    CrateDB

    The enterprise database for time series, documents, and vectors. Store any type of data and combine the simplicity of SQL with the scalability of NoSQL. CrateDB is an open source distributed database running queries in milliseconds, whatever the complexity, volume and velocity of data.
  • 19
    Apache Druid
    Apache Druid is an open source distributed data store. Druid’s core design combines ideas from data warehouses, timeseries databases, and search systems to create a high performance real-time analytics database for a broad range of use cases. Druid merges key characteristics of each of the 3 systems into its ingestion layer, storage format, querying layer, and core architecture. Druid stores and compresses each column individually, and only needs to read the ones needed for a particular query, which supports fast scans, rankings, and groupBys. Druid creates inverted indexes for string values for fast search and filter. Out-of-the-box connectors for Apache Kafka, HDFS, AWS S3, stream processors, and more. Druid intelligently partitions data based on time and time-based queries are significantly faster than traditional databases. Scale up or down by just adding or removing servers, and Druid automatically rebalances. Fault-tolerant architecture routes around server failures.
  • 20
    Apache Giraph

    Apache Giraph

    Apache Software Foundation

    Apache Giraph is an iterative graph processing system built for high scalability. For example, it is currently used at Facebook to analyze the social graph formed by users and their connections. Giraph originated as the open-source counterpart to Pregel, the graph processing architecture developed at Google and described in a 2010 paper. Both systems are inspired by the Bulk Synchronous Parallel model of distributed computation introduced by Leslie Valiant. Giraph adds several features beyond the basic Pregel model, including master computation, sharded aggregators, edge-oriented input, out-of-core computation, and more. With a steady development cycle and a growing community of users worldwide, Giraph is a natural choice for unleashing the potential of structured datasets at a massive scale. Apache Giraph is an iterative graph processing framework, built on top of Apache Hadoop.
  • 21
    ArangoDB

    ArangoDB

    ArangoDB

    Natively store data for graph, document and search needs. Utilize feature-rich access with one query language. Map data natively to the database and access it with the best patterns for the job – traversals, joins, search, ranking, geospatial, aggregations – you name it. Polyglot persistence without the costs. Easily design, scale and adapt your architectures to changing needs and with much less effort. Combine the flexibility of JSON with semantic search and graph technology for next generation feature extraction even for large datasets.
  • 22
    Apache Spark

    Apache Spark

    Apache Software Foundation

    Apache Spark™ is a unified analytics engine for large-scale data processing. Apache Spark achieves high performance for both batch and streaming data, using a state-of-the-art DAG scheduler, a query optimizer, and a physical execution engine. Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python, R, and SQL shells. Spark powers a stack of libraries including SQL and DataFrames, MLlib for machine learning, GraphX, and Spark Streaming. You can combine these libraries seamlessly in the same application. Spark runs on Hadoop, Apache Mesos, Kubernetes, standalone, or in the cloud. It can access diverse data sources. You can run Spark using its standalone cluster mode, on EC2, on Hadoop YARN, on Mesos, or on Kubernetes. Access data in HDFS, Alluxio, Apache Cassandra, Apache HBase, Apache Hive, and hundreds of other data sources.
  • 23
    JanusGraph

    JanusGraph

    JanusGraph

    JanusGraph is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. JanusGraph is a project under The Linux Foundation, and includes participants from Expero, Google, GRAKN.AI, Hortonworks, IBM and Amazon. Elastic and linear scalability for a growing data and user base. Data distribution and replication for performance and fault tolerance. Multi-datacenter high availability and hot backups. All functionality is totally free. No need to buy commercial licenses. JanusGraph is fully open source under the Apache 2 license. JanusGraph is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time. Support for ACID and eventual consistency. In addition to online transactional processing (OLTP), JanusGraph supports global graph analytics (OLAP) with its Apache Spark integration.
  • 24
    SkySQL

    SkySQL

    SkySQL

    SkySQL is a multi-cloud, serverless database-as-a-service (DBaaS) platform designed to help businesses streamline their MySQL and MariaDB cloud database management with zero waste, instant auto-scaling capabilities, and 99.995% uptime. SkySQL offers serverless databases that save 80% of traditional database costs with pay-as-you-go pricing that scales according to demand, ensuring businesses only pay for what they use. With advanced features like SkyAI agents for building natural language interfaces to your database into your apps and providing AI-assisted performance optimization, businesses can easily handle complex workloads, minimize security risks, and accelerate AI application development. SkySQL offers both cost savings and high performance, on-demand scalability, and reliability, making it ideal for modern applications and growing businesses.
    Starting Price: $28/month
  • 25
    Amazon MSK
    Amazon Managed Streaming for Apache Kafka (Amazon MSK) is a fully managed service that makes it easy for you to build and run applications that use Apache Kafka to process streaming data. Apache Kafka is an open-source platform for building real-time streaming data pipelines and applications. With Amazon MSK, you can use native Apache Kafka APIs to populate data lakes, stream changes to and from databases, and power machine learning and analytics applications. Apache Kafka clusters are challenging to setup, scale, and manage in production. When you run Apache Kafka on your own, you need to provision servers, configure Apache Kafka manually, replace servers when they fail, orchestrate server patches and upgrades, architect the cluster for high availability, ensure data is durably stored and secured, setup monitoring and alarms, and carefully plan scaling events to support load changes.
    Starting Price: $0.0543 per hour
  • 26
    Apache Kafka

    Apache Kafka

    The Apache Software Foundation

    Apache Kafka® is an open-source, distributed streaming platform. Scale production clusters up to a thousand brokers, trillions of messages per day, petabytes of data, hundreds of thousands of partitions. Elastically expand and contract storage and processing. Stretch clusters efficiently over availability zones or connect separate clusters across geographic regions. Process streams of events with joins, aggregations, filters, transformations, and more, using event-time and exactly-once processing. Kafka’s out-of-the-box Connect interface integrates with hundreds of event sources and event sinks including Postgres, JMS, Elasticsearch, AWS S3, and more. Read, write, and process streams of events in a vast array of programming languages.
  • 27
    ClusterControl

    ClusterControl

    Severalnines

    ClusterControl is a hybrid, multi-cloud database ops orchestration platform for MongoDB, Elasticsearch, Redis, TimescaleDB, SQL Server on Linux, Galera Cluster, PostgreSQL, and MySQL in on-premises, cloud, and hybrid environments. It handles full-lifecycle operations, from deployment to failover, backup and more. With its full suite of databases, ops features and ability to be deployed in any environment, it enables organizations to implement the Sovereign DBaaS concept. ClusterControl is perfect for organizations that need to reliably run large-scale, open-source database operations but don't want to be limited by traditional DBaaS providers in environment choice, open-source license stability, and DB access.
    Starting Price: €250/node/month
  • 28
    Hazelcast

    Hazelcast

    Hazelcast

    In-Memory Computing Platform. The digital world is different. Microseconds matter. That's why the world's largest organizations rely on us to power their most time-sensitive applications at scale. New data-enabled applications can deliver transformative business power – if they meet today’s requirement of immediacy. Hazelcast solutions complement virtually any database to deliver results that are significantly faster than a traditional system of record. Hazelcast’s distributed architecture provides redundancy for continuous cluster up-time and always available data to serve the most demanding applications. Capacity grows elastically with demand, without compromising performance or availability. The fastest in-memory data grid, combined with third-generation high-speed event processing, delivered through the cloud.
  • 29
    Google Cloud Bigtable
    Google Cloud Bigtable is a fully managed, scalable NoSQL database service for large analytical and operational workloads. Fast and performant: Use Cloud Bigtable as the storage engine that grows with you from your first gigabyte to petabyte-scale for low-latency applications as well as high-throughput data processing and analytics. Seamless scaling and replication: Start with a single node per cluster, and seamlessly scale to hundreds of nodes dynamically supporting peak demand. Replication also adds high availability and workload isolation for live serving apps. Simple and integrated: Fully managed service that integrates easily with big data tools like Hadoop, Dataflow, and Dataproc. Plus, support for the open source HBase API standard makes it easy for development teams to get started.
  • 30
    Azure Managed Instance for Apache Cassandra
    Cost-effectively run mission-critical workloads at scale with Azure Managed Instance for Apache Cassandra. Easily manage changing demands with multiple resource and data replication options. Ensure business continuity with zero downtime scalability for hybrid and cloud deployments. Develop applications faster using familiar and fully compatible Cassandra tools and languages. Free yourself from infrastructure management without compromising security. Run your workloads on a managed, secure service to streamline operations with automated repairs, patches, and updates. Make your database more durable and resilient with automatic backups and disaster recovery. Retain the flexibility and control of your hardware configuration with turnkey scaling services and hybrid deployment options. An instance-based pricing model enables you to define the number of CPU cores, virtual machines SKU, and memory/disk space needs.
    Starting Price: $0.911 per hour
  • 31
    Greenplum

    Greenplum

    Greenplum Database

    Greenplum Database® is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes. Greenplum Database® project is released under the Apache 2 license. We want to thank all our current community contributors and are interested in all new potential contributions. For the Greenplum Database community no contribution is too small, we encourage all types of contributions. An open-source massively parallel data platform for analytics, machine learning and AI. Rapidly create and deploy models for complex applications in cybersecurity, predictive maintenance, risk management, fraud detection, and many other areas. Experience the fully featured, integrated, open source analytics platform.
  • 32
    IBM Event Streams
    IBM Event Streams is a fully managed event streaming platform built on Apache Kafka, designed to help enterprises process and respond to real-time data streams. With capabilities for machine learning integration, high availability, and secure cloud deployment, it enables organizations to create intelligent applications that react to events as they happen. The platform supports multi-cloud environments, disaster recovery, and geo-replication, making it ideal for mission-critical workloads. IBM Event Streams simplifies building and scaling real-time, event-driven solutions, ensuring data is processed quickly and efficiently.
  • 33
    Red Hat OpenShift Streams
    Red Hat® OpenShift® Streams for Apache Kafka is a managed cloud service that provides a streamlined developer experience for building, deploying, and scaling new cloud-native applications or modernizing existing systems. Red Hat OpenShift Streams for Apache Kafka makes it easy to create, discover, and connect to real-time data streams no matter where they are deployed. Streams are a key component for delivering event-driven and data analytics applications. The combination of seamless operations across distributed microservices, large data transfer volumes, and managed operations allows teams to focus on team strengths, speed up time to value, and lower operational costs. OpenShift Streams for Apache Kafka includes a Kafka ecosystem and is part of a family of cloud services—and the Red Hat OpenShift product family—which helps you build a wide range of data-driven solutions.
  • 34
    ArcadeDB

    ArcadeDB

    ArcadeDB

    Manage complex models using ArcadeDB without any compromise. Forget about Polyglot Persistence. no need for multiple databases. You can store graphs, documents, key values and time series all in one ArcadeDB Multi-Model database. Since each model is native to the database engine, you don't have to worry about translations slowing you down. ArcadeDB's engine was built with Alien Technology. It's able to crunch millions of records per second. With ArcadeDB, the traversing speed is not affected by the database size. It is always constant, whether your database has a few records or billions. ArcadeDB can work as an embedded database, on a single server and can scale up using multiple servers with Kubernetes. Flexible enough to run on any platform with a small footprint. Your data is secure. Our unbreakable fully transactional engine assures durability for mission-critical production databases. ArcadeDB uses a Raft Consensus Algorithm to maintain consistency across multiple servers.
    Starting Price: Free
  • 35
    OrientDB
    OrientDB is the world’s fastest graph database. Period. An independent benchmark study by IBM and the Tokyo Institute of Technology showed that OrientDB is 10x faster than Neo4j on graph operations among all the workloads. Drive competitive advantage and accelerate innovation with new revenue streams.
  • 36
    Riak TS
    Riak® TS is the only enterprise-grade NoSQL time series database optimized specifically for IoT and Time Series data. It ingests, transforms, stores, and analyzes massive amounts of time series data. Riak TS is engineered to be faster than Cassandra. The Riak TS masterless architecture is designed to read and write data even in the event of hardware failures or network partitions. Data is evenly distributed across the Riak ring and, by default, there are three replicas of your data. This ensures at least one copy of your data is available for read operations. Riak TS is a distributed system with no central coordinator. It is easy to set up and operate. The masterless architecture makes it easy to add and remove nodes from a cluster. The masterless architecture of Riak TS makes it easy to add and remove nodes from your cluster. You can achieve predictable and near-linear scale by adding nodes using commodity hardware.
  • 37
    KairosDB

    KairosDB

    KairosDB

    Data can be pushed in KairosDB via multiple protocols like Telnet, Rest and Graphite. Other mechanisms such as plugins can also be used. KairosDB stores time series in Cassandra, the popular and performant NoSQL datastore. The schema consists of 3 column families. This API provides operations to list existing metric names, list tag names and values, store metric data points, and query for metric data points. With a default install, KairosDB serve up a query page whereby you can query data within the data store. It's designed primarily for development purposes. Aggregators perform an operation on data points and down samples. Standard functions like min, max, sum, count, mean and more are available. Import and export is available on the KairosDB server from the command line. Internal metrics to the data store can monitor the server’s performance.
  • 38
    Apache TinkerPop

    Apache TinkerPop

    Apache Software Foundation

    Apache TinkerPop™ is a graph computing framework for both graph databases (OLTP) and graph analytic systems (OLAP). Gremlin is the graph traversal language of Apache TinkerPop. Gremlin is a functional, data-flow language that enables users to succinctly express complex traversals on (or queries of) their application's property graph. Every Gremlin traversal is composed of a sequence of (potentially nested) steps. A graph is a structure composed of vertices and edges. Both vertices and edges can have an arbitrary number of key/value pairs called properties. Vertices denote discrete objects such as a person, a place, or an event. Edges denote relationships between vertices. For instance, a person may know another person, have been involved in an event, and/or have recently been at a particular place. If a user's domain is composed of a heterogeneous set of objects (vertices) that can be related to one another in a multitude of ways (edges).
    Starting Price: Free
  • 39
    Stratoscale

    Stratoscale

    Stratoscale

    Stratoscale enables IT to operate as an internal cloud service provider, so that IT can offer single-click / on-demand consumption of resources and managed open source technologies, and respond at the speed of the public cloud. Stratoscale’s multi-cloud hybrid platform delivers true infrastructure-as-a-service, ideal for DevOps and Infrastructure-as-Code automation. With single-panel monitoring and scaling of all resources and services as well as automation of the entire life cycle of open source technologies, IT can now shift focus to business specific innovation, governance and compliance. APIs are the best way to consume open-source technologies. But different cloud environments offer different sets of APIs which adds a significant layer of complexity and prevents DevOps professional from maximizing their potential. Stratoscale delivers Infrastructure-as-a-Service (IaaS),Database-as- a-Service (DBaaS).
  • 40
    BangDB

    BangDB

    BangDB

    BangDB natively integrates AI, streaming, graph, analytics within the DB itself to enable users to deal with complex data of different kinds, such as text, images, videos, objects etc. for real time data processing and analysis Ingest or stream any data, process it, train models, do prediction, find patterns, take action and automate all these to enable use cases such as IOT monitoring, fraud or disruption prevention, log analysis, lead generation, 1-on-1 personalisation and many more. Today’s use cases require different kinds of data to be ingested, processed, and queried at the same time for a given problem. BangDB supports most of the useful data formats to allow user to solve the problem in a simple manner. Rise of real time data pushes for real time streaming and predictive data analytics for advanced and optimized business operations.
    Starting Price: $2,499 per year
  • 41
    Amazon EMR
    Amazon EMR is the industry-leading cloud big data platform for processing vast amounts of data using open-source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi, and Presto. With EMR you can run Petabyte-scale analysis at less than half of the cost of traditional on-premises solutions and over 3x faster than standard Apache Spark. For short-running jobs, you can spin up and spin down clusters and pay per second for the instances used. For long-running workloads, you can create highly available clusters that automatically scale to meet demand. If you have existing on-premises deployments of open-source tools such as Apache Spark and Apache Hive, you can also run EMR clusters on AWS Outposts. Analyze data using open-source ML frameworks such as Apache Spark MLlib, TensorFlow, and Apache MXNet. Connect to Amazon SageMaker Studio for large-scale model training, analysis, and reporting.
  • 42
    BigLake

    BigLake

    Google

    BigLake is a storage engine that unifies data warehouses and lakes by enabling BigQuery and open-source frameworks like Spark to access data with fine-grained access control. BigLake provides accelerated query performance across multi-cloud storage and open formats such as Apache Iceberg. Store a single copy of data with uniform features across data warehouses & lakes. Fine-grained access control and multi-cloud governance over distributed data. Seamless integration with open-source analytics tools and open data formats. Unlock analytics on distributed data regardless of where and how it’s stored, while choosing the best analytics tools, open source or cloud-native over a single copy of data. Fine-grained access control across open source engines like Apache Spark, Presto, and Trino, and open formats such as Parquet. Performant queries over data lakes powered by BigQuery. Integrates with Dataplex to provide management at scale, including logical data organization.
    Starting Price: $5 per TB
  • 43
    Conduktor

    Conduktor

    Conduktor

    We created Conduktor, the all-in-one friendly interface to work with the Apache Kafka ecosystem. Develop and manage Apache Kafka with confidence. With Conduktor DevTools, the all-in-one Apache Kafka desktop client. Develop and manage Apache Kafka with confidence, and save time for your entire team. Apache Kafka is hard to learn and to use. Made by Kafka lovers, Conduktor best-in-class user experience is loved by developers. Conduktor offers more than just an interface over Apache Kafka. It provides you and your teams the control of your whole data pipeline, thanks to our integration with most technologies around Apache Kafka. Provide you and your teams the most complete tool on top of Apache Kafka.
  • 44
    OrigoDB

    OrigoDB

    Origo

    OrigoDB enables you to build high quality, mission critical systems with real-time performance at a fraction of the time and cost. This is not marketing gibberish! Please read on for a no nonsense description of our features. Get in touch if you have questions or download and try it out today! In-memory operations are orders of magnitude faster than disk operations. A single OrigoDB engine can execute millions of read transactions per second and thousands of write transactions per second with synchronous command journaling to a local SSD. This is the #1 reason we built OrigoDB. A single object oriented domain model is far simpler than the full stack including a relational model, object/relational mapping, data access code, views and stored procedures. That's a lot of waste that can be eliminated! The OrigoDB engine is 100% ACID out of the box. Commands execute one at a time, transitioning the in-memory model from one consistent state to the next.
    Starting Price: €200 per GB RAM per server
  • 45
    Apache HBase

    Apache HBase

    The Apache Software Foundation

    Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables -- billions of rows X millions of columns -- atop clusters of commodity hardware. Automatic failover support between RegionServers. Easy to use Java API for client access. Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options. Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX.
  • 46
    CelerData Cloud
    CelerData is a high-performance SQL engine built to power analytics directly on data lakehouses, eliminating the need for traditional data‐warehouse ingestion pipelines. It delivers sub-second query performance at scale, supports on-the‐fly JOINs without costly denormalization, and simplifies architecture by allowing users to run demanding workloads on open format tables. Built on the open source engine StarRocks, the platform outperforms legacy query engines like Trino, ClickHouse, and Apache Druid in latency, concurrency, and cost-efficiency. With a cloud-managed service that runs in your own VPC, you retain infrastructure control and data ownership while CelerData handles maintenance and optimization. The platform is positioned to power real-time OLAP, business intelligence, and customer-facing analytics use cases and is trusted by enterprise customers (including names such as Pinterest, Coinbase, and Fanatics) who have achieved significant latency reductions and cost savings.
  • 47
    Hackolade

    Hackolade

    Hackolade

    Hackolade Studio is a powerful data modeling platform that supports a wide range of technologies including relational SQL and NoSQL databases, cloud data warehouses, APIs, streaming platforms, and data exchange formats. Designed for modern data architecture, it enables users to visually design, document, and evolve schemas across systems like Oracle, PostgreSQL, Databricks, Snowflake, MongoDB, Cassandra, DynamoDB, Neo4j, Kafka (with Confluent Schema Registry), OpenAPI, GraphQL, and more. Hackolade Studio offers forward and reverse engineering, schema versioning, model validation, and integration with metadata catalogs such as Unity Catalog and Collibra. It empowers data architects, engineers, and governance teams to collaborate on consistent, governed, and scalable data models. Whether building data products, managing API contracts, or ensuring regulatory compliance, Hackolade Studio streamlines the process in one unified interface.
    Starting Price: €175 per month
  • 48
    Keen

    Keen

    Keen.io

    Keen is the fully managed event streaming platform. Built upon trusted Apache Kafka, we make it easier than ever for you to collect massive volumes of event data with our real-time data pipeline. Use Keen’s powerful REST API and SDKs to collect event data from anything connected to the internet. Our platform allows you to store your data securely decreasing your operational and delivery risk with Keen. With storage infrastructure powered by Apache Cassandra, data is totally secure through transfer through HTTPS and TLS, then stored with multi-layer AES encryption. Once data is securely stored, utilize our Access Keys to be able to present data in arbitrary ways without having to re-architect your security or data model. Or, take advantage of Role-based Access Control (RBAC), allowing for completely customizable permission tiers, down to specific data points or queries.
    Starting Price: $149 per month
  • 49
    Back4app

    Back4app

    Back4app

    Store and query relational data on the cloud. Make it accessible over GraphQL and REST with a scalable, open-source backend. Store and query relational data through GraphQL, REST, or Parse SDKs. Add business logic using cloud functions. Make apps faster without managing infrastructure. Build relational data models in minutes, and perform complex relational queries in a truly serverless platform. Our engineer team is ready to guide you on each step of the process. From database to cloud functions we help you to migrate in the smoothest way. Keep costs in check even when you scale your apps with affordable, predictable pricing. Scaling happens automatically and, we guide the process to ensure that nothing goes wrong. Build your app exactly the way you'd like, without getting bogged down in the backend. Of course, you do. Keeping data safe is more important than ever.
    Starting Price: $4.99 per month
  • 50
    Navisite DBaaS
    Deploy, monitor, provision and scale your favorite databases on AWS and Azure using Navisite’s fully managed database as a service (DBaaS) platform. With Navisite’s DBaaS, you get more than a powerful, high-speed service delivery platform. You get the expertise, services and scale you need to get ahead and stay ahead. Our fully managed DBaaS combines cloud management and automation with turnkey DBA services to reduce operating costs, speed time to market and simplify your journey to the cloud. Benefit from on-demand provisioning in the cloud, with seamless scaling and high availability via auto-failover on production-ready plans. Enable an additional layer of security with SSL-based access, and encrypt your data and backups "at-rest." Provision a database instantly, anywhere in the world, any time. Get scheduled and on-demand backups or snapshots with various restoration options for your data.
    Starting Price: $45 per month