The document outlines Airbnb's approach to building data products using Spark, highlighting their data infrastructure that integrates both streaming and batch processing. It describes the architecture, data sources, computation flows, and technology used, including HBase, Kafka, and various query engines. Key features include a unified API for processing and a shared global state store, enabling efficient data handling across multiple use cases.
Related topics: