Apache DataFusion’s cover photo
Apache DataFusion

Apache DataFusion

Software Development

Apache DataFusion is a fast, feature rich and extensible query engine built on the Apache Arrow memory model.

About us

Apache DataFusion is a fast, feature rich and extensible query engine built on the Apache Arrow memory model. “Out of the box,” DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community. Python Bindings are also available. DataFusion features a full query planner, a columnar, streaming, multi-threaded, vectorized execution engine, and partitioned data sources. You can customize DataFusion at almost all points including additional data sources, query languages, functions, custom operators and more. See the Architecture section for more details.

Industry
Software Development
Company size
51-200 employees
Type
Nonprofit
Founded
2020

Employees at Apache DataFusion

Updates

Similar pages