In-process SQL OLAP database for analytical queries with CLI, language clients, and direct CSV and Parquet reads
MIT
- C++
- Python
- C

About DuckDB
DuckDB is a high-performance, in-process SQL database for analytical workloads. It runs inside your application rather than as a separate server, and is designed to be fast, reliable, portable, and easy to use across many environments.
Its SQL dialect goes well beyond the basics, with arbitrary and nested correlated subqueries, window functions, collations, and complex types such as arrays, structs, and maps. CSV and Parquet files can be queried directly by referencing them in the FROM clause, so file analysis stays inside SQL.
DuckDB ships as a standalone CLI and as clients for Python, R, Java, and Wasm, with deep integrations into tools like pandas and dplyr. It is open source under the MIT license.
Key features
- In-process SQL OLAP engine, no separate server
- Standalone CLI plus Python, R, Java, Wasm clients
- Window functions and nested correlated subqueries
- Complex types including arrays, structs, and maps
- Query CSV and Parquet directly from the FROM clause
Details
- Type
- In-process SQL OLAP database
- Query language
- SQL
- Clients
- CLI · Python · R · Java · Wasm
- Integrations
- pandas · dplyr
- License
- MIT
