Open-source Python library for loading data into structured datasets
- Stars5.5k
- Forks526
- Open Issues347
Apache-2.0
- Python
- Makefile
- Dockerfile

About dlt
dlt is an open-source Python library for loading data from messy sources into structured datasets. It fits into notebooks, AWS Lambda, Airflow DAGs, local laptops, and other Python environments, so you can build data pipelines where you already work.
It extracts from REST APIs, SQL databases, cloud storage, and Python data structures. dlt infers schemas and data types, normalizes nested data, supports popular destinations and custom destinations, and adds incremental loading, schema evolution, schema and data contracts, Python and SQL data access, transformations, and pipeline inspection.
dlt runs wherever Python runs, including Airflow and serverless functions. It is open source under the Apache 2.0 License and supports Python 3.9 through Python 3.14, with Python 3.14 marked experimental for some optional extras.
Key features
- Extracts from REST APIs, SQL databases, cloud storage, and Python data
- Infers schemas and data types, normalizes nested data
- Supports popular destinations and custom destinations
- Incremental loading, schema evolution, and schema and data contracts
- Python and SQL data access, transformations, and pipeline inspection
Details
- First released
- 2022
- Platforms
- Web · Linux · macOS · Windows
- Deployment
- Local · cloud · serverless
- Language
- Python
- License
- Apache 2.0
- Python
- 3.9 - 3.14
