dlt logo

dlt

Open-source Python library for loading data into structured datasets

Open Source Alternative to
Repository activity
  • Stars5.5k
  • Forks526
  • Open Issues347
License

Apache-2.0

Languages
  • Python
  • Makefile
  • Dockerfile
dlt screenshot

About dlt

dlt is an open-source Python library for loading data from messy sources into structured datasets. It fits into notebooks, AWS Lambda, Airflow DAGs, local laptops, and other Python environments, so you can build data pipelines where you already work.

It extracts from REST APIs, SQL databases, cloud storage, and Python data structures. dlt infers schemas and data types, normalizes nested data, supports popular destinations and custom destinations, and adds incremental loading, schema evolution, schema and data contracts, Python and SQL data access, transformations, and pipeline inspection.

dlt runs wherever Python runs, including Airflow and serverless functions. It is open source under the Apache 2.0 License and supports Python 3.9 through Python 3.14, with Python 3.14 marked experimental for some optional extras.

Key features

  • Extracts from REST APIs, SQL databases, cloud storage, and Python data
  • Infers schemas and data types, normalizes nested data
  • Supports popular destinations and custom destinations
  • Incremental loading, schema evolution, and schema and data contracts
  • Python and SQL data access, transformations, and pipeline inspection

Details

First released
2022
Platforms
Web · Linux · macOS · Windows
Deployment
Local · cloud · serverless
Language
Python
License
Apache 2.0
Python
3.9 - 3.14