OpenMetadata logo

OpenMetadata

Open-source metadata platform for data cataloging, discovery, governance, lineage, and quality

Repository activity
  • Stars14.2k
  • Forks2.2k
  • Open Issues895
License

Apache-2.0

Languages
  • TypeScript
  • Java
  • Python
OpenMetadata screenshot

About OpenMetadata

OpenMetadata is an open-source platform for cataloging, discovering, and governing data across an organization. It connects technical metadata, lineage, ownership, usage, policies, glossaries, classifications, and data contracts into one metadata graph, giving data teams a single trusted place to find and understand their assets.

Metadata is collected through 130+ connectors plus ingestion APIs, events, and SDKs, then normalized with open schemas so assets, relationships, and lineage stay consistent. The graph powers semantic search, column-level lineage and impact analysis, data quality tests, profiling and freshness checks, governance with roles and policies, and an MCP server that lets AI assistants query metadata in natural language.

It is typically installed from a Docker-based quickstart for self-hosting, with a hosted sandbox to try first. It suits data engineering and governance teams wanting an open catalog they control, and is licensed under Apache-2.0.

Key features

  • 130+ connectors for warehouses, BI tools, and pipelines
  • Table and column-level lineage with impact analysis
  • Data quality tests, profiling, and freshness checks
  • Glossaries, classifications, policies, and data contracts
  • Semantic search and an MCP server for AI assistants

Details

On GitHub since
2021
Language
TypeScript, Java, Python
License
Apache-2.0
Connectors
130+
API
REST, SDKs, MCP server
Self-hosted
Docker quickstart