Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
- Updated
May 11, 2025 - Python
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
Turns Data and AI algorithms into production-ready web applications in no time.
An orchestration platform for the development, production, and observation of data assets.
SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
The developer first cloud governance platform
Flink CDC is a data integration tool
Upserts, Deletes And Incremental Processing on Big Data.
🦀 event stream processing for developers to collect and transform data in motion to power responsive data intensive applications.
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
Privacy and Security focused Segment-alternative, in Golang and React
Community-curated list of software packages and data resources for single-cell, including RNA-seq, ATAC-seq, etc.
ingestr is a CLI tool to copy data between any databases with a single command seamlessly.
Apache DevLake is an open-source dev data platform to ingest, analyze, and visualize the fragmented data from DevOps tools, extracting insights for engineering excellence, developer experience, and community growth.
A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow
BitSail is a distributed high-performance data integration engine which supports batch, and incremental scenarios. BitSail is widely used to synchronize hundreds of trillions of data every day.
Hop Orchestration Platform
Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to bring state-of-the-art data engineering tools you love, such as Airbyte, dbt, or Great Expectations together in one intuitive interface built with React Flow. In addition we provide third-party data into data sc…
Add a description, image, and links to the data-integration topic page so that developers can more easily learn about it.
To associate your repository with the data-integration topic, visit your repo's landing page and select "manage topics."