Data Pipelines
A data pipeline, sometimes referred to as an ETL pipeline, is a sequence of ETL jobs that work together to transforms data and information to be consumable by one or more data products.
Generally, each ETL in a data pipeline will extract data from one or more data sources, transform it for some particular purpose, then load it to a new data store. Subsequent ETLs will consume data from that store, then transform and load it to their own data store, and so on.
Deeper Knowledge on Data Pipelines
data:image/s3,"s3://crabby-images/7c1c1/7c1c19db6365f8ca0fb68a406c8d67a4e6f52c54" alt="Apache Kafka"
Apache Kafka
A distributed event streaming platform for data-pipelines and analytics
data:image/s3,"s3://crabby-images/7c1c1/7c1c19db6365f8ca0fb68a406c8d67a4e6f52c54" alt="Data Products"
Data Products
Ways of making data available
data:image/s3,"s3://crabby-images/7c1c1/7c1c19db6365f8ca0fb68a406c8d67a4e6f52c54" alt="Extract Transform Load (ETL)"
Extract Transform Load (ETL)
Ways to extract, transform, and load data
data:image/s3,"s3://crabby-images/7c1c1/7c1c19db6365f8ca0fb68a406c8d67a4e6f52c54" alt="Apache Spark"
Apache Spark
A data processing engine for batch processing, stream processing, and machine learning
Broader Topics Related to Data Pipelines
data:image/s3,"s3://crabby-images/7c1c1/7c1c19db6365f8ca0fb68a406c8d67a4e6f52c54" alt="Data Products"
Data Products
Ways of making data available
data:image/s3,"s3://crabby-images/7c1c1/7c1c19db6365f8ca0fb68a406c8d67a4e6f52c54" alt="Data Engineering"
Data Engineering
Engineering approaches to data management