Component

In Ascend, Components are the essential building blocks of data pipelines. They make up Flows, which define the sequence and logic of data transformations. Each Component is defined within the Flow by a YAML, Python, or SQL file, depending on its type and purpose. These Components handle specific tasks, from data ingestion to processing and output. Understanding how Components interact within a Flow is crucial for building efficient and reliable data pipelines.

This document provides a high-level overview of the different types of Components and their roles within a data pipeline.

Component types

info

For complete details on the types of Components available and their configuration options, see the Component reference documentation.

Components in Ascend are categorized into several types, each serving a distinct purpose in the data pipeline:

Read Components are the starting points of a data pipeline. They connect Ascend to various data sources, such as databases, file storage systems, and cloud services, enabling data ingestion. Once the data is ingested, it is processed within the data plane.
Transform Components handle the data processing logic in Ascend. They allow you to define operations, such as data cleaning, aggregation, and enrichment, using SQL or Python. These Components are critical for preparing data for analysis or further processing.
Write Components are the endpoints of the data pipeline. They export the processed data to external systems, databases, or storage solutions, ensuring that the data is available for downstream applications or storage.
Task Components provide the flexibility to execute custom SQL or Python code within a Flow. They enable complex processing tasks by allowing you to run multiline SQL statements or Python scripts with minimal modifications.
Test Components validate data quality and integrity throughout the data pipeline. These Components ensure that data meets specified criteria at various stages, reducing errors and improving reliability.

info

Tests can also be added to individual Components and are executed automatically as part of the Component's execution.

Interaction of Components in a Flow

The strength of Ascend lies in how its various Components work together within a data pipeline, forming a cohesive and efficient Flow. Each Component plays a specific role, but it's the interaction between them that ensures data is processed accurately and efficiently from start to finish.

Flow Example: From ingestion to output

Data Ingestion with Read Components:
- The Flow begins with Read Components, which connect to external data sources like databases, file storage systems, or cloud services. These Components ingest raw data into the Ascend platform.
Data Processing with Transform and Task Components:
- Once data is ingested, it typically passes through one or more Transform Components. These Components apply data processing logic—such as cleaning, aggregating, or enriching the data—using SQL or Python.
- In more complex scenarios, Task Components might be used alongside or instead of Transform Components to execute custom processing tasks. These tasks might involve running complex SQL queries, executing Python scripts, or integrating with external services.
- Throughout the Flow, Tests can be integrated within various Components to validate the data's quality and integrity.
Data Output with Write Components:
- After the data has been processed, Write Components take over. These Components export the transformed data to its final destination, which could be an external database, a cloud storage service, or another application that relies on the processed data.

By combining these Components effectively, you can build robust, scalable, and efficient data pipelines that handle everything from data ingestion to complex transformations, all the way through to final output. The flexibility to mix and match Components based on your specific needs makes Ascend a powerful tool for data engineering.

Next steps

Ready to build your own Components? Check out these practical how-to guides:

🔌 Create Connections - Connect to data sources and destinations
📥 Build Read Components - Import data from various sources
🔄 Build Transform Components - Process data with SQL and Python
📤 Build Write Components - Export data to destinations
🧪 Add data tests - Validate data quality and integrity
📋 Build Task Components - Execute custom SQL or Python code

Component types​

Interaction of Components in a Flow​

Flow Example: From ingestion to output​

Next steps​

Component types

Interaction of Components in a Flow

Flow Example: From ingestion to output

Next steps