Postgres Read Component
A component that reads data from a Postgresql table.
Examples
- postgres_read_component_config.yaml
- postgres_read_incremental.yaml
- postgres_merge_materialization.yaml
component:
read:
connection: my-postgres-connection
postgres:
table:
name: my_table
schema: my_schema
component:
read:
connection: my-postgres-connection
replication:
incremental:
column_name: updated_at
postgres:
tables:
- name: table1
schema: public
- name: table2
schema: public
component:
read:
postgres:
table:
name: my_table
schema: public
connection: my-postgres-connection
materialization:
merge:
mode: latest
unique_key: id
deletion_col: deleted_at
on_schema_change: append_new_columns
PostgresReadComponent
info
PostgresReadComponent
is defined beneath the following ancestor nodes in the YAML structure:
Below are the properties for the PostgresReadComponent
. Each property links to the specific details section further down in this page.
Property | Default | Type | Required | Description |
---|---|---|---|---|
materialization | MergeMaterialization | No | Strategy to use for data materialization during the read process. | |
replication | One of: Any of: string ("cdc") CdcReplication Any of: string ("incremental") IncrementalReplication | No | Replication strategy to use for data synchronization. | |
event_time | string | No | Timestamp column in the component output used to represent event time. | |
connection | string | No | The name of the connection to use for reading data. | |
columns | array[ComponentColumn] | No | A list specifying the columns to read from the source and transformations to make during read. | |
normalize | boolean | No | A boolean flag indicating if the output column names should be normalized to a standard naming convention after reading. | |
preserve_case | boolean | No | A boolean flag indicating if the case of the column names should be preserved after reading. | |
uppercase | boolean | No | A boolean flag indicating if the column names should be transformed to uppercase after reading. | |
chunk_size | integer | No | Size of chunks to read from the table at a time. | |
postgres | Postgres | Any of: SingleTableWithSchema MultipleTablesWithSchema SingleQuery MultipleQueries | No | Postgres read options. |
Property Details
Component
A component is a fundamental building block of a data flow. Types of components that are supported include: read, transform, task, test, and more.
Property | Default | Type | Required | Description |
---|---|---|---|---|
component | One of: ReadComponent TransformComponent TaskComponent SingularTestComponent CustomPythonReadComponent WriteComponent CompoundComponent AliasedTableComponent ExternalTableComponent | Yes | Configuration options for the component. |
ReadComponent
A component that reads data from a data system.
Property | Default | Type | Required | Description |
---|---|---|---|---|
data_plane | One of: SnowflakeDataPlane BigQueryDataPlane DuckdbDataPlane SynapseDataPlane | No | Data Plane-specific configuration options for a component. | |
name | string | No | The name of the model | |
description | string | No | A brief description of what the model does. | |
metadata | ResourceMetadata | No | Meta information of a resource. In most cases it doesn't affect the system behavior but may be helpful to analyze project resources. | |
flow_name | string | No | The name of the flow that the component belongs to. | |
skip | boolean | No | A boolean flag indicating whether to skip processing for the component or not. | |
data_maintenance | DataMaintenance | No | The data maintenance configuration options for the component. | |
skip_for_time_series_runs | boolean | No | A boolean flag indicating whether to skip processing for this component in time-series runs. | |
tests | ComponentTestColumn | No | Defines tests to run on the data of this component. | |
read | One of: GenericFileReadComponent LocalFileReadComponent S3ReadComponent GcsReadComponent AbfsReadComponent HttpReadComponent MSSQLReadComponent MySQLReadComponent OracleReadComponent PostgresReadComponent SnowflakeReadComponent BigQueryReadComponent | Yes | The read component that reads data from a data system. |
CdcReplication
Specifies if Change Data Capture (CDC) is the replication strategy.
Property | Default | Type | Required | Description |
---|---|---|---|---|
cdc | CdcOptions | No | Resource for Change Data Capture (CDC), enabling incremental data capture based on changes. |