Skip to main content

HTTP Read

For ingesting data from an HTTP endpoint.

Examples

component:
read:
http:
url: "http://example.com/data.csv"
parser: "csv"
uppercase: true

HttpReadComponent

info

HttpReadComponent is defined beneath the following ancestor nodes in the YAML structure:

Below are the properties for the HttpReadComponent. Each property links to the specific details section further down in this page.

PropertyDefaultTypeRequiredDescription
dependenciesarray[None]
NoList of dependencies that must complete before this Component runs.
event_timestring
NoTimestamp column in the component output used to represent event time.
connectionstring
NoName of the Connection to use for reading data.
columnsarray[None]
NoList specifying the columns to read from the source and transformations to make during read.
normalizeboolean
NoBoolean flag indicating whether the output column names should be normalized to a standard naming convention after reading.
preserve_caseboolean
NoBoolean flag indicating whether the case of the column names should be preserved after reading.
uppercaseboolean
NoBoolean flag indicating whether the column names should be transformed to uppercase after reading.
httpYes

Property Details

Component

A Component is a fundamental building block of a data Flow. Supported Component types include: Read, Transform, Task, Test, and more.

PropertyDefaultTypeRequiredDescription
componentOne of:
  CustomPythonReadComponent
  ApplicationComponent
  AliasedTableComponent
  ExternalTableComponent
YesComponent configuration options.

ReadComponent

Component that reads data from a system.

PropertyDefaultTypeRequiredDescription
data_plane  One of:
    SnowflakeDataPlane
    BigQueryDataPlane
    DatabricksDataPlane
NoData Plane-specific configuration options for a component.
skipboolean
NoBoolean flag indicating whether to skip processing for the Component or not.
retry_strategyNoRetry strategy configuration options for the Component if any exceptions are encountered.
descriptionstring
NoA brief description of what the model does.
metadataNoMeta information of a resource. In most cases it doesn't affect the system behavior but may be helpful to analyze project resources.
namestringYesThe name of the model
flow_namestring
NoName of the Flow that the Component belongs to.
data_maintenanceNoThe data maintenance configuration options for the Component.
testsNoDefines tests to run on this Component's data.
readOne of:
  GenericFileReadComponent
  LocalFileReadComponent
  SFTPReadComponent
  S3ReadComponent
  GcsReadComponent
  AbfsReadComponent
  HttpReadComponent
  MSSQLReadComponent
  MySQLReadComponent
  OracleReadComponent
  PostgresReadComponent
  SnowflakeReadComponent
  BigQueryReadComponent
  DatabricksReadComponent
YesRead component that reads data from a system.

HttpReadComponentOptions

Options for reading data from an HTTP endpoint.

PropertyDefaultTypeRequiredDescription
parserstring ("json", "csv")YesParser to use for HTTP-based data. Can be one of 'json' or 'csv'.
urlstringYesURL to retrieve the data from.

ComponentColumn

Component column expression definition.

No properties defined.

InputComponent

Specification for input Components defining how partitioning behaviors should be handled. This metadata is required when a Component serves as an input to other Components within a Flow.

PropertyDefaultTypeRequiredDescription
flowstringYesName of the parent Flow that the input Component belongs to.
namestringYesName of the input Component.
aliasstring
NoAlias to use for the input Component.
partition_specAny of:
  string ("full_reduction", "map")
NoThe type of partitioning to apply to the component's input data before processing the component's logic. Input partitioning is applied before the component's logic is executed.
wherestring
NoOptional filter condition to apply to the input Component's data.
partition_bindingAny of:
  string
NoOptional partition binding specification to apply to the Component on a per-output-partition basis against other inputs' partitions.

PartitionBinding

PropertyDefaultTypeRequiredDescription
logical_operatorlogical_operatorstring ("AND", "OR")NoTLogical operator to use to combine the partition binding predicates provided
predicatespredicatesarray[string]NoList of partition binding predicates to apply to the input Component's data

RepartitionSpec

Specification for repartitioning operations on input Component's data

PropertyDefaultTypeRequiredDescription
repartitionNoOptions for repartitioning the input Component's data.

RepartitionOptions

Options for repartitioning the input Component's data.

PropertyDefaultTypeRequiredDescription
partition_bystringYesColumn to partition by.
granularitystringYesGranularity to use for the partitioning.