Skip to main content
Version: 3.0.0

HTTP Read

For ingesting data from an HTTP endpoint.

Examples

component:
read:
http:
url: "http://example.com/data.csv"
parser: "csv"
uppercase: true

HttpReadComponent

info

HttpReadComponent is defined beneath the following ancestor nodes in the YAML structure:

Below are the properties for the HttpReadComponent. Each property links to the specific details section further down in this page.

PropertyDefaultTypeRequiredDescription
connectionstring
NoThe name of the connection to use for reading data.
columnsarray[ComponentColumn]
NoA list specifying the columns to read from the source and transformations to make during read.
normalizeboolean
NoA boolean flag indicating if the output column names should be normalized to a standard naming convention after reading.
preserve_caseboolean
NoA boolean flag indicating if the case of the column names should be preserved after reading.
uppercaseboolean
NoA boolean flag indicating if the column names should be transformed to uppercase after reading.
httpHttpReadComponentOptionsYes

Property Details

Component

A component is a fundamental building block of a data flow. Types of components that are supported include: read, transform, task, test, and more.

PropertyDefaultTypeRequiredDescription
componentOne of:
  ReadComponent
  TransformComponent
  TaskComponent
  SingularTestComponent
  CustomPythonReadComponent
  WriteComponent
  CompoundComponent
  AliasedTableComponent
  ExternalTableComponent
YesConfiguration options for the component.

ReadComponent

A component that reads data from a data system.

PropertyDefaultTypeRequiredDescription
data_plane  One of:
    SnowflakeDataPlane
    BigQueryDataPlane
    DuckdbDataPlane
    SynapseDataPlane
NoData Plane-specific configuration options for a component.
namestring
NoThe name of the model
descriptionstring
NoA brief description of what the model does.
metadataResourceMetadata
NoMeta information of a resource. In most cases it doesn't affect the system behavior but may be helpful to analyze project resources.
flow_namestring
NoThe name of the flow that the component belongs to.
skipboolean
NoA boolean flag indicating whether to skip processing for the component or not.
data_maintenanceDataMaintenance
NoThe data maintenance configuration options for the component.
testsComponentTestColumn
NoDefines tests to run on the data of this component.
readOne of:
  GenericFileReadComponent
  LocalFileReadComponent
  S3ReadComponent
  GcsReadComponent
  AbfsReadComponent
  HttpReadComponent
  MSSQLReadComponent
  MySQLReadComponent
  OracleReadComponent
  PostgresReadComponent
  SnowflakeReadComponent
  BigQueryReadComponent
YesThe read component that reads data from a data system.

HttpReadComponentOptions

Options for reading data from an HTTP endpoint.

PropertyDefaultTypeRequiredDescription
parserstring ("json", "csv")YesParser to use for HTTP-based data. Can be one of 'json' or 'csv'.
urlstringYesURL to retrieve the data from.
retryRetry
NoRetry Resource for HTTP requests.

Retry

Options for specifying retry behavior.

PropertyDefaultTypeRequiredDescription
max_triesintegerNoMaximum number of retries to attempt.
max_timeinteger
NoMaximum time to retry in seconds.
jitterstring ("random_jitter", "full_jitter")
NoType of jitter to apply to retry intervals.

ComponentColumn

Component column expression definition.

No properties defined.