Onramps¶
Specify how Tremor connects to the outside world in order to receive from external systems.
All Onramps support circuit breakers as in that no new events are read from it in the case of a circuit breaker triggering.
For example, the Kafka onramp receives data from a Kafka cluster by creating a local record consumer, connecting to a set of topics and ingesting Kafka record data.
All onramps are of the form:
onramp:
- id: <unique onramp id>
type: <onramp name>
preprocessors: # can be omitted
- <preprocessor 1>
- <preprocessor 2>
- ...
postprocessors: # only for linked transport, can be omitted
- <postprocessor 1>
- <postprocessor 2>
- ...
linked: <true or false> # enable linked transport, default: false
codec: <codec of the data>
codec_map:
"<mime-type>": <coded handling events of this mime-type>
config:
<key>: <value>
The codec
field is optional and if not provided, will use onramps default codec.
The err_required
field can be set to true
if the onramp should not start unless both out
and err
ports are connected to at least one pipeline.
The config
contains a map (key-value pairs) specific to the onramp type.
Delivery Properties¶
Onramps are able to act upon both circuit breaker and guaranteed delivery events from the downstream pipelines. Those are triggered when event delivery is acknowledged or when event delivery fails. Also when some part (offramps, operators) signals itself being broken, the circuit breaker opens, or when the downstream system heals, the circuit breaker closes again, signaling it is safe to send further events. How each onramp reacts is described in the table below:
The column Delivery Acknowledgements
describes when the onramp considers and reports the event delivered to the upstream it is connected to.
Onramp | Delivery Acknowledgements |
---|---|
amqp | not supported |
blaster | not supported |
cb | not supported |
crononome | not supported |
discord | not supported |
env | not supported |
file | not supported |
gsub | always |
kafka | always, only on ack event if enable.auto.commit is set to false |
metronome | not supported |
nats | not supported |
otel | not supported |
PostgreSQL | not supported |
rest | not supported |
stdin | not supported |
tcp | not supported |
udp | not supported |
ws | not supported |
Supported Onramps¶
amqp¶
The amqp
onramp allows consuming events from an AMQP broker. It uses lapin for an AMQP 0.9.1 protocol implementation.
Example:
onramp:
- id: amqp
type: amqp
config:
amqp_addr: "amqp://guest:guest@127.0.0.1:5672/"
queue_name: "my_queue"
queue_options:
passive: false
durable: false
exclusive: false
auto_delete: false
nowait: false
routing_key: "#"
exchange: ""
Supported configuration options are:
amqp_addr
- an AMQP URI. Format: String, required. For more details see AMQP 0.9.1 URI spec.exchange
- Specifies the exchange to bind the configured queue to. Format: String, optional, Default: the empty string, the default exchange.routing_key
- Specifies a routing key used when binding the configured queue to an exchange. Format: String, optional, Default: the empty string.queue_name
- The name of the queue to use/create for consuming messages. It will be bound to the configuredexchange
with the givenrouting_key
. Format: String, required.queue_options
- Required Options to use when declaring the queue.passive
- Declare the configured queue aspassive
, iftrue
do not auto-create the queue. Format: bool, Default:false
.durable
- Declare the configured queue asdurable
, so it survives AMQP server restarts. Format: bool, Default:false
.exclusive
- Declare the configured queue asexclusive
to this connection. Format: bool, Default:false
.auto_delete
- Declare the configured queue asauto-delete
, deleting it if there are no consumers left. Format: bool, Default:false
.nowait
- Declare the configured queue withnowait
, do not wait for a reply from the server when declaring the queue. Format: bool, Default:false
.
Upon onramp initialization, the specified queue_name
is declared
using queue_options
. It will be created if it doesn't exist yet. The queue is bound
to the named exchange
(emtpy string means the default exchange) with the given routing_key
(AMQP routing). If the queue was not able to bind, the onramp will error upon initialization.
The current implementation uses default queue bind options, i.e. nowait = False
, meaning the server reply is awaited before continuing.
Received messages are immediately acknowledged to the protocol stack. This Onramp does not wait for Guaranteed Delivery acknowledgements or fails.
blaster¶
Note
This onramp is for benchmarking use, it should not be deployed in a live production system.
The blaster onramp is built for performance testing, but it can be used for spaced-out replays of events as well. Files to replay can be xz
compressed. It will keep looping over the file.
The default codec is json
.
The event origin URI set by the onramp is of the form:
tremor-blaster://<tremor-host.local>/<config_source_file>
Supported configuration options are:
source
- The file to read from.interval
- The interval in which events are sent in nanoseconds.iters
- Number of times the file will be repeated.
Example:
onramp:
- id: blaster
type: blaster
codec: json
config:
source: ./demo/data/data.json.xz
cb¶
The cb
onramp is for testing how downstream pipeline and offramps issue circuit breaker events. It expects a circuit breaker event for each event it sent out, and then, the latest after the configured timeout
is exceeded, it exits the tremor process. If some events didn't receive circuit breaker events, it exits with status code 1
; if everything is fine, it exits with 0
.
Supported configuration options are:
source
- The file to read from, expecting 1 event payload per line.timeout
- The time to wait for circuit breaker events in milliseconds. If this timeout is exceeded, the tremor process is terminated. (Default: 10000 ms)
Example:
onramp:
- id: cb_test
type: cb
codec: json
config:
source: in.json
timeout: 1000
crononome¶
This sends a scheduled tick down the offramp. Schedules can be one-off or repeating and use a cron-like format.
Multiple cron entries can be configured, each with a symbolic name and an optional JSON payload in addition to the cron expression.
The event origin URI set by the onramp is of the form:
tremor-crononome://<tremor-host.local>
Supported configuration options are:
entries
- A sequence of entries.
Example
onramp:
- id: crononome
type: crononome
codec: json
config:
entries:
## every second
- name: 1s
expr: "* * * * * *"
## every 5 seconds
- name: 5s
expr: "0/5 * * * * *"
## every minute
- name: 1m
expr: "0 * * * * *"
payload:
snot: badger
Cron entries that are historic or in the past ( relative to the current UTC time ) will be ignored.
Cron entries beyond 2038 will not work due to underlying libraries ( rust
, chrono
, cron.rs
) suffering
from the year 2038 problem.
The data looks like this:
{
"onramp": "crononome",
"ingest_ns": 12345, // the time
"id": 42,
"name": "the name of the trigger",
"payload": ["the", "configured", "payload", "perhaps"],
"trigger": {"the": "trigger"}
}
discord¶
This onramp can be linked
The discord
onramp allows consuming events from the Discord API. It uses the event structure as provided by serenity wrapped in event-named records.
Replies sent to this onramp can perform multiple operations:
Guild related¶
{
"guild": {
"id": 1234, // guild id, required
// member section required
"member": {
"id": 2345, // member id, required
// Roles to remove, optional
"remove_roles": [
3456 // ... role ids
],
// Roles to add, optional
"add_roles": [
4567 // ... role ids
],
"deafen": true, // (un)deafen the member, optional
"mute": true // (un)deafen the member, optional
}
}
}
Message related¶
{"message": {
"channel_id": 1234, // channel id, required
// Update message section, optional
"update": {
"message_id": 2345, // message id to update, required
// Reactions to add
"add_reactions": [
"😀", // emoji reaction
{ // custom reaction
"id": 3456, // emoji id, required
"name": "seal-of-approval" // emoji name, optional
"animated": true, // animated, optional
}
// ...
],
},
// Send section, optional
"send": {
"content": "hello!", // message content, optional,
"reference_message": 4567, // Reference to other message, optional
"reference_channel": 5678, // reference channel, optional, default is `channel_id` (ignored w/o `reference_message`)
"tts": false, // use text to speech, optional
// Embed section, optional
"embed": {
// Author section, optional
"author": {
"icon_url": "https://...", // url of the author icon, optional
"name": "Snottus Badgerus", // name of the author, optional
"url": "https://...", // url of the author profile, optional
},
"colour": 0, // color (as number) of the embed, optional (hint: use hex in tremor script it makes it easier)\
"description": "This is an embed", // A description for the embed, optional
// Embedded fields, optional
"fields": [
{
"name": "field 1", // name of the field, required
"value": "explenation", // 'body' of the field, required
"inline": true, // if the field should be inlined, optional, default: false
}
// ...
],
"footer": "look at my feet!", // simple footer, optional
// Footer section, optional, alternative to simple footer
"footer": {
"text": "look at my feet!", // footer text, optional
"icon_url": "https://...", // footer icon, optional
}
},
// Reactions to add
"reactions": [
"😀", // emoji reaction
{ // custom reaction
"id": 3456, // emoji id, required
"name": "seal-of-approval" // emoji name, optional
"animated": true, // animated, optional
}
// ...
],
}
}}
env¶
The env
onramp reads the environment variables from the shell starting tremor and sends them as a message on startup. This message is send only once, right when the source is connected.
There are no configuration variables and no codecs since data is provided in a structured format.
The output has the form of a record with the key env
that holds a record where the key is the name of the environment variable and the value is it's value as a string.
{
"env": {
"SHELL": "/usr/bin/zsh"
// ...
}
}
file¶
The file onramp reads the content of a file, line by line, and sends each line as an event. It has the ability to shut down the system upon completion. Files can be xz
compressed.
The default codec is json
.
The event origin URI set by the onramp is of the form:
tremor-file://<tremor-host.local>/<config_source_file>
Supported configuration options are:
source
- The file to read from.close_on_done
- Terminates tremor once the file is processed.sleep_on_done
- Waits for the given number of milliseconds before terminating tremor. Intended to be used withclose_on_done
.
Example:
onramp:
- id: in
type: file
config:
source: /my/path/to/a/file.json
close_on_done: true
sleep_on_done: 1000 # wait for a second before terminating
gsub¶
Google Cloud Pubsub - Subscriber
This onramp can issue basic operation of receiving messages from a subscription.
Note
The onramp is experimental.
This onramp assumes that the environment variable GOOGLE_APPLICATION_CREDENTIALS
has been exported to the execution environment and it has been configured to point to a valid non-expired service account token json file.
Supported configuration options are:
pem
- The pem file from GCP for authentication.subscription
- The subscription name which is linked to a topic to receive messages.
Example:
onramp:
- id: gsub
type: gsub
codec: json
preprocessors:
- gzip
config:
pem: gcp.pem
subscription: "tremor-sub"
We get the meta data as response that includes the message id and the acknowledgement id of the message. Configure the inbound trickle file to display the metadata received.
Response:
{
"data": {
"hello": "folks!!"
},
"meta": {
"message_id": "<message-id>",
"acknowledgement_id": "<acknowledgement_id>"
}
}
Where
<data>
- The data received as message.<message-id>
- The message id assigned by the Google Cloud Pubsub api.<acknowledgement_id>
- The acknowedgement id assigned by the Google Cloud Pubsub api.
Kafka¶
The Kafka onramp connects to one or more Kafka topics. It uses librdkafka
to handle connections and can use the full set of librdkafka 1.5.0 configuration options.
The default codec is json
.
The event origin URI set by the onramp is of the form:
tremor-kafka://<config_first_broker_host>[:<config_first_broker_port>]/<topic>/<partition>/<offset>
Supported configuration options are:
group_id
- The Kafka consumer group id to use.topics
- A list of topics to subscribe to.brokers
- Broker servers to connect to. (Kafka nodes)rdkafka_options
- An optional map of an option to value, where both sides need to be strings.retry_failed_events
- If set tofalse
, the source will not seek back the consumer offset upon failed events, and thus not retry those whenenable.auto.commit
is set tofalse
inrdkafka_options
. (defaulttrue
)poll_interval
- Duration in milliseconds to wait until we poll again if no message is in the kafka queue. (default:100
)
Set metadata variables are:
$kafka
- Record consisting of two optional keys: -headers
: A record denoting the headers for the message (if any). -key
: The key used for this message in bytes (if any). -topic
: The topic the message was on (if any). -offset
: The offset in the partition the message was on (if any). -partition
: The partition the message was on (if any). -timestamp
: The timestamp provided bykafka
in milliseconds (if any). Example:
onramp:
- id: kafka-in
type: kafka
codec: json
config:
brokers:
- kafka:9092
topics:
- demo
- snotbadger
group_id: demo
A more involved example, only committing on successful circuit breaker event and not retrying failed events, while also decreasing the poll interval to 10ms to get notified of new messages faster:
onramp:
- id: involved-kafka
type: kafka
codec: msgpack
preprocessors:
- lines
config:
brokers:
- kafka01:9092
- kafka02:9092
topics:
- my_topic
group_id: my_group_id
retry_failed_events: false
poll_interval: 10
rdkafka_options:
"enable.auto.commit": false
Semantics with enable.auto.commit
¶
If enable.auto.commit: false
is set in rdkafka_options
, the consumer offset in kafka will only be committed when the event has successfully reached the other end of the pipeline (typically some offramp ).
If an event failed during processing within the pipeline or at a downstream offramp, the consumer offset will be reset to the offset of the failed event, so it will be retried. This has some consequences worth mentioning:
- Already processed
kafka
messages (that have succeeded before the failed message failed) might be seen again multiple times. - If the message is persistently failing (e.g. due to an malformed payload or similar), tremor will retry those messages infinitely.
If persistent failures are to be expected (e.g. due to incorrect event payloads), or if repeating messages in general are a problem for the application, avoiding retries with retry_failed_events: false
is advised.
If enable.auto.commit: true
is set in rdkafka_options
, which is the default behaviour if nothing is specified, the offset is immediately committed upon event reception in tremor, regardless of success or failure of processing the kafka
message as event in tremor.
metronome¶
This sends a periodic tick downstream. It is an excellent tool to generate some test traffic to validate pipelines.
The default codec is pass
(since we already output decoded JSON).
The event origin URI set by the onramp is of the form:
tremor-metronome://<tremor-host.local>/<config_interval>
Supported configuration options are:
interval
- The interval in which events are sent in milliseconds.
Example:
onramp:
- id: metronome
type: metronome
config:
interval: 10000
The data looks like this:
{
"onramp": "metronome",
"ingest_ns": 12345, // time
"id": 42
}
nats¶
The nats
onramp connects to Nats server(s) and subscribes to a specified subject.
The default codec is json
.
The event origin URI set by the onramp is of the form:
tremor-nats://<config_first_host_host_addr>[:<config_first_host_port>]/<subject>
Supported configuration operations are:
hosts
- List of hosts to connect to.subject
- Subject to subscribe to for listening to messages.queue
- Optional queue to subscribe to.options
- Optional struct, which can be used to customize the connection to the server (seenats.rs
configuration options for more info):token
: String; authenticate using a token.username
: String; authenticate using a username and password.password
: String; authenticate using a username and password.credentials_path
: String; path to a.creds
file for authentication.cert_path
: String; path to the client certificate file.key_path
: String; path to private key file.name
: String; name this configuration.echo
: Boolean; if true, published messages will not be delivered.max_reconnects
: Integer; max number of reconnection attempts.reconnect_buffer_size
: Integer; max amount of bytes to buffer when accepting outgoing traffic in disconnected mode.tls
: Boolean; if true, sets tls for all server connections.root_cert
: String; path to a root certificate.
Set metadata variables are:
-
$nats
: Record consisting of the following metadata: -
$reply
: Reply associated with the message (if any). $headers
: Record denoting the headers for the message (if any).
Example:
onramp:
- id: nats-in
type: nats
config:
hosts:
- "127.0.0.1:4444"
subject: demo
queue: stack
options:
name: nats-demo
reconnect_buffer_size: 1
PostgreSQL¶
PostgreSQL onramp.
Supported configuration options are:
host
- PostgreSQL database hostname.port
- PostgresSQL database port.user
- Username for authentication.password
- Password for authentication.dbname
- Database name.query
- Query run to retrieve data.interval_ms
- Query interval in milliseconds.cache
- Location (path
) and size (size
) for caching of latest successful query interval.
query
must include two arguments to be filled with start and end interval timestamps.
Data will come out of onramp in objects representing columns. If schema
specifies there are two fields, username
(VARCHAR
) and created_at
(TIMESTAMPTZ
), then the actual JSON coming out of onramp looks like this:
"username": {
"fieldType": "VARCHAR",
"name": "username",
"value": "actual\_username"
},
"created\_at": {
"fieldType": "TIMESTAMPTZ",
"name": "created\_at",
"value": "2020-04-04 00:00:00.000000 +00:00"
}
Example:
id: db
type: postgres
codec: json
config:
host: localhost
port: 5432
user: postgres
password: example
dbname: sales
query: "SELECT id, name from events WHERE produced_at <= $1 AND produced_at > $2"
interval_ms: 1000
cache:
path: "/path/to/cache.json"
size: 4096
rest¶
This onramp can be linked
The rest onramp listens on a specified port for inbound RESTful ( http ) data, treating the decoded and preprocessed http body as event data (and attaching other request attributes as event metadata).
The event origin URI set by the onramp is of the form:
tremor-rest://<tremor-rest-client-host.remote>
Supported configuration options are:
host
- The host to advertise as.port
- The TCP port to listen on.
The rest onramp respects the HTTP Content-Type header and will use it to decode the request body when it's present (otherwise, it defaults to using the codec specified in the onramp config).
Tremor supports a limited set of built-in codecs used for well known MIME types (e.g. application/json
, application/yaml
, text/plain
). In order to customize how certain Content-Type
s are handled, provide a codec_map
providing a mapping from MIME type to Tremor codec in the top level artefact config (where the codec
is set).
Set metadata variables:
$request
- A record capturing the HTTP request attributes. Available fields within:url
- A record with the following standard URL fields (optional fields might not be present):scheme
- String, typicallyhttp
username
- String, optionalpassword
- String, optionalhost
- Stringport
- number, optional, absence means80
path
- Stringquery
- String, optionalfragment
- String, optional
method
- HTTP method used by the incoming request.headers
- A record that maps header name (lowercase string) to values (array of strings).
Used metadata variables:
These variables can be used to dynamically change how responses are handled when using this onramp as linked transport:
$response
- A record capturing the HTTP response attributes. Available fields within:status
- Numeric HTTP status code. (optional. status code defaults to200
when not set).headers
- A record that maps header name (string) to value (string or array of strings) (optional).
When not used as a linked onramp, the status code returned with the response is 202
.
Example:
onramp:
- id: rest
type: rest
preprocessors:
- lines
codec: json
codec_map:
"text/html": "string"
config:
host: "localhost"
port: 9000
Known limitations:
It is currently not possible to configure rest onramps via swagger, RAML or OpenAPI configuration files.
sse¶
The SSE (Server Sent Events) onramp subscribes to an sse endpoint treating sse-event-data
as the event data.
The event origin URI set by the onramp is of the form:
tremor-sse://<tremor-host.local>
Supported configuration options are:
url
- The sse endpoint to subscribe.headers
- A map of headers to set for the requests, where both sides are strings. (optional)
Example:
onramp:
- id: sse
type: sse
config:
url: "http://localhost:8000/sse"
header:
"Client": "Tremor"
stdin¶
An onramp that takes input from stdin
.
The default codec is string
.
The event origin URI set by the onramp is of the form:
tremor-stdin://<tremor-host.local>
tcp¶
This listens on a specified port for inbound tcp data. TLS is supported.
The onramp can leverage preprocessors to segment data before codecs are applied and events are forwarded to pipelines.
The default codec is json
.
The event origin URI set by the onramp is of the form:
tremor-tcp://<client_ip>:<client_port>/<config_server_port>
Supported configuration options are:
host
- The IP to listen on.port
- The Port to listen on.tls
- The TLS config for receiving messages via TCP/TLS. If provided, this onramp expects TLS traffic.cert
- The server certificate (or certificate chain) PEM file (X.509 certificate). Required for TLS.key
- The private Key PEM file (RSA or PKCS8 format). Required for TLS.
Example:
onramp:
- id: tcp
type: tcp
preprocessors:
- base64
- lines
codec: json
config:
host: "127.0.0.1"
port: 9000
tcp onramp example for GELF¶
onramp:
- id: gelf-tcp
type: tcp
preprocessors:
- lines-null
codec: json
config:
host: "127.0.0.1"
port: 12201
tcp example for TLS¶
If the tls
config is provided, this onramp acts as a TCP/TLS server and expects SSL/TLS traffic from clients:
onramp:
- id: tls
type: tcp
preprocessors:
- lines
codec: string
config:
host: "127.0.0.1"
port: 65535
tls:
cert: "path/to/cert.pem"
key: "path/to/key.pem"
udp¶
The UDP onramp allows receiving data via UDP datagrams.
The default codec is string
.
The event origin URI set by the onramp is of the form:
tremor-udp://<sender_ip>:<sender_port>/<config_receive_port>
Supported configuration options are:
host
- The IP to listen on.port
- The Port to listen on.
Example:
onramp:
- id: udp
type: udp
codec: json
config:
host: "127.0.0.1"
port: 9000
udp onramp example for GELF¶
onramp:
- id: gelf-udp
type: udp
preprocessors:
- decompress
- gelf-chunking
- decompress
codec: json
config:
host: "127.0.0.1"
port: 12201
udp onramp example for syslog¶
onramp:
- id: syslog-udp
type: udp
codec: syslog
config:
host: "127.0.0.1"
port: 12201
ws¶
This onramp can be linked
WebSocket onramp. Receiving either binary or text packages from a WebSocket connection. the url is: ws://<host>:<port>/
.
The event origin URI set by the onramp is of the form:
tremor-ws://<tremor-ws-client-host.remote>
Supported configuration options are:
host
- The IP to listen on.port
- The Port to listen on.
Set metadata variables:
$binary
-true
if the incoming WebSocket message came as binary (false
otherwise).
Used metadata variables (for reply with linked transports):
$binary
- If reply data should be sent as binary instead of text (optional. data format defaults to text when not set).
Example:
onramp:
- id: ws
type: ws
codec: json
config:
port: 12201
host: "127.0.0.1"
otel¶
CNCF OpenTelemetry onramp. Listens on TCP port 4317
for gRPC traffic conforming to the CNCF OpenTelemetry protocol specification.
Forwards tremor value variants of logs
, trace
and metrics
messages.
The onramp is experimental.
Supported configuration options are:
host
- String - The host or IP to listen on.port
- integer - The TCP port to listen on.- 'logs' - boolean - Is logging enabled for this instance? Defaults to
true
. Receivedlogs
events are dropped whenfalse
. - 'metrics' - boolean - Is metrics enabled for this instance? Defaults to
true
. Receivedmetrics
events are dropped whenfalse
. - 'trace' - boolean - Is trace enabled for this instance? Defaults to
true
. Receivedtrace
events are dropped whenfalse
.
Pipelines that leverage the OpenTelemetry integration can use utility modules in the cncf::otel
module to
simplify working with the tremor value mapping of the event data. The connector translates the wire level
data from protocol buffers to tremor values automatically.
Example:
onramp:
- id: otlp
type: otel
codec: json
config:
port: 4317
host: 127.0.0.1