64f189cc8a
Summary: First version of the feature spec for importing streams of data using kafka in memgraph. Reviewers: buda, teon.banek, dgleich, ipaljak Reviewed By: buda Subscribers: lion, mculinovic Differential Revision: https://phabricator.memgraph.io/D1415
1.3 KiB
1.3 KiB
Kafka - openCypher clause
One must be able to specify the following when importing data from Kafka:
- Kafka URI
- Transform script URI
Kafka endpoint is the URI of the leader broker and it is required for data extractor.
Minimum required syntax looks like:
CREATE STREAM kafka_stream AS LOAD DATA KAFKA '127.0.0.1/topic' WITH TRANSFORM
'127.0.0.1/transform.py';
The CREATE STREAM
clause happens in a transaction.
The full openCypher clause for creating a stream is:
CREATE STREAM stream_name AS
LOAD DATA KAFKA 'URI'
WITH TRANSFORM 'URI'
[BATCH INTERVAL milliseconds]
The WITH TRANSFORM
parameter should contain a URI of the transform script.
The BATCH_INTERVAL
parameter defines the time interval in milliseconds
that defines the time between two successive stream importing operations.
The DROP
clause deletes a stream:
DROP STREAM stream_name;
The SHOW
clause enables you to see all configured streams:
SHOW STREAMS;
You can also start/stop streams with the START
and STOP
clauses:
START STREAM stream_name [LIMIT count BATCHES];
STOP STREAM stream_name;
There are also convenience clauses to start and stop all streams:
START ALL STREAMS;
STOP ALL STREAMS;