memgraph

Author	SHA1	Message	Date
Antonio Filipovic	bfc756c092	HA: Polish flow for replicas from coordinator (#1711 )	2024-02-16 10:58:01 +01:00
Gareth Andrew Lloyd	f48151576b	System replication experimental flag (#1702 ) - Remove the compile time control - Introduce the runtime control flag New flag `--experimental-enabled=system-replication`	2024-02-13 12:57:18 +00:00
Antonio Filipovic	c15b62a88d	HA: Disable replication from old main (#1674 )	2024-02-07 11:20:47 +01:00
andrejtonev	7ead00f23e	Adding authentication data replication (#1666 ) * Add AUTH system tx deltas * Add auth data RPC and handlers * Support multiple system deltas in a single transaction * Added e2e test * Bugfix: KVStore segfault after move --------- Co-authored-by: Gareth Lloyd <gareth.lloyd@memgraph.io>	2024-02-05 10:37:00 +00:00
Andi	78a88737f8	HA: Add automatic failover (#1646 ) Co-authored-by: antoniofilipovic <filipovicantonio1998@gmail.com>	2024-01-29 15:34:00 +01:00
andrejtonev	ff44d68843	Simplify auth::Auth (#1663 ) Moved various auth flags under a single config Moved all regex logic under auth::Auth	2024-01-29 12:52:32 +00:00
Andi	38ade99652	HA: Add coordinator to replication cluster (#1608 )	2024-01-24 13:07:51 +01:00
andrejtonev	071df2f439	Replication refactor part 7 (#1550 ) * Split queries into system and data queries * System queries are sequentially executed and generate separate transaction deltas * System transaction try locks for 100ms * last_commited_system_ts saved to DBMS durability * Replicating CREATE/DROP DATABASE * Sending a system snapshot if REPLICA behind * Passing a copy of the gatekeeper::access as std::any to all functions that could call an async execution * Removed delete_on_drop flag (we now always delete on drop) * Using UUID as the directory name for databases * DBMS durability update (added versioning and salient information) * Automatic migration from previous version * Interpreter can run some queries without a target database * SHOW REPLICA returns the status of the currently active DB * Returning UUID instead of db name in the RPC responses * Using UUIDs for database specification in RPC (not name) * FrequentCheck forces update on reconnect * TimestampRpc will detect if a replica is behind, and will update client's state * Safer SLK reads * Split SHOW DATABASES in two SHOW DATABASES (list of current databases) and SHOW DATABASE a single string naming the current database --------- Co-authored-by: Gareth Lloyd <gareth.lloyd@memgraph.io>	2024-01-23 12:06:10 +01:00
Gareth Andrew Lloyd	0fb8e4116f	Fix REPLICA timestamps (#1615 ) * Fix up REPLICA GetInfo and CreateSnapshot Subtle bug where these actions were using the incorrect transactional access while in REPLICA role. This casued timestamp to be incorrectly bumped, breaking REPLICA from doing replication. * Delay DNS resolution Rather than resolve at endpoint creation, we will instread resolve only on Socket connect. This allows k8s deployments to change their IP during pod restarts. * Minor sonarsource fixes --------- Co-authored-by: Andreja <andreja.tonev@memgraph.io> Co-authored-by: DavIvek <david.ivekovic@memgraph.io>	2024-01-05 16:42:54 +00:00
andrejtonev	8b9e1fa08b	Replication refactor part 6 (#1484 ) Single (instance level) connection to a replica (messages from all databases get multiplexed through it) ReplicationClient split in two: ReplicationClient and ReplicationStorageClient New ReplicationClient, moved under replication, handles the raw connection, owned by MainRoleData ReplicationStorageClient handles the storage <-> replica state machine and holds to a stream Removed epoch and storage from *Clients rpc::Stream proactively aborts on error and sets itself to a defunct state Removed HandleRpcFailure, instead we simply log the error and let the FrequentCheck handle re-connection replica_state is now a synced variable ReplicaStorageClient state machine bugfixes Single FrequentCheck that goes through DBMS Moved ReplicationState under DbmsHandler Moved some replication startup logic under the DbmsHandler's constructor Removed InMemoryReplicationClient CreateReplicationClient has been removed from Storage Simplified GetRecoverySteps and made safer --------- Co-authored-by: Gareth Lloyd <gareth.lloyd@memgraph.io>	2023-11-23 11:02:35 +01:00
andrejtonev	dbc6054689	Replication refactor (part 5) (#1378 )	2023-11-06 11:50:49 +00:00
Gareth Andrew Lloyd	d278a33f31	Decouple pure replication state from storage [part 1] (#1325 ) A major refactor to decouple replication state from storage. ATM it is still owned by storage but a following part should fix that.	2023-10-10 11:44:19 +01:00
Gareth Andrew Lloyd	3cc2bc2791	Refactor interpreter to support multiple distributed clocks (Part 1) (#1281 ) * Interpreter transaction ID decoupled from storage transaction ID * Transactional scope for indices, statistics and constraints * Storage::Accessor now has 2 modes (unique and shared) * Introduced ResourceLock to fix pthread mutex problems * Split InfoQuery in two: non-transactional SystemInfoQuery and transactional DatabaseInfoQuery * Replicable and durable statistics * Bumped WAL/Snapshot versions * Initial implementation of the Lamport clock --------- Co-authored-by: Andreja Tonev <andreja.tonev@memgraph.io>	2023-10-05 16:58:39 +02:00
Gareth Andrew Lloyd	d71b6a5007	Refactor replication client/server (#1311 )	2023-09-29 11:21:42 +01:00
Josipmrden	07dea328d8	[master < T1110] Add merge optimization to expand dynamically during runtime (#1110 )	2023-09-08 17:12:25 +02:00
Gareth Andrew Lloyd	4bc5d749b2	Refactor replication, part 3 (#1177 ) Changes to make replication code agnostic of the storage kind being used. Co-authored-by: Andreja Tonev <andreja.tonev@memgraph.io>	2023-08-25 10:52:07 +01:00
andrejtonev	9355e58e73	Decoupling replication logic from InMemoryStorage (#1169 )	2023-08-22 13:29:25 +02:00
Josh Soref	57fe3463f2	Fix a bunch of spelling mistakes (1/n) (#1112 )	2023-07-30 14:05:05 +02:00
Marko Budiselić	9d056e7649	Add experimental/v1 of ON_DISK_TRANSACTIONAL storage (#850 ) Co-authored-by: Andi Skrgat <andi8647@gmail.com> Co-authored-by: Aidar Samerkhanov <aidar.samerkhanov@memgraph.io>	2023-06-29 11:44:55 +02:00
Josipmrden	b875649270	Add restoring of replication roles upon database startup (#791 ) Fix replica node restoration on startup so it is restored as replica and not as main.	2023-06-21 19:08:58 +02:00
Jeremy B	d4f0bb0e38	Correct inconsistencies w.r.t. sync replication (#435 ) Add a report for the case where a sync replica does not confirm within a timeout: -Add a new exception: ReplicationException to be returned when one sync replica does not confirm the reception of messages (new data, new constraint/index, or for triggers) -Update the logic to throw the ReplicationException when needed for insertion of new data, triggers, or creation of new constraint/index -Add end-to-end tests to cover the loss of connection with sync/async replicas when adding new data, adding new constraint/indexes, and triggers Add end-to-end tests to cover the creation and drop of indexes, existence constraints, and uniqueness constraints Improved tooling function mg_sleep_and_assert to also show the last result when duration is exceeded	2022-08-09 11:29:55 +02:00
Jeremy B	f629de7e60	Save replication settings (#415 ) * Storage takes care of the saving of setting when a new replica is added * Restore replicas at startup * Modify interactive_mg_runner + memgraph to support that data-directory can be configured in CONTEXT * Extend e2e test * Correct typo * Add flag to config to specify when replication should be stored (true by default when starting Memgraph) * Remove un-necessary "--" in yaml file * Make sure Memgraph stops if a replica can't be restored. * Add UT covering the parsing of ReplicaStatus to/from json * Add assert in e2e script to check that a port is free before using it * Add test covering crash on Jepsen * Make sure applciaiton crashes if it starts on corrupted replications' info Starting with a non-reponsive replica is allowed. * Add temporary startup flag: this is needed so jepsen do not automatically restore replica on startup of main. This will be removed in T0835	2022-07-07 13:30:28 +02:00
Jeremy B	b737e53456	Remove sync with timeout (#423 ) * Remove timout when registering a sync replica * Simplify jepsen configuration file * Remove timeout from jepsen configuration * Add unit test * Remove TimeoutDispatcher	2022-07-05 09:40:50 +02:00
Jeremy B	589e0e098b	Forbid two replicas to point to the same ip port (#406 )	2022-06-20 17:10:20 +03:00
János Benjamin Antal	537855a0b2	Fix usages of constexpr (#367 ) * Fix usages of constexpr	2022-03-31 13:52:43 +02:00
jbajic	12b4ec1589	Add memgraph namespace	2022-03-14 15:47:41 +01:00
Antonio Andelic	bd21bc82b7	Add license to cpp/hpp/py test files (#283 )	2021-10-26 08:53:56 +02:00
antonio2368	3f3c55a4aa	Format all the memgraph and test source files (#97 )	2021-02-18 15:32:43 +01:00
antonio2368	200ce5f45e	Add configs and support for semi-sync and SSL (#55 ) * Add config for replication client/server * Add SSL to replication * Add semi-sync replication * Expose necessary information about replication * Thread pool fix * Set BasicResult value type to void	2021-01-21 15:49:32 +01:00
antonio2368	a0705746cb	Add epoch id and refactor replication client/server (#51 )	2021-01-21 15:49:32 +01:00
antonio2368	7e9175052a	Define communication process (#49 ) * Add basic communication process using commit timestamp * Add file number to req * Add proper recovery handling * Allow loading of WALs with same seq num * Allow always desired commit timestamp * Set replica timestamp for operation * Mark non-transactional timestamp as finished	2021-01-21 15:49:32 +01:00
antonio2368	03cc568e39	Add support for async replication (#41 ) * Add thread pool * Define async replication * Expose replication state * Rename TransactionHandler to ReplicaStream	2021-01-21 15:49:32 +01:00
antonio2368	bc0c944910	Add replica recovery process (#40 ) * Add file transfer over RPC * Snapshot transfer implementation * Allow snapshot creation only for MAIN instances * Replica and main can have replication clients * Use only snapshots and WALs that are from the Main storage * Add flush lock and expose buffer * Add fstat for file size and TryFlushing method * Use lseek for size Co-authored-by: Antonio Andelic <antonio.andelic@memgraph.io>	2021-01-21 15:49:32 +01:00
antonio2368	b10255a12f	Add initial support for multiple clients (#31 ) * Add tests for multiple clients * Use variant for RPC server and clients * Using synchronized list for replication clients, extracted variant access to a function * Set MAIN as default, add unregister function, add a name for replication clients * Use the regular list for clients * Use test fixture so storage directory is cleaned * Use seq_cst for replication_state Co-authored-by: Antonio Andelic <antonio.andelic@memgraph.io>	2021-01-21 15:49:32 +01:00
Marko Budiselić	c68ed8d94e	Add implementation of synchronous replication (#7 ) This implements the initial version of synchronous replication. Currently, only one replica is supported and that isn't configurable. To run the main instance use the following command: ``` ./memgraph \ --main \ --data-directory main-data \ --storage-properties-on-edges \ --storage-wal-enabled \ --storage-snapshot-interval-sec 300 ``` To run the replica instance use the following command: ``` ./memgraph \ --replica \ --data-directory replica-data \ --storage-properties-on-edges \ --bolt-port 7688 ``` You can then write/read data to Bolt port 7687 (the main instance) and also you can read the data from the replica instance using Bolt port 7688. NOTE: The main instance must be started without any data and the replica must be started before any data is added to the main instance. * Add basic synchronous replication test * Using RWLock for replication stuff Co-authored-by: Matej Ferencevic <matej.ferencevic@memgraph.io> Co-authored-by: Antonio Andelic <antonio.andelic@memgraph.io>	2021-01-21 15:49:32 +01:00

35 Commits