memgraph

Author	SHA1	Message	Date
Matija Santl	f85095c203	Fix Raft shutdown Summary: During the following scenario: - start a HA cluster with 3 machines - find the leader and start sending queries - SIGTERM the leader but leave other 2 machines untouched The leader would be stuck in the shutdown phase. This was happening because during the shutdown phase of the Bolt server, a `graph_db_accessor` would try to commit a transaction after we've already shut down Raft server. Raft, although not running, is still thinking it's in the Leader mode. Tx Engine calls the `SafeToCommit` method to Commit transactions, and ends up in an infinite loop. Since Raft was shut down it won't handle any of the incoming RPCs and won't change it's mode. The fix here is to shut down the Bolt server before Raft, so we don't have any pending commits once Raft is shut down. Reviewers: ipaljak Reviewed By: ipaljak Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1853	2019-02-12 15:12:39 +01:00
Matija Santl	145c81376f	Add log compaction for Raft, pt. 2 Summary: Implemented snapshot replication and log compaction. Reviewers: ipaljak Reviewed By: ipaljak Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1840	2019-02-04 15:32:07 +01:00
Matija Santl	da95cbf4ec	Add log compaction for Raft, pt. 1 Summary: In this part of log compaction for raft, I've implemented snapshooting and snapshot recovery. I've also refactored the code a bit, so `RaftServer` now has a pointer to the `GraphDb` and it can do some things by itself. Log compaction requires some further work. Since snapshooting isn't synchronous between peers, and each peer can work at their own pace, once we've compacted the log so that the next log to be sent to peer `x` isn't available anymore, we need to send the snapshot over the wire. This means that the next part will contain the `InstallSnapshotRPC` and then maybe one more that will implement the logic of sending `LogEntry` or the whole snapshot. Reviewers: ipaljak Reviewed By: ipaljak Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1834	2019-01-29 14:58:57 +01:00
Matija Santl	62e06d4b70	Fix re-election in Raft Summary: Once a leader loses it's leadership, in order to handle hanging transactions, we reset the storage and the transaction engine. This requires to re-apply all the commited entries from the log. Once we add snapshot (log compaction) we would need to do that also. One thing to have in mind is the `election_timeout_min` parameter. If it's set too low it could trigger leader re-election too often. Reviewers: ipaljak Reviewed By: ipaljak Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1822	2019-01-22 14:51:24 +01:00
Ivan Paljak	276672c310	Fix flaky ha feature benchmark Reviewers: msantl Reviewed By: msantl Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1820	2019-01-21 14:37:38 +01:00
Matija Santl	aba360968c	Change HA benchmark to run for a fixed amount of time Summary: In order to get more consistent results, give the benchmark a certain amount of time it is supposed to run and not the number of queries. The resluts on my machine are as following: ``` duration 10.0004 executed_writes 25190 write_per_second 2518.91 duration 10.0005 executed_writes 25096 write_per_second 2509.48 duration 10.0004 executed_writes 23068 write_per_second 2306.7 duration 10.0006 executed_writes 26390 write_per_second 2638.84 duration 10.0008 executed_writes 26246 write_per_second 2624.38 duration 10.0006 executed_writes 24752 write_per_second 2475.06 duration 10.0027 executed_writes 24818 write_per_second 2481.14 duration 10.0032 executed_writes 25148 write_per_second 2513.99 duration 10.0009 executed_writes 25075 write_per_second 2507.28 duration 10.0008 executed_writes 25846 write_per_second 2584.4 duration 10.0006 executed_writes 25671 write_per_second 2566.96 duration 10.0025 executed_writes 25983 write_per_second 2597.65 ``` Reviewers: ipaljak Reviewed By: ipaljak Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1812	2019-01-17 10:07:44 +01:00
Ivan Paljak	16752af614	Force issuing heartbeats when appending to Raft log Summary: Locally run HA feature benchmark: ``` duration: 20.66 executed_writes: 150007 write_per_second: 7527.89 ``` Reviewers: msantl Reviewed By: msantl Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1809	2019-01-16 16:38:45 +01:00
Matija Santl	ac5c6bf0e8	Add multi-threaded benchmark client for HA Summary: There are some serious speedups when doing parallel writes. Results on my machine (4 cores): ``` duration 6.73173 executed_writes 15003 write_per_second 2228.7 ``` Reviewers: ipaljak Reviewed By: ipaljak Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1807	2019-01-16 10:19:33 +01:00
Ivan Paljak	f09c1254f4	Optimize Raft log persistent storage Summary: Each `raft::LogEntry` is now persisted under its own key in our `KVStore`. Locally running our HA feature benchmark yields the following results: ``` duration 23.7 executed_writes: 15000 write_per_second: 632.888 ``` This represents about 5x increase in throughput. Reviewers: msantl Reviewed By: msantl Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1799	2019-01-15 16:06:08 +01:00
Matija Santl	c39a2278ae	Add HA benchmark Summary: A simple benchmark that starts a HA cluster with 3 machines. The benchmark issues only `CREATE (:Node)` queries. Local results (debug build), for this raft config, are: ``` duration 4.26899 executed_writes 300 write_per_second 70.2743 ``` Reviewers: ipaljak, mferencevic Reviewed By: ipaljak Subscribers: pullbot Differential Revision: https://phabricator.memgraph.io/D1798	2019-01-15 13:32:04 +01:00

10 Commits