memgraph/docs/dev/cpp-code-conventions.md
Teon Banek 0bb23df27b Update our code conventions and add required reading
Reviewers: mtomic, buda, ipaljak, mpetricevic, mferencevic, vkasljevic, mculinovic, msantl

Reviewed By: buda

Differential Revision: https://phabricator.memgraph.io/D1525
2018-08-02 16:07:39 +02:00

9.2 KiB

C++ Code Conventions

This chapter describes code conventions which should be followed when writing C++ code.

Code Style

Memgraph uses the Google Style Guide for C++ in most of its code. You should follow them whenever writing new code.

Often Overlooked Style Conventions

Pointers & References

References provide a shorter syntax for accessing members and better declare the intent that a pointer should not be nullptr. They do not prevent accessing a nullptr and obfuscate the client/calling code because the reference argument is passed just like a value. Errors with such code have been very difficult to debug. Therefore, pointers are always used. They will not prevent bugs but will make some of them more obvious when reading code.

The only time a reference can be used is if it is const. Note that this kind of reference is not allowed if it is stored somewhere, i.e. in a class. You should use a pointer to const then. The primary reason being is that references obscure the semantics of moving an object, thus making bugs with references pointing to invalid memory harder to track down.

Style guide reference

Constructors & RAII

RAII (Resource Acquisition is Initialization) is a nice mechanism for managing resources. It is especially useful when exceptions are used, such as in our code. Unfortunately, they do have 2 major downsides.

  • Only exceptions can be used for to signal failure.
  • Calls to virtual methods are not resolved as expected.

For those reasons the style guide recommends minimal work that cannot fail. Using virtual methods or doing a lot more should be delegated to some form of Init method, possibly coupled with factory functions. Similar rules apply to destructors, which are not allowed to even throw exceptions.

Style guide reference

Additional Style Conventions

Old code may have broken Google C++ Style accidentally, but the new code should adhere to it as close as possible. We do have some exceptions to Google style as well as additions for unspecified conventions.

Using C++ Exceptions

Unlike Google, we do not forbid using exceptions.

But, you should be very careful when using them and introducing new ones. They are indeed handy, but cause problems with understanding the control flow since exceptions are another form of goto. It also becomes very hard to determine that the program is in correct state after the stack is unwound and the thrown exception handled. Other than those issues, throwing exceptions in destructors will terminate the program. The same will happen if a thread doesn't handle an exception even though it is not the main thread.

Style guide reference

Assertions

We use CHECK and DCHECK macros from glog library. You are encouraged to use them as often as possible to both document and validate various pre and post conditions of a function.

CHECK remains even in release build and should be preferred over it's cousin DCHECK which only exists in debug builds. The primary reason is that you want to trigger assertions in release builds in case the tests didn't completely validate all code paths. It is better to fail fast and crash the program, than to leave it in undefined state and potentially corrupt end user's work. In cases when profiling shows that CHECK is causing visible slowdown you should switch to DCHECK.

Template Parameter Naming

Template parameter names should start with capital letter 'T' followed by a short descriptive name. For example:

template <typename TKey, typename TValue>
class KeyValueStore

Code Formatting

You should install clang-format and run it on code you change or add. The root of Memgraph's project contains the .clang-format file, which specifies how formatting should behave. Running clang-format -style=file in the project's root will read the file and behave as expected. For ease of use, you should integrate formatting with your favourite editor.

The code formatting isn't enforced, because sometimes manual formatting may produce better results. Though, running clang-format is strongly encouraged.

Documentation

Besides following the comment guidelines from Google Style Guide, your documentation of the public API should be Doxygen compatible. For private parts of the code or for comments accompanying the implementation, you are free to break doxygen compatibility. In both cases, you should write your documentation as full sentences, correctly written in English.

Doxygen

To start a Doxygen compatible documentation string, you should open your comment with either a JavaDoc style block comment (/**) or a line comment containing 3 slashes (///). Take a look at the 2 examples below.

Block Comment

/**
 * One sentence, brief description.
 *
 * Long form description.
 */

Line Comment

/// One sentence, brief description.
///
/// Long form description.

If you only have a brief description, you may collapse the documentation into a single line.

Block Comment

/** Brief description. */

Line Comment

/// Brief description.

Whichever style you choose, keep it consistent across the whole file.

Doxygen supports various commands in comments, such as @file and @param. These help Doxygen to render specified things differently or to track them for cross referencing. If you want to learn more, take a look at these two links:

Examples

Below are a few examples of documentation from the codebase.

Function

/**
 * Removes whitespace characters from the start and from the end of a string.
 *
 * @param s String that is going to be trimmed.
 *
 * @return Trimmed string.
 */
inline std::string Trim(const std::string &s);

Class

/** Base class for logical operators.
 *
 *  Each operator describes an operation, which is to be performed on the
 *  database. Operators are iterated over using a @c Cursor. Various operators
 *  can serve as inputs to others and thus a sequence of operations is formed.
 */
class LogicalOperator
    : public ::utils::Visitable<HierarchicalLogicalOperatorVisitor> {
 public:
  /** Constructs a @c Cursor which is used to run this operator.
   *
   *  @param GraphDbAccessor Used to perform operations on the database.
   */
  virtual std::unique_ptr<Cursor> MakeCursor(GraphDbAccessor &db) const = 0;

  /** Return @c Symbol vector where the results will be stored.
   *
   *  Currently, outputs symbols are only generated in @c Produce operator.
   *  @c Skip, @c Limit and @c OrderBy propagate the symbols from @c Produce (if
   *  it exists as input operator). In the future, we may want this method to
   *  return the symbols that will be set in this operator.
   *
   *  @param SymbolTable used to find symbols for expressions.
   *  @return std::vector<Symbol> used for results.
   */
  virtual std::vector<Symbol> OutputSymbols(const SymbolTable &) const {
    return std::vector<Symbol>();
  }

  virtual ~LogicalOperator() {}
};

File Header

/// @file visitor.hpp
///
/// This file contains the generic implementation of visitor pattern.
///
/// There are 2 approaches to the pattern:
///
///   * classic visitor pattern using @c Accept and @c Visit methods, and
///   * hierarchical visitor which also uses @c PreVisit and @c PostVisit
///   methods.
///
/// Classic Visitor
/// ===============
///
/// Explanation on the classic visitor pattern can be found from many
/// sources, but here is the link to hopefully most easily accessible
/// information: https://en.wikipedia.org/wiki/Visitor_pattern
///
/// The idea behind the generic implementation of classic visitor pattern is to
/// allow returning any type via @c Accept and @c Visit methods. Traversing the
/// class hierarchy is relegated to the visitor classes. Therefore, visitor
/// should call @c Accept on children when visiting their parents. To implement
/// such a visitor refer to @c Visitor and @c Visitable classes.
///
/// Hierarchical Visitor
/// ====================
///
/// Unlike the classic visitor, the intent of this design is to allow the
/// visited structure itself to control the traversal. This way the internal
/// children structure of classes can remain private. On the other hand,
/// visitors may want to differentiate visiting composite types from leaf types.
/// Composite types are those which contain visitable children, unlike the leaf
/// nodes. Differentiation is accomplished by providing @c PreVisit and @c
/// PostVisit methods, which should be called inside @c Accept of composite
/// types. Regular @c Visit is only called inside @c Accept of leaf types.
/// To implement such a visitor refer to @c CompositeVisitor, @c LeafVisitor and
/// @c Visitable classes.
///
/// Implementation of hierarchical visiting is modelled after:
/// http://wiki.c2.com/?HierarchicalVisitorPattern