Add section on iterations.

Also add some subheadings, and fix up the line lengths.

Fixes #194
This commit is contained in:
Dominic Hamon 2016-04-19 09:34:13 -07:00
parent 075a8a6fbf
commit d6f96ed639

View File

@ -1,5 +1,4 @@
benchmark # benchmark
=========
[![Build Status](https://travis-ci.org/google/benchmark.svg?branch=master)](https://travis-ci.org/google/benchmark) [![Build Status](https://travis-ci.org/google/benchmark.svg?branch=master)](https://travis-ci.org/google/benchmark)
[![Build status](https://ci.appveyor.com/api/projects/status/u0qsyp7t1tk7cpxs/branch/master?svg=true)](https://ci.appveyor.com/project/google/benchmark/branch/master) [![Build status](https://ci.appveyor.com/api/projects/status/u0qsyp7t1tk7cpxs/branch/master?svg=true)](https://ci.appveyor.com/project/google/benchmark/branch/master)
[![Coverage Status](https://coveralls.io/repos/google/benchmark/badge.svg)](https://coveralls.io/r/google/benchmark) [![Coverage Status](https://coveralls.io/repos/google/benchmark/badge.svg)](https://coveralls.io/r/google/benchmark)
@ -10,10 +9,9 @@ Discussion group: https://groups.google.com/d/forum/benchmark-discuss
IRC channel: https://freenode.net #googlebenchmark IRC channel: https://freenode.net #googlebenchmark
Example usage ## Example usage
------------- ### Basic usage
Define a function that executes the code to be measured a Define a function that executes the code to be measured.
specified number of times:
```c++ ```c++
static void BM_StringCreation(benchmark::State& state) { static void BM_StringCreation(benchmark::State& state) {
@ -34,15 +32,16 @@ BENCHMARK(BM_StringCopy);
BENCHMARK_MAIN(); BENCHMARK_MAIN();
``` ```
Sometimes a family of microbenchmarks can be implemented with ### Passing arguments
just one routine that takes an extra argument to specify which Sometimes a family of benchmarks can be implemented with just one routine that
one of the family of benchmarks to run. For example, the following takes an extra argument to specify which one of the family of benchmarks to
code defines a family of microbenchmarks for measuring the speed run. For example, the following code defines a family of benchmarks for
of `memcpy()` calls of different lengths: measuring the speed of `memcpy()` calls of different lengths:
```c++ ```c++
static void BM_memcpy(benchmark::State& state) { static void BM_memcpy(benchmark::State& state) {
char* src = new char[state.range_x()]; char* dst = new char[state.range_x()]; char* src = new char[state.range_x()];
char* dst = new char[state.range_x()];
memset(src, 'x', state.range_x()); memset(src, 'x', state.range_x());
while (state.KeepRunning()) while (state.KeepRunning())
memcpy(dst, src, state.range_x()); memcpy(dst, src, state.range_x());
@ -54,18 +53,17 @@ static void BM_memcpy(benchmark::State& state) {
BENCHMARK(BM_memcpy)->Arg(8)->Arg(64)->Arg(512)->Arg(1<<10)->Arg(8<<10); BENCHMARK(BM_memcpy)->Arg(8)->Arg(64)->Arg(512)->Arg(1<<10)->Arg(8<<10);
``` ```
The preceding code is quite repetitive, and can be replaced with the The preceding code is quite repetitive, and can be replaced with the following
following short-hand. The following invocation will pick a few short-hand. The following invocation will pick a few appropriate arguments in
appropriate arguments in the specified range and will generate a the specified range and will generate a benchmark for each such argument.
microbenchmark for each such argument.
```c++ ```c++
BENCHMARK(BM_memcpy)->Range(8, 8<<10); BENCHMARK(BM_memcpy)->Range(8, 8<<10);
``` ```
You might have a microbenchmark that depends on two inputs. For You might have a benchmark that depends on two inputs. For example, the
example, the following code defines a family of microbenchmarks for following code defines a family of benchmarks for measuring the speed of set
measuring the speed of set insertion. insertion.
```c++ ```c++
static void BM_SetInsert(benchmark::State& state) { static void BM_SetInsert(benchmark::State& state) {
@ -88,19 +86,18 @@ BENCHMARK(BM_SetInsert)
->ArgPair(8<<10, 512); ->ArgPair(8<<10, 512);
``` ```
The preceding code is quite repetitive, and can be replaced with The preceding code is quite repetitive, and can be replaced with the following
the following short-hand. The following macro will pick a few short-hand. The following macro will pick a few appropriate arguments in the
appropriate arguments in the product of the two specified ranges product of the two specified ranges and will generate a benchmark for each such
and will generate a microbenchmark for each such pair. pair.
```c++ ```c++
BENCHMARK(BM_SetInsert)->RangePair(1<<10, 8<<10, 1, 512); BENCHMARK(BM_SetInsert)->RangePair(1<<10, 8<<10, 1, 512);
``` ```
For more complex patterns of inputs, passing a custom function For more complex patterns of inputs, passing a custom function to `Apply` allows
to Apply allows programmatic specification of an programmatic specification of an arbitrary set of arguments on which to run the
arbitrary set of arguments to run the microbenchmark on. benchmark. The following example enumerates a dense range on one parameter,
The following example enumerates a dense range on one parameter,
and a sparse range on the second. and a sparse range on the second.
```c++ ```c++
@ -112,9 +109,10 @@ static void CustomArguments(benchmark::internal::Benchmark* b) {
BENCHMARK(BM_SetInsert)->Apply(CustomArguments); BENCHMARK(BM_SetInsert)->Apply(CustomArguments);
``` ```
Templated microbenchmarks work the same way: ### Templated benchmarks
Produce then consume 'size' messages 'iters' times Templated benchmarks work the same way: This example produces and consumes
Measures throughput in the absence of multiprogramming. messages of size `sizeof(v)` `range_x` times. It also outputs throughput in the
absence of multiprogramming.
```c++ ```c++
template <class Q> int BM_Sequential(benchmark::State& state) { template <class Q> int BM_Sequential(benchmark::State& state) {
@ -145,11 +143,12 @@ Three macros are provided for adding benchmark templates.
#define BENCHMARK_TEMPLATE2(func, arg1, arg2) #define BENCHMARK_TEMPLATE2(func, arg1, arg2)
``` ```
### Multithreaded benchmarks
In a multithreaded test (benchmark invoked by multiple threads simultaneously), In a multithreaded test (benchmark invoked by multiple threads simultaneously),
it is guaranteed that none of the threads will start until all have called it is guaranteed that none of the threads will start until all have called
KeepRunning, and all will have finished before KeepRunning returns false. As `KeepRunning`, and all will have finished before KeepRunning returns false. As
such, any global setup or teardown you want to do can be such, any global setup or teardown can be wrapped in a check against the thread
wrapped in a check against the thread index: index:
```c++ ```c++
static void BM_MultiThreaded(benchmark::State& state) { static void BM_MultiThreaded(benchmark::State& state) {
@ -176,6 +175,7 @@ BENCHMARK(BM_test)->Range(8, 8<<10)->UseRealTime();
Without `UseRealTime`, CPU time is used by default. Without `UseRealTime`, CPU time is used by default.
### Preventing optimisation
To prevent a value or expression from being optimized away by the compiler To prevent a value or expression from being optimized away by the compiler
the `benchmark::DoNotOptimize(...)` function can be used. the `benchmark::DoNotOptimize(...)` function can be used.
@ -190,8 +190,15 @@ static void BM_test(benchmark::State& state) {
} }
``` ```
Benchmark Fixtures ## Controlling number of iterations
------------------ In all cases, the number of iterations for which the benchmark is run is
governed by the amount of time the benchmark takes. Concretely, the number of
iterations is at least one, not more than 1e9, until CPU time is greater than
the minimum time, or the wallclock time is 5x minimum time. The minimum time is
set as a flag `--benchmark_min_time` or per benchmark by calling `MinTime` on
the registered benchmark object.
## Fixtures
Fixture tests are created by Fixture tests are created by
first defining a type that derives from ::benchmark::Fixture and then first defining a type that derives from ::benchmark::Fixture and then
creating/registering the tests using the following macros: creating/registering the tests using the following macros:
@ -221,8 +228,7 @@ BENCHMARK_REGISTER_F(MyFixture, BarTest)->Threads(2);
/* BarTest is now registered */ /* BarTest is now registered */
``` ```
Output Formats ## Output Formats
--------------
The library supports multiple output formats. Use the The library supports multiple output formats. Use the
`--benchmark_format=<tabular|json>` flag to set the format type. `tabular` is `--benchmark_format=<tabular|json>` flag to set the format type. `tabular` is
the default format. the default format.
@ -290,8 +296,7 @@ name,iterations,real_time,cpu_time,bytes_per_second,items_per_second,label
"BM_SetInsert/1024/10",106365,17238.4,8421.53,4.74973e+06,1.18743e+06, "BM_SetInsert/1024/10",106365,17238.4,8421.53,4.74973e+06,1.18743e+06,
``` ```
Debug vs Release ## Debug vs Release
----------------
By default, benchmark builds as a debug library. You will see a warning in the output when this is the case. To build it as a release library instead, use: By default, benchmark builds as a debug library. You will see a warning in the output when this is the case. To build it as a release library instead, use:
``` ```
@ -304,6 +309,5 @@ To enable link-time optimisation, use
cmake -DCMAKE_BUILD_TYPE=Release -DBENCHMARK_ENABLE_LTO=true cmake -DCMAKE_BUILD_TYPE=Release -DBENCHMARK_ENABLE_LTO=true
``` ```
Linking against the library ## Linking against the library
---------------------------
When using gcc, it is necessary to link against pthread to avoid runtime exceptions. This is due to how gcc implements std::thread. See [issue #67](https://github.com/google/benchmark/issues/67) for more details. When using gcc, it is necessary to link against pthread to avoid runtime exceptions. This is due to how gcc implements std::thread. See [issue #67](https://github.com/google/benchmark/issues/67) for more details.