mirror of
https://github.com/google/benchmark.git
synced 2025-01-13 21:30:14 +08:00
Add section on iterations.
Also add some subheadings, and fix up the line lengths. Fixes #194
This commit is contained in:
parent
075a8a6fbf
commit
d6f96ed639
86
README.md
86
README.md
@ -1,5 +1,4 @@
|
|||||||
benchmark
|
# benchmark
|
||||||
=========
|
|
||||||
[![Build Status](https://travis-ci.org/google/benchmark.svg?branch=master)](https://travis-ci.org/google/benchmark)
|
[![Build Status](https://travis-ci.org/google/benchmark.svg?branch=master)](https://travis-ci.org/google/benchmark)
|
||||||
[![Build status](https://ci.appveyor.com/api/projects/status/u0qsyp7t1tk7cpxs/branch/master?svg=true)](https://ci.appveyor.com/project/google/benchmark/branch/master)
|
[![Build status](https://ci.appveyor.com/api/projects/status/u0qsyp7t1tk7cpxs/branch/master?svg=true)](https://ci.appveyor.com/project/google/benchmark/branch/master)
|
||||||
[![Coverage Status](https://coveralls.io/repos/google/benchmark/badge.svg)](https://coveralls.io/r/google/benchmark)
|
[![Coverage Status](https://coveralls.io/repos/google/benchmark/badge.svg)](https://coveralls.io/r/google/benchmark)
|
||||||
@ -10,10 +9,9 @@ Discussion group: https://groups.google.com/d/forum/benchmark-discuss
|
|||||||
|
|
||||||
IRC channel: https://freenode.net #googlebenchmark
|
IRC channel: https://freenode.net #googlebenchmark
|
||||||
|
|
||||||
Example usage
|
## Example usage
|
||||||
-------------
|
### Basic usage
|
||||||
Define a function that executes the code to be measured a
|
Define a function that executes the code to be measured.
|
||||||
specified number of times:
|
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
static void BM_StringCreation(benchmark::State& state) {
|
static void BM_StringCreation(benchmark::State& state) {
|
||||||
@ -34,15 +32,16 @@ BENCHMARK(BM_StringCopy);
|
|||||||
BENCHMARK_MAIN();
|
BENCHMARK_MAIN();
|
||||||
```
|
```
|
||||||
|
|
||||||
Sometimes a family of microbenchmarks can be implemented with
|
### Passing arguments
|
||||||
just one routine that takes an extra argument to specify which
|
Sometimes a family of benchmarks can be implemented with just one routine that
|
||||||
one of the family of benchmarks to run. For example, the following
|
takes an extra argument to specify which one of the family of benchmarks to
|
||||||
code defines a family of microbenchmarks for measuring the speed
|
run. For example, the following code defines a family of benchmarks for
|
||||||
of `memcpy()` calls of different lengths:
|
measuring the speed of `memcpy()` calls of different lengths:
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
static void BM_memcpy(benchmark::State& state) {
|
static void BM_memcpy(benchmark::State& state) {
|
||||||
char* src = new char[state.range_x()]; char* dst = new char[state.range_x()];
|
char* src = new char[state.range_x()];
|
||||||
|
char* dst = new char[state.range_x()];
|
||||||
memset(src, 'x', state.range_x());
|
memset(src, 'x', state.range_x());
|
||||||
while (state.KeepRunning())
|
while (state.KeepRunning())
|
||||||
memcpy(dst, src, state.range_x());
|
memcpy(dst, src, state.range_x());
|
||||||
@ -54,18 +53,17 @@ static void BM_memcpy(benchmark::State& state) {
|
|||||||
BENCHMARK(BM_memcpy)->Arg(8)->Arg(64)->Arg(512)->Arg(1<<10)->Arg(8<<10);
|
BENCHMARK(BM_memcpy)->Arg(8)->Arg(64)->Arg(512)->Arg(1<<10)->Arg(8<<10);
|
||||||
```
|
```
|
||||||
|
|
||||||
The preceding code is quite repetitive, and can be replaced with the
|
The preceding code is quite repetitive, and can be replaced with the following
|
||||||
following short-hand. The following invocation will pick a few
|
short-hand. The following invocation will pick a few appropriate arguments in
|
||||||
appropriate arguments in the specified range and will generate a
|
the specified range and will generate a benchmark for each such argument.
|
||||||
microbenchmark for each such argument.
|
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
BENCHMARK(BM_memcpy)->Range(8, 8<<10);
|
BENCHMARK(BM_memcpy)->Range(8, 8<<10);
|
||||||
```
|
```
|
||||||
|
|
||||||
You might have a microbenchmark that depends on two inputs. For
|
You might have a benchmark that depends on two inputs. For example, the
|
||||||
example, the following code defines a family of microbenchmarks for
|
following code defines a family of benchmarks for measuring the speed of set
|
||||||
measuring the speed of set insertion.
|
insertion.
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
static void BM_SetInsert(benchmark::State& state) {
|
static void BM_SetInsert(benchmark::State& state) {
|
||||||
@ -88,19 +86,18 @@ BENCHMARK(BM_SetInsert)
|
|||||||
->ArgPair(8<<10, 512);
|
->ArgPair(8<<10, 512);
|
||||||
```
|
```
|
||||||
|
|
||||||
The preceding code is quite repetitive, and can be replaced with
|
The preceding code is quite repetitive, and can be replaced with the following
|
||||||
the following short-hand. The following macro will pick a few
|
short-hand. The following macro will pick a few appropriate arguments in the
|
||||||
appropriate arguments in the product of the two specified ranges
|
product of the two specified ranges and will generate a benchmark for each such
|
||||||
and will generate a microbenchmark for each such pair.
|
pair.
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
BENCHMARK(BM_SetInsert)->RangePair(1<<10, 8<<10, 1, 512);
|
BENCHMARK(BM_SetInsert)->RangePair(1<<10, 8<<10, 1, 512);
|
||||||
```
|
```
|
||||||
|
|
||||||
For more complex patterns of inputs, passing a custom function
|
For more complex patterns of inputs, passing a custom function to `Apply` allows
|
||||||
to Apply allows programmatic specification of an
|
programmatic specification of an arbitrary set of arguments on which to run the
|
||||||
arbitrary set of arguments to run the microbenchmark on.
|
benchmark. The following example enumerates a dense range on one parameter,
|
||||||
The following example enumerates a dense range on one parameter,
|
|
||||||
and a sparse range on the second.
|
and a sparse range on the second.
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
@ -112,9 +109,10 @@ static void CustomArguments(benchmark::internal::Benchmark* b) {
|
|||||||
BENCHMARK(BM_SetInsert)->Apply(CustomArguments);
|
BENCHMARK(BM_SetInsert)->Apply(CustomArguments);
|
||||||
```
|
```
|
||||||
|
|
||||||
Templated microbenchmarks work the same way:
|
### Templated benchmarks
|
||||||
Produce then consume 'size' messages 'iters' times
|
Templated benchmarks work the same way: This example produces and consumes
|
||||||
Measures throughput in the absence of multiprogramming.
|
messages of size `sizeof(v)` `range_x` times. It also outputs throughput in the
|
||||||
|
absence of multiprogramming.
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
template <class Q> int BM_Sequential(benchmark::State& state) {
|
template <class Q> int BM_Sequential(benchmark::State& state) {
|
||||||
@ -145,11 +143,12 @@ Three macros are provided for adding benchmark templates.
|
|||||||
#define BENCHMARK_TEMPLATE2(func, arg1, arg2)
|
#define BENCHMARK_TEMPLATE2(func, arg1, arg2)
|
||||||
```
|
```
|
||||||
|
|
||||||
|
### Multithreaded benchmarks
|
||||||
In a multithreaded test (benchmark invoked by multiple threads simultaneously),
|
In a multithreaded test (benchmark invoked by multiple threads simultaneously),
|
||||||
it is guaranteed that none of the threads will start until all have called
|
it is guaranteed that none of the threads will start until all have called
|
||||||
KeepRunning, and all will have finished before KeepRunning returns false. As
|
`KeepRunning`, and all will have finished before KeepRunning returns false. As
|
||||||
such, any global setup or teardown you want to do can be
|
such, any global setup or teardown can be wrapped in a check against the thread
|
||||||
wrapped in a check against the thread index:
|
index:
|
||||||
|
|
||||||
```c++
|
```c++
|
||||||
static void BM_MultiThreaded(benchmark::State& state) {
|
static void BM_MultiThreaded(benchmark::State& state) {
|
||||||
@ -176,6 +175,7 @@ BENCHMARK(BM_test)->Range(8, 8<<10)->UseRealTime();
|
|||||||
|
|
||||||
Without `UseRealTime`, CPU time is used by default.
|
Without `UseRealTime`, CPU time is used by default.
|
||||||
|
|
||||||
|
### Preventing optimisation
|
||||||
To prevent a value or expression from being optimized away by the compiler
|
To prevent a value or expression from being optimized away by the compiler
|
||||||
the `benchmark::DoNotOptimize(...)` function can be used.
|
the `benchmark::DoNotOptimize(...)` function can be used.
|
||||||
|
|
||||||
@ -190,8 +190,15 @@ static void BM_test(benchmark::State& state) {
|
|||||||
}
|
}
|
||||||
```
|
```
|
||||||
|
|
||||||
Benchmark Fixtures
|
## Controlling number of iterations
|
||||||
------------------
|
In all cases, the number of iterations for which the benchmark is run is
|
||||||
|
governed by the amount of time the benchmark takes. Concretely, the number of
|
||||||
|
iterations is at least one, not more than 1e9, until CPU time is greater than
|
||||||
|
the minimum time, or the wallclock time is 5x minimum time. The minimum time is
|
||||||
|
set as a flag `--benchmark_min_time` or per benchmark by calling `MinTime` on
|
||||||
|
the registered benchmark object.
|
||||||
|
|
||||||
|
## Fixtures
|
||||||
Fixture tests are created by
|
Fixture tests are created by
|
||||||
first defining a type that derives from ::benchmark::Fixture and then
|
first defining a type that derives from ::benchmark::Fixture and then
|
||||||
creating/registering the tests using the following macros:
|
creating/registering the tests using the following macros:
|
||||||
@ -221,8 +228,7 @@ BENCHMARK_REGISTER_F(MyFixture, BarTest)->Threads(2);
|
|||||||
/* BarTest is now registered */
|
/* BarTest is now registered */
|
||||||
```
|
```
|
||||||
|
|
||||||
Output Formats
|
## Output Formats
|
||||||
--------------
|
|
||||||
The library supports multiple output formats. Use the
|
The library supports multiple output formats. Use the
|
||||||
`--benchmark_format=<tabular|json>` flag to set the format type. `tabular` is
|
`--benchmark_format=<tabular|json>` flag to set the format type. `tabular` is
|
||||||
the default format.
|
the default format.
|
||||||
@ -290,8 +296,7 @@ name,iterations,real_time,cpu_time,bytes_per_second,items_per_second,label
|
|||||||
"BM_SetInsert/1024/10",106365,17238.4,8421.53,4.74973e+06,1.18743e+06,
|
"BM_SetInsert/1024/10",106365,17238.4,8421.53,4.74973e+06,1.18743e+06,
|
||||||
```
|
```
|
||||||
|
|
||||||
Debug vs Release
|
## Debug vs Release
|
||||||
----------------
|
|
||||||
By default, benchmark builds as a debug library. You will see a warning in the output when this is the case. To build it as a release library instead, use:
|
By default, benchmark builds as a debug library. You will see a warning in the output when this is the case. To build it as a release library instead, use:
|
||||||
|
|
||||||
```
|
```
|
||||||
@ -304,6 +309,5 @@ To enable link-time optimisation, use
|
|||||||
cmake -DCMAKE_BUILD_TYPE=Release -DBENCHMARK_ENABLE_LTO=true
|
cmake -DCMAKE_BUILD_TYPE=Release -DBENCHMARK_ENABLE_LTO=true
|
||||||
```
|
```
|
||||||
|
|
||||||
Linking against the library
|
## Linking against the library
|
||||||
---------------------------
|
|
||||||
When using gcc, it is necessary to link against pthread to avoid runtime exceptions. This is due to how gcc implements std::thread. See [issue #67](https://github.com/google/benchmark/issues/67) for more details.
|
When using gcc, it is necessary to link against pthread to avoid runtime exceptions. This is due to how gcc implements std::thread. See [issue #67](https://github.com/google/benchmark/issues/67) for more details.
|
||||||
|
Loading…
Reference in New Issue
Block a user