benchmark

mirror of https://github.com/google/benchmark.git synced 2025-03-15 11:40:12 +08:00

Author	SHA1	Message	Date
Dominic Hamon	4fbfa2f336	Some platforms and environments don't pass a valid argc/argv. (#607 ) Specifically some iOS targets.	2018-05-30 13:17:41 +01:00
Alex Strelnikov	e776aa0275	Add benchmark_main target. (#601 ) * Add benchmark_main library with support for Bazel. * fix newline at end of file * Add CMake support for benchmark_main. * Mention optionally using benchmark_main in README.	2018-05-25 11:18:58 +01:00
Nan Xiao	e90801ae47	Remove unnecessary memset functions. (#591 )	2018-05-09 10:31:24 +01:00
Sam Clegg	8986839e4a	Use __EMSCRIPTEN__ (rather then EMSCRIPTEN) to check for emscripten (#583 ) The old EMSCRIPTEN macro is deprecated and not enabled when EMCC_STRICT is set. Also fix a typo in EMSCRIPTN (not sure how this ever worked).	2018-05-03 09:34:26 +01:00
Nan Xiao	ea5551e7b3	Porting into OpenBSD (#582 )	2018-05-02 11:26:43 +01:00
Tim Bradgate	ed1bac8434	Issue 571: Allow support for negative regex filtering (#576 ) * Allow support for negative regex filtering This patch allows one to apply a negation to the entire regex filter by appending it with a '-' character, much in the same style as GoogleTest uses. * Address issues in PR * Add unit tests for negative filtering	2018-04-26 10:56:06 +01:00
Victor Costan	64d4805dd7	Fix precision loss warning in MSVC. (#574 )	2018-04-23 11:58:02 +01:00
Dominic Hamon	c4858d8012	Report the actual iterations run. (#572 ) Before this change, we would report the number of requested iterations passed to the state. After, we will report the actual number run. As a side-effect, instead of multiplying the expected iterations by the number of threads to get the total number, we can report the actual number of iterations across all threads, which takes into account the situation where some threads might run more iterations than others.	2018-04-19 18:40:08 +01:00
Dominic Hamon	64e5a13fa0	Ensure 64-bit truncation doesn't happen for complexity_n (#569 ) * Ensure 64-bit truncation doesn't happen for complexity results * One more complexity_n 64-bit fix * Missed another vector of int * Piping through the int64_t	2018-04-12 15:40:24 +01:00
Fred Tingaud	50ffc781b1	Optimize by using nth_element instead of partial_sort to find the median. (#565 )	2018-04-09 13:40:58 +01:00
Dominic Hamon	9913418d32	Allow AddRange to work with int64_t. (#548 ) * Allow AddRange to work with int64_t. Fixes #516 Also, tweak how we manage per-test build needs, and create a standard _gtest suffix for googletest to differentiate from non-googletest tests. I also ran clang-format on the files that I changed (but not the benchmark include or main src as they have too many clang-format issues). * Add benchmark_gtest to cmake * Set(Items\|Bytes)Processed now take int64_t	2018-04-03 23:12:47 +01:00
Dominic Hamon	df60aeb266	Rely on compiler intrinsics to identify regex engine. (#555 ) Having the copts set on a per-target level can lead to ODR violations in some cases. Avoid this by ensuring the regex engine is picked through compiler intrinsics in the header directly.	2018-03-23 11:45:15 +00:00
Eric Fiselier	e668e2a1ba	Fix #552 - GCC and Clang warn on possibly invalid offsetof usage. This patch disables the -Winvalid-offsetof warning for GCC and Clang when using it to check the cache lines of the State object. Technically this usage of offsetof is undefined behavior until C++17. However, all major compilers support this application as an extension, as demonstrated by the passing static assert (If a compiler encounters UB during evaluation of a constant expression, that UB must be diagnosed). Unfortunately, Clang and GCC also produce a warning about it. This patch temporarily suppresses the warning using #pragma's in the source file (instead of globally suppressing the warning in the build systems). This way the warning is ignored for both CMake and Bazel builds without having to modify either build system.	2018-03-21 13:47:25 -06:00
Dominic Hamon	674d0498b8	Move thread classes out to clean up monolithic code (#554 )	2018-03-16 10:14:38 +00:00
Wink Saville	61497236dd	Make string_util naming more consistent (#547 ) * Rename StringXxx to StrXxx in string_util.h and its users This makes the naming consistent within string_util and moves is the Abseil convention. * Style guide is 2 spaces before end of line "//" comments * Rename StrPrintF/StringPrintF to StrFormat for absl compatibility.	2018-03-07 11:20:06 +00:00
Wink Saville	f48a28d12a	Do not let StrCat be renamed to lstrcatA (#546 ) On Windows the Shlwapi.h file has a macro: #define StrCat lstrcatA And benchmark/src/string_util.h defines StrCat and it is renamed to lstrcatA if we don't undef the macro in Shlwapi.h. This is an innocuous bug if string_util.h is included after Shlwapi.h, but it is a compile error if string_util.h is included before Shlwapi.h. This fixes issue #545.	2018-03-06 18:15:03 +00:00
Wink Saville	69a52cff4f	Spelling fixes (#543 ) Upstream spelling fix changes from Pony, ec47ba8f565726414552f4bbf97d7, by ka7@la-evento.com that effected google/benchmark.	2018-03-06 11:44:25 +00:00
alekseyshl	47df49e573	Add Solaris support (#539 ) * Add Solaris support Define BENCHMARK_OS_SOLARIS for Solaris. Platform specific implementations added: * number of CPUs detection * CPU cycles per second detection * Thread CPU usage * Process CPU usage * Remove the special case for per process CPU time for Solaris, it's the same as the default.	2018-03-02 03:53:58 -08:00
Robert Guo	ff2c255af5	Use STCK to get the CPU clock on s390x (#540 )	2018-03-02 03:22:03 -08:00
Eric	56f52ee228	Print the executable name as part of the context. (#534 ) * Print the executable name as part of the context. A common use case of the library is to run two different versions of a benchmark to compare them. In my experience this often means compiling a benchmark twice, renaming one of the executables, and then running the executables back-to-back. In this case the name of the executable is important contextually information. Unfortunately the benchmark does not report this information. This patch adds the executable name to the context reported by the benchmark. * attempt to fix tests on Windows * attempt to fix tests on Windows	2018-02-21 08:43:57 -08:00
Ian McKellar	6ecf8a8e80	Don't include <sys/resource.h> on Fuchsia. (#531 ) * Don't include <sys/resource.h> on Fuchsia. It doesn't support POSIX resource measurement and timing APIs. Change-Id: Ifab4bac4296575f042c699db1ce5a4f7c2d82893 * Add BENCHMARK_OS_FUCHSIA for Fuchsia Change-Id: Ic536f9625e413270285fbfd08471dcb6753ddad1	2018-02-14 14:17:12 -07:00
Eric	207b9c7aec	Improve State packing: put important members on first cache line. (#527 ) * Improve State packing: put important members on first cache line. This patch does a few different things to ensure commonly accessed data is on the first cache line of the `State` object. First, it moves the `error_occurred_` member to reside after the `started_` and `finished_` bools, since there was internal padding there that was unused. Second, it moves `batch_leftover_` and `max_iterations` further up in the struct declaration. These variables are used in the calculation of `iterations()` which users might call within the loop. Therefore it's more important they exist on the first cache line. Finally, this patch turns the bool members into bitfields. Although this shouldn't have much of an effect currently, because padding is still needed between the last bool and the first size_t, it should help in future changes that require more "bool like" members. * Remove bitfield change for now * Move bools (and their padding) to end of "first cache line" vars. I think it makes the most sense to move the padding required following the group of bools to the end of the variables we want on the first cache line. This also means that the `total_iterations_` variable, which is the most accessed, has the same address as the State object. * Fix static assertion after moving bools	2018-02-14 13:44:41 -07:00
Samuel Panzer	296ec5693e	Support State::KeepRunningBatch(). (#521 ) * Support State::KeepRunningBatch(). State::KeepRunning() can take large amounts of time relative to quick operations (on the order of 1ns, depending on hardware). For such sensitive operations, it is recommended to run batches of repeated operations. This commit simplifies handling of total_iterations_. Rather than predecrementing such that total_iterations_ == 1 signals that KeepRunning() should exit, total_iterations_ == 0 now signals the intention for the benchmark to exit. * Create better fast path in State::KeepRunningBatch() * Replace int parameter with size_t to fix signed mismatch warnings * Ensure benchmark State has been started even on error. * Simplify KeepRunningBatch()	2018-02-09 21:57:04 -07:00
Dominic Hamon	df415adb2a	Some small clang-tidy fixes (#520 )	2018-01-29 08:38:47 -08:00
oskidan	4fe0206b65	Fixes compilation error caused by integer precision loss due to implicit (#518 ) conversion in sysinfo.cc	2018-01-19 09:17:01 -08:00
Dominic Hamon	9f5694ceb6	Wrap COMPILER macros. (#514 ) Some command line or build systems may already set these (eg, bazel) so make sure that takes priority. Fixes #513	2018-01-11 17:22:45 -08:00
Victor Costan	95a1435b81	Fix compilation error with GCC on OSX (issue #490 ). (#491 )	2017-11-30 08:05:38 -08:00
Roman Lebedev	ec5684ed75	Console reporter: properly account for the lenght of custom counter names (#484 ) Old output example: ``` Benchmark Time CPU Iterations CPUTime,s Pixels/s ThreadingFactor ------------------------------------------------------------------------------------------------------------------------------ 20170525_0036TEST.RAF/threads:8/real_time 45 ms 45 ms 16 0.718738 79.6277M/s 0.999978 2.41419GB/s 22.2613 items/s FileSize,MB=111.050781; MPix=57.231360 ``` New output example: ``` Benchmark Time CPU Iterations CPUTime,s Pixels/s ThreadingFactor ------------------------------------------------------------------------------------------------------------------------------ 20170525_0036TEST.RAF/threads:8/real_time 45 ms 45 ms 16 0.713575 80.1713M/s 0.999571 2.43067GB/s 22.4133 items/s FileSize,MB=111.050781; MPix=57.231360 ```	2017-11-27 09:01:01 -08:00
Eric Fiselier	2ec7399cf1	Improve BENCHMARK_UNREACHABLE() implementation. This patch primarily changes the BENCHMARK_UNREACHABLE() implementation under MSVC to use __assume(false) instead of being a NORETURN function, which ironically caused unreachable code warnings. Second, since the NOTHROW function attempt generated the warnings we meant to avoid, it has been replaced with a dummy null statement.	2017-11-26 13:58:24 -07:00
Eric	11dc36822b	Improve CPU Cache info reporting -- Add Windows support. (#486 ) * Improve CPU Cache info reporting -- Add Windows support. This patch does a couple of thing regarding CPU Cache reporting. First, it adds an implementation on Windows. Second it fixes the JSONReporter to correctly (and actually) output the CPU configuration information. And finally, third, it detects and reports the number of physical CPU's that share the same cache.	2017-11-26 13:33:01 -07:00
Eric	27e0b439cf	Refactor System information collection -- Add CPU Cache Info (#483 ) * Refactor System information collection. This patch refactors the system information collection, and in particular information about the target CPU. The motivation is to make it easier to access CPU information, and easier to add new information as need be. This patch additionally adds information about the cache sizes of the CPU. * Address review comments: Clean up integer types. This commit cleans up the integer types used in ValueUnion to follow the Google style guide. Additionally it adds a BENCHMARK_UNREACHABLE macro to assist in documenting/catching unreachable code paths. * Rename ValueUnion accessors.	2017-11-22 08:33:52 -08:00
Kamil Rytarowski	aad6a5fa76	Add NetBSD support (#482 ) Define BENCHMARK_OS_NETBSD for NetBSD. Add detection of cpuinfo_cycles_per_second and cpuinfo_num_cpus. This code shared detection of these properties with FreeBSD.	2017-11-17 08:46:08 -08:00
Steinar H. Gunderson	0c3ec998c4	Add a pkg-config file, for the benefit of projects not using CMake. (#480 )	2017-11-15 11:51:22 -08:00
Eric	72a4581caf	Fix #382 - MinGW often reports negative CPU times. (#475 ) When stopping a timer, the current time is subtracted from the start time. However, when the times are identical, or sufficiently close together, the subtraction can result in a negative number. For some reason MinGW is the only platform where this problem manifests. I suspect it's due to MinGW specific behavior in either the CPU timing code, floating point model, or printf formatting. Either way, the fix for MinGW should be correct across all platforms.	2017-11-07 09:44:39 -08:00
Dominic Hamon	f65c6d9a2c	Remove deprecated headers (#473 )	2017-11-06 08:53:23 -08:00
Yangqing Jia	491360b833	Add option to install benchmark (#463 ) * Add option to install benchmark * Change to BENCHMARK_ENABLE_INSTALL per @dominichamon	2017-10-20 13:49:37 -07:00
Eric	a37fc0c48a	Improve KeepRunning loop performance to be similar to the range-based for. (#460 ) This patch improves the performance of the KeepRunning loop in two ways: (A) it removes the dependency on the max_iterations variable, preventing it from being loaded every iteration. (B) it loops to zero, instead of to an upper bound. This allows a single decrement instruction to be used instead of a arithmetic op followed by a comparison.	2017-10-17 08:40:44 -07:00
Raúl Marín	cacd321808	Avoid implicit float to double conversion (#457 ) Triggered by -Werror=double-promotion	2017-10-13 09:17:02 -07:00
Dominic Hamon	2409cb2eb1	Minor move of code to cleanup up namespace spaghetti a bit	2017-10-09 12:01:30 -07:00
Roman Lebedev	a271c36af9	Drop Stat1, refactor statistics to be user-providable, add median. (#428 ) * Drop Stat1, refactor statistics to be user-providable, add median. My main goal was to add median statistic. Since Stat1 calculated the stats incrementally, and did not store the values themselves, it is was not possible. Thus, i have replaced Stat1 with simple std::vector<double>, containing all the values. Then, i have refactored current mean/stdev to be a function that is provided with values vector, and returns the statistic. While there, it seemed to make sense to deduplicate the code by storing all the statistics functions in a map, and then simply iterate over it. And the interface to add new statistics is intentionally exposed, so they may be added easily. The notable change is that Iterations are no longer displayed as 0 for stdev. Is could be changed, but i'm not sure how to nicely fit that into the API. Similarly, this dance about sometimes (for some fields, for some statistics) dividing by run.iterations, and then multiplying the calculated stastic back is also dropped, and if you do the math, i fail to see why it was needed there in the first place. Since that was the only use of stat.h, it is removed. * complexity.h: attempt to fix MSVC build * Update README.md * Store statistics to compute in a vector, ensures ordering. * Add a bit more tests for repetitions. * Partially address review notes. * Fix gcc build: drop extra ';' clang, why didn't you warn me? * Address review comments. * double() -> 0.0 * early return	2017-08-23 16:44:29 -07:00
Dominic Hamon	d70417994a	Allow the definition of 1k to be flexible. (#438 ) When generating a human-readable number for user counters, we don't generally expect 1k to be 1024. This is the default due to the more general purpose string utility. Fixes #437	2017-08-21 16:05:24 -07:00
Roman Lebedev	b9be142d1e	Json reporter: don't cast floating-point to int; adjust tooling (#426 ) * Json reporter: passthrough fp, don't cast it to int; adjust tooling Json output format is generally meant for further processing using some automated tools. Thus, it makes sense not to intentionally limit the precision of the values contained in the report. As it can be seen, FormatKV() for doubles, used %.2f format, which was meant to preserve at least some of the precision. However, before that function is ever called, the doubles were already cast to the integer via RoundDouble()... This is also the case for console reporter, where it makes sense because the screen space is limited, and this reporter, however the CSV reporter does output some( decimal digits. Thus i can only conclude that the loss of the precision was not really considered, so i have decided to adjust the code of the json reporter to output the full fp precision. There can be several reasons why that is the right thing to do, the bigger the time_unit used, the greater the precision loss, so i'd say any sort of further processing (like e.g. tools/compare_bench.py does) is best done on the values with most precision. Also, that cast skewed the data away from zero, which i think may or may not result in false- positives/negatives in the output of tools/compare_bench.py * Json reporter: FormatKV(double): address review note * tools/gbench/report.py: skip benchmarks with different time units While it may be useful to teach it to operate on the measurements with different time units, which is now possible since floats are stored, and not the integers, but for now at least doing such a sanity-checking is better than providing misinformation.	2017-07-24 16:13:55 -07:00
Dominic Hamon	5b7683f49e	more clang tidy cleanups (#417 )	2017-07-15 00:21:20 +02:00
Dominic Hamon	e8fc2a2b8c	Google-style cleanups (#416 )	2017-07-13 18:33:43 +02:00
Tom Madams	ee3cfca651	Fix ThreadCPUUsage when running on RTEMS. (#414 ) Change ThreadCPUUsage to call ProcessCPUUsage if __rtems__ is defined. RTEMS real time OS doesn't support CLOCK_THREAD_CPUTIME_ID. See https://github.com/RTEMS/rtems/blob/master/cpukit/posix/src/clockgettime.c#L58-L59 Prior to this change, ThreadCPUUsage would fail when running on RTEMS with: ERROR: clock_gettime(CLOCK_THREAD_CPUTIME_ID, ...) failed	2017-07-06 15:59:13 -07:00
Eric	9d4b719dae	Make Benchmark a single header library (but not header-only) (#407 ) * Make Benchmark a single header library (but not header-only) This patch refactors benchmark into a single header, to allow for slightly easier usage. The initial reason for the header split was to keep C++ library components from being included by benchmark_api.h, making that part of the library STL agnostic. However this has since changed and there seems to be little reason to separate the reporters from the rest of the library. * Fix internal_macros.h * Remove more references to macros.h	2017-07-04 16:31:47 -06:00
Eric	b8a2206fb2	Add ClearRegisteredBenchmark() function. (#402 ) * Add ClearRegisteredBenchmark() function. Since benchmarks can be registered at runtime using the RegisterBenchmark(...) functions, it makes sense to have a ClearRegisteredBenchmarks() function too, that can be used at runtime to clear the currently registered benchmark and re-register an entirely new set. This allows users to run a set of registered benchmarks, get the output using a custom reporter, and then clear and re-register new benchmarks based on the previous results. This fixes issue #400, at least partially. * Remove unused change	2017-06-14 09:16:53 -07:00
Yixuan Qiu	f3b3dd99be	Use the sample version of standard deviation (#383 ) * remove unnecessary weights * use sample standard deviation * add contributor information * remove redundant code * initialize variable to eliminate compiler warning	2017-06-05 10:32:15 -07:00
David Kruger	15e9ebaf83	Associate the required include directory with the benchmark library (#393 ) Using target_include_directories CMake will implicitly add the the necessary include paths to targets which link against the benchmark library. This is useful when the benchmark repo is included as a subdirectory in another CMake build.	2017-05-23 08:40:31 -07:00
Joao Paulo Magalhaes	ec6f03579e	Trying again to fix error caused by -Wunused-function. This thing with the pragma ignore was getting out of hand: now MinGW (and probably GCC) was erroring too. So I chose to move the definition of IsZero() out of the anonymous namespace into benchmark.cc.	2017-05-03 00:05:15 +01:00

1 2 3 4 5 ...

376 Commits