benchmark

mirror of https://github.com/google/benchmark.git synced 2025-03-17 04:30:07 +08:00

Author	SHA1	Message	Date
Eric	0526755944	Add C++11 Ranged For loop alternative to KeepRunning (#454 ) * Add C++11 Ranged For loop alternative to KeepRunning As pointed out by @astrelni and @dominichamon, the KeepRunning loop requires a bunch of memory loads and stores every iterations, which affects the measurements. The main reason for these additional loads and stores is that the State object is passed in by reference, making its contents externally visible memory, and the compiler doesn't know it hasn't been changed by non-visible code. It's also possible the large size of the State struct is hindering optimizations. This patch allows the `State` object to be iterated over using a range-based for loop. Example: void BM_Foo(benchmark::State& state) { for (auto _ : state) { [...] } } This formulation is much more efficient, because the variable counting the loop index is stored in the iterator produced by `State::begin()`, which itself is stored in function-local memory and therefore not accessible by code outside of the function. Therefore the compiler knows the iterator hasn't been changed every iteration. This initial patch and idea was from Alex Strelnikov. * Fix null pointer initialization in C++03	2017-10-10 08:56:42 -07:00
mwinterb	f3cd636f18	Always use inline asm DoNotOptimize with clang. (#452 ) * Always use inline asm DoNotOptimize with clang. clang-cl masquerades as MSVC but not GCC, so it was using the MSVC-compatible definitions of DoNotOptimize and ClobberMemory. Presumably, it's better in general to use the targeted assembly for this functionality (the codegen is different), but the specific issue is that clang-cl deprecates the usage of _ReadWriteBarrier, and this gets rid of that warning. * triggering another AppVeyor run	2017-10-10 00:19:01 +02:00
Anton Lashkov	819adb4cd1	Add macros for create benchmark with templated fixture (#451 ) * Add macros for create benchmark with templated fixture * Add info about templated fixtures to README.md * Add tests for templated fixtures	2017-10-09 21:10:37 +02:00
Dominic Hamon	2409cb2eb1	Minor move of code to cleanup up namespace spaghetti a bit	2017-10-09 12:01:30 -07:00
Dominic Hamon	a96ff121b3	Alphabets are hard. AUTHORS version. #448	2017-09-27 11:53:16 -07:00
Dominic Hamon	5d47e9878f	Alphabets are hard. CONTRIBUTORS version. #448	2017-09-27 11:52:47 -07:00
Dominic Hamon	8792dff1c9	Remove myself from AUTHORS Covered by Google Inc here and i'm in CONTRIBUTORS	2017-09-27 20:01:49 +02:00
Dominic Hamon	359120be78	Order CONTRIBUTORS Fixes #448	2017-09-27 20:01:10 +02:00
Dominic Hamon	84a54ae9f4	Organize AUTHORS Part of #448	2017-09-27 20:00:12 +02:00
Eric	6d8339dd97	Fix #444 - Use BENCHMARK_HAS_CXX11 over __cplusplus. (#446 ) * Fix #444 - Use BENCHMARK_HAS_CXX11 over __cplusplus. MSVC incorrectly defines __cplusplus to report C++03, despite the compiler actually providing C++11 or greater. Therefore we have to detect C++11 differently for MSVC. This patch uses `_MSVC_LANG` which has been defined since Visual Studio 2015 Update 3; which should be sufficient for detecting C++11. Secondly this patch changes over most usages of __cplusplus >= 201103L to check BENCHMARK_HAS_CXX11 instead. * remove redunant comment	2017-09-14 15:50:33 -06:00
Disconnect3d	2a05f248be	Improve README's basic usage example (#433 )	2017-09-14 09:31:35 +02:00
Andre Schroeder	24b8042733	Fix Markdown typos in readme. (#445 )	2017-09-13 15:42:45 -06:00
Roman Lebedev	886585a3b7	[RFC] Tools: compare-bench.py: print change% with two decimal digits (#440 ) * Tools: compare-bench.py: print change% with two decimal digits Here is a comparison of before vs. after: ```diff -Benchmark Time CPU Time Old Time New CPU Old CPU New ---------------------------------------------------------------------------------------------------------- -BM_SameTimes +0.00 +0.00 10 10 10 10 -BM_2xFaster -0.50 -0.50 50 25 50 25 -BM_2xSlower +1.00 +1.00 50 100 50 100 -BM_1PercentFaster -0.01 -0.01 100 99 100 99 -BM_1PercentSlower +0.01 +0.01 100 101 100 101 -BM_10PercentFaster -0.10 -0.10 100 90 100 90 -BM_10PercentSlower +0.10 +0.10 100 110 100 110 -BM_100xSlower +99.00 +99.00 100 10000 100 10000 -BM_100xFaster -0.99 -0.99 10000 100 10000 100 -BM_10PercentCPUToTime +0.10 -0.10 100 110 100 90 +Benchmark Time CPU Time Old Time New CPU Old CPU New +------------------------------------------------------------------------------------------------------------- +BM_SameTimes +0.0000 +0.0000 10 10 10 10 +BM_2xFaster -0.5000 -0.5000 50 25 50 25 +BM_2xSlower +1.0000 +1.0000 50 100 50 100 +BM_1PercentFaster -0.0100 -0.0100 100 99 100 99 +BM_1PercentSlower +0.0100 +0.0100 100 101 100 101 +BM_10PercentFaster -0.1000 -0.1000 100 90 100 90 +BM_10PercentSlower +0.1000 +0.1000 100 110 100 110 +BM_100xSlower +99.0000 +99.0000 100 10000 100 10000 +BM_100xFaster -0.9900 -0.9900 10000 100 10000 100 +BM_10PercentCPUToTime +0.1000 -0.1000 100 110 100 90 +BM_ThirdFaster -0.3333 -0.3333 100 67 100 67 ``` So the first ("Time") column is exactly where it was, but with two more decimal digits. The position of the '.' in the second ("CPU") column is shifted right by those two positions, and the rest is unmodified, but simply shifted right by those 4 positions. As for the reasoning, i guess it is more or less the same as with #426. In some sad times, microbenchmarking is not applicable. In those cases, the more precise the change report is, the better. The current formatting prints not so much the percentages, but the fraction i'd say. It is more useful for huge changes, much more than 100%. That is not always the case, especially if it is not a microbenchmark. Then, even though the change may be good/bad, the change is small (<0.5% or so), rounding happens, and it is no longer possible to tell. I do acknowledge that this change does not fix that problem. Of course, confidence intervals and such would be better, and they would probably fix the problem. But i think this is good as-is too, because now the you see 2 fractional percentage digits!1 The obvious downside is that the output is now even wider. * Revisit tests, more closely documents the current behavior.	2017-08-28 16:12:18 -07:00
Roman Lebedev	6e06648133	Attempting to resolve a submoduling issues... (#439 )	2017-08-28 16:10:19 -07:00
Roman Lebedev	a271c36af9	Drop Stat1, refactor statistics to be user-providable, add median. (#428 ) * Drop Stat1, refactor statistics to be user-providable, add median. My main goal was to add median statistic. Since Stat1 calculated the stats incrementally, and did not store the values themselves, it is was not possible. Thus, i have replaced Stat1 with simple std::vector<double>, containing all the values. Then, i have refactored current mean/stdev to be a function that is provided with values vector, and returns the statistic. While there, it seemed to make sense to deduplicate the code by storing all the statistics functions in a map, and then simply iterate over it. And the interface to add new statistics is intentionally exposed, so they may be added easily. The notable change is that Iterations are no longer displayed as 0 for stdev. Is could be changed, but i'm not sure how to nicely fit that into the API. Similarly, this dance about sometimes (for some fields, for some statistics) dividing by run.iterations, and then multiplying the calculated stastic back is also dropped, and if you do the math, i fail to see why it was needed there in the first place. Since that was the only use of stat.h, it is removed. * complexity.h: attempt to fix MSVC build * Update README.md * Store statistics to compute in a vector, ensures ordering. * Add a bit more tests for repetitions. * Partially address review notes. * Fix gcc build: drop extra ';' clang, why didn't you warn me? * Address review comments. * double() -> 0.0 * early return	2017-08-23 16:44:29 -07:00
Dominic Hamon	d70417994a	Allow the definition of 1k to be flexible. (#438 ) When generating a human-readable number for user counters, we don't generally expect 1k to be 1024. This is the default due to the more general purpose string utility. Fixes #437	2017-08-21 16:05:24 -07:00
Roman Lebedev	c7192c8a9a	compare_bench.py: fixup benchmark_options. (#435 ) `2373382284` reworked parsing, and introduced a regression in handling of the optional options that should be passed to both of the benchmarks. Now, unless the first optional argument starts with '-', it would just complain about that argument: Unrecognized positional argument arguments: '['q']' which is wrong. However if some dummy arg like '-q' was passed first, it would then happily passthrough them all... This commit fixes benchmark_options behavior, by restoring original passthrough behavior for all the optional positional arguments.	2017-08-18 10:55:27 -07:00
Victor Costan	902936033d	CMake: Fallback from try_run to try_compile when cross-compiling. (#436 )	2017-08-15 15:53:30 -07:00
Roman Lebedev	3347a20e0e	reporter_output_test: json: iterations is int, not float (#431 ) May be relevant for flakiness of win builds Noted by @KindDragon	2017-07-31 19:04:02 -06:00
Eric Fiselier	abafced990	Suppress -Wodr on C++03 tests when LTO is enabled. The benchmark library is compiled as C++11, but certain tests are compiled as C++03. When -flto is enabled GCC 5.4 and above will diagnose an ODR violation in libstdc++'s <map>. This ODR violation, although real, should likely be benign. For this reason it seems sensible to simply suppress -Wodr when building the C++03 test. This patch fixes #420 and supersede's PR #424.	2017-07-30 18:44:04 -06:00
Roman Lebedev	d474450b89	Tooling: generate_difference_report(): show old/new for both values (#427 ) While the percentages are displayed for both of the columns, the old/new values are only displayed for the second column, for the CPU time. And the column is not even spelled out. In cases where b->UseRealTime(); is used, this is at the very least highly confusing. So why don't we just display both the old/new for both the columns? Fixes #425	2017-07-25 09:09:26 -07:00
Roman Lebedev	b9be142d1e	Json reporter: don't cast floating-point to int; adjust tooling (#426 ) * Json reporter: passthrough fp, don't cast it to int; adjust tooling Json output format is generally meant for further processing using some automated tools. Thus, it makes sense not to intentionally limit the precision of the values contained in the report. As it can be seen, FormatKV() for doubles, used %.2f format, which was meant to preserve at least some of the precision. However, before that function is ever called, the doubles were already cast to the integer via RoundDouble()... This is also the case for console reporter, where it makes sense because the screen space is limited, and this reporter, however the CSV reporter does output some( decimal digits. Thus i can only conclude that the loss of the precision was not really considered, so i have decided to adjust the code of the json reporter to output the full fp precision. There can be several reasons why that is the right thing to do, the bigger the time_unit used, the greater the precision loss, so i'd say any sort of further processing (like e.g. tools/compare_bench.py does) is best done on the values with most precision. Also, that cast skewed the data away from zero, which i think may or may not result in false- positives/negatives in the output of tools/compare_bench.py * Json reporter: FormatKV(double): address review note * tools/gbench/report.py: skip benchmarks with different time units While it may be useful to teach it to operate on the measurements with different time units, which is now possible since floats are stored, and not the integers, but for now at least doing such a sanity-checking is better than providing misinformation.	2017-07-24 16:13:55 -07:00
Dominic Hamon	5b7683f49e	more clang tidy cleanups (#417 )	2017-07-15 00:21:20 +02:00
Dominic Hamon	e8fc2a2b8c	Google-style cleanups (#416 )	2017-07-13 18:33:43 +02:00
Tom Madams	ee3cfca651	Fix ThreadCPUUsage when running on RTEMS. (#414 ) Change ThreadCPUUsage to call ProcessCPUUsage if __rtems__ is defined. RTEMS real time OS doesn't support CLOCK_THREAD_CPUTIME_ID. See https://github.com/RTEMS/rtems/blob/master/cpukit/posix/src/clockgettime.c#L58-L59 Prior to this change, ThreadCPUUsage would fail when running on RTEMS with: ERROR: clock_gettime(CLOCK_THREAD_CPUTIME_ID, ...) failed	2017-07-06 15:59:13 -07:00
Eric	9d4b719dae	Make Benchmark a single header library (but not header-only) (#407 ) * Make Benchmark a single header library (but not header-only) This patch refactors benchmark into a single header, to allow for slightly easier usage. The initial reason for the header split was to keep C++ library components from being included by benchmark_api.h, making that part of the library STL agnostic. However this has since changed and there seems to be little reason to separate the reporters from the rest of the library. * Fix internal_macros.h * Remove more references to macros.h	2017-07-04 16:31:47 -06:00
Jern-Kuan Leong	710c2b89d8	Fix #403 HAVE_${VAR} not passed to makefile (#404 ) Add definition of ${VAR} to makefiles if specified as part of cmake parameter.	2017-06-16 14:46:11 -07:00
Eric	b8a2206fb2	Add ClearRegisteredBenchmark() function. (#402 ) * Add ClearRegisteredBenchmark() function. Since benchmarks can be registered at runtime using the RegisterBenchmark(...) functions, it makes sense to have a ClearRegisteredBenchmarks() function too, that can be used at runtime to clear the currently registered benchmark and re-register an entirely new set. This allows users to run a set of registered benchmarks, get the output using a custom reporter, and then clear and re-register new benchmarks based on the previous results. This fixes issue #400, at least partially. * Remove unused change	2017-06-14 09:16:53 -07:00
Eric	d6aacaf48f	Revert "Use NEW settings for CMP0063 policy (#399 )" (#401 ) This reverts commit `af542061c5`.	2017-06-13 18:42:32 -06:00
Tim	af542061c5	Use NEW settings for CMP0063 policy (#399 ) This removes warnings when using CMake >= 3.3 if you have symbol visibility set.	2017-06-13 18:42:07 -06:00
Yixuan Qiu	f3b3dd99be	Use the sample version of standard deviation (#383 ) * remove unnecessary weights * use sample standard deviation * add contributor information * remove redundant code * initialize variable to eliminate compiler warning	2017-06-05 10:32:15 -07:00
Eric	93bfabc8b8	Fix #342 : DoNotOptimize causes compile errors on older GCC versions. (#398 ) * Fix #342: DoNotOptimize causes compile errors on older GCC versions. DoNotOptimize uses inline assembly contraints to tell the compiler what the type of the input variable. The 'g' operand allows the input to be any register, memory, or immediate integer operand. However this constraint seems to be too weak on older GCC versions, and certain inputs will cause compile errors. This patch changes the constraint to 'X', which is documented as "any operand whatsoever is allowed". This appears to fix the issues with older GCC versions. However Clang doesn't seem to like "X", and will attempt to put the input into a register even when it can't/shouldn't; causing a compile error. However using "g" seems to work like "X" with GCC, so for this reason Clang still uses "g". * Try alternative formulation to placate GCC	2017-06-02 15:47:23 -07:00
David Kruger	15e9ebaf83	Associate the required include directory with the benchmark library (#393 ) Using target_include_directories CMake will implicitly add the the necessary include paths to targets which link against the benchmark library. This is useful when the benchmark repo is included as a subdirectory in another CMake build.	2017-05-23 08:40:31 -07:00
Dominic Hamon	febd0d7a7a	Remove unnecessary whitespace in travis yaml	2017-05-22 09:27:28 -07:00
Tushar Maheshwari	b1f33d44ea	Add macOS builds to .travis.yml (#389 )	2017-05-22 09:26:05 -07:00
Eric Fiselier	cb8a0cc10f	test commit	2017-05-03 23:43:16 -06:00
Dominic Hamon	4cfe790a25	Merge branch 'biojppm-compact'	2017-05-03 09:11:45 -07:00
Joao Paulo Magalhaes	ec6f03579e	Trying again to fix error caused by -Wunused-function. This thing with the pragma ignore was getting out of hand: now MinGW (and probably GCC) was erroring too. So I chose to move the definition of IsZero() out of the anonymous namespace into benchmark.cc.	2017-05-03 00:05:15 +01:00
Joao Paulo Magalhaes	1735413188	Fix pragma clang ignore with gcc.	2017-05-02 23:35:46 +01:00
Joao Paulo Magalhaes	160770fd08	Fix dropped-style elses.	2017-05-02 23:30:36 +01:00
Joao Paulo Magalhaes	a31088632a	Fix (that is, ignore) clang compile error.	2017-05-02 23:25:22 +01:00
Joao Paulo Magalhaes	020bac985b	Extend tabular counter tests to different counter sets.	2017-05-02 23:00:45 +01:00
Joao Paulo Magalhaes	2506044902	Add unit test for counter sets.	2017-05-02 22:14:49 +01:00
Joao Paulo Magalhaes	ea019f3cd8	Allow different counter sets in CSV reporting.	2017-05-02 22:10:08 +01:00
Joao Paulo Magalhaes	3db6254c39	Console reporter: add /s prefix to counter rates.	2017-05-02 20:48:29 +01:00
Joao Paulo Magalhaes	cf20dc967f	Add test for tabular output of rate counters.	2017-05-02 20:47:41 +01:00
Joao Paulo Magalhaes	c69b385c9c	Add first unit test for benchmark_tabular_counters.	2017-05-02 20:33:28 +01:00
Joao Paulo Magalhaes	17a012d754	Fix: --benchmark_counters_tabular was not being passed to tests.	2017-05-02 20:31:54 +01:00
Joao Paulo Magalhaes	615151723e	Merge remote-tracking branch 'upstream/master' into compact	2017-05-02 18:54:37 +01:00
Dominic Hamon	da8cd74d85	Merge branch 'biojppm-test_usercounters'	2017-05-02 08:44:55 -07:00

1 2 3 4 5 ...

781 Commits