benchmark

mirror of https://github.com/google/benchmark.git synced 2025-03-15 19:50:09 +08:00

Author	SHA1	Message	Date
Yuri Khan	892f29589d	Initialize help hook before actually parsing the command line (#1447 )	2022-07-26 16:33:32 +01:00
maochongxin	ef7f75fb18	simplified code (#1439 )	2022-07-21 12:34:02 +01:00
Dominic Hamon	e27c93073f	use target_compile_definitions (#1440 )	2022-07-21 11:50:01 +01:00
Dominic Hamon	7b3ac07517	Stop generating the export header and just check it in (#1435 ) * Stop generating the export header and just check it in * format the new header * support windows * format the header again * avoid depending on internal macro * ensure we define the right thing for windows static builds * support older cmake * and for tests	2022-07-20 20:34:39 +01:00
Dominic Hamon	d845b7b3a2	Also fix the SOVERSION for benchmark_main	2022-07-19 09:14:35 +01:00
Dominic Hamon	d4bc509bcd	Fix SOVERSION of shared library Fixes #1434	2022-07-18 18:19:05 +01:00
Cezary Skrzyński	4efcc47461	Suppress nvcc `offsetof` warning (#1429 ) * Suppress nvcc offsetof warning * Update AUTHORS and CONTRIBUTORS	2022-07-15 12:18:45 +01:00
Yuri Khan	4136c4a3c5	Expose default help printer function (#1425 ) * Pass the default help string into custom help printer Improves #1329. * Expose default help printer	2022-07-04 10:29:03 +01:00
Dominic Hamon	b7afda2cd2	Revert "Add possibility to ask for libbenchmark version number (#1004 ) (#1403 )" (#1417 ) This reverts commit `efadf67a12`.	2022-06-20 17:52:03 +01:00
Dominic Hamon	af7de865eb	Clarify that the cpu frequency is not used for benchmark timings. (#1414 ) * Clarify that the cpu frequency is not used for benchmark timings. Fixes #1310 * fix format (clang-format missed this...) * oops	2022-06-20 11:22:57 +01:00
Matthias Donaubauer	efadf67a12	Add possibility to ask for libbenchmark version number (#1004 ) (#1403 ) * Add possibility to ask for libbenchmark version number (#1004) Add a header which holds the current major, minor, and patch number of the library. The header is auto generated by CMake. * Do not generate unused functions (#1004) * Add support for version number in bazel (#1004) * Fix clang format #1004 * Fix more clang format problems (#1004) * Use git version feature of cmake to determine current lib version * Rename version_config header to version * Bake git version into bazel build * Use same input config header as in cmake for version.h * Adapt the releasing.md to include versioning in bazel	2022-06-20 09:45:50 +01:00
Dominic Hamon	920fa14898	fix some build warnings on type conversions	2022-06-08 10:32:20 +01:00
Matthdonau	6d50251d8e	Report large numbers in scientific notation in console reporter (#1303 ) (#1402 ) Report all time numbers > 10 digits in scientific notation with 4 decimal places. This is necessary since only 10 digits are currently reserved for the time columns (Time and CPU). If exceeding 10 digits the output isnt properly aligned anymore.	2022-05-27 09:29:53 +01:00
Matthdonau	7eb8c0fe45	Introduce warmup phase to BenchmarkRunner (#1130 ) (#1399 ) * Introduce warmup phase to BenchmarkRunner (#1130) In order to account for caching effects in user benchmarks introduce a new command line option "--benchmark_min_warmup_time" which allows to specify an amount of time for which the benchmark should be run before results are meaningful. * Adapt review suggestions regarding introduction of warmup phase (#1130) * Fix BM_CHECK call in MinWarmUpTime (#1130) * Fix comment on requirements of MinWarmUpTime (#1130) * Add basic description of warmup phase mechanism to user guide (#1130)	2022-05-23 13:50:17 +01:00
Zi Xuan Wu (Zeson)	6c46c9f593	Add support to get clock for new architecture CSKY (#1400 ) it's like what loongarch does to get cycle clock for CSKY by gettimeofday function.	2022-05-23 11:47:58 +01:00
Matthdonau	37be1e8252	Add option to get the verbosity provided by commandline flag -v (#1330 ) (#1397 ) * Add option to get the verbosity provided by commandline flag -v (#1330) * replace assert with test failure asserts are stripped out in non debug builds, and we run tests in non-debug CI bots. * clang-format my own tweak Co-authored-by: Dominic Hamon <dominichamon@users.noreply.github.com>	2022-05-17 17:59:36 +01:00
cui fliter	aecbdbff4f	fix some typos (#1393 ) Signed-off-by: cuishuang <imcusg@gmail.com>	2022-05-11 09:03:20 +01:00
Dominic Hamon	8d86026c67	Enable -Wconversion (#1390 ) Requires some casts here and there, but nothing unreasonable. Fixes #1268	2022-05-01 19:56:30 +01:00
Dominic Hamon	a162a38ca0	Filter out benchmarks that start with "DISABLED_" (#1387 ) * Filter out benchmarks that start with "DISABLED_" This could be slightly more elegant, in that the registration and the benchmark definition names have to change. Ideally, we'd still register without the DISABLED_ prefix and it would all "just work". Fixes #1365 * add some documentation	2022-05-01 10:41:34 +01:00
Dominic Hamon	74ae567294	Small optimization to counter map management (#1382 ) * Small optimization to counter map management Avoids an unnecessary lookup. * formatting	2022-04-07 14:37:22 +01:00
Dominic Hamon	3eac3b60d2	getting sysinfo in line with Google style (#1381 ) * getting sysinfo in line with Google style * format tweak * fixing osx compile errors * more style tweaks	2022-04-07 10:50:52 +01:00
Brad Messer	60b16f11a3	Promote inclusive language. (#1360 )	2022-03-18 15:59:31 +00:00
Bensuperpc	4f77cf9e62	Fix float comparaison and add float comparison warning (#1368 ) GCC warns about comparison with zero, clang does not.	2022-03-12 19:05:23 +03:00
Vy Nguyen	eacce0b503	Add SetBenchmarkFilter() to set --benchmark_filter flag value in user code (#1362 ) * Add SetBenchmarkFilter() to set --benchmark_filter flag value in user code. Use case: Provide an API to set this flag indepedence of the flag's implementation (ie., absl flag vs benchmark's flag facility) * add test * added notes on Initialize()	2022-03-08 16:02:37 +00:00
Bátor Tallér	d08e7b6056	Allow setting the default time unit globally (#1337 ) * Add option to set the default time unit globally This commit introduces the `--benchmark_time_unit={ns\|us\|ms\|s}` command line argument. The argument only affects benchmarks where the time unit is not set explicitly. * Update AUTHORS and CONTRIBUTORS * Test `SetDefaultTimeUnit` * clang format * Use `GetDefaultTimeUnit()` for initializing `TimeUnit` variables * Review fixes * Export functions * Add comment	2022-03-04 11:07:01 +00:00
Sergiu Deitsch	e33986a000	restore BENCHMARK_MAIN() (#1357 )	2022-02-26 10:17:13 +00:00
Vincenzo Palazzo	c563644040	Introduce the possibility to customize the help printer function (#1342 ) * introduce the possibility to customize the help printer function Signed-off-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> * fixed naming convertion, and introduce the option function in the init method Signed-off-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> * remove the macros to inject the helper function Signed-off-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com> * remove the default implementation, and introduce the nullprt Signed-off-by: Vincenzo Palazzo <vincenzopalazzodev@gmail.com>	2022-02-16 09:23:54 +00:00
Vy Nguyen	7b46d3ddbf	Check for macro existence before using (#1347 )	2022-02-14 17:00:44 +00:00
Sergiu Deitsch	9e47d070fe	annotate and export public symbols (#1321 )	2022-02-14 10:48:53 +00:00
Dominic Hamon	6e51dcbcc3	Expose default display reporter creation in public API (#1344 ) * Expose default display reporter creation in public API this is useful when a custom reporter wants to fall back on the default display reporter, but doesn't necessarily have access to the benchmark library flag configuration. * Make use of unique_ptr in the random interleaving test. * clang-format	2022-02-11 10:23:05 +00:00
Liqiang TAO	bdea5051b0	Add mutex when reading counters_ (Fixes #1335 ) (#1338 )	2022-02-07 13:03:19 +00:00
batortaller	1ee7bee6c5	Use Win32 API only for Win32 apps (#1333 )	2022-02-02 12:16:19 +03:00
Liqiang TAO	d0fbf8ac23	Cache PerfCounters instance in PerfCountersMeasurement (#1308 ) This patch fixes #1306, by reducing the pinned instances of PerfCounters. The issue is caused by creating multiple pinned events in the same thread, doing so results in the Snapshot(PerfCounterValues* values) failing, and that's now discoverable. Creating multile pinned events is an unsupported behavior currently. The error would be detected at read() time, not perf_event_open() / iotcl() time. The unsupported benavior above is confirmed by Stephane Eranian @seranian, and he also pointed the dectection method. Finished this patch under the guidance of Mircea Trofin @mtrofin.	2022-01-25 10:14:20 +00:00
staffantj	0e78738a25	Destructor not returning is expected in some cases (#1316 ) * Address MSVC C4722 warning in tests Some test paths deliberately exit, and it appears that the appropriate declspec does not stop the compiler generating the C4722 warning as one might expect. Per https://github.com/google/benchmark/issues/826#issuecomment-851995549 this commit ignores the warning for the affected call site. * Fix up Formatting * Fix up formatting issue on pragmas * Fix up formatting issue on pragmas take 2 Co-authored-by: Staffan Tjernstrom <staffantj@users.noreply.github.com>	2022-01-10 15:44:42 +00:00
Roman Lebedev	3b3de69400	Fix `-DBENCHMARK_ENABLE_INSTALL=OFF` (Fixes #1275 ) (#1305 ) Otherwise this fails with ``` CMake Error at src/CMakeLists.txt:154 (export): export Export set "benchmarkTargets" not found. ``` This is what https://cmake.org/cmake/help/latest/guide/importing-exporting/index.html#exporting-targets seems to suggest. While there, really respect BENCHMARK_ENABLE_INSTALL, BENCHMARK_INSTALL_DOCS shouldn't override it.	2021-12-14 09:46:23 +00:00
Martin Storsjö	b000672793	Avoid errors due to "default label in switch which covers all enumeration values" in Windows codepath (#1302 ) This applies a fix that used to exist in LLVM's downstream copy of this library, from `948ce4e6ed`. I presume this warning isn't present if built with MSVC or Clang-cl, but it's printed in MinGW mode. As the benchmark library adds -Werror, this is a fatal error when builtin MinGW mode.	2021-12-09 09:24:54 +00:00
dominc8	ab867074da	clang-tidy: readability-redundant and performance (#1298 ) * clang-tidy: readability-redundant-* * clang-tidy: performance-*	2021-12-06 11:18:04 +00:00
dominc8	680d3fdbb5	Add clang-tidy check (#1290 ) * Add clang-tidy.yml and .clang-tidy * Add mention to authors/contributors * Temp fix 2 clang-tidy issues * Enable clang-tidy on pull requests * Exclude gtest source files from clang-tidy	2021-11-25 15:47:44 +00:00
Dominic Hamon	ce92bbfb90	remove long-defunct cli parameter	2021-11-19 19:58:08 +00:00
Vy Nguyen	91ed7eea68	Disable clang-tidy (unused-using-decls) (#1287 ) The NOLINTBEGIN block only covers warnings on `long` types and other styling issues but not clang-tidies.	2021-11-19 11:12:59 +00:00
Vy Nguyen	8722d6f014	disable lint check where we know it'd fail (#1286 ) * disable lint check where we know it'd fail one less noisy presubmit * clang format * remove ws	2021-11-17 17:57:36 +00:00
Vy Nguyen	b5bb9f0675	Add Setup/Teardown option on Benchmark. (#1269 ) * Add Setup/Teardown option on Benchmark. Motivations: - feature parity with our internal library. (which has ~718 callers) - more flexible than cordinating setup/teardown inside the benchmark routine. * change Setup/Teardown callback type to raw function pointers * add test file to cmake file * move b.Teardown() up * add const to param of Setup/Teardown callbacks * fix comment and add doc to user_guide * fix typo * fix doc, fix test and add bindings to python/benchmark.cc * fix binding again * remove explicit C cast - that was wrong * change policy to reference_internal * try removing the bindinds ... * clean up * add more tests with repetitions and fixtures * more comments * init setup/teardown callbacks to NULL * s/nullptr/NULL * removed unused var * change assertion on fixture_interaction::fixture_setup * move NULL init to .cc file	2021-11-17 16:51:55 +00:00
Dominic Hamon	b3c08f6ec3	check clang format on pull requests and merges (#1281 ) * check clang format on pull requests and merges * manage some formatting by hand * undo one format too many	2021-11-10 16:49:49 +00:00
Dominic Hamon	fcef4fb669	clang-format Google on {src/,include/} (#1280 )	2021-11-10 16:04:32 +00:00
Bensuperpc	431abd149f	Fix warning with MacOS (#1276 ) * Fix warning with MacOS Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Re-trigger GitHub actions Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Fix style Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Revert "Fix style" This reverts commit `1d5fe9ce87`. * Fix style only on changes Signed-off-by: Bensuperpc <bensuperpc@gmail.com>	2021-11-08 12:39:36 +00:00
Bensuperpc	329fb06d99	Fix error with Fix Werror=old-style-cast (#1272 ) * Fix Werror=old-style-cast Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Fix Werror=old-style-cast Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Fix Werror=old-style-cast Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Fix typo Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Fix build error with MacOS Signed-off-by: Bensuperpc <bensuperpc@gmail.com> * Revert "Fix build error with MacOS" This reverts commit `cee213bb95`.	2021-11-04 12:09:10 +00:00
Vy Nguyen	4f31803ebb	Fix un-initted error in test and fix change the API previously proposed to use std::string instead of raw char* (#1266 ) * Fix un-initted error in test. Found by -Werror,-Wsometimes-uninitialized * Update spec_arg_test.cc * additional change: - Change the API on GetBenchmarkFilter and the `spec` to std::string because google C++ styleguide internally kind of discouraged using raw const char*	2021-10-29 11:48:56 +01:00
Vy Nguyen	4f47ed2c9a	[RFC] Adding API for setting/getting benchmark_filter flag? (#1254 ) * [RFC] Adding API for setting/getting benchmark_filter flag? This PR is more of a Request-for-comment - open to other ideas/suggestions as well. Details: This flag has different implementations(absl vs benchmark) and since the proposal to add absl as a dependency was rejected, it would be nice to have a reliable (and less hacky) way to access this flag internally. (Actually, reading it isn't much a problem but setting it is). Internally, we have a sizeable number users to use absl::SetFlags to set this flag. This will not work with benchmark-flags. Another motivation is that not all users use the command line flag. Some prefer to programmatically set this value. * fixed build errors * fix lints again * per discussion: add additional RunSpecifiedBenchmarks instead. * add tests * fix up tests * clarify comment * fix stray : in test * more assertion in test * add test file to test/CMakeLists.txt * more test * make test ISO C++ compliant * fix up BUILD file to pass the flag	2021-10-27 08:52:57 +01:00
Vitaly Zaitsev	365670e432	Added Doxygen support. (#1228 ) Signed-off-by: Vitaly Zaitsev <vitaly@easycoding.org>	2021-10-25 12:32:33 +01:00
Byoungchan Lee	80d70ddd94	Fix -Wdeprecated-declarations warning once more. (#1256 ) In #1238, one of MemoryManager's Stop methods was marked as deprecated and this method is used in the same header. This change generated -Wdeprecated-declarations warning on every file that includes "benchmark.h". Use gcc's diagnostics to fix this warning.	2021-10-21 10:10:38 +01:00
Sergiu Deitsch	1be88c0683	cmake: allow to use package config from build directory	2021-10-19 11:11:11 +02:00
Sergiu Deitsch	eb9100bf41	cmake: make package config relocatable	2021-10-19 11:05:29 +02:00
Vy Nguyen	7fad964a94	Introduce additional memory metrics (#1238 ) - added total_allocs and net_allocs - updated reporter code to report these, if available.	2021-10-18 16:29:35 +01:00
Byoungchan Lee	f730846b0a	Fix -Wdeprecated-declarations warning triggered by clang-cl. (#1245 ) WebRTC uses Google Benchmarks as a dependency and uses Chromium's build infrastructure. Chromium is compiled using clang-cl on Windows, and the -Wdeprecated-declarations warning is triggered. Because clang-cl accepts gcc's diagnostic prama and defines the __clang__ macro, using it can solve this issue. Bug: webrtc:13280	2021-10-18 11:31:51 +01:00
Sergiu Deitsch	59bbc7fd9d	cmake: eliminate redundant `target_include_directories` (#1242 )	2021-10-17 15:07:19 +03:00
Vitaly Zaitsev	1bd8098d3d	Optimized docs installation (#1225 ) * Use GNUInstallDirs to install docs. Signed-off-by: Vitaly Zaitsev <vitaly@easycoding.org> * Added an option to disable docs installation. Signed-off-by: Vitaly Zaitsev <vitaly@easycoding.org>	2021-09-08 18:40:25 +01:00
Roman Lebedev	4f8070590c	COnsole reporter: if statistic produces percents, format it as such (#1221 )	2021-09-06 11:33:27 +03:00
Roman Lebedev	45b194e4d4	Introduce Coefficient of variation aggregate (#1220 ) * Introduce Coefficient of variation aggregate I believe, it is much more useful / use to understand, because it is already normalized by the mean, so it is not affected by the duration of the benchmark, unlike the standard deviation. Example of real-world output: ``` raw.pixls.us-unique/GoPro/HERO6 Black$ ~/rawspeed/build-old/src/utilities/rsbench/rsbench GOPR9172.GPR --benchmark_repetitions=27 --benchmark_display_aggregates_only=true --benchmark_counters_tabular=true 2021-09-03T18:05:56+03:00 Running /home/lebedevri/rawspeed/build-old/src/utilities/rsbench/rsbench Run on (32 X 3596.16 MHz CPU s) CPU Caches: L1 Data 32 KiB (x16) L1 Instruction 32 KiB (x16) L2 Unified 512 KiB (x16) L3 Unified 32768 KiB (x2) Load Average: 7.00, 2.99, 1.85 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Benchmark Time CPU Iterations CPUTime,s CPUTime/WallTime Pixels Pixels/CPUTime Pixels/WallTime Raws/CPUTime Raws/WallTime WallTime,s ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ GOPR9172.GPR/threads:32/process_time/real_time_mean 11.1 ms 353 ms 27 0.353122 31.9473 12M 33.9879M 1085.84M 2.83232 90.4864 0.0110535 GOPR9172.GPR/threads:32/process_time/real_time_median 11.0 ms 352 ms 27 0.351696 31.9599 12M 34.1203M 1090.11M 2.84336 90.8425 0.0110081 GOPR9172.GPR/threads:32/process_time/real_time_stddev 0.159 ms 4.60 ms 27 4.59539m 0.0462064 0 426.371k 14.9631M 0.0355309 1.24692 158.944u GOPR9172.GPR/threads:32/process_time/real_time_cv 1.44 % 1.30 % 27 0.0130136 1.44633m 0 0.0125448 0.0137802 0.0125448 0.0137802 0.0143795 ``` Fixes https://github.com/google/benchmark/issues/1146 * Be consistent, it's CV, not 'rel std dev'	2021-09-03 18:44:10 +01:00
Roman Lebedev	12dc5eeafc	Statistics: add support for percentage unit in addition to time (#1219 ) * Statistics: add support for percentage unit in addition to time I think, `stddev` statistic is useful, but confusing. What does it mean if `stddev` of `1ms` is reported? Is that good or bad? If the `median` is `1s`, then that means that the measurements are pretty noise-less. And what about `stddev` of `100ms` is reported? If the `median` is `1s` - awful, if the `median` is `10s` - good. And hurray, there is just the statistic that we need: https://en.wikipedia.org/wiki/Coefficient_of_variation But, naturally, that produces a value in percents, but the statistics are currently hardcoded to produce time. So this refactors thinkgs a bit, and allows a percentage unit for statistics. I'm not sure whether or not `benchmark` would be okay with adding this `RSD` statistic by default, but regales, that is a separate patch. Refs. https://github.com/google/benchmark/issues/1146 * Address review notes	2021-09-03 15:36:56 +01:00
Dominic Hamon	2b093325e1	replace #warning with #pragma message (#1216 )	2021-08-24 15:08:15 +01:00
Vy Nguyen	dc1a97174d	Introduce accessors for currently public data members (threads and thread_index) (#1208 ) * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate the direct access to these fields. Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate the direct access to these fields. Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else. * [benchmark] Introduce accessors for currently public data members `threads` and `thread_index` Also deprecate direct access to `.thread_index` and make threads a private field Motivations: Our internal library provides accessors for those fields because the styleguide disalows accessing classes' data members directly (even if they're const). There has been a discussion to simply move internal library to make its fields public similarly to the OSS version here, however, the concern is that these kinds of direct access would prevent many types of future design changes (eg how/whether the values would be stored in the data member) I think the concensus in the end is that we'd change the external library for this case. AFAIK, there are three important third_party users that we'd need to migrate: tcmalloc, abseil and tensorflow. Please let me know if I'm missing anyone else.	2021-08-23 09:06:57 +01:00
Nico Weber	8fd49d6671	Fix a -Wunreachable-code-aggressive warning (#1214 )	2021-08-19 16:09:49 +03:00
Dominic Hamon	990299fff8	install docs folder when installing library (#1212 ) fixes #470	2021-08-17 22:16:40 +01:00
Vy Nguyen	4124223bf5	Change the default value of `--benchmark_filter` from "." to <empty> (#1207 ) Both `.` and `<empty>` already means "run all benchmarks" here, as commented on this flag's declaration (and below around line 448-449). So this is a NFC. On the other hand, this help internally because internally, if the flag is empty (or if it's not a specified by a binary), we don't call the RunSpecifiedBenchmarks. There is still a difference in what <empty> means internally (runs no benchmarks) and externally (runs all benchmarks). But we can work around this.	2021-08-03 17:11:47 +01:00
Braedy	1067dfc91e	Remove dead code from PredictNumItersNeeded (#1206 ) * Remove `min` with dead path. When `isSignificant` is false, the smallest value `multiplier` can approach is 14. Thus, min(10, multiplier) will always return 10. Addresses part one of #1205. * Remove always false condition. 1. `multiplier <= 1.0` implies `i.seconds >= min_time * 1.4` 2. By (1), `isSignficant` is true because `i.seconds > min_time * 1.4` implies `i.seconds > min_time` implies that `i.seconds / minTime > 1 > 0.1`. Thus, the ternary maintains the same multiplier value. 3. `ShouldReportResults` is always called before `PredictNumItersNeeded`, if `i.seconds >= min_time` then the loop is broken and `PredictNumItersNeeded` is never called. 4. 1 and 3 together imply that `multiplier <= 1.0` is never true. Addresses part 2 of #1205.	2021-07-29 08:59:46 +01:00
Dominic Hamon	1fcb5c23d8	Don't return a reference when the callers all expect pointers. Fixes #1196	2021-07-01 09:39:09 +01:00
Mircea Trofin	94f845ec4f	Fix typos (#1194 )	2021-06-28 17:07:54 +01:00
Mircea Trofin	05a2ace713	Fix type warning on certain compilers (#1193 ) repetition_indices is populated with size_t values, so typing it accordingly.	2021-06-28 17:06:22 +01:00
Dominic Hamon	1799e1b9ec	prefix VLOG (#1187 )	2021-06-24 18:55:37 +01:00
Dominic Hamon	6a5bf081d3	prefix macros to avoid clashes (#1186 )	2021-06-24 18:21:59 +01:00
Dominic Hamon	5da5660429	Move flags inside the `benchmark` namespace (#1185 ) This avoids clashes with other libraries that might define the same flags.	2021-06-24 16:50:19 +01:00
Dominic Hamon	62937f91b5	Add missing trailing commas (#1182 ) * Add missing trailing commas Fixes #1181 * Better trailing commas	2021-06-18 17:31:47 +01:00
PCMan	c932169e76	Provide helpers to create integer lists for the given ranges. (#1179 ) This can be used together with ArgsProduct() to allow multiple ranges with different multipliers and mixing dense and sparse ranges. Example: BENCHMARK(MyTest)->ArgsProduct({ CreateRange(0, 1024, /multi=/32), CreateRange(0, 100, /multi=/4), CreateDenseRange(0, 4, /step=/1) }); Co-authored-by: Jen-yee Hong <pcmantw@google.com>	2021-06-16 12:56:24 +01:00
Michael Lippautz	5b7518482c	benchmark_runner.h: Remove superfluous semi colon (#1178 ) Some downstream projects (e.g. V8) treat warnings as errors and cannot roll the latest changes.	2021-06-15 13:28:55 +01:00
Roman Lebedev	e991355c02	[NFCI] Drop warning to satisfy clang's -Wunused-but-set-variable diag (#1174 ) Fixes https://github.com/google/benchmark/issues/1172	2021-06-09 11:52:12 +03:00
huajingyun	f90215f1cc	Add support for new architecture loongarch (#1173 )	2021-06-08 10:26:24 +01:00
Roman Lebedev	fbc31405b2	Random interleaving of benchmark repetitions - the sequel (fixes #1051 ) (#1163 ) Inspired by the original implementation by Hai Huang @haih-g from https://github.com/google/benchmark/pull/1105. The original implementation had design deficiencies that weren't really addressable without redesign, so it was reverted. In essence, the original implementation consisted of two separateable parts: * reducing the amount time each repetition is run for, and symmetrically increasing repetition count * running the repetitions in random order While it worked fine for the usual case, it broke down when user would specify repetitions (it would completely ignore that request), or specified per-repetition min time (while it would still adjust the repetition count, it would not adjust the per-repetition time, leading to much greater run times) Here, like i was originally suggesting in the original review, i'm separating the features, and only dealing with a single one - running repetitions in random order. Now that the runs/repetitions are no longer in-order, the tooling may wish to sort the output, and indeed `compare.py` has been updated to do that: #1168.	2021-06-03 21:16:54 +03:00
Dominic Hamon	d17ea66551	Fix leak in test, and provide path to remove leak from library (#1169 ) * Fix leak in test, and provide path to remove leak from library * make doc change	2021-06-03 16:08:00 +01:00
Roman Lebedev	32cc607107	[NFCI] Make BenchmarkRunner non-internal to it's .cpp file Currently the lifetime of a single BenchmarkRunner is constrained to a RunBenchmark(), but that will have to change for interleaved benchmark execution, because we'll need to keep it around to not forget how much repetitions of an instance we've done.	2021-06-03 16:56:15 +03:00
Roman Lebedev	520573fecb	[NFCI] RunBenchmarks(): extract FlushStreams()/Report() functions Based on original implementation by Hai Huang @haih-g in https://github.com/google/benchmark/pull/1105	2021-06-03 16:44:20 +03:00
Roman Lebedev	0c1da0a713	Make 'complexity reports' cache per-family, not global (#1166 ) While the current variant works, it assumes that all the instances of a single family will be run together, with nothing inbetween them. Naturally, that won't work once the runs may be interleaved.	2021-06-03 11:46:34 +03:00
Roman Lebedev	80a62618e8	Introduce per-family instance index (#1165 ) Much like it makes sense to enumerate all the families, it makes sense to enumerate stuff within families. Alternatively, we could have a global instance index, but i'm not sure why that would be better. This will be useful when the benchmarks are run not in order, for the tools to sort the results properly.	2021-06-02 23:45:41 +03:00
Roman Lebedev	4c2e32f1d0	Introduce "family index" field into JSON output (#1164 ) It may be useful for those wishing to further post-process JSON results, but it is mainly geared towards better support for run interleaving, where results from the same family may not be close-by in the JSON. While we won't be able to do much about that for outputs, the tools can and perhaps should reorder the results to that at least in their output they are in proper order, not run order. Note that this only counts the families that were filtered-in, so if e.g. there were three families, and we filtered-out the second one, the two families (which were first and third) will have family indexes 0 and 1.	2021-06-02 18:06:45 +03:00
Roman Lebedev	e0a080d00e	BenchmarkFamilies::FindBenchmarks(): correctly use std::vector<>::reserve() It takes the whole total new capacity, not the increase.	2021-06-02 13:28:05 +03:00
Roman Lebedev	a54ef37aea	Ensure that we print repetition count even when it was specified via flag `--benchmark_repetitions=`	2021-06-02 12:34:00 +03:00
Dominic Hamon	e025dd5a54	Revert "Implementation of random interleaving. (#1105 )" (#1161 ) This reverts commit `a6a738c1cc`.	2021-06-01 16:05:50 +01:00
Norman Heino	6f094ba13e	Fix perf counter argument parsing (#1160 ) * Fix argument order in StrSplit * Update AUTHORS, CONTRIBUTORS	2021-06-01 15:50:42 +01:00
Mariusz Wachowicz	db2de74cc8	Fix pedantic compilation flag violation (#1156 ) ';' after method definition was removed. Also, pedantic flag is now uncommented in CMakeList.txt.	2021-05-21 09:48:20 +01:00
haih-g	a6a738c1cc	Implementation of random interleaving. (#1105 ) * Implementation of random interleaving. See http://github.com/google/benchmark/issues/1051 for the feature requests. Committer: Hai Huang (http://github.com/haih-g) On branch fr-1051 Changes to be committed: modified: include/benchmark/benchmark.h modified: src/benchmark.cc new file: src/benchmark_adjust_repetitions.cc new file: src/benchmark_adjust_repetitions.h modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h modified: src/benchmark_register.cc modified: src/benchmark_runner.cc modified: src/benchmark_runner.h modified: test/CMakeLists.txt new file: test/benchmark_random_interleaving_gtest.cc * Fix benchmark_random_interleaving_gtest.cc for fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc modified: src/benchmark_runner.cc modified: test/benchmark_random_interleaving_gtest.cc * Fix macos build for fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h modified: src/benchmark_runner.cc * Fix macos and windows build for fr-1051. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_runner.cc * Fix benchmark_random_interleaving_test.cc for macos and windows in fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: test/benchmark_random_interleaving_gtest.cc * Fix int type benchmark_random_interleaving_gtest for macos in fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: test/benchmark_random_interleaving_gtest.cc * Address dominichamon's comments 03/29 for fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h modified: test/benchmark_random_interleaving_gtest.cc * Address dominichamon's comment on default min_time / repetitions for fr-1051. Also change sentinel of random_interleaving_repetitions to -1. Hopefully it fixes the failures on Windows. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h * Fix windows test failures for fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_api_internal.cc modified: src/benchmark_runner.cc * Add license blurb for fr-1051. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_adjust_repetitions.cc modified: src/benchmark_adjust_repetitions.h * Switch to std::shuffle() for fr-1105. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc * Change to 1e-9 in fr-1105 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_adjust_repetitions.cc * Fix broken build caused by bad merge for fr-1105. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_api_internal.cc modified: src/benchmark_runner.cc * Fix build breakage for fr-1051. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h modified: src/benchmark_register.cc modified: src/benchmark_runner.cc * Print out reports as they come in if random interleaving is disabled (fr-1051) Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc * size_t, int64_t --> int in benchmark_runner for fr-1051. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_runner.cc modified: src/benchmark_runner.h * Address comments from dominichamon for fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc modified: src/benchmark_adjust_repetitions.cc modified: src/benchmark_adjust_repetitions.h modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h modified: test/benchmark_random_interleaving_gtest.cc * benchmar_indices --> size_t to make CI pass: fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark.cc * Fix min_time not initialized issue for fr-1051. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h * min_time --> MinTime in fr-1051. Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: src/benchmark_api_internal.cc modified: src/benchmark_api_internal.h modified: src/benchmark_runner.cc * Add doc for random interleaving for fr-1051 Committer: Hai Huang <haih@google.com> On branch fr-1051 Your branch is up to date with 'origin/fr-1051'. Changes to be committed: modified: README.md new file: docs/random_interleaving.md Co-authored-by: Dominic Hamon <dominichamon@users.noreply.github.com>	2021-05-20 17:09:16 +01:00
Mircea Trofin	e539e807da	[PFM] Extend perf counter support to multi-threaded cases. (#1153 ) * Extend perf counter support to multi-threaded cases. * Docs update * const-ed Snapshot	2021-05-19 09:49:05 +01:00
Dominic Hamon	3b508fad1f	Refactor `BenchmarkInstance` (#1148 ) * Refactor BenchmarkInstance (precursor to #1105) * fix bazel (debug) build * clang-format on header * fix build error on g++-4.8	2021-05-10 17:12:09 +01:00
Roman Lebedev	a2e8a8a9db	Clean -Wreserved-identifier instances (#1143 )	2021-05-06 20:31:14 +01:00
Mircea Trofin	e0826edea7	Fix StrSplit empty string case (#1142 ) This also fixes #1135. Because StrSplit was returning a vector with an empty string, it was treated by PerfCounters::Create as a legitimate ask for setting up a counter with that name. The empty vector is understood by PerfCounters as "just return NoCounters()".	2021-05-06 19:12:36 +01:00
Dominic Hamon	d0c227ccfd	Add API to benchmark allowing for custom context to be added (#1137 ) * Add API to benchmark allowing for custom context to be added Fixes #525 * add docs * Add context flag output to JSON reporter * Plumb everything into the global context. * Add googletests for custom context * update docs with duplicate key behaviour	2021-05-05 12:08:23 +01:00
Dominic Hamon	33c133a206	Add `benchmark_context` flag that allows per-run custom context. (#1127 ) * Add `benchmark_context` flag that allows per-run custom context. Add support for key-value flags in general. Added test for key-value flags. Added `benchmark_context` flag. Output content of `benchmark_context` to base reporter. Solves the first part of #525. * Docs and better help	2021-05-04 14:36:11 +01:00
Mircea Trofin	376ebc2635	Support optional, user-directed collection of performance counters (#1114 ) * Support optional, user-directed collection of performance counters The patch allows an engineer wishing to drill into the root causes of a regression, for example. Currently, only single threaded runs are supported. The feature is a build-time opt in, and then a runtime opt in. The engineer may run the benchmark executable, passing a list of performance counter names (using libpfm's naming scheme) at the command line. The counter values will then be collected and reported back as UserCounters. This is different from #240 in that it is a benchmark user opt-in, and the counter collection is transparent to the benchmark. Currently, this is only supported on platforms where libpfm is supported. libpfm: http://perfmon2.sourceforge.net/ * 'Use' values param in Snapshot when BENCHMARK_OS_WINDOWS This is to avoid unused parameter warning-as-error * Added missing include for <vector> in perf_counters.cc * Moved doc to docs * Added license blurbs	2021-04-28 09:25:29 +01:00
Dominic Hamon	264976def3	Fix windows warning on type conversion (#1121 )	2021-04-27 08:24:27 +01:00
Roman Lebedev	c05843a9f6	[sysinfo] Fix CPU Frequency reading on AMD Ryzen CPU's (#1117 ) Currently, i get: ``` Run on (32 X 7326.56 MHz CPU s) CPU Caches: L1 Data 32 KiB (x16) L1 Instruction 32 KiB (x16) L2 Unified 512 KiB (x16) L3 Unified 32768 KiB (x2) ``` which seems mostly right, except that the frequency is rather bogus. Yes, i guess the CPU could theoretically achieve that, but i have 3.6GHz configured, and scaling disabled. So we clearly read the wrong thing. With this fix, i now get the expected ``` Run on (32 X 3598.53 MHz CPU s) CPU Caches: L1 Data 32 KiB (x16) L1 Instruction 32 KiB (x16) L2 Unified 512 KiB (x16) L3 Unified 32768 KiB (x2) ```	2021-04-23 14:33:22 +03:00
Matt Armstrong	69054ae50e	Use fewer ramp up repetitions when KeepRunningBatch is used (#1113 ) Use the benchmark's reported iteration count when estimating iterations for the next repetition, rather than the requested iteration count. When the benchmark uses KeepRunningBatch the actual iteration count can be larger than the one the runner requested. Prior to this fix the runner was underestimating the next iteration count, sometimes significantly so. Consider the case of a benchmark using a batch size of 1024. Prior to this change, the benchmark runner would attempt iteration counts 1, 10, 100 and 1000, yet the benchmark itself would do the same amount of work each time: a single batch of 1024 iterations. The discrepancy could also contribute to estimation errors once the benchmark time reached 10% of the target. For example, if the very first batch of 1024 iterations reached 10% of benchmark_min_min time, the runner would attempt to scale that to 100% from a basis of one iteration rather than 1024. This bug was particularly noticeable in benchmarks with large batch sizes, especially when the benchmark also had slow set up or tear down phases. With this fix in place it is possible to use KeepRunningBatch to achieve a kind of "minimum iteration count" feature by using a larger fixed batch size. For example, a benchmark may build a map of 500K elements and test a "find" operation. There is no point in running "find" just 1, 10, 100, etc., times. The benchmark can now pick a batch size of something like 10K, and the runner will arrive at the final max iteration count with in noticeably fewer repetitions.	2021-04-20 07:16:05 +01:00
Chris Lalancette	07578d82e0	Shrink the tz_offset size to 41. (#1110 ) When building with gcc TSan on, and in Debug mode, we see a warning like: benchmark/src/timers.cc: In function ‘std::string benchmark::LocalDateTimeString()’: src/timers.cc:241:15: warning: ‘char* strncat(char, const char, size_t)’ output may be truncated copying 108 bytes from a string of length 127 [-Wstringop-truncation] 241 \| std::strncat(storage, tz_offset, sizeof(storage) - timestamp_len - 1); \| ~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ While this is essentially a false positive (we never expect the number of bytes in tz_offset to be too large), the compiler can't actually tell that. Shrink the size of tz_offset to a smaller, but still safe size to eliminate this warning. Signed-off-by: Chris Lalancette <clalancette@openrobotics.org>	2021-04-09 17:32:00 +01:00

1 2 3 4 5 ...

625 Commits