benchmark

mirror of https://github.com/google/benchmark.git synced 2025-03-15 11:40:12 +08:00

Author	SHA1	Message	Date
Roman Lebedev	090faecb45	Use IterationCount in one more place Found in -UNDEBUG build	2019-05-13 22:42:18 +03:00
Roman Lebedev	f92903cc53	Iteration counts should be `uint64_t` globally. (#817 ) This is a shameless rip-off of https://github.com/google/benchmark/pull/646 I did promise to look into why that proposed PR was producing so much worse assembly, and so i finally did. The reason is - that diff changes `size_t` (unsigned) to `int64_t` (signed). There is this nice little `assert`: `7a1c370283/include/benchmark/benchmark.h (L744)` It ensures that we didn't magically decide to advance our iterator when we should have finished benchmarking. When `cached_` was unsigned, the `assert` was `cached_ UGT 0`. But we only ever get to that `assert` if `cached_ NE 0`, and naturally if `cached_` is not `0`, then it is bigger than `0`, so the `assert` is tautological, and gets folded away. But now that `cached_` became signed, the assert became `cached_ SGT 0`. And we still only know that `cached_ NE 0`, so the assert can't be optimized out, or at least it doesn't currently. Regardless of whether or not that is a bug in itself, that particular diff would have regressed the normal 64-bit systems, by halving the maximal iteration space (since we go from unsigned counter to signed one, of the same bit-width), which seems like a bug. And just so it happens, fixing this bug, fixes the other bug. This produces fully (bit-by-bit) identical state_assembly_test.s The filecheck change is actually needed regardless of this patch, else this test does not pass for me even without this diff.	2019-05-13 12:33:11 +03:00
Michał Janiszewski	b988639f31	Fix compilation for Android (#816 ) Android doesn't support `getloadavg`	2019-05-09 15:22:13 -07:00
Roman Lebedev	33d4404650	Don't read CMAKE_BUILD_TYPE if it is not there (#811 ) Weird, but seems consistent with the rest of cmake here.	2019-05-07 16:06:50 -07:00
Lockywolf	823d24630d	Add support for GNU Install Dirs from GNU Coding Standards. Fixes #807 (#808 ) * Add support for GNU Install Dirs from GNU Coding Standards * src/CMakeLists.txt: Added support for setting the standard variables, such as CMAKE_INSTALL_BINDIR. * Replace install destinations by the ones from GNU Coding Standards. * Set the default .cmake and .pc default path.	2019-05-01 09:13:33 +01:00
Dominic Hamon	13b8bdc2b5	Bump required cmake version from 2.x to 3.x (#801 )	2019-05-01 09:06:12 +01:00
Michael Tesch	588be0446a	escape special chars in csv and json output. (#802 ) * escape special chars in csv and json output. - escape \b,\f,\n,\r,\t,\," from strings before dumping them to json or csv. - also faithfully reproduce the sign of nan in json. this fixes github issue #745. * functionalize. * split string escape functions between csv and json * Update src/csv_reporter.cc Co-Authored-By: tesch1 <tesch1@gmail.com> * Update src/json_reporter.cc Co-Authored-By: tesch1 <tesch1@gmail.com>	2019-04-19 18:47:25 +01:00
Dominic Hamon	1d41de8463	Add command line flags tests (#793 ) Increase coverage	2019-04-17 17:08:52 +01:00
Hannes Hauswedell	415835e03e	fix master branch on BSD (#792 ) fix master branch on BSD add name to CONTRIBUTORS	2019-04-11 16:36:11 +01:00
Bryan Lunt	7a1c370283	Add process_time for better OpenMP and user-managed thread timing * Google Benchmark now works with OpenMP and other user-managed threading.	2019-04-09 13:01:33 +01:00
Daniel Harvey	e3666568a9	Negative ranges #762 (#787 ) * Add FIXME in multiple_ranges_test.cc * Improve handling of large bounds in AddRange. Due to breaking the loop too early, AddRange would miss a final multplier of 'mult' that was within the numeric range of T. * Enable negative values for Range argument Fixes #762. * Try to fix build of benchmark_gtest * Try some more to fix build * Attempt to fix format macros * Attempt to resolve format errors for mingw32 * Review feedback Put unit tests in benchmark::internal namespace Fix error reporting in multiple_ranges_test.cc	2019-03-26 10:50:53 +00:00
BaaMeow	478eafa36b	[JSON] add threads and repetitions to the json output (#748 ) * [JSON] add threads and repetitions to the json output, for better ide… [Tests] explicitly check for thread == 1 [Tests] specifically mark all repetition checks [JSON] add repetition_index reporting, but only for non-aggregates (i… * [Formatting] Be very, very explicit about pointer alignment so clang-format can not put pointers/references on the wrong side of arguments. [Benchmark::Run] Make sure to use explanatory sentinel variable rather than a magic number. * Do not pass redundant information	2019-03-26 09:53:07 +00:00
Michael Tesch	fae8726690	Replace JSON inf and nan with JS compliant Infinity and NaN	2019-03-19 10:12:54 +00:00
Daniel Harvey	f6e96861a3	BENCHMARK_CAPTURE() and Complexity() - naming problem (#761 ) Created BenchmarkName class which holds the full benchmark name and allows specifying and retrieving different components of the name (e.g. ARGS, THREADS etc.) Fixes #730.	2019-03-17 16:38:51 +03:00
Jilin Zhou	d205ead299	[#774 ] implement GetNumCPUs(), GetCPUCyclesPerSecond(), and GetCacheSizes() (#775 ) - On qnx platform, cpu and cache info is stored in a syspage struct which is different from other OS platform. - The fix has been verified on an aarch64 target running qnx 7.0. Fixes #774	2019-02-28 10:42:44 +00:00
Jilin Zhou	0ae233ab23	[#766 ] add x-compile support for QNX SDP7 (#770 ) Since googletest already supports x-compilation for QNX, it is nice to have google benchmark support it too. Fixes #766	2019-02-19 13:05:55 +00:00
Andriy Berestovskyy	4b9f43e2c4	Fix header lines length (#752 ) Commit `17a012d7` added a newline to the str, so the line built from str.length() is one character longer than it should be.	2019-01-13 17:26:49 +03:00
Eric	4528c76b71	Print at least three significant digits for times. (#701 ) Some benchmarks are particularly sensitive and they run in less than a nanosecond. In order for the console reporter to provide meaningful output for such benchmarks it needs to be able to display the times using more resolution than a single nanosecond. This patch changes the console reporter to print at least three significant digits for all results. Unlike the initial attempt, this patch does not align the decimal point.	2018-12-13 22:49:21 -05:00
Dominic Hamon	0ed529a7e3	Update documentation of benchmark_filter (#744 ) It should now match reality.	2018-12-13 11:14:50 +00:00
Jatin Chaudhary	47a5f77d75	#722 Adding Host Name in Reporting (#733 ) * Adding Host Name and test * Addressing Review Comments * Adding Test for JSON Reporter * Adding HOST_NAME_MAX for MacOS systems * Adding Explaination for MacOS HOST_NAME_MAX Addition * Addressing Peer Review Comments * Adding codecvt in windows header guard * Changing name SystemInfo and adding empty message incase host name fetch fails * Adding Comment on Struct SystemInfo	2018-12-11 11:23:02 +00:00
Tobias Ulvgård	1f3cba06e4	Update reference to complexity calculation (#723 )	2018-12-10 15:15:34 +00:00
Roman Lebedev	c9f2693ea9	StrFormat() is a printf-like function, mark it as such, fix fallout. (#727 ) Fixes #714.	2018-11-26 19:55:05 -05:00
Dominic Hamon	b5082bbd65	Merge branch 'report_loadavg' of https://github.com/atdt/benchmark into atdt-report_loadavg	2018-11-13 10:13:58 +00:00
Anton Gladky	c6193afe7e	Fix parsing of cpuinfo for s390 platform. (#712 ) s390 has another line structure for processor-field. It should be differently parsed.	2018-10-21 11:01:42 +03:00
Roman Lebedev	507c06e636	Aggregates: use non-aggregate count as iteration count. (#706 ) It is incorrect to say that an aggregate is computed over run's iterations, because those iterations already got averaged. Similarly, if there are N repetitions with 1 iterations each, an aggregate will be computed over N measurements, not 1. Thus it is best to simply use the count of separate reports. Fixes #586.	2018-10-18 17:17:14 +03:00
Roman Lebedev	99d1356c04	[NFC] BenchmarkRunner: always populate _report_aggregates_only bools. (#708 ) It is better to let the RunBenchmarks(), report() decide whether to actually only* output aggregates or not, depending on whether there are actually aggregates. It's subtle indeed. Previously, `BenchmarkRunner()` always said that "if there are no repetitions, then you should never output only the repetitions". And the `report()` simply assumed that the `report_aggregates_only` bool it received makes sense, and simply used it. Now, the logic is the same, but the blame has shifted. `BenchmarkRunner()` always propagates what those benchmarks would have wanted to happen wrt the aggregates. And the `report()` lambda has to actually consider both the `report_aggregates_only` bool, and it's meaningfulness. To put it in the context of the patch series - if the repetition count was `1`, but `_report_aggregates_only` was set to `true`, and we capture each iteration separately, then we will compute the aggregates, but then output everything, both the iteration, and aggregates, despite `_report_aggregates_only` being set to `true`.	2018-10-18 15:08:59 +03:00
Roman Lebedev	9cacec8e78	[NFC] RunBenchmarks(): s/has_repetitions/might_have_aggregates/ (#707 ) That is the real purpose of that bool. A follow-up change will make it consider something else other than repetitions.	2018-10-18 15:03:17 +03:00
Ilya A. Kriveshko	8503dfe537	benchmark_color: fix auto option (#559 ) (#699 ) As prevously written, "--benchmark_color=auto" was treated as true, because IsTruthyFlagValue("auto") returned true. The fix is to rely on IsColorTerminal test only if the flag value is "auto", and fall back to IsTruthyFlagValue otherwise. I also integrated force_no_color check into the same block.	2018-10-08 09:33:21 +01:00
Roman Lebedev	a8082de5df	[NFC] Refactor RunBenchmark() (#690 ) Ok, so, i'm still trying to get to the state when it will be a trivial change to report all the separate iterations. The old code (LHS of the diff) was rather convoluted i'd say. I have tried to refactor it a bit into small logical chunks, with proper comments. As far as i can tell, i preserved the intent of the code, what it was doing before. The road forward still isn't clear, but i'm quite sure it's not with the old code :)	2018-10-01 17:51:08 +03:00
Dominic Hamon	edc77a3669	Make State constructor private. (#650 ) The State constructor should not be part of the public API. Adding a utility method to BenchmarkInstance allows us to avoid leaking the RunInThread method into the public API.	2018-09-28 12:28:43 +01:00
Martin Storsjö	439d6b1c2a	Include sys/time.h for cycleclock.h when building on MinGW (#680 ) When building for ARM, there is a fallback codepath that uses gettimeofday, which requires sys/time.h. The Windows SDK doesn't have this header, but MinGW does have it. Thus, this fixes building for Windows on ARM with MinGW headers/libraries, while Windows on ARM with the Windows SDK still is broken.	2018-09-19 11:52:05 +01:00
Martin Storsjö	5261307982	[benchmark] Lowercase windows specific includes (#679 ) The windows SDK headers don't have self-consistent casing anyway, and many projects consistently use lowercase for them, in order to fix crosscompilation with mingw headers.	2018-09-18 09:42:20 +01:00
Roman Lebedev	1b44120cd1	Un-deprecate [SG]et{Item,Byte}sProcessed, re-implement as custom counters. (#676 ) As discussed with @dominichamon and @dbabokin, sugar is nice. Well, maybe not for the health, but it's sweet. Alright, enough puns. A special care needs to be applied not to break csv reporter. UGH. We end up shedding some code over this. We no longer specially pretty-print them, they are printed just like the rest of custom counters. Fixes #627.	2018-09-13 22:03:47 +03:00
Roman Lebedev	58588476ce	Track two more details about runs - the aggregate name, and run name. (#675 ) This is related to @BaaMeow's work in https://github.com/google/benchmark/pull/616 but is not based on it. Two new fields are tracked, and dumped into JSON: * If the run is an aggregate, the aggregate's name is stored. It can be RMS, BigO, mean, median, stddev, or any custom stat name. * The aggregate-name-less run name is additionally stored. I.e. not some name of the benchmark function, but the actual name, but without the 'aggregate name' suffix. This way one can group/filter all the runs, and filter by the particular aggregate type. I might need this for further tooling improvement. Or maybe not. But this is certainly worthwhile for custom tooling.	2018-09-13 15:08:15 +03:00
Roman Lebedev	c614dfc0d4	Display aggregates only. (#665 ) There is a flag `d9cab612e4/src/benchmark.cc (L75-L78)` and a call `d9cab612e4/include/benchmark/benchmark.h (L837-L840)` But that affects everything, every reporter, destination: `d9cab612e4/src/benchmark.cc (L316)` It would be quite useful to have an ability to be more picky. More specifically, i would like to be able to only see the aggregates in the on-screen output, but for the file output to still contain everything. The former is useful in case of a lot of repetition (or even more so if every iteration is reported separately), while the former is great for tooling. Fixes https://github.com/google/benchmark/issues/664	2018-09-12 16:26:17 +03:00
Roman Lebedev	f0901417c8	GetCacheSizesMacOSX(): use consistent types. (#667 ) I have absolutely no way to test this, but this looks obviously-good. This was reported by Tim Northover @TNorthover in http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20180903/584223.html > I think this breaks some 32-bit configurations (well, mine at least). > I was using Clang (from Xcode 10 beta) on macOS and got a bunch of > errors referencing sysinfo.cc:292 and onwards: > /Users/tim/llvm/llvm-project/llvm/utils/benchmark/src/sysinfo.cc:292:47: > error: non-constant-expression cannot be narrowed from type > 'std::__1::array<unsigned long long, 4>::value_type' (aka 'unsigned > long long') to 'size_t' (aka 'unsigned long') in initializer list > [-Wc++11-narrowing] > } Cases[] = {{"hw.l1dcachesize", "Data", 1, CacheCounts[1]}, > ^~~~~~~~~~~~~~ > > The same happens when self-hosting ToT. Unfortunately I couldn't > reproduce the issue on Debian (Clang 6.0.1) even with libc++; I'm not > sure what the difference is.	2018-09-05 12:20:18 +01:00
pseyfert	fbfc495d7f	add missing closing bracket in --help message (#666 )	2018-09-03 19:45:09 +03:00
Roman Lebedev	caa2fcb19c	Counter(): add 'one thousand' param. (#657 ) * Counter(): add 'one thousand' param. Needed for https://github.com/google/benchmark/pull/654 Custom user counters are quite custom. It is not guaranteed that the user always expects for these to have 1k == 1000. If the counter represents bytes/memory/etc, 1k should be 1024. Some bikeshedding points: 1. Is this sufficient, or do we really want to go full on into custom types with names? I think just the '1000' is sufficient for now. 2. Should there be a helper benchmark::Counter::Counter{1000,1024}() static 'constructor' functions, since these two, by far, will be the most used? 3. In the future, we should be somehow encoding this info into JSON. * Counter(): use std::pair<> to represent 'one thousand' * Counter(): just use a new enum with two values 1000 vs 1024. Simpler is better. If someone comes up with a real reason to need something more advanced, it can be added later on. * Counter: just store the 1000 or 1024 in the One_K values directly * Counter: s/One_K/OneK/	2018-08-29 21:11:06 +03:00
Roman Lebedev	d9cab612e4	[NFC] s/console_reporter/display_reporter/ (#663 ) There are two destinations: * display (console, terminal) and * file. And each of the destinations can be poplulated with one of the reporters: * console - human-friendly table-like display * json * csv (deprecated) So using the name console_reporter is confusing. Is it talking about the console reporter in the sense of table-like reporter, or in the sense of display destination?	2018-08-29 14:58:54 +03:00
Roman Lebedev	8688c5c4cf	Track 'type' of the run - is it an actual measurement, or an aggregate. (#658 ) This is only exposed in the JSON. Not in CSV, which is deprecated. This only supposed to track these two states. An additional field could later track which aggregate this is, specifically (statistic name, rms, bigo, ...) The motivation is that we already have ReportAggregatesOnly, but it affects the entire reports, both the display, and the reporters (json files), which isn't ideal. It would be very useful to have a 'display aggregates only' option, both in the library's console reporter, and the python tooling, This will be especially needed for the 'store separate iterations'.	2018-08-28 18:11:36 +03:00
Roman Lebedev	9a179cb93f	[NFC] Prefix "report(_)?mode" with Aggregation. (#656 ) This only specifically represents handling of reporting of aggregates. Not of anything else. Making it more specific makes the name less generic. This is an issue because i want to add "iteration report mode", so the naming would be conflicting.	2018-08-28 17:19:25 +03:00
BaaMeow	af441fc114	properly escape json names (#652 )	2018-08-16 09:47:09 -07:00
Kirill Bobyrev	f85304e4e3	Remove redundant default which causes failures (#649 ) * Remove redundant default which causes failures * Fix old GCC warnings caused by poor analysis * Use __builtin_unreachable * Use BENCHMARK_UNREACHABLE() * Pull __has_builtin to benchmark.h too * Also move compiler identification macro to main header * Move custom compiler identification macro back	2018-08-08 14:39:57 +01:00
Dominic Hamon	f965eab508	Memory management and reporting hooks (#625 ) * Introduce memory manager interface * Add memory stats to JSON reporter and a test * Add comments and switch json output test to int	2018-07-24 15:57:15 +01:00
Ori Livneh	da9ec3dfca	Include system load average in console and JSON reports High system load can skew benchmark results. By including system load averages in the library's output, we help users identify a potential issue in the quality of their measurements, and thus assist them in producing better (more reproducible) results. I got the idea for this from Brendan Gregg's checklist for benchmark accuracy (http://www.brendangregg.com/blog/2018-06-30/benchmarking-checklist.html).	2018-07-09 10:51:08 -04:00
Federico Ficarelli	0c21bc369a	Fix build with Intel compiler (#631 ) * Set -Wno-deprecated-declarations for Intel Intel compiler silently ignores -Wno-deprecated-declarations so warning no. 1786 must be explicitly suppressed. * Make std::int64_t → double casts explicit While std::int64_t → double is a perfectly conformant implicit conversion, Intel compiler warns about it. Make them explicit via static_cast<double>. * Make std::int64_t → int casts explicit Intel compiler warns about emplacing an std::int64_t into an int container. Just make the conversion explicit via static_cast<int>. * Cleanup Intel -Wno-deprecated-declarations workaround logic	2018-07-09 11:45:10 +01:00
Federico Ficarelli	5946795e82	Disable Intel invalid offsetof warning (#629 )	2018-07-03 10:13:22 +01:00
Roman Lebedev	b123abdcf4	Add Iteration-related Counter::Flags. Fixes #618 (#621 ) Inspired by these [two](`a1ebe07bea`) [bugs](`0891555be5`) in my code due to the lack of those i have found fixed in my code: * `kIsIterationInvariant` - `* state.iterations()` The value is constant for every iteration, and needs to be multiplied by the iteration count. * `kAvgIterations` - `/ state.iterations()` The is global over all the iterations, and needs to be divided by the iteration count. They play nice with `kIsRate`: * `kIsIterationInvariantRate` * `kAvgIterationsRate`. I'm not sure how meaningful they are when combined with `kAvgThreads`. I guess the `kIsThreadInvariant` can be added, too, for symmetry with `kAvgThreads`.	2018-06-27 15:45:30 +01:00
Marat Dukhan	505be96ab2	Avoid using CMake 3.6 feature list(FILTER ...) (#612 ) list(FILTER ...) is a CMake 3.6 feature, but benchmark targets CMake 2.8.12	2018-06-06 12:32:42 +01:00
Sergiu Deitsch	1301f53e31	cmake: use numeric version in package config (#611 )	2018-06-05 15:01:44 +01:00

1 2 3 4 5 ...

428 Commits