Use HDR histograms for calculating percentiles in thresholds and summary stats

Currently the [`Trend`](https://docs.k6.io/docs/trend-k6metrics)-based threshold checks and end-of-test summary stats rely on saving all of the relevant metric values in-memory. This is basically a large memory leak that can negatively affect long-running and/or HTTP-heavy load tests, as [reported](https://k6io.slack.com/archives/C3E688J8Z/p1535727394000100) by this user on slack.

For the moment the best solution for solving those issues without any loss of functionality or and only a very tiny loss of precision appears to be [HdrHistogram](http://hdrhistogram.github.io/HdrHistogram/). Here's an excerpt from the description in the above website:
> A Histogram that supports recording and analyzing sampled data value counts across a configurable integer value range with configurable value precision within the range. Value precision is expressed as the number of significant digits in the value recording, and provides control over value quantization behavior across the value range and the subsequent value resolution at any given level.
> [ ... ]
> HDR Histogram is designed for recoding histograms of value measurements in latency and performance sensitive applications. [ ... ] The HDR Histogram maintains a fixed cost in both space and time. A Histogram's memory footprint is constant, with no allocation operations involved in recording data values or in iterating through them. The memory footprint is fixed regardless of the number of data value samples recorded, and depends solely on the dynamic range and precision chosen. The amount of work involved in recording a sample is constant, and directly computes storage index locations such that no iteration or searching is ever involved in recording data values.

[This](https://github.com/HdrHistogram/HdrHistogram) is the original Java library by Gil Tene. He also has some great talks ([this](https://www.infoq.com/presentations/latency-response-time) and its [earlier](https://www.infoq.com/presentations/latency-pitfalls) version for example) that explain some of the common pitfalls when people measure latency and why he built HdrHistogram. They're also a very strong argument why we should prioritize the [arrival-rate based VU executor](https://github.com/loadimpact/k6/issues/550)... :smile: 

[This](https://github.com/codahale/hdrhistogram) is an MIT-licensed Go implementation of HdrHistogram, though it seems to be dead - archived repo with no recent commits and unresolved issues and PRs. So we may need to fork that repo and maintain it or and re-implement the algorithm ourselves.

Another thing that HdrHistogram **may** help with is exposing summary stats in the `teardown()` function or outputting them in a JSON file at the end. This is something that a lot of users have requested - https://github.com/loadimpact/k6/issues/647 and https://github.com/loadimpact/k6/issues/351 and somewhat https://github.com/loadimpact/k6/issues/355. 

Most of the difficulty there lies in exposing the raw data to the JS runtime (and with HdrHistogram we can expose its API), and especially with implementing the stats calculation in the distributed execution environment (the current Load Impact cloud or the future native k6 cluster execution). Having trend metrics backed by HdrHistogram should allow us to avoid the need to schlep all of the raw metrics data between k6 instances (or require an external DB) at the end of a distributed test...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use HDR histograms for calculating percentiles in thresholds and summary stats #763

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Use HDR histograms for calculating percentiles in thresholds and summary stats #763

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions