perfdb

command module

v0.0.0-...-aace0a5 Latest Latest Go to latest Published: Apr 14, 2018 License: Apache-2.0 Imports: 19 Imported by: 0

Details

Valid go.mod file

The Go module system was introduced in Go 1.11 and is the official dependency management solution for Go.
Redistributable license

Redistributable licenses place minimal restrictions on how software can be used, modified, and redistributed.
Tagged version

Modules with tagged versions give importers more predictable builds.
Stable version

When a project reaches major version v1 it is considered stable.
Learn more about best practices

Repository

github.com/pavel-paulau/perfdb

Links

Open Source Insights

README ¶

perfdb

perfdb is a time series database optimized for performance measurements.

Why?

Yes, this is yet another time series database written in Go. There are also many other beautiful non-Go implementations like cube, KairosDB or OpenTSDB. Unfortunately, most of them are designed for continuous monitoring and essentially implement different requirements. Also many databases are overly complicated or have tons of dependencies.

perfdb was created to address daily needs of performance benchmarking. The storage was implemented so that one can accurately aggregate and visualize millions of samples.

It's not aimed to support flexible queries. But it produces nice SVG graphs and helps to explore data via convenient REST API.

The last but not least, perfdb is distributed as a single binary file with literally zero external dependencies.

Storing samples

Let's say you measure application latency several times per second. Each sample is a JSON document:

{
	"read_latency": 12.3
}

To persist measurements, send the following HTTP request:

curl -X POST http://localhost:8080/mydatabase -d '{"read_latency":12.3}'

where:

mydatabase is time series database name. It's recommended to create a separate database for each benchmark.

Obviously, you will rather use your favourite programming language to send HTTP requests.

It's absolutely OK to create thousands of databases.

Aggregation and visualization

This API returns JSON document with aggregated characteristics (mean, percentiles, and etc.):

$ curl -s http://127.0.0.1:8080/mydatabase/read_latency/summary | python -m json.tool
{
	"avg": 5.82248,
	"count": 200000,
	"max": 100,
	"min": 0,
	"p50": 3,
	"p80": 9,
	"p90": 14,
	"p95": 21,
	"p99": 40,
	"p99.9": 76
}

Please note that Python is used for demonstration purpose only.

Finally, it is possible to generate heat map graphs in SVG format (use your browser to view):

http://127.0.0.1:8080/mydatabase/read_latency/heatmap

Each rectangle is a cluster of values. The darker color corresponds to the denser population. The legend on the right side of the graph (the vertical bar) should help to understand the density.

Browsing data

To list all available database, use the following request:

$ curl -s http://127.0.0.1:8080/ | python -m json.tool
[
	"mydatabase"
]

To list all metrics, use request similar to:

$ curl -s http://127.0.0.1:8080/mydatabase | python -m json.tool
[
	"read_latency",
	"write_latency"
]

Querying samples

Only bulk queries are supported. To get the list of samples, use request similar to:

$ curl -s http://127.0.0.1:8080/mydatabase/read_latency | python -m json.tool

Output is a JSON document with all timestamps and values:

[
	[
		1437137708114,
		10
	],
	[
		1437137708118,
		15
	],
	[
		1437137708122,
		16
	]
]

The first value in the nested list is the timestamp (the number of milliseconds elapsed since January 1, 1970 UTC).

The second value is the stored measurement (integer or float).

Getting started

The latest stable perfdb binaries are available on the Releases page.

Just download the file for your platform and run it in terminal:

$ ./perfdb

The command above starts HTTP listener on port 8080. Folder named "data" will be created in the current working directory by default.

It possible to specify custom setting using CLI arguments:

$ ./perfdb -h
Usage of ./perfdb:
	-address string
		serve requests to this host:port (default "127.0.0.1:8080")
	-path string
		PerfDB data directory (default "data")

Documentation ¶

Overview ¶

A time series database optimized for performance measurements.

Storing samples ¶

Let's say you measure application latency several times per second. Each sample is a JSON document:

{
	"read_latency": 12.3
}

To persist measurements, send the following HTTP request:

curl -X POST http://localhost:8080/mydatabase -d '{"read_latency":12.3}'

where:

`mydatabase` is time series database name. It's recommended to create a separate database for each benchmark.

Obviously, you will rather use your favourite programming language to send HTTP requests.

It's absolutely OK to create thousands of databases.

Aggregation and visualization ¶

This API returns JSON document with aggregated characteristics (mean, percentiles, and etc.):

$ curl -s http://127.0.0.1:8080/mydatabase/read_latency/summary | python -m json.tool
{
	"avg": 5.82248,
	"count": 200000,
	"max": 100,
	"min": 0,
	"p50": 3,
	"p80": 9,
	"p90": 14,
	"p95": 21,
	"p99": 40,
	"p99.9": 76
}

Please notice that Python is used for demonstration purpose only.

Finally, it is possible to generate heat map graphs in SVG format (use your browser to view):

http://127.0.0.1:8080/mydatabase/read_latency/heatmap

Each rectangle is a cluster of values. The darker color corresponds to the denser population. The legend on the right side of the graph (the vertical bar) should help to understand the density.

Browsing data ¶

To list all available database, use the following request:

$ curl -s http://127.0.0.1:8080/ | python -m json.tool
[
	"mydatabase"
]

To list all metrics, use request similar to:

$ curl -s http://127.0.0.1:8080/mydatabase | python -m json.tool
[
	"read_latency",
	"write_latency"
]

Querying samples ¶

Only bulk queries are supported, but even they are not recommended.

To get the list of samples, use request similar to:

$ curl -s http://127.0.0.1:8080/mydatabase/read_latency | python -m json.tool

Output is a JSON document with all timestamps and values:

[
	[
		1437137708114018208,
		10
	],
	[
		1437137708114967597,
		15
	],
	[
		1437137708123781628,
		16
	]
]

The first value in the nested list is the timestamp (the number of milliseconds elapsed since January 1, 1970 UTC).

The second value is the stored measurement (integer or float).

Source Files ¶

View all Source files

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL