Data migration

tool to import whisper data into InfluxDB

It works! There's 2 modes of operation:

standard: copies the from-to timerange from the best fitting whisper archive (the one with highest resolution that covers the range)
if you specify -all, ignores from/to, and copies all data from all archives. currently it copies from lowest resolution (longest range) to higher resolution (shorter range), because they overlap and so that the records from more accurate archives overwrite earlier, less accurate records with the same timestamp. Later, this can be extended to make the points from different archives go to different series, which is how you implement retention policies in InfluxDB.

options:

usage of whisper-to-influxdb:
  -all=false: copy all data from all archives, as opposed to just querying the timerange from the best archive
  -exclude="": don't process whisper files whose filename contains this string ("" disables the filter, and matches nothing
  -from=1412110823: Unix epoch time of the beginning of the requested interval. (default: 24 hours ago). ignored if all=true
  -include="": only process whisper files whose filename contains this string ("" is a no-op, and matches everything
  -influxDb="graphite": influxdb database
  -influxHost="localhost": influxdb host
  -influxPass="graphite": influxdb pass
  -influxPort=8086: influxdb port
  -influxPrefix="": prefix this string to all imported data
  -influxUser="graphite": influxdb user
  -influxWorkers=10: specify how many influx workers
  -skipUntil="": absolute path of a whisper file from which to resume processing
  -statsInterval=10: interval to display stats. by default 10 seconds.
  -until=1412197223: Unix epoch time of the end of the requested interval. (default: now). ignored if all=true
  -verbose=false: verbose output
  -whisperDir="/opt/graphite/storage/whisper/": location where all whisper files are stored
  -whisperWorkers=10: specify how many whisper workers

Data migration

Some metrics are converted from collectd/graphite to telegraf/influxdb format

This includes:

CPU
Load
Memory
haproxy (curl_json)
elasticsearch (curl_json)

Some metrics are converted from counters to derived gauges by collectd and are pushed to _derived column in influxdb to prevent wrong derived valued when you query them.

This includes:

disk
interfaces

Here is an example influxdb query to keep compat in your grafana:

SELECT mean("reads") as "reads", mean("writes") as "writes" FROM (SELECT non_negative_derivative(mean("reads"), 1s) AS "reads", non_negative_derivative(mean("writes"), 1s) AS "writes" FROM "diskio" WHERE ("host" =~ /^$host$/ AND "name" =~ /^$disk$/) AND $timeFilter GROUP BY time($__interval) fill(null)), (SELECT mean("reads_derived") as "reads", mean("writes_derived") as "writes" FROM "diskio" WHERE ("host" =~ /^$host$/ AND "name" =~ /^$disk$/) AND $timeFilter GROUP BY time($__interval) fill(null)) GROUP BY time($__interval)

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.gitignore		.gitignore
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
go.mod		go.mod
whisper-to-influxdb.go		whisper-to-influxdb.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data migration

About

Releases

Packages

Languages

License

infopro-digital/whisper-to-influxdb

Folders and files

Latest commit

History

Repository files navigation

Data migration

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages