Influxdata is a platform for storing, collecting, visualizing and managing time-series data. Currently Influxdb is the most famous time series database. This article is for version 1.x only.
The following topics that I am going to cover in this article.
- Influxdb Installation
- Inserting Data
- Querying Data
- Tags in influxdb
- Group by time Query
- Deleting measurement or data from measurement
- Retention Policies
In influxdb the combination of timestamp and tags is composite keys. So lets start.
First of all install influxdb from its official download page. If you donot have admin rights then you can follow my article on how to install influxdb without root access.
Now open another terminal and start the CLI using command
$ influx Connected to http://localhost:8086 version 0.13.x InfluxDB shell 0.13.x >
To create a database
create database mydb
To view the list of all databases
show databases name: databases --------------- name _internal mydb
To use a particular database
To delete a particular database
drop database mydb
In influxdb we call tables as measurements and columns as fields.
We don’t need to define measurements(tables) and fields(columns). It will create measurements and add columns automatically when we insert data. It is also like a NoSQL database
Format for writing data is
measurementName field1=value1,field2=value2,field3=value3 timestamp
- Position of spaces and comma matters a lot.
- The timestamp is in nanoseconds. If we don’t provide timestamp it will assign the local current timestamp (in nanoseconds).
- By default it assumes all the numbers as doubles. For integer value we have to append
iat the end.
> insert measurementName field4=12i
- String values should be in double quotes.
> insert measurementName field5="qwqw"
- For boolean values use t, T, true, True, or TRUE for TRUE, and f, F, false, False, or FALSE for FALSE
> insert measurementName field6=T
- We can use
\character for escaping comma, space, equal and other special character in field (field) value
Note : Once you write a particular field in a particular datatype, you cannot change it back.
For more details refer official documentation
To select all fields from measurement
select * from measurementName
To select particular fields
select field1, field2 from measurement
Note : If your mesurement name or field name contains characters such as
=, then use double quotes
select "field1.name", "field2.name" from "measurement.name"
I suggest you to always use double quotes because some times influxdb doesn’t show you values if you donot use double quotes.
A typical usage of where clause
select * from measurement where field1 > 12 and field2 = 'sparta' and time > now() - 1d
We can also use
or logic using separaters
Supported comaparaters in influxdb are
<>not equal to
!=not equal to
!~doesn’t match against
Tags in influxdb
In the starting of the article I told you that in influxdb time is the primary key. This is wrong. The truth is In influxdb the combination of timestamp + tags is the primary key (better say composition key).
What is a tag?
According to influxdb official documentation tags are used to store metadata. Let me explain in my words.
Once I had to run a group by query based on a particular field. Then I came to know that you cannot just run group by query on any field. To use group by on a particular field, there are special type of fields known as tags. Its better not to call them fields. Its just for your understanding.
There can be different values of a field (column) but for a tag, its values should be limited. Lets say I have a measurement called students. Its fields are name, age, height and its tags are gender, grade.
Format for inserting row having tags in it
measurementName,tagKey1=tagValue1,tagKey2=tagValue2 field1=value1,field2=value2,field3=value3 timestamp
insert students,gender=male,grade=B age=23i,name="Ashish",height=11.2
Group by tag query example
select * from students group by grade, gender
name: students tags: gender=male, grade=B time age height name ---- --- ------ ---- 1584820680760917555 23 11.2 Ashish
Group by time Query
Suppose you are inserting room temperature value in influxdb for the last 24 hours at regular interval of 1 minute. Now you want to check the behaviour of room temperature that either it is increasing or decreasing.
One way is directly hitting the query
select temperature from stats where time > now() - 24h and check all the 1440 rows ( 24 * 60).
Another way is getting the average teperature of every hour for all 24 hours. In this way we have to analyze only 24 points. So for getting the type of result we use group by time()
select mean("temperature") from stats where time > now() - 24h group by time(1h)
You can change the aggregate function mean. There are various aggregate functions like mean, max, min, median, count, mode, distinct, sum, stddev, first, last, difference.
You can read about queries in details from its website
Deleting measurement or data from measurement
To delete a whole measurement
drop measurement measurementName
drop measurement students
To delete a specific row
In influxdb you can delete data from a measurement only on the basis on time or tags. You cannot specify field name.
delete from students where time = 1584822445492464254
delete from students where "grade"='B'
Here double quotes and single quote matters.
Lets say if you want to remove data older than 1 month automatically by influxdb then retention policies are used.
If you want Influx cli to show result in column or json format use these commands