Aggregation functions
AVG
¶
Applies to: stream, table Since: 0.6.0 | |
---|---|
1 |
|
Returns the average value for col1
.
See AVG in action
COLLECT_LIST
¶
Applies to: stream, table | |
---|---|
1 |
|
Returns an array containing all the values of col1
from each
input row (for the specified grouping and time window, if any).
The size of the result ARRAY can be limited to a maximum of
ksql.functions.collect_list.limit
entries, and any values beyond this
limit are ignored silently.
When used with SESSION
window, sometimes two session windows are merged
together into one, when a out-of-order record with a timestamp between
the two windows is processed. In this case, the record limit is calculated by
first considering all the records from the first window, then the out-of-order
record, then the records from the second window in the order they were
originally processed.
See COLLECT_LIST in action
COLLECT_SET
¶
Applies to: stream | |
---|---|
1 |
|
Returns an array containing the distinct values of col1
from
each input row (for the specified grouping and time window, if any).
The size of the result ARRAY can be limited to a maximum of
ksql.functions.collect_set.limit
entries, and any values beyond this
limit are ignored silently.
When used with a SESSION
window, sometimes two session windows are merged
together into one, when a out-of-order record with a timestamp between
the two windows is processed. In this case, the record limit is calculated by
first considering all the records from the first window, then the out-of-order
record, then the records from the second window in the order they were
originally processed.
COUNT
¶
Applies to: stream, table | |
---|---|
1 2 |
|
Counts the number of rows.
When col1
is specified, the count returned is the number of rows where
col1
is non-null.
When *
is specified, the count returned is the total number of rows.
See COUNT in action
COUNT_DISTINCT
¶
Applies to: stream Since: 0.7.0 | |
---|---|
1 |
|
Returns the approximate number of unique values of col1
in a group.
The function implementation uses HyperLogLog to estimate cardinalities of 10^9 with a typical standard error of 2 percent.
EARLIEST_BY_OFFSET
¶
Applies to: stream Since: 0.10.0 | |
---|---|
1 |
|
Return the earliest value for the specified column.
The earliest value in the partition has the lowest offset.
The optional ignoreNulls
parameter, available since version 0.13.0,
controls whether NULL
values are ignored. The default is to ignore
NULL
values.
Applies to: stream Since: 0.13.0 | |
---|---|
1 |
|
Returns the earliest N values for the specified column as an ARRAY
.
The earliest values in the partition have the lowest offsets.
The optional ignoreNulls
parameter, available since version 0.13.0,
controls whether NULL
values are ignored. The default is to ignore
NULL
values.
HISTOGRAM
¶
Applies to: stream, table | |
---|---|
1 |
|
Returns a map containing the distinct string values of col1
mapped to the number of times each one occurs for the given window.
The HISTOGRAM
function limits the number of distinct values which can be
counted to 1000, beyond which any additional entries are ignored.
When used with a SESSION
window, sometimes two session windows are merged
together into one, when a out-of-order record with a timestamp between
the two windows is processed. In this case, the 1000-record limit is calculated
by first considering all the records from the first window, then the
out-of-order record, then the records from the second window in the order they
were originally processed.
See HISTOGRAM in action
LATEST_BY_OFFSET
¶
Applies to: stream Since: 0.8.0 | |
---|---|
1 |
|
Returns the latest value for the specified column.
The latest value in the partition has the largest offset.
The optional ignoreNulls
parameter, available since version 0.13.0,
controls whether NULL
values are ignored. The default is to ignore
NULL
values.
Applies to: stream Since: 0.13.0 | |
---|---|
1 |
|
Returns the latest N values for the specified column as an ARRAY
.
The latest values have the largest offset.
The optional ignoreNulls
parameter, available since version 0.13.0,
controls whether NULL
values are ignored. The default is to ignore
NULL
values.
MAX
¶
Applies to: stream | |
---|---|
1 |
|
Returns the maximum value for a given column and window.
Rows that have col1
set to NULL
are ignored.
See MAX in action
MIN
¶
Applies to: stream | |
---|---|
1 |
|
Returns the minimum value for a given column and window.
Rows that have col1
set to NULL
are ignored.
See MIN in action
STDDEV_SAMP
¶
Applies to: stream, table Since: 0.16.0 | |
---|---|
1 |
|
the standard deviation. Use the STDDEV_SAMPLE
function to compute the standard deviation.
Returns the sample standard deviation for the column.
STDDEV_SAMPLE
¶
Applies to: stream, table Since: 0.25.0 | |
---|---|
1 |
|
Returns the sample standard deviation for the column.
SUM
¶
Applies to: stream, table | |
---|---|
1 |
|
Rows that have col1
set to NULL
are ignored.
See SUM in action
TOPK
¶
Applies to: stream | |
---|---|
1 |
|
Returns the Top K values for the given column and window.
Rows that have col1
set to NULL
are ignored.
Example | |
---|---|
1 2 3 4 |
|
TOPKDISTINCT
¶
Applies to: stream | |
---|---|
1 |
|
Returns the distinct Top K values for the given column and window.
Rows that have col1
set to NULL
are ignored.
Example | |
---|---|
1 2 3 4 |
|