如何解决Impala 中的 AVG 随时间窗口...结束PARTITION BY ... ORDER BY
我在 Impala 中有一个表,其中有 UnixTime 的时间信息,频率为 1 毫秒。我正在尝试获取 10 秒窗口的 AVG()、MIN() 和 MAX()(但我不想修复它,可以是 20 秒、30 秒等)。
我正在使用子查询来做这件事,但我没有得到正确的答案。以下是我在表中的数据: Data in the Table
我正在使用以下子查询来获取 10 秒窗口的 AVG()、MIN() 和 MAX()。我正在使用 OVER (PARTITION BY ... ORDER BY) 但没有得到正确的结果。我的查询如下:
SELECT disTINCT *
FROM
(SELECT ts,last_value(Table1.val1) OVER (PARTITION BY Table1.ts ORDER BY Table1.ts rows between unbounded preceding and unbounded following) as val1,AVG(Table1.val2) OVER (PARTITION BY Table1.ts ORDER BY Table1.ts rows between unbounded preceding and unbounded following) as val2
MIN(Table1.val3) OVER (PARTITION BY Table1.ts ORDER BY Table1.ts rows between unbounded preceding and unbounded following) as val2
MAX(Table1.val4) OVER (PARTITION BY Table1.ts ORDER BY Table1.ts rows between unbounded preceding and unbounded following) as val2
FROM (SELECT cast(cast(unix_timestamp(cast(ts/1000 as TIMESTAMP))/10 as bigint)*10 as TIMESTAMP) as ts,val1 as val1,val2 as val2,val3 as val3,val4 as val4
FROM Sensor_Data.Table where unit='Unit1'
and cast(ts/1000 as TIMESTAMP) BETWEEN '2020-11-29 22:30:00' and '2020-12-01 01:51:00') as Table1) as Table2
ORDER BY ts
我需要以下答案:
Time Val1 Val2 Val3 Val4
2020-11-29 22:30:00 last_value AVG MIN MAX
2020-11-29 22:30:10 last_value AVG MIN MAX
2020-11-29 22:30:20 last_value AVG MIN MAX
谁能告诉我我的 Impala 查询有什么问题。
谢谢!!!
解决方法
我认为你只是想要聚合,而不是窗口函数:
SELECT cast(cast(unix_timestamp(cast(ts/1000 as TIMESTAMP))/10 as bigint)*10 as TIMESTAMP),AVG(val2) as val2,MIN(val3) as val3,MAX(val4) as val4
FROM Sensor_Data.Table
WHERE unit = 'Unit1' AND
CAST(ts/1000 as TIMESTAMP) BETWEEN '2020-11-29 22:30:00' and '2020-12-01 01:51:00'
GROUP BY cast(cast(unix_timestamp(cast(ts/1000 as TIMESTAMP))/10 as bigint)*10 as TIMESTAMP)
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。