经典SQL之连续3天登陆
经典SQL1——连续3天登陆
无论大厂还是小厂,在SQL题面试中,面试官都很喜欢问连续3天登陆,那么今天就来手撕一下
建表
在Hive中简单建表
drop table if exists user_login_details_last_3_day;
create table user_login_details_last_3_day
(Userid int,login_date date
)
插入临时数据
简单插入几条测试数据
insert overwrite table user_login_details_last_3_day
select 1010123,'2021-08-02'
union all
select 1010123,'2021-08-03'
union all
select 1010123,'2021-08-04'
union all
select 1010122,'2021-11-02'
union all
select 1010122,'2021-12-09'
union all
select 1010199,'2021-01-01'
union all
select 1010199,'2021-04-23'
union all
select 1010199,'2021-09-10'
union all
select 1010199,'2021-09-11'
union all
select 1010199,'2021-09-12'
union all
select 1010155,'2021-08-02'
union all
select 1010155,'2021-08-04'
union all
select 1010155,'2021-08-05'
union all
select 1010155,'2021-08-07';
思路
判断一个用户是否连续3天登陆的关键点是如何判断连续
- 通过用户ID分组升序排序得出排序号rk
- 然后用登陆日期减去这个排序号得出一个日期dt
- 如果是连续登陆,那么这个dt是相同的
实现
select Userid,count(1) as cntfrom (select Userid,login_date,row_number()over(partition by Userid order by login_date) as rk from user_login_details_last_3_day) tgroup by Userid,date_sub(login_date,t.rk)
having count(1) >=3
结果如下:
发布评论