【SQL进阶】【CASE/IF、COUNT/SUM、多条记录拼接为一个内容】Day03:聚合分组查询
〇、今日内容概述
一、聚合函数
1、SQL类别高难度试卷得分的截断平均值【去最高最低分求平均】

自己的想法
SELECT tag, difficulty, ROUND((SUM(score)-MIN(score)-MAX(score))/(COUNT(score)-2),1) AS clip_avg_score FROM examination_info,exam_record WHERE examination_info.exam_id=exam_record.exam_id AND tag='hard' AND tag='SQL'报错:
Execution Error SQL_ERROR_INFO: "In aggregated query without GROUP BY, expression #2 of SELECT list contains nonaggregated column 'examination_info.difficulty'; this is incompatible with sql_mode=only_full_group_by"正确做法
SELECT tag, difficulty, ROUND((SUM(score)-MIN(score)-MAX(score))/(COUNT(score)-2),1) AS clip_avg_score FROM examination_info JOIN exam_record USING(exam_id) WHERE tag='SQL' AND difficulty='hard'
或
SELECT tag, difficulty, ROUND((SUM(score)-MIN(score)-MAX(score))/(COUNT(score)-2),1) AS clip_avg_score FROM examination_info,exam_record WHERE examination_info.exam_id=exam_record.exam_id AND difficulty='hard' AND tag='SQL'
2、统计作答次数

- 自己的想法
SELECT COUNT(*) AS total_pv, SUM((CASE WHEN score IS NULL AND submit_time IS NULL THEN 0 ELSE 1 )) AS complete_pv, COUNT(DISTINCT exam_id) AS complete_exam_cnt FROM exam_record - 报错
Execution Error SQL_ERROR_INFO: "You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near ')) AS complete_pv,\n COUNT(DISTINCT exam_id) AS complete_exam_cnt\nFROM exam_re' at line 7" - 正确做法
SELECT COUNT(*) AS total_pv, SUM((CASE WHEN score IS NULL AND submit_time IS NULL THEN 0 ELSE 1 END )) AS complete_pv, COUNT(DISTINCT exam_id AND score IS NOT NULL AND submit_time IS NOT NULL) AS complete_exam_cnt FROM exam_record - 原因:CASE ... WHEN ... THEN ... END
- 方法2:使用IF
SELECT COUNT(*) AS total_pv, SUM((CASE WHEN score IS NULL AND submit_time IS NULL THEN 0 ELSE 1 END )) AS complete_pv, COUNT(DISTINCT IF(score IS NOT NULL, exam_id, NULL)) AS complete_exam_cnt FROM exam_record
3、得分不小于平均分的最低分
- 分组的可以在分组内使用join


- 自己的思路
点击查看代码
SELECT
MIN(score) AS min_score_over_avg
FROM exam_record A
JOIN examination_info B
JOIN (SELECT exam_id,AVG(score) AS ex_score
FROM exam_record
GROUP BY exam_id) AVG_E
USING exam_id
WHERE
score<ex_score
AND
tag='SQL'
* 正确答案
点击查看代码
SELECT
MIN(score) AS min_score_over_avg
FROM exam_record er
JOIN examination_info ei
ON er.exam_id=ei.exam_id
WHERE
tag='SQL'
AND score>=
(SELECT AVG(score)
FROM exam_record er
WHERE
tag='SQL'
AND
er.exam_id=ei.exam_id
GROUP BY er.exam_id)
* 方法2:使用over函数☆
# 二、分组函数
## 1、平均**活跃天数**和月活人数


- 自己写的
点击查看代码
SELECT
DATE_FORMAT(submit_time,"%Y%m") AS month,
ROUND(SUM(IF(submit_time IS NOT NULL,1,0))/COUNT(DISTINCT uid),2) AS avg_active_days,
COUNT(DISTINCT uid) AS mau
FROM exam_record
WHERE
submit_time IS NOT NULL
AND
YEAR(submit_time)='2021'
GROUP BY month
* 陷阱在于九月份有个用户同一天做了两种卷子,直接count统计的话活跃天数会多一天,即用户ID和做题日期submit_time要同时去重才能得出正确的活跃天数.
* 正确答案
点击查看代码
SELECT
DATE_FORMAT(submit_time,"%Y%m") AS month,
ROUND(COUNT(DISTINCT uid,DATE_FORMAT(submit_time,"%Y%m%d"))/COUNT(DISTINCT uid),2) AS avg_active_days,
COUNT(DISTINCT uid) AS mau
FROM exam_record
WHERE
submit_time IS NOT NULL
AND
YEAR(submit_time)='2021'
GROUP BY month
## 2、月总刷题数和日均刷题数【拼接未知数据使用UNION】

- 自己写的【错误】:
点击查看代码
SELECT
DATE_FORMAT(submit_time,"%Y%m") AS submit_month,
COUNT(submit_time) AS month_q_cnt,
ROUND(COUNT(submit_time)/(
CASE
WHEN MONTH(submit_time)=1 THEN 31
WHEN MONTH(submit_time)=2 THEN 28
WHEN MONTH(submit_time)=3 THEN 31
WHEN MONTH(submit_time)=4 THEN 30
WHEN MONTH(submit_time)=5 THEN 31
WHEN MONTH(submit_time)=6 THEN 30
WHEN MONTH(submit_time)=7 THEN 31
WHEN MONTH(submit_time)=8 THEN 31
WHEN MONTH(submit_time)=9 THEN 30
WHEN MONTH(submit_time)=10 THEN 31
WHEN MONTH(submit_time)=11 THEN 30
WHEN MONTH(submit_time)=12 THEN 31
END
),3) AS avg_day_q_cnt
FROM practice_record
WHERE
submit_time IS NOT NULL
AND
YEAR(submit_time)=2021
GROUP BY submit_month
ORDER BY submit_month ASC
- 正确答案
点击查看代码
SELECT
DATE_FORMAT(submit_time,"%Y%m") submit_month,
COUNT(submit_time) month_q_cnt,
ROUND(COUNT(submit_time)/MAX(DAY(LAST_DAY(submit_time))),3) avg_day_q_cnt
-- 使用max实现去重
FROM practice_record
WHERE YEAR(submit_time)=2021
GROUP BY submit_month
UNION ALL
SELECT
"2021汇总" submit_month,
COUNT(submit_time) month_q_cnt,
ROUND(COUNT(submit_time)/31,3) avg_day_q_cnt
FROM practice_record
WHERE YEAR(submit_time)=2021
ORDER BY submit_month ASC
3、未完成试卷数大于1的有效用户


点击查看代码
SELECT
uid,
SUM(IF(er.submit_time IS NULL,1,0)) AS incomplete_cnt,
-- COUNT(CASE WHEN er.submit_time IS NULL THEN er.start_time ELSE NULL END) AS incomplete_cnt,
SUM(IF(er.submit_time IS NOT NULL,1,0)) AS complete_cnt,
**GROUP_CONCAT(DISTINCT CONCAT_WS(':',DATE_FORMAT(er.start_time,"%Y-%m-%d"),ei.tag) SEPARATOR ';') **AS detail
FROM exam_record er
LEFT JOIN examination_info ei
ON er.exam_id=ei.exam_id
WHERE YEAR(er.start_time)=2021
GROUP BY er.uid
HAVING
complete_cnt>=1
AND
incomplete_cnt<5
AND
incomplete_cnt>1
ORDER BY incomplete_cnt DESC
【SQL进阶】【CASE/IF、COUNT/SUM、多条记录拼接为一个内容】Day03:聚合分组查询的更多相关文章
- LINQ to SQL 语句(3) 之 Count/Sum/Min/Max/Avg
LINQ to SQL 语句(3) 之 Count/Sum/Min/Max/Avg [1] Count/Sum 讲解 [2] Min 讲解 [3] Max 讲解 [4] Average 和 Agg ...
- oracle通过sql随机取表中的10条记录
oracle通过sql随机取表中的10条记录: SELECT * FROM (SELECT * FROM T_USER ORDER BY DBMS_RANDOM.RANDOM()) WHERE Row ...
- LINQ to SQL语句(3)之Count/Sum/Min/Max/Avg
适用场景:统计数据吧,比如统计一些数据的个数,求和,最小值,最大值,平均数. Count 说明:返回集合中的元素个数,返回INT类型:不延迟.生成SQL语句为:SELECT COUNT(*) FROM ...
- SQL 父子表,显示表中每条记录所在层级
1.sqlserer 中有一张父子关系表,表结构如下: CREATE TABLE [dbo].[testparent]( [ID] [int] IDENTITY(1,1) NOT NULL, [nam ...
- MYSQL实现列拼接,即同一个字段,多条记录拼接成一条
一.首先,新建三张表 DROP TABLE IF EXISTS `article`; CREATE TABLE `article` ( `id` ) unsigned NOT NULL AUTO_IN ...
- 每天努力一点之SQL(二) count sum case when then group by
1. select sum(CASE WHEN A.[STATUS]=0 THEN 1 ELSE 0 end) as a1, sum(CASE A.[STATUS] WHEN 1 THEN 1 EL ...
- SQL —— 获取重复某个字段的第一条记录
----------用来双重排序,且获取唯一 go SELECT ROW_NUMBER() OVER (ORDER BY AScore DESC,ATime ASC) AS Rank, * FROM ...
- 一条sql获取每个类别最新的一条记录
1.初始化数据 create table Products ( id ,), name ), categroy int, addtime datetime , ) insert into Produc ...
- SQL学习笔记:选取第N条记录
Northwind数据库,选取价格第二高的产品. 有两种方法,一个是用Row_Number()函数: SELECT productname FROM ( productname, Row_Number ...
- sql 更新重复数据只取一条记录
select s.* from ( select *, row_number() over (partition by PersonnelAccount order BY Personnel ...
随机推荐
- linux中通过date命令获取昨天或明天时间的方法
date命令可以获取当前的时间,通过man,可以看到date有很多参数可以用,很容易做到格式化 # 获取当前日期 date +"%F" 或者 date +"%Y-%m-% ...
- Ubuntu20.04和Docker环境下安装Redash中文版
创建Ubunt20.04虚拟机,请参考:https://www.linuxidc.com/Linux/2020-03/162547.htm 一.安装基础环境: # 1.更换APT国内源 sudo se ...
- WSL 2 上启用微软官方支持的 systemd
以前折腾了很久的 genie 和 distrod 来实现 wsl2 上的 systemd.现在微软和Canonical联合声明发布了官方支持的systemd,之前的折腾貌似有点浪费时间了.如果微软不发 ...
- 「Tubian」Tubian0.41!支持Windows QQ微信!
Tubian 0.42已发布:https://www.cnblogs.com/tubentubentu/p/16745926.html Sourceforge.net下载:https://source ...
- Node.js(六)连接MongoDB进行数据访问
npm init -y(初始化项目) npm install mongodb --save(引入MongoDB) const { MongoClient } = require("mongo ...
- 空 Maven项目转成 Web项目 & SpringMVC调用其他 Module中的方法可能会遇到的小问题
SpringMVC调用其他 模块内的方法的 坑 下次别在阴沟里翻船啦.. 一共花费 4个小时,解决项目中的这个问题 OMG 1. 首先是 Maven新建工程 一般使用 Maven都是先创建 空工程 当 ...
- java集合框架复习----(1)
文章目录 1 .集合框架思维导图 一.什么是集合 二.collection接口 1 .集合框架思维导图 一.什么是集合 存放在java.util.*.是一个存放对象的容器. 存放的是对象的引用,不是对 ...
- 在vue项目中禁用eslint
文章目录 1.在创建项目的时候不自动使用eslint 2.在package.json中删除所有的eslint,然后重新install 3.按照图片注释(亲测可用) 在使用eslin进行规则验证时,一点 ...
- 知识图谱-生物信息学-医学顶刊论文(Bioinformatics-2021)-KG4SL:用于人类癌症综合致死率预测的知识图神经网络
5.(2021.7.12)Bioinformatics-KG4SL:用于人类癌症综合致死率预测的知识图神经网络 论文标题:KG4SL: knowledge graph neural network f ...
- SQL 报表制作和整形
本文章主要介绍制作报表的查询,这些查询通常需要考虑与报表相关的格式设置,还需使用多级聚合. 1.将结果集转置为一行(行转列) 将多行中的值转换为单行中的列. 情景:有一个员工表,统计出一个结果集,显示 ...