muyue123

2021年6月23日

摘要： # ratefrom pyspark.sql import SparkSession spark = SparkSession.builder. \ appName("study_structured_streaming"). \ enableHiveSupport(). \ config("spa 阅读全文

posted @ 2021-06-23 20:11 muyue123 阅读(57) 评论(0) 推荐(0) 编辑

2021年4月29日

行转列例子

摘要： create table test.t_20210429 ( app String, cnt Nullable(UInt32), per Nullable(UInt32) ) ENGINE=MergeTree() order by app; insert into test.t_20210429 v 阅读全文

posted @ 2021-04-29 18:56 muyue123 阅读(63) 评论(0) 推荐(0) 编辑

2021年4月28日

分组取topN

摘要： # 分组取topn create table t_0428(id UInt32,nm String,cnt UInt32) ENGINE=MergeTree() order by id; insert into t_0428 values(1,'a',100),(1,'b',101),(1,'c', 阅读全文

posted @ 2021-04-28 14:46 muyue123 阅读(239) 评论(0) 推荐(0) 编辑

2021年4月25日

spark中替换回车换行等

摘要：当要匹配特殊的隐藏字符\n \r \t ,等回车符、制表符时，需要通过使用四个 \ 进行转译。 regexp_replace(title, '\\\\n|\\\\\t|\\\\\r', ',') title 使用char(*)也可以进行处理 spark.sql("select regexp_repl 阅读全文

posted @ 2021-04-25 16:38 muyue123 阅读(1665) 评论(0) 推荐(0) 编辑

2021年3月1日

删除文件_通配符问题

摘要： awscli 里不能直接使用“*” aws s3 rm s3://s3://log-provision/08_nhk/mesh/temp/*/*/ver3/*1这样是不行的，需要使用--recursive和--exclude、--include.在--exclude、--include里使用“*”。阅读全文

posted @ 2021-03-01 16:54 muyue123 阅读(287) 评论(0) 推荐(0) 编辑

2021年2月24日

mysql建表方式_区分大小写以及支持中文

摘要：李林 1-11 19:58:17避免mysql大小写不敏感的建表方式：李林 1-11 19:58:18create table test.tmp_app_category_20210111(app_id varchar(500),title varchar(500),category varchar 阅读全文

posted @ 2021-02-24 13:52 muyue123 阅读(310) 评论(0) 推荐(0) 编辑

2021年2月4日

hive表中有数据但count结果为0

摘要： Hive 中 A 表存在数据, 但执行 select count(*) from A 返回结果为 0 原因参数 hive.compute.query.using.stats 默认为 false, 在参数优化时修改为 true 导致上述问题产生解决使用 select count(*) / cou 阅读全文

posted @ 2021-02-04 11:47 muyue123 阅读(1582) 评论(0) 推荐(0) 编辑

2021年2月1日

yarn的log查看方式

摘要： yarn logs -applicationId application_1493700892407_0007 阅读全文

posted @ 2021-02-01 15:58 muyue123 阅读(557) 评论(0) 推荐(0) 编辑

2021年1月14日

函数

摘要： SELECT modulo(10, 3) #求余数 SELECT modulo(10, 3) 阅读全文

posted @ 2021-01-14 18:05 muyue123 阅读(39) 评论(0) 推荐(0) 编辑

2021年1月6日

sparksql中的集合类型

摘要： df = spark.createDataFrame([('LC7-H6116BCF-R-GL-201116V750Fans', '张三', 88), ('语文', '张三', 92), ('英语', '张三', 77), ('数学', '王五', 65), ('语文', '王五', 87), (' 阅读全文

posted @ 2021-01-06 20:00 muyue123 阅读(213) 评论(0) 推荐(0) 编辑

公告