018.hive-hive两个表leftjoin,由于关联字段类型不同导致的数据错误(bigint、string)

018.hive-hive两个表leftjoin,由于关联字段类型不同导致的数据错误(bigint、string)

复制代码
select 
b1.member_id b1,b2.member_id b2
from tmp1_ads_cstm_member_balance b1
left join tmp2_ads_cstm_member_balance b2 on b1.member_id = b2.member_id
where b1.member_id = '1257829907772824682'
1257829907772824682  1257829907772824682
1257829907772824682  1257829907772824680

select 
b1.member_id b1,b2.member_id b2
from tmp1_ads_cstm_member_balance b1
left join tmp2_ads_cstm_member_balance b2 on cast(b1.member_id as string) = b2.member_id
where b1.member_id = '1257829907772824682'



hive> show create table tmp1_ads_cstm_member_balance;
OK
CREATE TABLE `tmp1_ads_cstm_member_balance`(
  `member_id` bigint, 
  `member_card` string, 
  `member_phone` string, 
  `member_name` string, 
  `register_time` timestamp, 
  `shop_name` string, 
  `shop_code` string, 
  `shop_type` int, 
  `organization_id` bigint, 
  `organization_name` string, 
  `activate_time` timestamp, 
  `tenant_id` bigint)
ROW FORMAT SERDE 
  'org.apache.hadoop.hive.ql.io.orc.OrcSerde' 
STORED AS INPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat' 
OUTPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat'
LOCATION
  'hdfs://bi-stage/apps/hive/warehouse/saas.db/tmp1_ads_cstm_member_balance'
TBLPROPERTIES (
  'COLUMN_STATS_ACCURATE'='{\"BASIC_STATS\":\"true\"}', 
  'numFiles'='10', 
  'numRows'='1491161', 
  'rawDataSize'='582180585', 
  'totalSize'='31368949', 
  'transient_lastDdlTime'='1634792580')
  

hive> show create table tmp2_ads_cstm_member_balance;
OK
CREATE TABLE `tmp2_ads_cstm_member_balance`(
  `member_id` string COMMENT '????', 
  `balance` float COMMENT '????????', 
  `actual_balance` float COMMENT '??????', 
  `actual_recharge_amount` float COMMENT '????', 
  `actual_consume_amount` float COMMENT '??????', 
  `total_consume_amount` float COMMENT '???????', 
  `consume_total_count` int COMMENT '??????', 
  `give_total_amount` float COMMENT '????(??????????)', 
  `give_total_consume_amount` float COMMENT '??????(??????????)', 
  `total_amount` float COMMENT '?????', 
  `activity_give_total_amount` float COMMENT '??????(????/????)', 
  `event_give_total_amount` float COMMENT '??????(??????-????-????)', 
  `give_expired_amount` float COMMENT '??????', 
  `give_balance` float COMMENT '??????(??????????)', 
  `consume_total_pay_amount` double COMMENT '?????????)')
COMMENT '???????-??'
ROW FORMAT DELIMITED 
  FIELDS TERMINATED BY '\u0001' 
STORED AS INPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.orc.OrcInputFormat' 
OUTPUTFORMAT 
  'org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat'
LOCATION
  'hdfs://bi-stage/apps/hive/warehouse/saas.db/tmp2_ads_cstm_member_balance'
TBLPROPERTIES (
  'COLUMN_STATS_ACCURATE'='{\"BASIC_STATS\":\"true\"}', 
  'numFiles'='58', 
  'numRows'='23367', 
  'rawDataSize'='2696828', 
  'totalSize'='523061', 
  'transient_lastDdlTime'='1634792987')
复制代码

 

 

posted @   star521  阅读(1076)  评论(0编辑  收藏  举报
编辑推荐:
· Linux系列:如何用heaptrack跟踪.NET程序的非托管内存泄露
· 开发者必知的日志记录最佳实践
· SQL Server 2025 AI相关能力初探
· Linux系列:如何用 C#调用 C方法造成内存泄露
· AI与.NET技术实操系列(二):开始使用ML.NET
阅读排行:
· 无需6万激活码!GitHub神秘组织3小时极速复刻Manus,手把手教你使用OpenManus搭建本
· C#/.NET/.NET Core优秀项目和框架2025年2月简报
· 一文读懂知识蒸馏
· Manus爆火,是硬核还是营销?
· 终于写完轮子一部分:tcp代理 了,记录一下
点击右上角即可分享
微信分享提示