【TPC-DS】trino+S3+hive+postgresql性能测试----生成500GB数据(六)
【注意事项】
- 测试不同的数据前,需要清空之前的测试的旧数据
- 连接postgressql,清空每个表的数据
- 连接hive,清空每个表的数据
【总结】
- 生成数据:使用多个磁盘,共同生成500GB数据,生成数据放在后台执行,避免中断
- 写s3数据:每个磁盘,均修改mys3中脚本,由customer_4_4.dat、customer_3_4.dat、customer_2_4.dat、customer_1_4.dat均写入S3的customer
#生成数据存放的路径(自己定义),建议不要放在系统盘,可放在数据盘即可
/data1/trino/TpcdsData
第一步:生成数据
#后台运行生成数据
注:使用了4台服务器,每个服务的路径下均执行响应的命令
#125 cd /root/trino/tpcds-kit/tools nohup ./dsdgen -SCALE 500GB -DIR /data1/trino/TpcdsData -parallel 4 -child 4 & #124 cd /root/trino/tpcds-kit/tools nohup ./dsdgen -SCALE 500GB -DIR /data1/trino/TpcdsData -parallel 4 -child 3 & #123 cd /root/trino/tpcds-kit/tools nohup ./dsdgen -SCALE 500GB -DIR /data1/trino/TpcdsData -parallel 4 -child 2 & #122 cd /root/trino/tpcds-kit/tools nohup ./dsdgen -SCALE 500GB -DIR /data1/trino/TpcdsData -parallel 4 -child 1 &
#每台服务器上均做软连接(注:因我自己喜欢把数据放在一起,所以做了软连接,根据需求而定,可以不做这一步均可)
ln -sv /data1/trino/TpcdsData /root/trino/tpcds-kit
查看生成数据是否成功,#125为例,其余的服务器查看方式一致

第二步:写数据(写入s3)
注意事项:根据每个磁盘的数据文件,修改表名catalog_returns_4_4,其余的不变
#125,mys3中的内容修改后如下,
aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_returns_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_sales_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_address_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_address/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_demographics_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_demographics/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/inventory_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/inventory/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_returns_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_sales_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_returns_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_sales_4_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_sales/
#124
aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_returns_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_sales_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_address_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_address/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_demographics_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_demographics/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/inventory_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/inventory/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_returns_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_sales_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_returns_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_sales_3_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_sales/
#123
aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_returns_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_sales_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_address_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_address/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_demographics_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_demographics/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/inventory_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/inventory/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_returns_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_sales_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_sale/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_returns_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_sales_2_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_sales/
#122
aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_returns_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/catalog_sales_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_address_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_address/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/customer_demographics_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_demographics/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/inventory_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/inventory/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_returns_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/store_sales_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_sale/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_returns_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 cp /root/trino/tpcds-kit/TpcdsData/web_sales_1_4.dat s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_sales/
#查看125的执行数据是否生成
aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/catalog_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_address/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/customer_demographics/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/inventory/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/store_sales/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_returns/ aws --endpoint-url http://obs.cn-southwest-2.myhuaweicloud.com s3 ls s3://fastdata-tsdb/deepdevhwc/syw_postgreSQL_TPCds/web_sales/
1.作者:Syw 2.出处:http://www.cnblogs.com/syw20170419/ 3.本文版权归作者和博客园共有,欢迎转载,但未经作者同意必须保留此段声明,且在文章页面明显位置给出原文连接,否则保留追究法律责任的权利。 4.如果文中有什么错误,欢迎指出。以免更多的人被误导。 |
【推荐】国内首个AI IDE,深度理解中文开发场景,立即下载体验Trae
【推荐】编程新体验,更懂你的AI,立即体验豆包MarsCode编程助手
【推荐】抖音旗下AI助手豆包,你的智能百科全书,全免费不限次数
【推荐】轻量又高性能的 SSH 工具 IShell:AI 加持,快人一步
· AI与.NET技术实操系列(二):开始使用ML.NET
· 记一次.NET内存居高不下排查解决与启示
· 探究高空视频全景AR技术的实现原理
· 理解Rust引用及其生命周期标识(上)
· 浏览器原生「磁吸」效果!Anchor Positioning 锚点定位神器解析
· DeepSeek 开源周回顾「GitHub 热点速览」
· 物流快递公司核心技术能力-地址解析分单基础技术分享
· .NET 10首个预览版发布:重大改进与新特性概览!
· AI与.NET技术实操系列(二):开始使用ML.NET
· 单线程的Redis速度为什么快?