07 2020 档案
摘要:出错如下 9-07-2020 17:22:44 CST dataset_task INFO - 20/07/29 17:22:44 INFO dataaccumulation.CalculateDataSet$: sql == select cardinality(merge(cast(hll AS
阅读全文
摘要:解决办法:使用* 验证: 在hdfs某个路径下新建两个文件夹,然后分别上传一个文件到新建好的子文件夹中 [user1@hdp ~]$ hdfs dfs -mkdir /tmp/test [user1@hdp ~]$ hdfs dfs -mkdir /tmp/test/data1 [user1@hdp
阅读全文
摘要:0xEF,0xBB,0xBF
阅读全文
摘要:先来看一下报错内容 20/07/17 10:20:07 INFO yarn.YarnAllocator: Will request 1 executor container(s), each with 1 core(s) and 1408 MB memory (including 384 MB of
阅读全文
摘要:space.db 20/07/16 17:24:59 INFO client.HiveClientImpl: Warehouse location for Hive client (version 1.2.2) is file:/data/shiseido/spark-warehouse 20/07
阅读全文
摘要:./bin/spark-submit --class org.apache.spark.examples.SparkPi --master yarn-client ./examples/jars/spark-examples_2.11-2.4.3.jar 1000
阅读全文
摘要:来源:https://azkaban.readthedocs.io/en/latest/createFlows.html Creating Flows This section covers how to create your Azkaban flows using Azkaban Flow 2.
阅读全文
摘要:mport org.apache.spark.sql.types.{StructType,StructField,StringType, IntegerType, LongType} import java.util.ArrayList import org.apache.spark.sql._ v
阅读全文