在windows上安装jupyter,配置单机版pyspark

参考帖子操作解决:

安装jupyter参考:

https://blog.csdn.net/lanyuelvyun/article/details/93499423

运行pyspark参考:

https://www.cnblogs.com/chenxiangzhen/p/10706258.html

jupyter集合Scala:

https://blog.csdn.net/u014612752/article/details/51789233

win10部署spark和jupyter:

https://www.cnblogs.com/wubdut/p/11552059.html

https://www.cnblogs.com/xuliangxing/p/7279662.html

Linux上切换python版本

https://blog.csdn.net/weixin_43645287/article/details/109776871

pyspark:TypeError:an integer is required(got type bytes):

小结:

安装python;安装spark;把spark的python文件夹下pyspark文件夹复制放到本机python目录的lib/site-scripts安装hadoop及winutils.exe;安装jupyter;安装py4j

3.8版本用不了,卸载重装3.7

https://blog.csdn.net/weixin_43645287/article/details/109776235

然后pip3 install py4j -i http://pypi.douban.com/simple --trusted-host pypi.douban.com

完成。

下一步:如何在jupyter中跑spark+Scala

https://www.jb51.net/article/184487.htm

posted @ 2020-12-11 21:50  foolangirl  阅读(279)  评论(0编辑  收藏  举报