big data env setup
install Hadoop
CentOS:
https://www.vultr.com/docs/how-to-install-hadoop-in-stand-alone-mode-on-centos-7
https://www.linode.com/docs/databases/hadoop/how-to-install-and-set-up-hadoop-cluster/
http://www.michael-noll.com/tutorials/running-hadoop-on-ubuntu-linux-multi-node-cluster/
check nodes status
$HADOOP_HOME/bin/hadoop dfsadmin -report
https://ambari.apache.org/1.2.3/installing-hadoop-using-ambari/content/reference_chap2_1.html
Also can access name node web page with port 50070
install Spark
on CentOS:
https://aodba.com/how-to-install-apache-spark-in-centos-standalone/
https://bigdata-etl.com/how-to-install-apache-spark-standalone-in-centos/
https://www.tutorialspoint.com/apache_spark/apache_spark_installation.htm
https://gist.github.com/darcyliu/d47edccb923b0f03280a4cf8b66227c1
on Ubuntu:
install JAVA
install Scala
install Spark
转载请注明出处 http://www.cnblogs.com/mashuai-191/