You are viewing an old version of this page. View the current version.
Compare with Current
View Page History
« Previous
Version 17
Next »
data:image/s3,"s3://crabby-images/ca632/ca632df94235da7f7377d608ec98bf02bb91403d" alt=""
http://spark.apache.org
How to create an instance of Spark Cluster in the KASI Science Cloud
- Step 1 : Choose a spark cluster template
data:image/s3,"s3://crabby-images/33d60/33d601e69b9b94d35125efa22f8624b711890265" alt=""
- Step 2 : Use the default settings in most cases
data:image/s3,"s3://crabby-images/e8ccb/e8ccb6973c828e66f2632948bdce488c47ad7900" alt=""
- Step 3 : Choose a flavor and Set the number of slaves (minons), then Create
data:image/s3,"s3://crabby-images/d423c/d423cbac2ca68dc539de11e198695bef25c9acaa" alt=""
Connect to the Master-Node and Run some basic scripts
- root 으로 접속 후, nfs 디렉토리로 이동
alias
를 확인해보면, allon
과 alloff
명령어를 볼 수 있음. 이 명령어로 Spark+Hadoop Cluster를 Star/Stop 할 수 있음.data:image/s3,"s3://crabby-images/1795f/1795f5753e3d4d78827a3e8b0e11acb728dceadf" alt=""
allon
이 제대로 실행되었다면, http://master-node-ip:8080 와 http://master-node-ip:9870 에서 Spark와 Hadoop의 WebUI를 볼 수 있음.
여기까지 설정이 끝났으면, spark-submit
을 이용한 script 실행이 가능함. Jupyter Notebook을 이용한 interacitve shell mode를 이용하려면, 아래 설명한 추가 설정이 필요함.
SparkUI
data:image/s3,"s3://crabby-images/bad46/bad461daf6bb2aa2b7b917411a18e1440ae732d7" alt=""
HadoopUI
data:image/s3,"s3://crabby-images/23abb/23abb58acad03da7845fe714be55c00ce394e262" alt=""
- 여기까지 설정이 끝났으면,
spark-submit
을 이용한 script 실행이 가능함.
Introduction to Apache Spark