spark standalone work扩展

本文档详细介绍了如何在已有的Spark Standalone集群上进行扩展,包括添加新的Worker节点。首先确保所有节点配置了Java环境和下载了Spark安装包,并更新hosts文件。接着创建名为'spark'的用户,并实现免密登录。然后进行目录权限设置。在master和node1节点上完成特定的Spark配置。最后,逐步指导如何将node2作为新的Worker节点加入到集群中,以扩展Spark的工作能力。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

所有节点配置Java环境以及下载spark安装包

所有节点配置hosts文件

192.168.2.28 master
192.168.2.29 node1
192.168.2.30 node2

1. 创建spark用户

[root@master ~]# useradd spark
[root@node1 ~]# useradd spark

2. 配置免秘钥登录

测试:

[spark@master ~]$ ssh master    登录自己
Last login: Thu Feb  9 15:12:08 2017 from master
[spark@master ~]$ exit
logout
Connection to master closed.
[spark@master ~]$ ssh node1     登录node1
Last login: Thu Feb  9 14:55:46 2017
[spark@node1 ~]$ exit
logout
Connection to node1 closed.

3. 目录权限设置

master:
[spark@master ~]$ ll -d /opt/source/spark-2.0.2-bin-hadoop2.7
drwxr-xr-x 14 spark spark 4096 Feb  9 10:53 /opt/source/spark-2.0.2-bin-hadoop2.7

node1:
[spark@node1 ~]$ ll -d /opt/source/spark-2.0.2-bin-hadoop2.7
drwxr-xr-x 14 spark spark 4096 Feb  9 10:54 /opt/source/spark-2.0.2-bin-hadoop2.7

4. spark 配置(spark用户操作)

master节点操作:

[spark@master ~]$ cd /opt/spark/
[spark@master spark]$ cp conf/spark-env.sh.template conf/spark-env.sh
[spark@master spark]$ cp conf/slaves.template conf/slaves
[spark@master spark]$ vim conf/spark-env.sh
......
export JAVA_HOME=/opt/jdk
export SPARK_WORKER_CORES=1
export SPARK_WORKER_DIR=/home/spark/work
export SPARK_DAEMON_MEMORY=1G
export SHARK_MASTER_MEM=1G
export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/lib/*:/opt/spark/lib/*
[spark@master spark]$ tail -n 2 /opt/spark/conf/slaves
master
node1

node1节点操作:

[spark@node1 spark]$ vim conf/spark-env.sh
......
export JAVA_HOME=/opt/jdk
export SPARK_WORKER_CORES=1
export SPARK_WORKER_DIR=/home/spark/work
export SPARK_DAEMON_MEMORY=1G
export SHARK_MASTER_MEM=1G
export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/lib/*:/opt/spark/lib/*

启动spark

[spark@master spark]$ ./sbin/start-all.sh 
starting org.apache.spark.deploy.master.Master, logging to /opt/spark/logs/spark-spark-org.apache.spark.deploy.master.Master-1-master.out
node1: starting org.apache.spark.deploy.worker.Worker, logging to /opt/spark/logs/spark-spark-org.apache.spark.deploy.worker.Worker-1-node1.out
master: starting org.apache.spark.deploy.worker.Worker, logging to /opt/spark/logs/spark-spark-org.apache.spark.deploy.worker.Worker-1-master.out

这里写图片描述

扩展添加 work节点

添加node2 work 节点

  • 配置Java环境、hosts文件、spark安装包、创建用户、免密登录(略)
[spark@node2 ~]$ ll -d /opt/source/spark-2.0.2-bin-hadoop2.7
drwxr-xr-x 14 spark spark 4096 Feb  9 10:54 /opt/source/spark-2.0.2-bin-hadoop2.7

[spark@node2 spark]$ vim conf/spark-env.sh
......
export JAVA_HOME=/opt/jdk
export SPARK_WORKER_CORES=1
export SPARK_WORKER_DIR=/home/spark/work
export SPARK_DAEMON_MEMORY=1G
export SHARK_MASTER_MEM=1G
export SPARK_CLASSPATH=$SPARK_CLASSPATH:/opt/lib/*:/opt/spark/lib/*

[spark@node2 spark]$ tail -n 2 conf/slaves
# A Spark Worker will be started on each of the machines listed below.
node2

[spark@node2 spark]$ ./sbin/start-slave.sh spark://master:7077
starting org.apache.spark.deploy.worker.Worker, logging to /opt/spark/logs/spark-spark-org.apache.spark.deploy.worker.Worker-1-node2.out


这里写图片描述

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值