配置客户端连接服务器上的hadoop集群,毕竟自己的机子性能有限,进行一些大规模的计算还是需要到服务器上进行,以下记录了配置客户端的全过程。
搭建环境:CentOS7+hadoop3.0.3+jdk8
前置需求
- 配置静态ip地址
- 修改主机名
- 配置主机名和ip地址映射
- 关闭防火墙,设置开机不自启动
- 安装并配置好jdk
注:以上操作在笔者之前的Linux相关笔记中均有介绍
正式搭建
- 创建相关目录
mkdir /usr/local/apps/ - 解压hadoop-3.0.3.tar.gz到apps/
tar -zxvf hadoop-3.0.3.tar.gz -C /usr/local/apps/ - 建立软连接
hadoop指向hadoop-3.0.3
ln -s hadoop-3.0.3 hadoop - 配置hadoop环境变量
vi ~/.bashrc
添加
export HADOOP_HOME=/usr/local/apps/hadoop-3.0.3
export HADOOP_PREFIX=$HADOOP_HOME
export HADOOP_CONF_DIR=$HADOOP_HOME/etc/hadoop
export LD_LIBRARY_PATH=$HADOOP_HOME/lib/native
export HADOOP_USER_NAME=xujie
export JAVA_HOME=/usr/local/apps/jdk1.8.0_101
export PATH=$PATH:$HADOOP_HOME/bin:$HADOOP_HOME/sbin
重新加载
source ~/.bashrc 修改配置文件core-site.xml
vi /usr/local/apps/hadoop-3.0.3/etc/hadoop/core-site.xml
在configuration标签中添加属性123456<configuration><property><name>fs.defaultFS</name><value>hdfs://computer1.cloud.briup.com:9000</value></property></configuration>修改配置文件mapred-site.xml
vi /usr/local/apps/hadoop-3.0.3/etc/hadoop/mapred-site.xml
在configuration标签中添加属性12345678910111213141516171819202122<configuration><property><name>mapreduce.framework.name</name><value>yarn</value></property><property><name>mapreduce.app-submission.cross-platform</name><value>true</value></property><property><name>mapreduce.admin.user.env</name><value>HADOOP_MAPRED_HOME=$HADOOP_HOME</value></property><property><name>yarn.app.mapreduce.am.env</name><value>HADOOP_MAPRED_HOME=$HADOOP_HOME</value></property><property><name>mapreduce.application.classpath</name><value>/opt/hadoop/etc/hadoop:/opt/hadoop/share/hadoop/common/lib/*:/opt/hadoop/share/hadoop/common/*:/opt/hadoop/share/hadoop/hdfs:/opt/hadoop/share/hadoop/hdfs/lib/*:/opt/hadoop/share/hadoop/hdfs/*:/opt/hadoop/share/hadoop/mapreduce/lib/*:/opt/hadoop/share/hadoop/mapreduce/*:/opt/hadoop/share/hadoop/yarn:/opt/hadoop/share/hadoop/yarn/lib/*:/opt/hadoop/share/hadoop/yarn/*</value></property></configuration>修改配置文件yarn-site.xml
vi /usr/local/apps/hadoop-3.0.3/etc/hadoop/yarn-site.xml
在configuration标签中添加属性12345678910<configuration><property><name>yarn.resourcemanager.hostname</name><value>computer1.cloud.briup.com</value> #主节点ip</property><property><name>yarn.nodemanager.aux-services</name><value>mapreduce_shuffle</value></property></configuration>查看该集群根目录
hdfs dfs -ls /- 创建自己的家目录
hdfs dfs -mkdir /user/xujie - 测试计算pi的值
cd /usr/local/apps/hadoop/share/hadoop/mapreduce
hadoop jar hadoop-mapreduce-examples-3.0.3.jar pi 4 5000000
计算5000000次pi的值 - 在浏览器查看
172.16.0.4:8088
172.16.0.4:9870
最后更新: 2018年10月08日 18:25