8000 GitHub - non-one/HadoopDockerDemo: 通过docker部署hadoop-3.2.4集群,用于学习测试
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

non-one/HadoopDockerDemo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Start your own Hadoop cluster with docker

hadoop-master hadoop-slave1 hadoop-slave2
HDFS NameNode
SecondaryNameNode
DataNode DataNode
YARN ResourceManager
LogServer
NodeManager NodeManager

Prepare

  • Docker(my version is 20.10.17)

    just make sure docker compose can work

  • Download hadoop-3.2.4.tar.gz and Put the file in the same level directory of Dockerfile

Quick start

  1. Clone the repository

  2. Build your own images

    docker compose build
  3. Start docker

    docker compose up -d

Verify

图片1.png

docker exec -it hadoop-master bash
./start-hadoop.sh
./run-wordcount.sh

If successful, you will see

图片2.png

MapReduce UI: http://<HOST IP>:8080

HDFS UI: http://:9870

Log UI: http://:19888

Q&A

  1. Can I use it offline?

    NO, the docker base image is ubuntu:18.04 that pulled from Internet,and use apt to install openssh, java8.

    In addition, at Dockerfile line-9,you can replace default sources.list to speed up apt installation.

  2. How to increase the number of DataNode/NodeManager?

    1. edit config/works file. Each line represents a Node.

    2. edit docker-compose.yml.

    3. go to Quick start step-2 rebuild your images.

refer

https://github.com/kiwenlau/hadoop-cluster-docker

About

通过docker部署hadoop-3.2.4集群,用于学习测试

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published
0