hdfs

Java version support

Apache Hadoop 3.3 and upper supports Java 8 and Java 11 (runtime only)

  • Apache Hadoop from 3.0.x to 3.2.x now supports only Java 8

everyone bringing up a production cluster should include connecting to their organisation’s Kerberos infrastructure as a key part of the deployment.

Install Java 8

sudo apt update

sudo apt install openjdk-8-jdk

java -version

Download hadoop

  • Create new user
sudo addgroup hadoop
sudo adduser --ingroup hadoop hduser
sudo passwd hduser
sudo adduser hduser sudo
sudo cp /home/vuthanhdatt/Downloads/hadoop-3.3.6 /home/hduser/hadoop-3.3.6
sudo chown -r hduser:hadoop /home/hduser/hadoop-3.3.6

https://hadoop.apache.org/releases.html Download binary version.

Unzip tar -xzf hadoop-3.3.6.tar.gz

edit the file etc/hadoop/hadoop-env.sh

`export JAVA_HOME=java_install_path

If install Java with apt, then java install path should be in /usr/lib/jvm/. We can use command update-java-alternatives -l to list all the Java versions installed on system and their paths