Friday 9 October 2015

Hadoop working

Working with Hadoop an example  file.



https://drive.google.com/file/d/0B0CDNJOhbMlDNGtvYkl4aExzTVE/view?usp=sharing
Madhusudhana R K                                               HADOOP INSTALLATION





How to Configure HADOOP-1.0.4 on Ubuntu 14.04LTS

1. Configure SSH
$ sudo apt-get install openssh-server
$ sudo apt-get install openssh-client
$ ssh-keygen -t rsa -P ""
$ ssh localhost (if not installed, type $ sudo apt-get install localhost)

2. Disable IPV6
open the follwing file
$ sudo gedit /etc/sysctl.conf
#Add below contents
#disable ipv6
net.ipv6.conf.all.disable_ipv6 = 1
net.ipv6.conf.default.disable_ipv6 = 1
net.ipv6.conf.lo.disable_ipv6 = 1

3. Install Hadoop

move the hadoop-1.0.4 file to below path

$ /home/hadoop/apache/hadoop-1.0.4

4. Edit the .bashrc file
#Add the below contents
# Set Hadoop-related environment variablesexport
HADOOP_HOME=/home/hadoop/apache/hadoop-1.0.4
# Set JAVA_HOME (we will also configure JAVA_HOME directly for Hadoop later on)

export JAVA_HOME=/home/hadoop/apache/jdk1.7.0_25
# Add Hadoop & Java directory to PATH
export PATH=$PATH:$JAVA_HOME/bin:$HADOOP_HOME/bin


5. Goto following path to edit the configuration files

$ sudo gedit /home/hadoop/apache/hadoop-1.0.4/conf/

edit hadoop-env.sh
#add below lines
export HADOOP_OPTS=-Djava.net.preferIPv4Stack=true
export JAVA_HOME=/home/hadoop/apache/jdk1.7.0_25
export PATH=$PATH:$JAVA_HOME/bin

conf/core-site.xml
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:9000</value>
</property>
</configuration>

conf/hdfs-site.xml :
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>


conf/mapred-site.xml :
<configuration>
<property>
<name>mapred.job.tracker</name>
<value>localhost:9001</value>
</property>
</configuration>

6. check for localhost

$ssh localhost
now start the namenode, go to below path
$ /home/hadoop/apache/hadoop-1.0.4

type

bin/hadoop namenode -format

you will receive information about the namenode starting.

Type “ bin/start-all.sh “to run all nodes

type jps

you will receive the following nodes.
3526 JobTracker
3084 NameNode
3433 SecondaryNameNode
5809 Jps
3726 TaskTracker
3260 DataNode


this complets your HADOOP installation !!!!!!!!!!!!!!!!!!!!!!!!




Madhusudhana R K
+91 9008836782 / 7353293143
rkmadhu619@gmail.com