Adventures in Web Development: Installing Hadoop 2 on a Mac

Tuesday, October 29, 2013

Installing Hadoop 2 on a Mac

I've had a lot of trouble getting Hadoop 2 and yarn 2 running on my MAC. There are some tutorials out there but they are often for
beta and alpha versions of the hadoop 2.0 family. These are the steps I used to get Hadoop 2.2.0 working on my MAC running OSX 10.9

Note: watch for version differences in this blog. It was written for Hadoop 2.2.0, we are currently on 2.6.2 so that will need to be changed throughout.

Get hadoop from http://www.apache.org/dyn/closer.cgi/hadoop/common/

make sure JAVA_HOME is set (if you have Java 6 on your machine):
export JAVA_HOME=`/usr/libexec/java_home -v1.6`
(Note your Java version should be 1.7 or 1.8)

point HADOOP_INSTALL to the hadoop installation directory
export HADOOP_INSTALL=/Applications/hadoop-2.2.0

And set the path
export PATH=$PATH:$HADOOP_INSTALL/bin:$HADOOP_INSTALL/sbin

You can test hadoop is found with
hadoop -version

make sure ssh is set up on your machine:
system preferences -> sharing -> remote login is ticked

try:
ssh @localhost

where is the name you used to logon.

in $HADOOP_INSTALL/etc these are the conf files I changed.

core-site.xml

 <configuration>  
 <property>  
   <name>fs.default.name</name>  
   <value>hdfs://localhost:9000</value>  
  </property>  
 </configuration>

hdfs-site.xml

 <configuration>  
 <property>  
   <name>dfs.replication</name>  
   <value>1</value>  
  </property>  
  <property>  
   <name>dfs.namenode.name.dir</name>  
   <value>file:/Users/Administrator/hadoop/namenode</value>  
  </property>  
  <property>  
   <name>dfs.datanode.data.dir</name>  
   <value>file:/Users/Administrator/hadoop/datanode</value>  
  </property>  
 </configuration>

Make the directories for the namenode and datanode data (note the file above and the mkdir below will need to reflect where you want to store the files, I've stored mine in the home directory of the Administrator user on my Mac).

mkdir -p /Users/Administrator/hadoop/namenode
mkdir -p /Users/Administrator/hadoop/datanode

hadoop namenode -format

yarn-site.xml

 <configuration>  
 <!-- Site specific YARN configuration properties -->  
 <property>  
 <name>yarn.resourcemanager.address</name>  
 <value>localhost:8032</value>  
 </property>  
 <property>  
 <name>yarn.nodemanager-aux-services</name>  
 <value>madpreduce.shuffle</value>  
 </property>  
 </configuration>

start-dfs.sh
start-yarn.sh
jps

should give
9430 ResourceManager
9325 SecondaryNameNode
9513 NodeManager
9225 DataNode
9916 Jps
9140 NameNode

if not check log files. If data node is not started and you get incompatible id's error, stop everything delete datanode directory and recreate
datanode directory

try a ls
hadoop fs -ls

if you get

ls: `.': No such file or directory

then there is no home directory in the hadoop file system. So

hadoop fs -mkdir /user
hadoop fs -mkdir /user/<username>
where is the name you are logged onto the machine with.

now change to $HADOOP_INSTALL directory and upload a file

hadoop fs -put LICENSE.txt

finally try a mapreduce job:

cd share/hadoop/mapreduce
hadoop jar ./hadoop-mapreduce-examples-2.2.0.jar wordcount LICENSE.txt out

28 comments:

MattOctober 31, 2013 at 4:42 AM
Thanks for the great post. It really helped me get started. Since I ran into a few problems while following your directions, I thought I'd post the problem and solutions here in case they are useful to anyone else.

Problem 1
------------
When executing 'hadoop version', I would get a error. I apologize that I didn't capture the exact error syntax but the gist was that the hadoop was complaining about the location of java_home.

Solution 1
------------
Instead of using

export JAVA_HOME=`/usr/libexec/java_home -v1.6`

I added the following to my .bash_profile file:

export JAVA_HOME="$(/usr/libexec/java_home)"

Problem 2
------------
When I perform an operation on the file system, I'd get errors that read "Unable to load realm info from SCDynamicStore".

Solution 2
------------
I added the following line to the bottom of the hadoop-env.sh file:

export HADOOP_OPTS="-Djava.security.krb5.realm=OX.AC.UK -Djava.security.krb5.kdc=kdc0.ox.ac.uk:kdc1.ox.ac.uk"

I also added the following to the yarn-env.sh file:

YARN_OPTS="$YARN_OPTS -Djava.security.krb5.realm=OX.AC.UK -Djava.security.krb5.kdc=kdc0.ox.ac.uk:kdc1.ox.ac.uk"

Hope this helps!
ReplyDelete
Replies
Web Developing ServicesNovember 28, 2013 at 9:57 AM
This is really essential information for web developers who are in the beginning stage of website developing.
Web Designing Companies India | Web Development Companies
ReplyDelete
Replies
Lars TijhuisDecember 1, 2013 at 5:48 PM
Thanks for your post!

I have been trying to configure it this whole Sunday afternoon, but whatever tutorial I try, I keep getting the following error:

======

13/12/01 18:44:32 INFO mapreduce.Job: Job job_1385919832889_0001 failed with state FAILED due to: Application application_1385919832889_0001 failed 2 times due to AM Container for appattempt_1385919832889_0001_000002 exited with exitCode: 127 due to: Exception from container-launch:
org.apache.hadoop.util.Shell$ExitCodeException:
at org.apache.hadoop.util.Shell.runCommand(Shell.java:464)
at org.apache.hadoop.util.Shell.run(Shell.java:379)
at org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:589)
at org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:195)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:283)
at org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:79)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334)
at java.util.concurrent.FutureTask.run(FutureTask.java:166)
at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
at java.lang.Thread.run(Thread.java:722)

.Failing this attempt.. Failing the application.
13/12/01 18:44:32 INFO mapreduce.Job: Counters: 0
=====

Any ideas?
ReplyDelete
Replies
bgreene10123January 18, 2014 at 2:43 AM
Awesome! Worked perfectly for me! Thanks!
ReplyDelete
Replies
Bangaloreweb guruFebruary 5, 2014 at 4:38 PM
This comment has been removed by a blog administrator.
ReplyDelete
Replies
UnknownFebruary 26, 2014 at 10:24 AM
Much obliged concerning your answers. I recon issue 1 is on the grounds that the punctuation I had unequivocally sets the java form to 1.6 which may not have been introduced on your framework. A debt of gratitude is in order regarding the reply to issue.

best website design//Mobile Apps N Webs Development
ReplyDelete
Replies
Matt GardnerMarch 18, 2014 at 5:41 AM
Thanks for great article. All worked for me - and this was the first time I tried to get Hadoop up and running.
ReplyDelete
Replies
UnknownAugust 18, 2014 at 12:44 PM
I hope this information of installing process would be as best reference to install the hadoop for the people.I really grateful to this blog for updating useful things.
Web Designing Companies Bangalore | Website Development Company Bangalore
ReplyDelete
Replies
saranyaApril 28, 2018 at 10:48 AM
I believe there are many more pleasurable opportunities ahead for individuals that looked at your site.
selenium training in chennai
ReplyDelete
Replies
AnonymousJune 5, 2018 at 8:08 AM
This comment has been removed by the author.
ReplyDelete
Replies
AnonymousJune 5, 2018 at 8:11 AM
This is extremely great information for these blog!! And Very good work. It is very interesting to learn from to easy understood. Thank you for giving information. Please let us know and more information get post to link.Devops interview Questions
ReplyDelete
Replies
service careFebruary 27, 2019 at 11:29 AM
Thanks for posting useful information.You have provided an nice article, Thank you very much for this one. And i hope this will be useful for many people.. and i am waiting for your next post keep on updating these kinds of knowledgeable things...Really it was an awesome article...very interesting to read..please sharing like this information......
samsung mobile service center in vadapalani
ReplyDelete
Replies
PavaniSeptember 6, 2019 at 7:00 AM
Thanks for sharing this valubale information with us Keep Blogging !!
Digital Marketing Course in Vizag
Seo Training in vizag
Seo services in vizag
Digital Marketing Course in vijayawada
Digital Marketing Course in Guntur
Digital Marketing Course in Tirupati
Balloon Decoration in Vizag

ReplyDelete
Replies
merlinMay 16, 2020 at 7:13 AM
The blog you shared about Web Development is very good. I expect more information from you like this blog. Thankyou.

Selenium Training in chennai | Selenium Training in annanagar | Selenium Training in omr | Selenium Training in porur | Selenium Training in tambaram | Selenium Training in velachery
ReplyDelete
Replies
RohitJune 1, 2020 at 12:00 PM
Great Article & Thanks for sharing.

Mitron App details
ReplyDelete
Replies
sureshDecember 16, 2020 at 4:51 PM
This is extremely great information for these blog!! And Very good work. It is very interesting to learn from to easy understood. Thank you for giving information.
DevOps Training in Chennai

DevOps Course in Chennai
ReplyDelete
Replies
KEMAL UZUNJune 26, 2021 at 2:49 AM
coin haber - koin haber - kripto para haberleri - coin haber - instagram video indir - instagram takipçi satın al - instagram takipçi satın al - tiktok takipçi satın al - instagram takipçi satın al - instagram takipçi satın al - instagram takipçi satın al - instagram takipçi satın al - instagram takipçi satın al - binance güvenilir mi - binance güvenilir mi - binance güvenilir mi - binance güvenilir mi - instagram beğeni satın al - instagram beğeni satın al - google haritalara yer ekleme - btcturk güvenilir mi - binance hesap açma - kuşadası kiralık villa - tiktok izlenme satın al - instagram takipçi satın al - sms onay - paribu sahibi - binance sahibi - btcturk sahibi - paribu ne zaman kuruldu - binance ne zaman kuruldu - btcturk ne zaman kuruldu - youtube izlenme satın al - torrent oyun - google haritalara yer ekleme - altyapısız internet - bedava internet - no deposit bonus forex - erkek spor ayakkabı - tiktok jeton hilesi - tiktok beğeni satın al - microsoft word indir - misli indir - instagram takipçi satın al
ReplyDelete
Replies
AnonymousSeptember 18, 2021 at 1:28 PM
i am very gladfully to u share a this kind of information with us u make a blog on web development. if you want to know about server hosting or interested in best Managed Dedicated Server you can ask us for more details and services.
ReplyDelete
Replies
AnonymousSeptember 18, 2021 at 1:56 PM
Hello friend your blog is very instructive, and it contains a very good amount of knowledge, knowing about Installing Hadoop 2 on a Mac. Web Hosting plays a very important role in the business world. And it is important to have the best hosting services. Buy the best Dedicated Server Hosting service for your website.
ReplyDelete
Replies
MikeFebruary 1, 2022 at 10:29 AM
This blog has really best knowledge for the Web development Dubai and also helped me to find out out new ways at my work, thanks for sharing the blog.
ReplyDelete
Replies
AnonymousApril 28, 2022 at 11:02 PM
mmorpg oyunlar
İnstagram Takipci Satın Al
tiktok jeton hilesi
Tiktok Jeton Hilesi
sac ekim antalya
takipçi
instagram takipçi satın al
metin2 pvp serverlar
INSTAGRAM TAKİPÇİ
ReplyDelete
Replies
AnonymousMay 17, 2022 at 5:33 PM
perde modelleri
NUMARA ONAY
MOBİL ÖDEME BOZDURMA
nft nasıl alınır
ankara evden eve nakliyat
trafik sigortası
DEDEKTÖR
SİTE KURMA
Ask romanlari
ReplyDelete
Replies
James AlvarezMay 4, 2023 at 12:17 PM
Great post! Such an informative and well-written piece. Are you looking for a gardener that cares for or designs your garden then click here for the best Gärtner service.
ReplyDelete
Replies

Add comment

Adventures in Web Development

Tuesday, October 29, 2013

Installing Hadoop 2 on a Mac

28 comments:

About Me

Blog Archive