I've been playing a bit with Hadoop. Below you'll find notes and screenshots showing how to install a simple development setup (on a virtual server).
You'll need
- A Linux Server. I'm using Ubuntu 12.04 LTS 64 bit, but any should do
- Java JDK 1.6_24 64 bit for Linux (jdk-6u24-linux-x64.bin) get it here
- Hadoop 1.0.4 (hadoop-1.0.4-bin.tar.gz) get it here
Install your server. I wont describe how to do that.
I'm using a virtual machine running on VirtualBox. After installing linux, you'll need to configure port forwarding. Open "Settings", select the "Network" tab and click "Port Forwarding":
Then configure the following four port forwards:
Now you're ready to copy Java and Hadoop to the server. I usually use command line tools: E.g. on Windows with PuTTY:
pscp -l ubuntu -P 9022 hadoop-1.0.4-bin.tar.gz jdk-6u24-linux-x64.bin 127.0.0.1:
Or on Linux:
scp hadoop-1.0.4-bin.tar.gz jdk-6u24-linux-x64.bin ubuntu@127.0.0.1:9022:
Now you need to log on to your server. On Windows you should use PuTTY; enter "Host Name" and "Port" and click "Open":
First install java. Make the file executable and run it as root using sudo:
A lot of files will be unpacked:
Finally you can press enter followed by a few commands to move java into /opt, create a softlink and create a profile for java:
In the profile set the JAVA_HOME environment variable and prepend Java bin directory to PATH:
Check the settings by source'ing and verifying the version of Java:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
More:
All screenshots can be downloaded here.