This documentation is intended to provide instructions on how to run Stratosphere locally on a single machine.
Go to the downloads page and get the ready to run package. If you want to interact with Hadoop (e.g. HDFS or HBase), make sure to pick the Stratosphere package matching your Hadoop version. When in doubt or you plan to just work with the local file system pick the package for Hadoop 1.2.x.
Stratosphere runs on Linux, Mac OS X and Windows. The only requirement for a local setup is Java 1.6.x or higher. The following manual assumes a UNIX-like environment, for Windows see Stratosphere on Windows.
You can check the correct installation of Java by issuing the following command:
java -version
The command should output something comparable to the following:
java version "1.6.0_22"
Java(TM) SE Runtime Environment (build 1.6.0_22-b04)
Java HotSpot(TM) 64-Bit Server VM (build 17.1-b03, mixed mode)
For local mode Stratosphere is ready to go out of the box and you don't need to change the default configuration.
The out of the box configuration will use your default Java installation. You can manually set the environment variable JAVA_HOME
or the configuration key env.java.home
in conf/stratosphere-conf.yaml
if you want to manually override the Java runtime to use. Consult the configuration page for further details about configuring Stratosphere.
You are now ready to start Stratosphere. Unpack the downloaded archive and change to the newly created stratosphere
directory. There you can start Stratosphere in local mode:
$ tar xzf stratosphere-*.tgz
$ cd stratosphere
$ bin/start-local.sh
Starting Nephele job manager
You can check that the system is running by checking the log files in the logs
directory:
$ tail log/nephele-*-jobmanager-*.log
INFO ... - Initializing memory manager with 409 megabytes of memory
INFO ... - Trying to load eu.stratosphere.nephele.jobmanager.scheduler.local.LocalScheduler as scheduler
INFO ... - Setting up web info server, using web-root directory ...
INFO ... - Web info server will display information about nephele job-manager on localhost, port 8081.
INFO ... - Starting web info server for JobManager on port 8081
The JobManager will also start a web frontend on port 8081, which you can check with your browser at http://localhost:8081
.
If you want to run Stratosphere on Windows you need to download, unpack and configure the stratosphere archive as mentioned above. After that you can either use the Windows Batch or Cygwin to run the Stratosphere jobmanager.
To start Stratosphere in local mode from the Windows Batch, open the command window, navigate to the bin directory of Stratosphere and run start-local.bat.
$ cd stratosphere
$ cd bin
$ start-local.bat
Starting Stratosphere job manager. Webinterface by default on http://localhost:8081/.
Do not close this batch window. Stop job manager by pressing Ctrl+C.
After that, you need to open a second terminal to run jobs using pact-client.bat.
With Cygwin you need to start the Cygwin Terminal, navigate to your stratosphere directory and run the start-local.sh script:
$ cd stratosphere
$ bin/start-local.sh
Starting Nephele job manager
If you are installing Stratosphere from the git repository and you are using the Windows git shell, Cygwin can produce a failure similiar to this one:
c:/stratosphere/bin/start-local.sh: line 30: $'\r': command not found
This error occurs, because git is automatically transforming UNIX line endings to Windows style line endings when running in Windows. The problem is, that Cygwin can only deal with UNIX style line endings. The solution is to adjust the Cygwin settings to deal with the correct line endings by following these three steps:
Start a Cygwin shell.
Determine your home directory by entering
cd;pwd
It will return a path under the Cygwin root path.
export SHELLOPTS
set -o igncr
Save the file and open a new bash shell.