This article describes how the create Map Reduce jobs on a cluster that has been provisioned on the Hadoop on Windows Azure Portal. For more information on running Map Reduce Jobs for an on-premise or Windows Azure Hadoop cluster, see the getting started guide for your cluster deployment type.
To run a job, you will need to first create your own jar file – compiled java code that contains your Map Reduce code. For more information on how to create a jar file, you can reference these links:
We have also made the hadoop-examples-0.20.203.1-SNAPSHOT.jar file available which is made use below to run the pi sample. Once, your logged into the portal, download the .jar file to your local computer by clicking the Samples tile, then click the Pi Estimator tile. The steps in this topic walk you through manually uploading a jar file from your local computer. However, you can just click Deploy to your cluster for the sample.
pi, 10, 100
This indicates that for the examples jar file, you will run the pi sample using iterations / threads of 10 with the number of jobs to aggregate over at 100. The parameters should look much like the screenshot below.
If you click on the task itself, (e.g. the “pi #1” tile), you can re-execute the job – similar to what you had done in Step #3.
Ed Price - MSFT edited Revision 11. Comment: Fonts and spacing
Wesley McSwain MSFT edited Revision 12. Comment: Put the table in step 3 in it's own paragraph so it reads better
Wesley McSwain MSFT edited Revision 13. Comment: cleaned up the formatting
Wesley McSwain MSFT edited Revision 16. Comment: correcting image formats
BradSevertson edited Revision 24. Comment: Got rid of EMR language.
Adding tags. Great article!