Deployment of Hadoop-based Services on Windows and on Windows Azure

Deployment of Hadoop-based Services on Windows and on Windows Azure

Apache™ Hadoop™ is an open source framework from Apache. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. It is very useful for analyzing and developing relationships for large unstructured datasets. Data processing in Hadoop is distributed across a cluster of computers using a simple programming model. For a complete reference on Hadoop, see hadoop.apache.org.

The core Hadoop project contains the following components:

  • Hadoop Common is the common utilities that support other Hadoop related subprojects.
  • Hadoop Distributed File System (HDFS) is the primary storage system used by Hadoop applications.
  • Hadoop MapReduce is the software framework for distributed processing of large unstructured datasets across a Hadoop cluster of computers.

Hadoop-based services for Microsoft Windows includes the core components and the following Hadoop related projects:

  • Hive is a data warehouse infrastructure that provides data summarization and ad hoc querying.
  • Pig is a high-level data-flow language and execution framework for parallel computation.

Microsoft provides Hadoop-based services packages for Windows and for Windows Azure. To get started using one of the packages click on a link below:

 

 

Leave a Comment
  • Please add 6 and 5 and type the answer here:
  • Post
Wiki - Revision Comment List(Revision Comment)
Comments
  • Wesley McSwain MSFT edited Revision 6. Comment: fixing naming

  • Horizon_Net edited Revision 9. Comment: added language tags

  • BradSevertson edited Revision 10. Comment: edited our ref to EMR portal

  • BradSevertson edited Revision 11. Comment: Options at the end edited - need link for onprem still. deleted Windows Azure Deployment of Hadoop-based Services for Windows: Hadoop Deployment to Windows Azure using compute instances as nodes in the Hadoop cluster for processing data in the cloud = option2

  • BradSevertson edited Revision 12. Comment: removed link from onprem option - needs to be updated when we have a new location.

  • BradSevertson edited Revision 13. Comment: stylistic edit

  • BradSevertson edited Revision 15. Comment: Updated on-premise link for the Developer Preview of Apache Hadoop-based Services for Windows

  • BradSevertson edited Revision 16. Comment: fixing link

Page 1 of 1 (8 items)
Wikis - Comment List
Posting comments is temporarily disabled until 10:00am PST on Saturday, December 14th. Thank you for your patience.
Comments
  • Wesley McSwain MSFT edited Revision 6. Comment: fixing naming

  • Horizon_Net edited Revision 9. Comment: added language tags

  • What happened to the on-premise installer?

  • Bumping the comment by johnrwest, What happened to the on-premise installer?

  • Echoing the other two messages. Are you still planning planning to release hadoop for windows?

  • I can't make any announcements here. We have to wait for an official announcements. Just hold tight for now

  • When I click on "Windows Azure Deploymnt of Hadoop-based Services for Windows", I get a page that says:

    "Not Found: Resource Not Found

    The resource you requested does not exist. "

  • Kindly post information about "Windows Azure Deployment of Hadoop-based Services for Windows:"

  • Still nothing new regarding the on-premise solution?

  • BradSevertson edited Revision 10. Comment: edited our ref to EMR portal

  • BradSevertson edited Revision 11. Comment: Options at the end edited - need link for onprem still. deleted Windows Azure Deployment of Hadoop-based Services for Windows: Hadoop Deployment to Windows Azure using compute instances as nodes in the Hadoop cluster for processing data in the cloud = option2

  • BradSevertson edited Revision 12. Comment: removed link from onprem option - needs to be updated when we have a new location.

  • BradSevertson edited Revision 13. Comment: stylistic edit

  • BradSevertson edited Revision 15. Comment: Updated on-premise link for the Developer Preview of Apache Hadoop-based Services for Windows

  • BradSevertson edited Revision 16. Comment: fixing link

Page 1 of 1 (15 items)