TechNet
Products
IT Resources
Downloads
Training
Support
Products
Windows
Windows Server
System Center
Microsoft Edge
Office
Office 365
Exchange Server
SQL Server
SharePoint Products
Skype for Business
See all products »
Resources
Channel 9 Video
Evaluation Center
Learning Resources
Microsoft Tech Companion App
Microsoft Technical Communities
Microsoft Virtual Academy
Script Center
Server and Tools Blogs
TechNet Blogs
TechNet Flash Newsletter
TechNet Gallery
TechNet Library
TechNet Magazine
TechNet Wiki
Windows Sysinternals
Virtual Labs
Solutions
Networking
Cloud and Datacenter
Security
Virtualization
Updates
Service Packs
Security Bulletins
Windows Update
Trials
Windows Server 2016
System Center 2016
Windows 10 Enterprise
SQL Server 2016
See all trials »
Related Sites
Microsoft Download Center
Microsoft Evaluation Center
Drivers
Windows Sysinternals
TechNet Gallery
Training
Expert-led, virtual classes
Training Catalog
Class Locator
Microsoft Virtual Academy
Free Windows Server 2012 courses
Free Windows 8 courses
SQL Server training
Microsoft Official Courses On-Demand
Certifications
Certification overview
Special offers
MCSE Cloud Platform and Infrastructure
MCSE: Mobility
MCSE: Data Management and Analytics
MCSE Productivity
Other resources
Microsoft Events
Exam Replay
Born To Learn blog
Find technical communities in your area
Azure training
Official Practice Tests
Support options
For business
For developers
For IT professionals
For technical support
Support offerings
More support
Microsoft Premier Online
TechNet Forums
MSDN Forums
Security Bulletins & Advisories
Not an IT pro?
Microsoft Customer Support
Microsoft Community Forums
Sign in
Home
Library
Wiki
Learn
Gallery
Downloads
Support
Forums
Blogs
Resources For IT Professionals
United States (English)
Россия (Pусский)
中国(简体中文)
Brasil (Português)
Skip to locale bar
Get this Tag RSS feed
Translate this page
Powered by
Microsoft® Translator
Popular Tags
Active Directory
AD
AD DS
adfs
ASP.NET
azure
BizTalk
BizTalk Server
BizTalk Server 2010
C#
Candidate for deletion
certification
cloud
core docs
de-DE
EAA
Ed Price
Ed's Stub Pages
en-US
ESA
es-ES
Excel
Exchange
Exchange 2010
fa-IR
Fernando Lugao Veltem
FIM
FIM 2010
FIM Resources
FIM-HELP
forefront
forums
fr-FR
Gokan Ozcifci
has code
has comment
has comments
has image
has Images
has Other Languages
has See Also
Has Table
Has TOC
Horizon_Net
How To
Hyper-V
id-ID
IIS
Italian Wiki Articles
it-IT
ja-JP
Jordano Mazzoni
Link Collection
Luciano Lima
Luigi Bruno
Lync Server 2010
MIISILMFIM MACAULAY
Multi Language Wiki Articles
needs work
operations manager
Pirated Content
Portal
Português Brasil
PowerShell
pt-BR
security
SharePoint
SharePoint 2010
SharePoint 2013
SharePoint Pirate
Small Basic
solucionando problemas
SQL Server
SQL Server 2012
stub
System Center
System Center 2012
TechNet Guru
TechNet Wiki
TechNet Wiki Featured Article
tonyso
Translated into Japanese
troubleshooting
tr-TR
vídeo
Video
Virtualization
VMM
Wiki
Windows
Windows 7
Windows 8
Windows Azure
Windows Server
Windows Server 2003
Windows Server 2008
Windows Server 2008 R2
Windows Server 2012
yottun8
اکتیو دایرکتوری
Browse by Tags
>
TechNet Articles
>
All Tags
>
HDFS
Tagged Content List
Wiki Page:
How to Import Data to Hadoop on Windows Azure from Windows Azure Marketplace
Maheshkumar S Tiwari
Before you use the Apache Hadoop on Windows Azure portal to import Windows Azure Marketplace data into Hadoop on Windows Azure, you must know the following information: User name: the live ID used to sign in to the marketplace. PassKey Sign in http://datamarket.azure.com with your live ID...
on
14 Sep 2013
Wiki Page:
Running HDInsight C# Hadoop Streaming Sample
Maheshkumar S Tiwari
MapReduce is a programming model designed for processing large volumes of data in parallel by dividing the work into a set of independent tasks. Most of the MapReduce jobs are written in Java. Hadoop provides a streaming API to MapReduce that enables you to write map and reduce functions in languages...
on
14 Sep 2013
Wiki Page:
HDInsight Services For Windows
Richard Mueller
This article is the main portal for technical information about HDInsight Services for Windows and related Microsoft technologies. It provides a brief overview of Apache Hadoop, as well as information for the HDInsight Services provided by Microsoft for deployment on both Windows and Windows Azure...
on
3 Sep 2013
Wiki Page:
Getting Started with the HDInsight Server Developer Preview
BradSevertson
Table of Contents Introduction Installation of Hadoop on Windows The Apache™ Hadoop™-based services on Windows dashboard Getting started with Microsoft Hadoop on Windows Load Some Data Running MapReduce Jobs Running Pig Jobs Running Hive Jobs Additonal Resources: Apache Hadoop, Hadoop on Windows, and...
on
7 Jun 2013
Wiki Page:
Installing the Developer Preview of HDInsight Services on Windows
BradSevertson
Introduction The HDInsight Server Developer Preview is an implementation of HDInsight on Windows. This Developer Preview of Apache™ Hadoop™-based services on Windows uses only a single node deployment. HDInsight Server provides a local development environment for the Windows Azure HDInsight Service...
on
7 Jun 2013
Wiki Page:
The Hadoop on Azure Pegasus Page Rank Sample
Carsten Siemens
Overview This tutorial shows how to deploy Pegasus from the Hadoop on Azure portal to compute the page rank for a simple 16-node graph. The rank calculated for a node is a measure of how well connected it is to the other nodes in the graph structure. A graph is type of abstract mathematical structure...
on
5 May 2013
Wiki Page:
HDInsight Services for Windows Azure QuickStart: Running Hadoop Jobs
Carsten Siemens
This tutorial shows two ways in which Hadoop MapReduce programs can be run on an Hadoop Distributed File System (HDFS) using HDInsight Services for Windows Azure. Use the Create Job UI to run MapReduce programs written in Java, contained in Hadoop jar files Use the Interactive JavaScript Console...
on
5 May 2013
Wiki Page:
Analyzing Twitter Data with Hive in HDInsight and SteamInsight
Carsten Siemens
In this tutorial you will query, explore, and analyze data from twitter using Apache™ Hadoop™-based Services for Windows Azure and a Hive query in Excel. Social web sites are one of the major driving forces for Big Data adoption. Public APIs provided by sites like Twitter are a useful source of data...
on
5 May 2013
Wiki Page:
Simple recommendation engine using Apache Mahout
Carsten Siemens
Apache Mahout™ is a machine learning library built for use in scalable machine learning applications. Recommender engines are some of the most immediately recognizable machine learning applications in use today. In this tutorial you use the Million Song Dataset to create song recommendations for users...
on
5 May 2013
Wiki Page:
Working With Data in Windows Azure HDInsight Service
Carsten Siemens
This tutorial covers several techniques for storing and importing data for use in Hadoop MapReduce jobs run with Windows Azure HDInsight Service ( formerly Apache™ Hadoop™-based Services for Windows Azure). Apache Hadoop is a software framework that supports data-intensive distributed applications...
on
5 May 2013
Wiki Page:
Introduction to HDInsight Services for Windows Azure
Carsten Siemens
Overview HDInsight Services for Windows Azure is a service that deploys and provisions Apache™ Hadoop™ clusters in the cloud, providing a software framework designed to manage, analyze and report on big data. Data is described as "big data" to indicate that it is being collected in ever...
on
5 May 2013
Wiki Page:
Hadoop on Azure WordCount Sample Tutorial
Jonathan Gao
Overview This tutorial shows two ways to use Hadoop on Azure to run a MapReduce program that counts word occurences in a text. First, with a Hadoop .jar file by using the Create Job UI. Second, with a query by using the fluent API layered on Pig that is provided by the Interactive Console . The...
on
22 Apr 2013
Wiki Page:
Analyzing Twitter Movie Data with Hive in HDInsight
Jonathan Gao
In this tutorial you will query, explore, and analyze data from twitter using Apache™ Hadoop™-based Services for Windows Azure and a Hive query in Excel. Social web sites are one of the major driving forces for Big Data adoption. Public APIs provided by sites like Twitter are a useful source of data...
on
11 Apr 2013
Wiki Page:
The Hadoop on Azure Sqoop Import Sample Tutorial
Ed Price - MSFT
Overview This tutorial shows how to use Sqoop to import data from a SQL database on Windows Azure to an Hadoop on Azure HDFS cluster. While Hadoop is a natural choice for processing unstructured and semi-structured data, such as logs and files, there may also be a need to process structured data...
on
10 Jan 2013
Wiki Page:
The Hadoop on Azure Pi Estimator Sample Tutorial
Luis Panzano Barbero
Overview This tutorial shows how to deploy a MapReduce program that uses a statistical (quasi-Monte Carlo) method to estimate the value of Pi. Points placed at random inside of a unit square also fall within a circle inscribed within that square with a probability equal to the area of the circle...
on
28 Oct 2012
Wiki Page:
The Hadoop on Azure Pegasus Degree Distribution Sample Tutorial
Michael Agranov
Overview This tutorial shows how to deploy Pegasus from the Hadoop on Azure portal to compute the degree of each node and the distribution of degrees for a simple 16-node graph. The degree distribution gives the number of nodes in the graph at each degree. The degree of a node in a network (or...
on
5 Oct 2012
Wiki Page:
The Hadoop on Azure Mahout Clustering Sample Tutorial
Michael Agranov
Overview This tutorial illustrates how to use Hadoop on Azure to do cluster analysis with Mahout. The various forms of cluster analysis attempt to answer the problem: given a collection of objects with values for a set of properties, devise a scheme for grouping them where similar ones are put...
on
4 Oct 2012
Wiki Page:
The Hadoop on Azure Mahout Classification Sample Tutorial
Michael Agranov
Overview This tutorial illustrates how to use Apache Mahout in Hadoop on Azure to do classification. Classification techniques attempt to answer the question: how much some object is or is not part of some type or category, or, whether it does or does not have some attribute. The sample used...
on
4 Oct 2012
Wiki Page:
Hadoop on Azure 10 GB GraySort Sample Tutorial
Michael Agranov
Overview This tutorial shows how to run a general purpose GraySort on a 10 GB file using Hadoop on Azure. A GraySort is a benchmark sort whose metric is the sort rate (TB/minute) that is achieved while sorting a very large amount of data, usually a 100 TB minimum. This sample uses a more modest 10...
on
3 Oct 2012
Wiki Page:
The Hadoop on Azure Pegasus Page Rank Sample Tutorial
BradSevertson
Overview This tutorial shows how to deploy Pegasus from the Hadoop on Azure portal to compute the page rank for a simple 16-node graph. The rank calculated for a node is a measure of how well connected it is to the other nodes in the graph structure. A graph is type of abstract mathematical structure...
on
3 Oct 2012
Wiki Page:
Restoring HDFS Metadata from a Backup
Ed Price - MSFT
Transcript In the event of a catastrophic failure, your Hadoop cluster can be restored from a backup. In a previous video, Brad Sarsfield demonstrated how to configure and use the namenode backup service to save a copy of your HDFS metadata to your Azure storage account. In this video, he shows...
on
15 May 2012
Page 1 of 1 (21 items)
Can't find it? Write it!
Post an Article