Browse by Tags

Tagged Content List
  • Wiki Page: Running HDInsight C# Hadoop Streaming Sample

    MapReduce is a programming model designed for processing large volumes of data in parallel by dividing the work into a set of independent tasks. Most of the MapReduce jobs are written in Java. Hadoop provides a streaming API to MapReduce that enables you to write map and reduce functions in languages...
  • Wiki Page: HDInsight Services For Windows

    This article is the main portal for technical information about HDInsight Services for Windows and related Microsoft technologies. It provides a brief overview of Apache Hadoop, as well as information for the HDInsight Services provided by Microsoft for deployment on both Windows and Windows Azure...
  • Wiki Page: Big Data – Beginning of New Era of Analytics in 2013

    In today’s digital world, data has been increasing at a very fast pace which demands organizations to focus on data and business intelligence. There is a drastic increase in technologies which rapidly evaluates massive amounts and varieties of data flowing from different devices, sensors, mobiles,...
  • Wiki Page: Microsoft HDInsight (Big Data) Solution

    This article serves as a single point repository for all content, resources, links, information and latest updates about Big Data, and Microsoft's activities around it. Contributions from everyone are most welcome. Note1: Please see, all the articles are listed in proper order. Articles...
  • Wiki Page: Analyzing Twitter Data with Hive in HDInsight and SteamInsight

    In this tutorial you will query, explore, and analyze data from twitter using Apache™ Hadoop™-based Services for Windows Azure and a Hive query in Excel. Social web sites are one of the major driving forces for Big Data adoption. Public APIs provided by sites like Twitter are a useful source of data...
  • Wiki Page: Working With Data in Windows Azure HDInsight Service

    This tutorial covers several techniques for storing and importing data for use in Hadoop MapReduce jobs run with Windows Azure HDInsight Service ( formerly Apache™ Hadoop™-based Services for Windows Azure). Apache Hadoop is a software framework that supports data-intensive distributed applications...
  • Wiki Page: Introduction to HDInsight Services for Windows Azure

    Overview HDInsight Services for Windows Azure is a service that deploys and provisions Apache™ Hadoop™ clusters in the cloud, providing a software framework designed to manage, analyze and report on big data. Data is described as "big data" to indicate that it is being collected in ever...
  • Wiki Page: Hadoop on Azure WordCount Sample Tutorial

    Overview This tutorial shows two ways to use Hadoop on Azure to run a MapReduce program that counts word occurences in a text. First, with a Hadoop .jar file by using the Create Job UI. Second, with a query by using the fluent API layered on Pig that is provided by the Interactive Console . The...
  • Wiki Page: How to Connect Excel to Hadoop on Azure via HiveODBC

    One key feature of Microsoft’s Big Data Solution is solid integration of Apache Hadoop with the Microsoft Business Intelligence (BI) components. A good example of this is the ability for Excel to connect to the Hive data warehouse framework in the Hadoop cluster. This section walks you through using...
  • Wiki Page: Analyzing Twitter Movie Data with Hive in HDInsight

    In this tutorial you will query, explore, and analyze data from twitter using Apache™ Hadoop™-based Services for Windows Azure and a Hive query in Excel. Social web sites are one of the major driving forces for Big Data adoption. Public APIs provided by sites like Twitter are a useful source of data...
  • Wiki Page: Hadoop

    This article gives a brief overview of Apache Hadoop and directs you to Wiki articles that can give you more in-depth information about specific areas of Hadoop. Table of Contents Overview WindowsAzure.com TechNet Wiki Articles Community Resources Blog Posts Overview Apache Hadoop is an open...
  • Wiki Page: HDInsight Scenario: Query a Web Log via HiveQL

    The purpose of this wiki post is to provide an example scenario on how to work with Hadoop on Azure, upload a web log sample file via secure FTP, and run some simple HiveQL queries. Important! This wiki topic may be obsolete. The wiki topics on Windows Azure HDInsight Service are no...
  • Wiki Page: Microsoft and Hadoop - Windows Azure HDInsight

    Traditionally Microsoft Windows used to be a sort of stepchild in Hadoop world – the ‘hadoop’ command to manage actions from command line and the startup/shutdown scripts were written in Linux/*nix in mind assuming bash. Thus if you wanted to run Hadoop on Windows, you had to install cygwin . Also...
Page 1 of 1 (13 items)
Can't find it? Write it!