Browse by Tags

Tagged Content List
  • Wiki Page: Mahout on Windows Azure - Machine Learning Using Microsoft HDInsight

    Introduction One of the Microsoft HDInsight key components is Mahout, a scalable machine learning library that provides a number of algorithms relying on the Hadoop platform. Machine learning supports a wide range of use cases from email spam filtering to fraud detection to recommending books...
  • Wiki Page: Online Classification for Big Data

    Highlighting my experience during coding of Stochastic Gradient Descent algorithm Stochastic Gradient Descent is an online classification algorithm. This algorithm proves to be very efficient in classification of huge big data problems. Unlike Logistic algorithm, which is somewhat...
  • Wiki Page: About Big Data Documentation

    The Big Data content team is a v-team. Currently, the writers working on Big Data are Jonathan Gao, Wesley McSwain, Larry Franks, and Brad Severtson. The content publishing manager is Mohamed Ibrahim. The content PM is Susan Joly. The content team wrote documentation for the first releases of Isotope...
  • Wiki Page: Big Data – Beginning of New Era of Analytics in 2013

    In today’s digital world, data has been increasing at a very fast pace which demands organizations to focus on data and business intelligence. There is a drastic increase in technologies which rapidly evaluates massive amounts and varieties of data flowing from different devices, sensors, mobiles,...
  • Wiki Page: Working with NoSQL Databases

    Table of Contents How to get started with NoSQL? What NoSQL databases are present today? .NET APIs Cassandra CouchDB MongoDB Tokyo Cabinet Further Reading See Also Other Languages Deutsch (de-DE) Italian (it-IT) Português (pt-BR) How to get started with NoSQL? Since 2009 NoSQL databases becomes...
  • Wiki Page: Key-value stores (No SQL Databases)

    A key - value store is a sub-category of NoSQL Databases (to start working with NoSQL Databases or to know what they are, refer : Working with NoSQL Databases ) They allow the application to store its data in a schema-less way. The data, however can be stored in a user-defined data type or object...
  • Wiki Page: Microsoft HDInsight (Big Data) Solution

    This article serves as a single point repository for all content, resources, links, information and latest updates about Big Data, and Microsoft's activities around it. Contributions from everyone are most welcome. Note1: Please see, all the articles are listed in proper order. Articles...
  • Wiki Page: A Lap Around HDInsight

    (cross-posted from The Blog @ Graemesplace and the Content Master Technology Blog ) I’m currently working with Microsoft’s Patterns and Practices team, researching and documenting best practices guidance for big data analysis with HDInsight. For those of you who may not know, HDInsight is Microsoft...
  • Wiki Page: How to Connect Excel to Hadoop on Azure via HiveODBC

    One key feature of Microsoft’s Big Data Solution is solid integration of Apache Hadoop with the Microsoft Business Intelligence (BI) components. A good example of this is the ability for Excel to connect to the Hive data warehouse framework in the Hadoop cluster. This section walks you through using...
  • Wiki Page: Introduction to the Hadoop Services on Azure Hive Console (video)

    The Microsoft deployment of Apache Hadoop for Windows lets you set up a private Hadoop cluster on Azure. One of the included administration/deployment tools is an Interactive Console for JavaScript and Hive. This video introduces the Interactive Hive console. Developer Lengning Liu demonstrates running...
  • Wiki Page: Run the Pi Estimator Sample on Hadoop Services for Windows Azure (video)

    http://youtu.be//w0BpLawwmKI Hadoop-based Services for Windows Azure includes several samples you can use for learning and testing.In this video, Developer Brad Sarsfield walks you through the Pi Estimator sample. See Also More Videos about Hadoop Services on Windows and Windows Azure ...
  • Wiki Page: Microsoft and Hadoop - Windows Azure HDInsight

    Traditionally Microsoft Windows used to be a sort of stepchild in Hadoop world – the ‘hadoop’ command to manage actions from command line and the startup/shutdown scripts were written in Linux/*nix in mind assuming bash. Thus if you wanted to run Hadoop on Windows, you had to install cygwin . Also...
  • Wiki Page: Big Data Documentation

    The Big Data content team is a v-team. Currently, the writers working on Big Data are Jonathan Gao, Wesley McSwain, Larry Franks, and Brad Severtson. The content publishing manager is Mohamed Ibrahim. The content PM is Susan Joly. The content team wrote documentation for the first releases of Isotope...
  • Wiki Page: Use PowerPivot to Access Hive on Windows Azure (video)

    This screencast shows you how to use Excel PowerPivot to access data from Hive on Windows Azure. It includes opening the ODBC Server port for a Hadoop cluster on Windows Azure, downloading and installing the Hive ODBC Driver, and creating an ODBC DSN pointing to a Hive data warehouse running on Windows...
  • Wiki Page: Use Excel Hive Add-in to Access Hive on Windows Azure (video)

    This screencast shows how to use Excel Hive Add-in to import data from Hive on Windows Azure. It includes opening the HiveODBC port of Hadoop Services on Windows Azure, importing sample data into Hive on Windows Azure, and access the data using the add-in. You can use the same procedure to connect to...
  • Wiki Page: Part 2: 10GB GraySort - Terasort (video)

    Transcript Hadoop-based Services for Windows Azure includes several samples you can use for learning and testing. One sample is the 10GB GraySort which is a scaled-down version of the Hadoop Terasort benchmark. There are three jobs to run and in this video, Developer Brad Sarsfield walks you through...
  • Wiki Page: Part 3: 10GB GraySort - Teravalidate (video)

    Transcript Hadoop-based Services for Windows Azure includes several samples you can use for learning and testing. One sample is the 10GB GraySort which is a scaled-down version of the Hadoop Terasort benchmark. There are three jobs to run and in this video, Developer Brad Sarsfield walks you through...
  • Wiki Page: Back Up Hadoop HDFS Metadata to Azure

    Transcript To protect against catastrophic failure, always backup your Hadoop-based Services for Windows HDFS metadata. In this video, Brad Sarsfield shows you how to take periodic snapshots of your data and upload that data to an Azure storage account. In the next video, he shows you how to restore...
  • Wiki Page: Restoring HDFS Metadata from a Backup

    Transcript In the event of a catastrophic failure, your Hadoop cluster can be restored from a backup. In a previous video, Brad Sarsfield demonstrated how to configure and use the namenode backup service to save a copy of your HDFS metadata to your Azure storage account. In this video, he shows...
  • Wiki Page: Video Plan: Hadoop-based Service for Windows and Windows Azure

    Introduction This document describes the video content plans for Hadoop-based Service for Windows and Windows Azure. This Plan complements the Documentation Plan . Goals Identify what video content will be produced Identify when video content will be delivered Identify...
  • Wiki Page: Microsoft Big Data Community

    Link to the Microsoft Big Data Community Plan for FY12. Link to the Events tracking spreadsheet . Visit Project Heathrow for community engagement tips, tricks and strategies.
  • Wiki Page: Import Data from Windows Azure Marketplace to Hadoop Services on Azure

    http://youtu.be//3OOmV_d0Vsk This screencast shows how to import data from Windows Azure Marketplace to Apache Hadoop-based Services for Windows Azure. It demonstrates how to collect the Windows Azure Marketplace account information, and import the data by using the Apache Hadoop for Windows...
Page 1 of 1 (22 items)
Can't find it? Write it!