Hadoop allows businesses to quickly analyze very large data sets. Microsoft Excel is the ready-to-go front end for data analysis. This lab explorse a simple Excel front-end to HBase MapReduce, allowing you to view HBase tables and execute MapReduce jobs. The goal is to make the add-in generic with respect to the column definitions and data in a HBase table.
HBase is a distributed, scalable, big data storage and retrieval system developed as part of the Apache Hadoop project. This explores building a simple .NET LINQ provider for HBase using the Java API for creating, managing and scanning HBase tables and the .NET LINQ provider interfaces. In addition, the project investigates LINQ support for HBase MapReduce jobs written in Java by adding an extension to the LINQ query syntax.
The Apache Hadoop framework enables distributed processing of very large data sets. Hadoop is written in Java, and has limited methods to integrate with mapreducers written in other languages. This lab demonstrates how you can use JNBridgePro to program directly against the Java-based Hadoop API to create .NET-based mapreducers.