Cloudera Day in DC: Cloudera Manager and Enterprise

Editor’s note: In this post,  provides more context on Big Data gleaned from the 26 Jan 2012 Cloudera Day in DC.-bg

Another valuable pannel at the DC Cloudera Day was Todd Lipcon’s look into Hadoop management software Cloudera Manager available through Cloudera Enterprise. Cloudera is in the business of making Hadoop, the open source Big Data storage and analysis platform, easier for enterprises to adopt and, though the first step is their Cloudera Distribution Including Apache Hadoop (CDH), enterpise deployments will likely need additional help managing their clusters. While Google can ship a bus of computer science PhDs in from Stanford whenever they have a problem, most businesses and government agencies don’t have those kinds of resources available. Cloudera Enterprise and Manager allows the rest of us to build Hadoop systems up, predict issues, solve problems, and make improvements.

Cloudera Manager is the first end-to-end management tool for Apache Hadoop. It’s available in a Free Edition for download from Cloudera’s website which allows users to install, configure, and perform basic management for Hadoop clusters up to 50 nodes. The Enterpise Edition, with a number of more advanced features, is available through Cloudera Enterprise subscription service, which also includes Cloudera Support.

Manager greatly reduces the chance of operator error. It runs checks and validations on your code and creates a complete audit trail of changes to the system. Manager annotates changes and correlates them with performance to measure results and uncover mistakes. If you do harm your cluster’s performance, manager can automatically roll back the changes.

Manager also provides insight into the performance of your Hadoop cluster by tracking trends and alerting the user if a job is running slower than usual and by how much. If Hadoop fails, it can tell you what events occurred and what was going on with the data when it happened. Like Splunk, Manager also tracks and allows searches on log data. It performs all of these functions with minimal overhead, requiring at most 1% CPU and often much less, and continued to perform well even in thousand node clusters.

At the end of his panel, Lipcon offered some insight into what comes next for Cloudera Enterprise and Manager. New capabilities are being developed to make the most of the upcoming CDH4. CDH4 will have a secondary name node in case the first fails, so the next version of Manager will provide failover management and multiple-namespace management. CDH4 will also implement an updated version of MapReduce, so Manager will include MapReduce2 service and configuration tools.

Cloudera, along with numerous other key players in Big Data, were also present at yesterday’s Carahsoft Government Big Data Forum. Check back for upcoming recaps of panels, speakers, and technology in the coming days and weeks.

CTOvision Pro Special Technology Assessments

We produce special technology reviews continuously updated for CTOvision Pro members. Categories we cover include:

  • Analytical Tools - With a special focus on technologies that can make dramatic positive improvements for enterprise analysts.
  • Big Data - We cover the technologies that help organizations deal with massive quantities of data.
  • Cloud Computing - We curate information on the technologies enabling enterprise use of the cloud.
  • Communications - Advances in communications are revolutionizing how data gets moved.
  • GreenIT - A great and virtuous reason to modernize!
  • Infrastructure  - Modernizing Infrastructure can have dramatic benefits on functionality while reducing operating costs.
  • Mobile - This revolution is empowering the workforce in ways few of us ever dreamed of.
  • Security  -  There are real needs for enhancements to security systems.
  • Visualization  - Connecting computers with humans.
  • Hot Technologies - Firms we believe warrant special attention.


Recent Research

Finding The Elusive Data Scientist In The Federal Space

DoD Public And Private Cloud Mandates: And insights from a deployed communications professional on why it matters

Intel CEO Brian Krzanich and Cloudera CSO Mike Olson on Intel and Cloudera’s Technology Collaboration

Watch For More Product Feature Enhancements for Actifio Following $100M Funding Round

Navy Information Dominance Corps: IT still searching for the right governance model

DISA Provides A milCloud Overview: Looks like progress, but watch for two big risks

Innovators, Integrators and Tech Vendors: Here is what the government hopes they will buy from you in 2015

Navy continues to invest in innovation: Review their S&T efforts here

MSPA Unified Certification Standard For Cloud Service Providers: Is This A Commercial Version of FedRamp?

Watch Ben Fry And His Visualizations: Multiple use-cases come to mind, including national security efforts

Agenda And More Details for 4-5 March NIST Data Science Symposium

Actionable Insights From AFCEA Western Conference and Exposition 2014