Cloudera Day in DC: Cloudera Manager and Enterprise

Editor’s note: In this post,  provides more context on Big Data gleaned from the 26 Jan 2012 Cloudera Day in DC.-bg

Another valuable pannel at the DC Cloudera Day was Todd Lipcon’s look into Hadoop management software Cloudera Manager available through Cloudera Enterprise. Cloudera is in the business of making Hadoop, the open source Big Data storage and analysis platform, easier for enterprises to adopt and, though the first step is their Cloudera Distribution Including Apache Hadoop (CDH), enterpise deployments will likely need additional help managing their clusters. While Google can ship a bus of computer science PhDs in from Stanford whenever they have a problem, most businesses and government agencies don’t have those kinds of resources available. Cloudera Enterprise and Manager allows the rest of us to build Hadoop systems up, predict issues, solve problems, and make improvements.

Cloudera Manager is the first end-to-end management tool for Apache Hadoop. It’s available in a Free Edition for download from Cloudera’s website which allows users to install, configure, and perform basic management for Hadoop clusters up to 50 nodes. The Enterpise Edition, with a number of more advanced features, is available through Cloudera Enterprise subscription service, which also includes Cloudera Support.

Manager greatly reduces the chance of operator error. It runs checks and validations on your code and creates a complete audit trail of changes to the system. Manager annotates changes and correlates them with performance to measure results and uncover mistakes. If you do harm your cluster’s performance, manager can automatically roll back the changes.

Manager also provides insight into the performance of your Hadoop cluster by tracking trends and alerting the user if a job is running slower than usual and by how much. If Hadoop fails, it can tell you what events occurred and what was going on with the data when it happened. Like Splunk, Manager also tracks and allows searches on log data. It performs all of these functions with minimal overhead, requiring at most 1% CPU and often much less, and continued to perform well even in thousand node clusters.

At the end of his panel, Lipcon offered some insight into what comes next for Cloudera Enterprise and Manager. New capabilities are being developed to make the most of the upcoming CDH4. CDH4 will have a secondary name node in case the first fails, so the next version of Manager will provide failover management and multiple-namespace management. CDH4 will also implement an updated version of MapReduce, so Manager will include MapReduce2 service and configuration tools.

Cloudera, along with numerous other key players in Big Data, were also present at yesterday’s Carahsoft Government Big Data Forum. Check back for upcoming recaps of panels, speakers, and technology in the coming days and weeks.

Sign up for your free CTOvision Pro trial today for unique insights, exclusive content and special reporting.

CTOvision Pro Special Technology Assessments

We produce special technology reviews continuously updated for CTOvision Pro members. Categories we cover include:

  • Analytical Tools - With a special focus on technologies that can make dramatic positive improvements for enterprise analysts.
  • Big Data - We cover the technologies that help organizations deal with massive quantities of data.
  • Cloud Computing - We curate information on the technologies enabling enterprise use of the cloud.
  • Communications - Advances in communications are revolutionizing how data gets moved.
  • GreenIT - A great and virtuous reason to modernize!
  • Infrastructure  - Modernizing Infrastructure can have dramatic benefits on functionality while reducing operating costs.
  • Mobile - This revolution is empowering the workforce in ways few of us ever dreamed of.
  • Security  -  There are real needs for enhancements to security systems.
  • Visualization  - Connecting computers with humans.
  • Hot Technologies - Firms we believe warrant special attention.

 

Recent Research

USN Quarterly Industry Day at Charleston: What you need to know to compete

Request Your Invite to the 20 May 2014 Andreessen Horowitz Fed Forum in DC

Amazon Hopeful that Fire TV will Spread

What The Enterprise IT Professional Needs To Know About Git and GitHub

3D Printing… At Home?

Tech Firms Seeking To Serve Federal Missions: Here is how to follow the money

Creating The New Cyber Warrior: Eight South Carolina Universities Compete

Mobile Gamers: Fun-Seeking but Fickle

Update from DIA CTO, CIO and Chief Engineer on ICITE and Enterprise Apps

Pew Report: Increasing Technology Use among Seniors

Finding The Elusive Data Scientist In The Federal Space

DoD Public And Private Cloud Mandates: And insights from a deployed communications professional on why it matters

solid