Adobe Flex Authors: Matthew Lobas, PR.com Newswire, Shelly Palmer, Kevin Benedict

Blog Feed Post

Monitoring HBase

HBase is a distributed, NoSQL, open-source database, initially conceived as an open-source alternative to Google’s proprietary BigTable. Originally, HBase was part of the Hadoop project, but was eventually spun off as a subproject. Given this legacy, it is not surprising that most often HBase is deployed on top of a Hadoop cluster (it used HDFS as its underlying storage), however a case study suggests that it can run on top of Amazon Elastic Block Store (EBS) as well. These days HBase is used by companies such as Adobe, Facebook, Twitter and Yahoo – and many others to process large amounts of data in real time, since it is ideally placed to store the input and/or the output of MapReduce jobs.

Monitoring HBase with Monitis and JMX

Like most products written in Java, both HBase and Hadoop contain built-in JMX instrumentation, which theoretically allows us to use any JMX client to view their performance metrics. Naturally, Monitis has just what the doctor ordered – a generic JMX agent that can be configured to monitor any JMX-enabled process through a point-and-click web interface.

Enabling JMX in HBase

By default the HBase JMX interface is disabled, but it can be enabled with relatively few configuration changes as explained in the official HBase documentation. (For security reasons and especially since we are not going to use JMX to modify the metric values or invoke administrative operations on the MBeans, I highly recommend that you do not add controlRole to jmxremote.passwd and jmxremote.access as the document suggests. Also, make sure these two files are owned by the login under which the HBase daemons will run, and that their permissions are 600, otherwise HBase will not start).

Installing the Monitis JMX Agent

The agent is implemented as a JEE web application and is packaged accordingly as a .war file. To download, log on to your Monitis account and select Monitors -> Manage Monitors -> JMX Monitors:

The JMX Monitors window will open. At the bottom of it, you will find a link to download the JMX Agent:

The .war file should be deployed in a standard JEE servlet container. While I recommend Tomcat due to its small footprint and ease of use, there are other options. If you already use an application server such as JBoss or WebSphere, you can deploy the JMX agent in it. While Tomcat offers many ways to deploy a .war file, the easiest one is to copy the .war file to Tomcat’s deploy folder. If Tomcat is already running, you don’t need to restart it – it will pick up and deploy the new .war file automatically.

Another consideration is which machine to deploy the monitor on. The HBase master would be the natural choice but your decision is going to be influenced by your specific network topology and corporate standards. In any case, the JMX agent should be able to accessthe HBase master on TCP port 10101 and optionally the Region Servers (data nodes) on port 10102. Aditionaly, Tomcat needs to be accessible on port 8080 (default).

Once the JMX agent .war file is deployed, go to http://<server_name>:8080/mon_jmx_agent. You should see the JMX agent’s login page which looks like this:

Enter your Monitis credentials – the same ones you use to login to your account on monitis.com – and click Login.

Creating an HBase Monitor in Monitis

Once it logs you in, the JMX Agent will prompt you to enter an Agent Name:

The Agent Name is used to uniquely identify the JMX Agent instance within Monitis. The metadata about the monitors is associated with your acount on monitis.com and the JMX agent will automatically download and run any existing monitors previously defined for this agent name. For  this reason, you want to choose a unique name for each JMX agent deployment. Once you enter a meaningful name and click Save, you should see the JMX Parameters page:

Make sure you enter the correct JMX port number and credentials you configured in HBase and click Submit to go to the next page:

Select the hadoop domain from the drop down – HBase’s MBeans live there for historical reasons. (You may also want to explore other domains – such as java.lang, which provides important information on JVM’s internals). Within the hadoop domain, select the HBase service -> RPC Statistics. The next screen shows an impressive number of metrics:

Under Monitor Name enter something meaningful. This is how your monitor will appear on monitis.com. Check interval is in minutes. Select the following attributes:

  • getNumOps
  • getAvgTime
  • getMinTime
  • getMaxTime
  • putNumOps
  • putAvgTime
  • putMinTime
  • putMaxTime

While most attribute names are self-explanatory, the MBean does not provide a meaningful description for the attributes, so feel free to examine the JMX section of the HBase book. Once you have selected all the metrics you are interested in, click on the Add Monitor button at the bottom of the page.

We are now ready to log on to Monitis and examine the data collected by our newly created monitor. If you are just logging on, Monitis will prompt you to add the new monitor, otherwise go to Monitors -> Manage Monitors -> JMX Monitors to open the familiar JMX Monitors screen:

Select the check box next to the new monitor and click Add to Window to open a new monitor window:

That’s it! As with any monitor, you can choose between multiple views and define notifications for your HBase performance metrics:


And finally, a few words about architecture. First, the JMX agent’s collector is implemented in the web application (war file). For this reason, you want to make sure that Tomcat (or whatever application server you deployed the agent’s war file on) keeps running for as long as you need to collect data. You may want to monitor the application server process itself and make sure it starts up automatically when the system is booted. Second, when you log on to the JMX Agent’s web interface, your credentials are submitted over an unencrypted HTTP connection (at least with the default Tomcat setup). This might be OK if you are viewing over the corporate LAN, otherwise you should look into enabling HTTPS on Tomcat. Alternatively, you could front Tomcat with Apache and let it do the heavy lifting.

In this installment we introduced the Monitis JMX Agent and hopefully convinced you how easy it is to monitor your HBase cluster. In a future post you will how you can use the agent to monitor JBoss. Happy monitoring!


Share Now:del.icio.usDiggFacebookLinkedInBlinkListDZoneGoogle BookmarksRedditStumbleUponTwitterRSS

Read the original blog entry...

More Stories By Hovhannes Avoyan

Hovhannes Avoyan is the CEO of PicsArt, Inc.,

@ThingsExpo Stories
Almost everyone sees the potential of Internet of Things but how can businesses truly unlock that potential. The key will be in the ability to discover business insight in the midst of an ocean of Big Data generated from billions of embedded devices via Systems of Discover. Businesses will also need to ensure that they can sustain that insight by leveraging the cloud for global reach, scale and elasticity.
In past @ThingsExpo presentations, Joseph di Paolantonio has explored how various Internet of Things (IoT) and data management and analytics (DMA) solution spaces will come together as sensor analytics ecosystems. This year, in his session at @ThingsExpo, Joseph di Paolantonio from DataArchon, will be adding the numerous Transportation areas, from autonomous vehicles to “Uber for containers.” While IoT data in any one area of Transportation will have a huge impact in that area, combining sensor...
SYS-CON Media announced today that @WebRTCSummit Blog, the largest WebRTC resource in the world, has been launched. @WebRTCSummit Blog offers top articles, news stories, and blog posts from the world's well-known experts and guarantees better exposure for its authors than any other publication. @WebRTCSummit Blog can be bookmarked ▸ Here @WebRTCSummit conference site can be bookmarked ▸ Here
Most people haven’t heard the word, “gamification,” even though they probably, and perhaps unwittingly, participate in it every day. Gamification is “the process of adding games or game-like elements to something (as a task) so as to encourage participation.” Further, gamification is about bringing game mechanics – rules, constructs, processes, and methods – into the real world in an effort to engage people. In his session at @ThingsExpo, Robert Endo, owner and engagement manager of Intrepid D...
SYS-CON Events announced today that LeaseWeb USA, a cloud Infrastructure-as-a-Service (IaaS) provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. LeaseWeb is one of the world's largest hosting brands. The company helps customers define, develop and deploy IT infrastructure tailored to their exact business needs, by combining various kinds cloud solutions.
Established in 1998, Calsoft is a leading software product engineering Services Company specializing in Storage, Networking, Virtualization and Cloud business verticals. Calsoft provides End-to-End Product Development, Quality Assurance Sustenance, Solution Engineering and Professional Services expertise to assist customers in achieving their product development and business goals. The company's deep domain knowledge of Storage, Virtualization, Networking and Cloud verticals helps in delivering ...
SYS-CON Events announced today that CDS Global Cloud, an Infrastructure as a Service provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. CDS Global Cloud is an IaaS (Infrastructure as a Service) provider specializing in solutions for e-commerce, internet gaming, online education and other internet applications. With a growing number of data centers and network points around the world, ...
In his general session at 19th Cloud Expo, Manish Dixit, VP of Product and Engineering at Dice, will discuss how Dice leverages data insights and tools to help both tech professionals and recruiters better understand how skills relate to each other and which skills are in high demand using interactive visualizations and salary indicator tools to maximize earning potential. Manish Dixit is VP of Product and Engineering at Dice. As the leader of the Product, Engineering and Data Sciences team a...
SYS-CON Events announced today that Transparent Cloud Computing (T-Cloud) Consortium will exhibit at the 19th International Cloud Expo®, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. The Transparent Cloud Computing Consortium (T-Cloud Consortium) will conduct research activities into changes in the computing model as a result of collaboration between "device" and "cloud" and the creation of new value and markets through organic data proces...
In the next five to ten years, millions, if not billions of things will become smarter. This smartness goes beyond connected things in our homes like the fridge, thermostat and fancy lighting, and into heavily regulated industries including aerospace, pharmaceutical/medical devices and energy. “Smartness” will embed itself within individual products that are part of our daily lives. We will engage with smart products - learning from them, informing them, and communicating with them. Smart produc...
SYS-CON Events announced today that Enzu will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Enzu’s mission is to be the leading provider of enterprise cloud solutions worldwide. Enzu enables online businesses to use its IT infrastructure to their competitive advantage. By offering a suite of proven hosting and management services, Enzu wants companies to focus on the core of their online busine...
WebRTC adoption has generated a wave of creative uses of communications and collaboration through websites, sales apps, customer care and business applications. As WebRTC has become more mainstream it has evolved to use cases beyond the original peer-to-peer case, which has led to a repeating requirement for interoperability with existing infrastructures. In his session at @ThingsExpo, Graham Holt, Executive Vice President of Daitan Group, will cover implementation examples that have enabled ea...
SYS-CON Events announced today that Coalfire will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Coalfire is the trusted leader in cybersecurity risk management and compliance services. Coalfire integrates advisory and technical assessments and recommendations to the corporate directors, executives, boards, and IT organizations for global brands and organizations in the technology, cloud, health...
November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Penta Security is a leading vendor for data security solutions, including its encryption solution, D’Amo. By using FPE technology, D’Amo allows for the implementation of encryption technology to sensitive data fields without modification to schema in the database environment. With businesses having their data become increasingly more complicated in their mission-critical applications (such as ERP, CRM, HRM), continued ...
SYS-CON Events announced today that Cloudbric, a leading website security provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Cloudbric is an elite full service website protection solution specifically designed for IT novices, entrepreneurs, and small and medium businesses. First launched in 2015, Cloudbric is based on the enterprise level Web Application Firewall by Penta Security Sys...
WebRTC sits at the intersection between VoIP and the Web. As such, it poses some interesting challenges for those developing services on top of it, but also for those who need to test and monitor these services. In his session at WebRTC Summit, Tsahi Levent-Levi, co-founder of testRTC, reviewed the various challenges posed by WebRTC when it comes to testing and monitoring and on ways to overcome them.
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this SYS-CON.tv interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
In his general session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed cloud as a ‘better data center’ and how it adds new capacity (faster) and improves application availability (redundancy). The cloud is a ‘Dynamic Tool for Dynamic Apps’ and resource allocation is an integral part of your application architecture, so use only the resources you need and allocate /de-allocate resources on the fly.
DevOps is being widely accepted (if not fully adopted) as essential in enterprise IT. But as Enterprise DevOps gains maturity, expands scope, and increases velocity, the need for data-driven decisions across teams becomes more acute. DevOps teams in any modern business must wrangle the ‘digital exhaust’ from the delivery toolchain, "pervasive" and "cognitive" computing, APIs and services, mobile devices and applications, the Internet of Things, and now even blockchain. In this power panel at @...
SYS-CON Events announced today that SoftNet Solutions will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. SoftNet Solutions specializes in Enterprise Solutions for Hadoop and Big Data. It offers customers the most open, robust, and value-conscious portfolio of solutions, services, and tools for the shortest route to success with Big Data. The unique differentiator is the ability to architect and ...