Blog

HDInsight Of Azure

  • (5.0)
  • | 2903 Ratings
HDInsight Of Azure

Big data described as bulk information. Hadoop is an open-source, Java-based programming framework that supports the processing and storage of Big Data.  A computer cluster is a set of connected computers that can work together as a single system.  Hadoop Clusters are such type of computer clusters that can store, analyse big data which are structured and unstructured. Azure HDInsight deploys these Azure Hadoop clusters in the cloud using the Hortonworks Data Platform (HDP) Hadoop distribution. It also consists of Apache HBase which is a tabular NoSQL database that provides a real-time access to data in HDFS. Apache Storm is a stream analytics platform for processing real-time events like sensors. 

Features of Azure HDInsights

  • It is mainly used to create, manage and analyse data report statistics on big data. 
  • With the help of virtual machines, you can quickly deploy the system from the Azure portal.
  • You can implement any number of nodes in a cluster.
  • Pay for the service you used.
  • Reuse the cluster to another when a specific job is completed or you can stop using it.
  • The HDInsight service can work with on- Microsoft vendors.
  • It is cost-effective to collect and store structured or unstructured data.
  • Extracting undiscovered information from big quantities of unstructured data is easy.
  • Hadoop cluster can be built within minutes.
  • The RESTful API to perform create, read, update, delete (CRUD) operations on text or binary data, like video, audio and images given by the client.
  • The flat network storage system technology offers a high-speed connection between nodes and blob storage system.
  • The master-slave pattern of Insight allows central node or master node to operate and control the cluster centrally. The secondary nodes are integrated into Azure deployments.

The Insights provided by Azure HD are:

  • Disk Usage
  • Utilization of CPU
  • Cluster Load
  • Memory Used
  • Network Used
  • AJAX Calls of a website like no.of views, no.of clicks on particular event etc.

Azure HDinsights Storage Services

It is a general-purpose storage system connected to compute nodes. By storing the data in Azure Storage one has the benefits of data sharing, data achieving, geo-replication and elastic scaling capabilities.  These enable data recovery and redundancy.  The scale-out file system automatically scaled depending upon a number of nodes connected to the cluster.  Every time when a cluster is generated, there is no need to reload the data. Even after the original HDInsight cluster is deleted, you can still use the default storage container.

Azure HDinsights Storage Services

Limitation to Storage Services

  • The Insight Storage service account located in a different location other than the HDInsight cluster location is not supported.
  • Blob storage accounts are not supported.
  • Sharing the default Blob container with multiple HDInsight clusters might corrupt job history kind of cluster-specific information stored in Blob Container.

Live Scenario on Azure HDInsights

Let us consider a healthcare monitoring development and operational cycle.

Scenario on Azure HDInsights

The above is a health care monitoring process that happens in any hospital. Using Azure HDInsights, you can have time-to-time monitoring on each process, the status of servers and finally depicts faults and errors if occurred. Azure Insight is deployed in the healthcare product. 

After registering the hospital application in the Azure portal and when you start running it, you get the overall performance of healthcare application as given in the below figure.

Performance Metrics

Overall Application Performance Metrics

Fig: Overall Application Performance Metrics

It shows Browser metrics like page views, page load time, request on each page, each session etc 

Browser metrics

The failures and errors occurred while performing a task in the  application like server exceptions, page faults, data dependency failures etc 

Failures Metrics

Fig: Failures Metrics


Subscribe For Free Demo

Free Demo for Corporate & Online Trainings.

Anji Velagana
About The Author

Anji Velagana is working as a Digital Marketing Analyst and Content Contributor for Mindmajix. He writes about various platforms like Servicenow, Business analysis,  Performance testing, Mulesoft, Oracle Exadata, Azure, and few other courses. Contact him via anjivelagana@gmail.com and LinkedIn.


DMCA.com Protection Status
Close
Close