Blog

Apache Storm Interview Questions And Answers

  • (4.0)
  • | 476 Ratings

Apache Storm Interview Questions And Answers

Mindmajix as a team have got enough questions from the trainees who got their Apache Storm Training and cracked interviews at various MNCs around the world and successfully placed. From the collection of Apache Storm Interview Questions, following are the most common questions we got are listed to make it easy for those who are willing to crack the interview with ease. All the answers to those were written by our professional experienced trainers and are tailored to meet the concepts expected by the interviewer.

Are you interested in taking up for Apache Storm Certification Training? Enroll for Free Demo on Apache Storm Training!

Q1) Apache Storm Vs Kafka

Comparison Storm Kafka
Data Source Kafka and other DBs FB and Twitter
Data Storage data exchange between an input to output streams EXT4 File System/XFS
Dependency Independent Depends on Zookeeper
Inventor Twitter LinkedIn
Language Support All languages supported Supports all, but Java is recommended
Latency Milli-Second latency Depends upon Data Source generally less than 1-2 seconds.
Primary Use Stream Processing Message Broker
Stream processing Micro-Batch Processing Small-Batch Processing
Type Real Time Message Processing Distributed Messaging System

Q2) What is Apache Storm?

Apache storm is an free open source software that helps you to work with massive quantities of data including batch processing. Real-time computation system with batch processing is what makes Apache Storm ahead of other softwares like hadoop, mapreduce, etc.

Q3) What is the latest version of Apache Storm.

1.2.1 version released in February 2018

Q4) What are the programming languages supported to work with Apache Storm.

There is no specific language mentioned, as apache storm is flexible to work with any of the programming language.

Q5) What are components of Apache Storm?

Nimbus, Zookeeper and Supervisor are the components of Apache Storm.

Q6) What is Nimbus used for?

Nimbus is also known as Master Node. Nimbus is used to track jobs of workers. All the code is distributed among the workers and allocated workers to clusters available. If at all any of the worker is needed with more number or resources, Nimbus has to provide extra resources to the workers.
Note: Nimbus is similar to Job Tracker in Hadoop

Q7) What is Zookeeper?

Zookeeper helps in communication among the storm cluster nodes. As the zookeeper is concerned only with coordination and not in messages, there exists not much workload.

Q8) What is the use of Supervisor?

Supervisor takes signals rom Nimbus through Zookeeper to execute the process. Supervisors are also known as Worker nodes.

Q9) What are the features of Apache Storm.

  • Reliable - All the data is ensured to be executed
  • Scalable - Machine’s cluster execution provides in scalability by parallel calculations.
  • Robust - Storm restarts workers when there is error/fault providing successful uninterrupted executions of other workers in the node.
  • Easy to operate - Standard configurations helps it easy to deploy and use.
  • Quick - Each node can process One million 100 byte messages.

Q10) Components of data stream flow in storm?

Data can be stream of flow with three components:

  • Tuple - Data to be flown
  • Spout - Data source
  • Bolt - Processes the Tuples

Q11) How log files can be stream lined?

First, configure spout to read the log files to emit one line and then analyse it by assigning to bolt.

 

Check Out Apache Storm Tutorials

 

Q12) What are types of stream groups in Apache storm?

All, none, local, global, field, shuffle and direct groupings are available in apache storm.

Q13) What is Topology_Message_Timeout_secs used for?

Time specified to process a message released from spout, and if at all the message is not processed, then the message is considered as fail.

Q14) How do you use Apache Storm as Proxy server?

Using mod-proxy module, it can be used as proxy server as well.

Q15) What is ZeroMQ?

While working with storm topologies, ZeroMQ helps to communicate tasks with each other.

Q16) Command to kill storm topology.

Storm kill {mindmajix_topology}
Where - mindmajix_topology is the name of the topology

Q17) Why is Apache Storm not provided with SSL.

In order to avoid legal or bureaucratic issues, Apache Storm avoids SSL.

Q18) Apache has search engine. True or False?

Yes. To search data with titles.

Q19) What are java elements supported by Apache Storm?

There is no Java support for Apache storm.

Q20) What are ASP (Active Server Page) elements supported by Apache Storm?

There is no ASP support for Apache storm.

Q21) What is combiner aggregator?

To group tuples to unified field, combiner aggregator is used.

Q22) What are the components of running topology?

Three elements collectively define execution of tasks in the topology.

  • Worker processes
  • Executors
  • Tasks

Worker process are responsible to work with executors that belong to one or more components(i.e., spouts or bolts).

An executor is used to execute one or more processes as threads under worker process.

Tasks refer to the number of subtasks included in the cluster to perform a whole operation of the topology. One task may contain more workers, resulting in more number of workers than total tasks together.

#threads (greater than or equals to) #tasks

Explore Apache Storm Sample Resumes! Download & Edit, Get Noticed by Top Employers!Download Now!