Apache Storm Interview Questions

Apache Storm is a Clojure-based distributed stream processing computation framework. This blog provides technical interview questions for Apache Storm that were asked by an interviewer. After attending dozens of technical interviews at top-notch companies, we came up with this list.

Mindmajix as a team has got enough questions from the trainees who got their Apache Storm Training and cracked interviews at various MNCs around the world and successfully placed. From the collection of Apache Storm Interview Questions, the following are the most common questions we got are listed to make it easy for those who are willing to crack the interview with ease. All the answers to those were written by our professional experienced trainers and are tailored to meet the concepts expected by the interviewer.

Top Apache Storm Interview Questions

Q1) Apache Storm Vs Kafka

ComparisonStormKafka
Data SourceKafka and other DBsFB and Twitter
Data Storagedata exchange between an input to output streamsEXT4 File System/XFS
DependencyIndependentDepends on Zookeeper
InventorTwitterLinkedIn
Language SupportAll languages supportedSupports all, but Java is recommended
LatencyMilli-Second latencyDepends upon Data Source generally less than 1-2 seconds.
Primary UseStream ProcessingMessage Broker
Stream processingMicro-Batch ProcessingSmall-Batch Processing
TypeReal-Time Message ProcessingDistributed Messaging System

Q2) What is Apache Storm?

Apache Storm is a free open source software that helps you to work with massive quantities of data including batch processing. A real-time computation system with batch processing is what makes Apache Storm ahead of other software like Hadoop, MapReduce, etc.

Q3) What is the latest version of Apache Storm.

1.2.1 version released in February 2018

Q4) What are the programming languages supported to work with Apache Storm.

There is no specific language mentioned, as apache storm is flexible to work with any of the programming languages.

Q5) What are the components of Apache Storm?

Nimbus, Zookeeper, and Supervisor are the components of Apache Storm.

Are you interested in taking up Apache Storm Certification Training? Enroll Demo on Apache Storm Training!

Q6) What is Nimbus used for?

Ans: Nimbus is also known as Master Node. Nimbus is used to track the jobs of workers. All the code is distributed among the workers and allocated workers to clusters available. If at all any of the workers is needed with more resources, Nimbus has to provide extra resources to the workers.
Note: Nimbus is similar to Job Tracker in Hadoop

Q7) What is Zookeeper?

Zookeeper helps in communication among the storm cluster nodes. As the zookeeper is concerned only with coordination and not in messages, there exists not much workload.

Q8) What is the use of a Supervisor?

The supervisor takes signals rom Nimbus through Zookeeper to execute the process. Supervisors are also known as Worker nodes.

Subscribe MindMajix YouTube Channel

Q9) What are the features of Apache Storm.

  • Reliable - All the data is ensured to be executed
  • Scalable - Machine’s cluster execution provides scalability by parallel calculations.
  • Robust - Storm restarts workers when there is an error/fault providing successful uninterrupted executions of other workers in the node.
  • Easy to operate - Standard configurations help it be easy to deploy and use.
  • Quick - Each node can process One million 100 byte messages.

Q10) Components of data streamflow in a storm?

Data can be stream of flow with three components:

  • Tuple - Data to be flown
  • Spout - Data source
  • Bolt - Processes the Tuples

Q11) How log files can be streamlined?

First, configure spout to read the log files to emit one line and then analyze it by assigning to bolt.

Q12) What are the types of stream groups in Apache storm?

All, none, local, global, field, shuffle, and direct groupings are available in apache storm.

Q13) What is Topology_Message_Timeout_secs used for?

Time specified to process a message released from the spout, and if at all the message is not processed, then the message is considered as fail.

Q14) How do you use Apache Storm as a Proxy server?

Using the mod-proxy module, it can be used as a proxy server as well.

Q15) What is ZeroMQ?

While working with storm topologies, ZeroMQ helps to communicate tasks with each other.

Q16) Command to kill storm topology.

Storm kill {mindmajix_topology}

Where - mindmajix_topology is the name of the topology

Q17) Why is Apache Storm not provided with SSL.

In order to avoid legal or bureaucratic issues, Apache Storm avoids SSL.

Q18) Apache has a search engine. True or False?

Yes. To search data with titles.

Q19) What are java elements supported by Apache Storm?

There is no Java support for Apache storm.

Q20) What are ASP (Active Server Page) elements supported by Apache Storm?

There is no ASP support for Apache storm.

Q21) What is a combiner aggregator?

To group tuples into the unified field, a combiner aggregator is used.

Q22) What are the components of running topology?

Three elements collectively define the execution of tasks in the topology.

  • Worker processes
  • Executors
  • Tasks

Worker processes are responsible to work with executors that belong to one or more components(i.e., spouts or bolts).

An executor is used to execute one or more processes as threads under the worker process.

Tasks refer to the number of subtasks included in the cluster to perform a whole operation of the topology. One task may contain more workers, resulting in more number of workers than total tasks together.

#threads (greater than or equals to) #tasks

Explore Apache Storm Sample Resumes! Download & Edit, Get Noticed by Top Employers!
 
Course Schedule
NameDates
Apache Storm TrainingOct 12 to Oct 27View Details
Apache Storm TrainingOct 15 to Oct 30View Details
Apache Storm TrainingOct 19 to Nov 03View Details
Apache Storm TrainingOct 22 to Nov 06View Details
Last updated: 02 Jan 2024
About Author

Ravindra Savaram is a Technical Lead at Mindmajix.com. His passion lies in writing articles on the most popular IT platforms including Machine learning, DevOps, Data Science, Artificial Intelligence, RPA, Deep Learning, and so on. You can stay up to date on all these technologies by following him on LinkedIn and Twitter.

read less