Course Name:- Simplifying data pipelines with Apache Kafka
Modules 1. Introduction to Apache Kafka
Question 1. Which of the following are a Kafka use case?
- Messaging
- All of the above
- Stream Processing
- Website Activity Tracking
- Log Aggregation
Question 2.A Kafka cluster is comprised of one or more servers which are called “producers”
- True
- False
Question 3.Kafka requires Apache ZooKeeper
- True
- False
Modules 2. kafka command line
Question 1. There are two ways to create a topic in Kafka, by enabling the auto.create.topics.enable property and by using the kafka-topics.sh script.
- True
- False
Question 2. Which of the following is NOT returned when –describe is passed to kafka-topics.sh?
- Configs
- None of the Above
- PartitionNumber
- ReplicationFactor
- Topic
Question 3. Topic deletion is disabled by default.
- True
- False
Module 3. kafka producer java API
Question 1. The setting of ack that provides the strongest guarantee is ack=1
- True
- False
Question 2.The KafkaProducer is the client that publishes records to the Kafka cluster.
- True
- False
Question 3. Which of the following is not a Producer configuration setting?
- batch.size
- linger.ms
- key.serializer
- retries
- None of the above
Module 4. Kafka consumer java API
Question 1. The Kafka consumer handles various things behind the scenes, such as:
- Failures of servers in the Kafka cluster
- Adapts as partitions of data it fetches migrates within the cluster
- Data management and storage into databases
- and b) only
- All of the Above
Question 2. If enable.auto.commit is set to false, then committing offsets is done manually, which provides gives you more control.
- True
- False
Question 3. Rebalancing is a process where group of consumer instances within a consumer group, coordinate to own mutally shared sets of partitions of topics that the groups are subscribed to.
- True
- False
Module 5. kafka connect and spark streaming
Question 1. Which of the following are Kafka Connect features?
- A common framework for Kafka connectors
- Automatic offset management
- REST interface
- Streaming/batch integration
- All of the above
Question 2. Kafka Connector has two types of worker nodes called standalone mode and centralized mode cluster
- True
- False
Question 3. Spark periodically queries Kafka to get the latest offsets in each topic and partition that it is interested in consuming form.
- True
- False
Simplifying data pipelines with Apache Kafka Cognitive Class Final Exam Answers:-
Question 1. If the auto.create.topics.enable property is set to false and you try to write a topic that doesn’t yet exist, a new topic will be created.
- True
- False
Question 2. Which of the following is false about Kafka Connect?
- Kafka Connect makes building and managing stream data pipelines easier
- Kafka Connect simplifies adoption of connectors for stream data integration
- It is a framework for small scale, asynchronous stream data integration
- None of the above
Question 3. Kafka comes packaged with a command line client that you can use as a producer.
- True
- False
Question 4. Kafka Connect worker processes work autonomously to distribute work and provide scalability with fault tolerance to the system.
- True
- False
Question 5. What are the three Spark/Kafka direct approach benefits? (Place the answers in alphabetical order.)
Question 6. Kafka Consumer is thread safe, as it can give each thread its own consumer instance
- True
- False
Question 7. What other open source producers can be used to code producer logic?
- Java
- Python
- C++
- All of the above
Question 8. If you set acks=1 in a Producer, it means that the leader will write the received message to the local log and respond after waiting for full acknowledgement from all of its followers.
- True
- False
Question 9. Kafka has a cluster-centric design which offers strong durability and fault-tolerance guarantees.
- True
- False
Question 10. Which of the following values of ack will not wait for any acknowledgement from the server?
- all
- 0
- 1
- -1
Question 11. A Kafka cluster is comprised of one or more servers which are called “Producers”
- True
- False
Question 12.What are In Sync Replicas?
- They are a set of replicas that are not active and are delayed behind the leader
- They are a set of replicas that are not active and are fully caught up with the leader
- They are a set of replicas that are alive and are fully caught up with the leader
- They are a set of replicas that are alive and are delayed behind the leader
Question 13. In many use cases, you see Kafka used to feed streaming data into Spark Streaming
- True
- False
Question 14. All Kafka Connect sources and sinks map to united streams of records
- True
- False
Question 15. Which is false about the Kafka Producer send method?
- The send method returns a Future for the Record Metadata that will be assigned to a record
- All writes are asynchronous by default
- It is not possible to make asynchronous writes
- Method returns immediately once record has been stored in buffer of records waiting to be sent