Saturday , July 27 2024

Exploring Spark’s GraphX cognitive class Exam Answers:-

Course Name :- Exploring Spark’s GraphX

Module 1: Introduction to Graph parallel

Question 1: GraphX extends RDDs, which allows users to use GraphX as a collection, but not as a graph!

  • True
  • False

Question 2: Which of the following statements is true?

  • Graph-Parallel is usually handled by Hadoop and Spark.
  • Graph-Parallel focuses on distributing data across different nodes and systems.
  • Data-Parallel is usually handled by Pregel, GraphLab and Giraph.
  • Data-Parallel focuses on efficiently executing graph algorithms.
  • None of the above

Question 3: GraphX unifies Data-Parallelism and Graph-Parallelism in one library.

  • True
  • False

Module 2:- Visualizing GraphX and Exploring Graph operators

Question 1 : The “degree” operator returns a VertexRDD[Int] containing the number of outgoing edges of each vertex.

  • True
  • False

Question 2: Which of the following is not an attribute of a Triplet class?

  • attr
  • id
  • srcAttr
  • srcId
  • None of the above

Question 3 : Other libraries such as Gephi or GraphLab can help GraphX with visualization.

  • True
  • False

Module 3 :- Modifying GraphX

Question 1: We must run the “partitionBy” function before running the “groupEdges” operator.

  • True
  • False

Question 2 : Which of following is among the PartitionStrategies provided by GraphX?

  • EdgePartition2D
  • RandomVertexCut
  • EdgePartition1D
  • CanonicalRandomVertexCut
  • All of the above

Question 3: To improve efficiency, GraphX reuses portions of the graph which are unaffected by a modifier.

  • True
  • False

Module 4 :- Neighborhood Aggregation Caching

Question 1: AggregateMessages is the only neighborhood aggregation function provided by GraphX.

  • True
  • False

Question 2: Which of the following is not an attribute of TripletFields?

  • TripletFields.None
  • TripletFields.DstOnly
  • TripletFields.EdgeOnly
  • TripletFields.All
  • None of the Above

Question 3: The ClassTag is optional for aggregateMessages if the message is a String.

  • True
  • False

Exploring Spark’s GraphX Cognitive  class final Exam  Answers:-

Question 1 : To instantiate a Graph, you need at LEAST 2 RDDs.

  • True
  • False

Question 2: pageRank is a graph algorithm that ranks the edges of the graph by correlating their relation with vertices, in terms of both quality and quantity.

  • True
  • False

Question 3: The numEdges operator returns an EdgesRDD[Long].

  • True
  • False

Question 4: Which of the following ClassTypes are returned from mapTriplets, assuming Graph[VD, ED] is the original?

  • Graph[VD, ED]
  • Graph[VD2, ED]
  • Graph[VD, ED2]
  • Graph[VD2, ED2]
  • None of the Above

Question 5: The reverse operator returns a graph in which the direction of all edges are reversed.

  • True
  • False

Question 6 : Which of the following ClassTypes are returned from mapTriplets, assuming Graph[VD, ED] is the original?

  • Graph[VD, ED]
  • Graph[VD2, ED]
  • Graph[VD, ED2]
  • Graph[VD2, ED2]
  • None of the Above

Question 7 : Caching graphs that are only used infrequently can slow computations.

  • True
  • False

Question 8: Which of the following is required to define aggregateMessages?

  • sendMsg
  • mergeMsg
  • tripletFields
  • sendMsg and mergeMsg
  • All of the Above

Question 9 : Triplets are a required parameter when instantiating a Graph.

  • True
  • False

Question 10 : When defining the merge parameter for groupEdges (Int), which of the following is a valid definition for merge = (Edge1, Edge2)?

  • Edge1
  • Edge1 * Edge2
  • Edge1 – Edge2 / Edge1
  • Edge1 + Edge2
  • All of the Above

Question 11: In a tuple, the first parameter returned by the “degrees” operator is the degree info, and the second parameter is the vertexid.

  • True
  • False

Question 12 : Data-Parallel is usually handled by Pregel, GraphLab, and Giraph.

  • True
  • False

Question 13 : Which of the following is true about GraphX?

  • GraphX does not have built-in visualization functions.
  • GraphX is a Graph-Processing library built into Apache Spark.
  • GraphX extends the RDD class which allows us to use GraphX as a graph or a collection.
  • GraphX is mainly a graph processing library.
  • All of the above

Question 14 : By using the mapTriplets function, we are only able to modify the edge attribute.

  • True
  • False

Question 15: Which of the following is true about the EdgeContext class?

  • It has access to vertex attributes, but not to edge attributes.
  • It has access to edge attributes, but not to vertex attributes.
  • It has sendToDst, sendToSrc, and sendToAll functions.
  • It is the same as the EdgeTriplet Class.
  • None of the above

About Machine Learning

Check Also

Python for Data Science Cognitive Class Exam Answers:-

Course Name:- Python for Data Science Module 1. Python Basics Question 1. What is the …

Leave a Reply

Your email address will not be published. Required fields are marked *