Apache Spark Interview Questions

Saghir Hussain 12:11

Spark Architecture
Client mode vs Cluster mode
RDD vs Dataframe vs Dataset
SparkContext vs SparkSession
map() vs flatmap()
reduce() vs reduceByKey()
Performance Techniques
Repartition vs Colesece
Order By vs Sort By
Persist vs Cache
Skewness and Salting
Map side Join
Spark configuration for joining two large tables
map() vs mapPartitions()
Broadcast and Accumuator variables
What is lineage and DAG?
Relation between driver, executor, memory, cores, partitions, stage, job, task using example

14 Comments

Anonymous24 June 2021 at 08:31
https://github.com/gjeevanm/SparkDataSkewness/blob/master/src/main/scala/com/gjeevan/DataSkew/RemoveDataSkew.scala
ReplyDelete
Replies
Saghir Hussain25 June 2021 at 05:32
https://www.linkedin.com/posts/gauravpatil95_thinkhadoop-hive-spark-activity-6813879213045665792-xQ37
ReplyDelete
Replies
Saghir Hussain28 June 2021 at 18:36
https://dzone.com/articles/dynamic-partition-pruning-in-spark-30
ReplyDelete
Replies
Saghir Hussain1 July 2021 at 10:05
https://www.linkedin.com/feed/update/urn:li:activity:6816216259940663296
ReplyDelete
Replies
Anonymous3 July 2021 at 09:41
https://www.linkedin.com/posts/aparup-chatterjee_apache-spark-30-dpp-activity-6816719664182366208-i6bV
ReplyDelete
Replies
Anonymous7 July 2021 at 20:33
https://github.com/ankurchavda/SparkLearning
ReplyDelete
Replies
Anonymous11 July 2021 at 06:56
https://www.java-success.com/spark-interview-qas-with-coding-examples-in-scala-part-1/
ReplyDelete
Replies
Anonymous16 July 2021 at 08:47
https://www.linkedin.com/posts/yusuf-didighar-64922a166_spark-join-internals-activity-6821458775623507968-2Zjz
ReplyDelete
Replies
Anonymous17 July 2021 at 19:22
https://stackoverflow.com/questions/32356143/what-does-setmaster-local-mean-in-spark
ReplyDelete
Replies
Anonymous25 July 2021 at 07:14
https://medium.com/@deepa.account/spark-udfs-and-its-deterministic-nature-b69e3dfc020e
ReplyDelete
Replies
Anonymous25 July 2021 at 07:36
https://luminousmen.com/post/hadoop-yarn-spark
ReplyDelete
Replies
Anonymous28 July 2021 at 09:29
https://www.linkedin.com/posts/mayank-ahuja-4b3a23105_kafka-schema-activity-6825815707146575873-h3Vj
ReplyDelete
Replies
Anonymous2 August 2021 at 09:56
https://www.analyticsvidhya.com/blog/2020/11/8-must-know-spark-optimization-tips-for-data-engineering-beginners/
ReplyDelete
Replies
Venu23 September 2021 at 18:33
I am giving spark training in Hyderabad, thanks to share valuable spark interview questions to learn spark in Hyderabad . . If you share answers also its really helpful
ReplyDelete
Replies

Add comment

Apache Spark Interview Questions

Post a Comment

14 Comments

Popular Posts

Kaggle : ecommerce-events-history-in-cosmetics-shop17:08

LeetCode SQL Question#175 - Combine Two Tables11:09

Apache Spark Interview Questions12:11

C Language

Categories

Tags

Footer Menu Widget

Apache Spark Interview Questions

You may like these posts

Post a Comment

14 Comments

Social Plugin

Popular Posts

Kaggle : ecommerce-events-history-in-cosmetics-shop17:08

LeetCode SQL Question#175 - Combine Two Tables11:09

Apache Spark Interview Questions12:11

C Language

Categories

Tags

Footer Menu Widget

Social Footer Widget