Big data developer Interview Questions
249
Big Data Developer interview questions shared by candidates
Longest palindromes in a string SQL quesiton
4 Answers↳
What level of experience they are looking for?
↳
It was for New Grad
↳
What visa status they were looking for ?

Logic based questions. Scenarios. Code reviews
3 Answers↳
Hi, I have an interview with Siemens next week ,could you please guide me a bit as how should I prepare.it is for the role "Big Data Software Engineer".I just wanted to know what type of Java question I should expect in the online exam. Regards, Tan Sah Less
↳
For the HireVue just basic Java stuff like Design Patterns, Threads, Multithreading and so Less
↳
Hi, I have cleared HireVue and have panel interview scheduled with Siemens. Can you please tell me what to expect?? Its for Big Data Engineer position. Less

what are python generators?
2 Answers↳
USER_ID_LIST=[1,2,3,4,5,6,7,8,9] def get_user_ids(): for id in USER_ID_LIST: yield id if __name__ == "__main__": user_ids = get_user_ids() print("First Loop") for user_id in user_ids: print(user_id) if user_id == 5: break print("Second Loop") for user_id in user_ids: print(user_id) ------output----- First Loop 1 2 3 4 5 Second Loop 6 7 8 9 Less
↳
Generator functions allow you to declare a function that behaves like an iterator. Generators introduce the yield statement to Python. It works a bit like return because it returns a value. The difference is that it saves the state of the function. The next time the function is called, execution continues from where it left off, with the same variable values it had before yielding. USER_ID_LIST=[1,2,3,4,5,6,7,8,9] def get_user_ids(): for id in USER_ID_LIST: yield id if __name__ == "__main__": user_ids = get_user_ids() print("First Loop") for user_id in user_ids: print(user_id) if user_id == 5: break print("Second Loop") for user_id in user_ids: print(user_id) Less

Implement a poorly specified and undefined variation of a Quad-tree and write a parsing function to initialize it from a 2d array.
1 Answers↳
There are a few ways to implement this but it really depends on a very specific definition of how the quadtree should behave. As this clarity wasn't provided I avoided a generic solution and provided a 4x4 specific version in Python code. Less

Questions asked were of good standard that reflects working people have good knowledge in the field. Mostly questions were related to spark, hive, hadoop, and some other component as well
1 Answers↳
All rounds were good and i was able to clear it. To my shock I never get the offer. During HR round they said they will release the offer in few days. But I never got any update. It seems they have either cancelled the position or might have hired someone else. But this not ethical and they should have clear cut information regarding this. After few days i followed then i came to know that they will take 1 more round with some other clients. I said ok .But that interview never happened. Since I was holding other offers as well I didn't bothered about it. But out of curosity i called them again after few days and got same response that they will schedule interview once panel is free. Since they knew that I was in Notice period eventually I will stop chasing them after some times and they were right. I forget about them also. But such kind of response was not expected. Please have guts to say to candidate that yes they have wasted our times and efforts and they can't offer any position now. Less



Hive Static partition vs Dynamic partition
1 Answers↳
Insert input data files individually into a partition table is Static Partition, fast, can alter the partition, set hive.mapred.mode = strict Single insert to partition table is known as a dynamic partition, slow, can’t perform alter, don’t know how many columns Less


Print every number divisible by both 3 and 5 up to 100 Sample output: 0 15 30 45 60 75 90
1 Answers↳
for x in xrange(101): If x%15 == 0: print x