Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Add office photos

Engaged Employer

Wipro

Compare

3.7

based on 58.6k Reviews

Video summary

Filter interviews by

Wipro Lead Data Engineer Interview Questions and Answers

Updated 6 Dec 2024

7 Interview questions

A Lead Data Engineer was asked 7mo ago

Q. How would you build an ETL pipeline to read JSON files that are irregularly dropped into storage, transform the data, and match the schema?

Ans.

Design an ETL pipeline to handle irregularly timed JSON file uploads for data transformation and schema matching.

Use a cloud storage service (e.g., AWS S3) to store incoming JSON files.
Implement a file watcher or event-driven architecture (e.g., AWS Lambda) to trigger processing when new files arrive.
Utilize a data processing framework (e.g., Apache Spark or Apache Beam) to read and transform the JSON data.
Define ...

A Lead Data Engineer was asked 7mo ago

Q. Write an SQL query using window functions to find the highest sale amount per day for each store.

Ans.

Use SQL window functions to identify the highest sale amount for each store per day.

Use the ROW_NUMBER() function to rank sales within each day and store.
Partition the data by store and date to isolate daily sales.
Order the sales in descending order to get the highest sale at the top.
Example SQL query: SELECT store_id, sale_date, sale_amount, ROW_NUMBER() OVER (PARTITION BY store_id, sale_date ORDER BY sale_amount...

A Lead Data Engineer was asked

Q. How does Kafka work with Spark Streaming?

Ans.

Kafka is used as a message broker to ingest data into Spark Streaming for real-time processing.

Kafka acts as a buffer between data producers and Spark Streaming to handle high throughput of data
Spark Streaming can consume data from Kafka topics in micro-batches for real-time processing
Kafka provides fault-tolerance and scalability for streaming data processing in Spark

A Lead Data Engineer was asked

Q. Write an SQL query to find the users who made purchases in 3 consecutive months within a year.

Ans.

SQL query to find users who purchased 3 consecutive months in a year

Use a self join on the table to compare purchase months for each user
Group by user and year, then filter for counts of 3 consecutive months
Example: SELECT user_id FROM purchases p1 JOIN purchases p2 ON p1.user_id = p2.user_id WHERE p1.month = p2.month - 1 AND p2.month = p1.month + 1 GROUP BY p1.user_id, YEAR(p1.purchase_date) HAVING COUNT(DISTINCT...

What people are saying about Wipro

View All

a software developer

Wipro Elite to Turbo Upgrade: How To?

Got a 3.5 LPA Project Engineer (Elite) offer at Wipro and aiming for the 6.5 LPA Turbo package. Just received my LOI a month ago. What's the process to upgrade my package?

Got a question about Wipro?

Ask anonymously on communities.

A Lead Data Engineer was asked

Q. What methods do you use to optimize Spark jobs?

Ans.

Optimizing Spark jobs involves tuning configurations, partitioning data, caching, and using efficient transformations.

Tune Spark configurations for memory, cores, and parallelism
Partition data to distribute workload evenly
Cache intermediate results to avoid recomputation
Use efficient transformations like map, filter, and reduce
Avoid shuffling data unnecessarily

A Lead Data Engineer was asked

Q. Write SQL to find the second highest salary of employees in each department.

Ans.

SQL query to find the second highest salary of employees in each department

Use a subquery to rank the salaries within each department
Filter the results to only include the second highest salary for each department
Join the result with the employee table to get additional information if needed

A Lead Data Engineer was asked

Q. Architecture of spark

Ans.

Spark is a distributed computing framework that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Spark is built around the concept of Resilient Distributed Datasets (RDDs) which are immutable distributed collections of objects.
It supports various programming languages like Java, Scala, Python, and R.
Spark provides high-level APIs like Spark SQL for structured...

Are these interview questions helpful?

Wipro Lead Data Engineer Interview Experiences

2 interviews found

Lead Data Engineer Interview Questions & Answers

Anonymous

posted on 6 Dec 2024

Interview experience

Excellent

Difficulty level

Easy

Process Duration

Less than 2 weeks

Result

I applied via Approached by Company and was interviewed in Nov 2024. There was 1 interview round.

Round 1 - Technical

(3 Questions)

Q1. SQL Question on window functions to find the highest sale amount per day of the stores

Ans.

Use SQL window functions to identify the highest sale amount for each store per day.

Use the ROW_NUMBER() function to rank sales within each day and store.
Partition the data by store and date to isolate daily sales.
Order the sales in descending order to get the highest sale at the top.
Example SQL query: SELECT store_id, sale_date, sale_amount, ROW_NUMBER() OVER (PARTITION BY store_id, sale_date ORDER BY sale_amount DESC...

Answered by AI

Add your answer

Q2. Build an ETL Pipeline to read json files which are dropping at irregular times into storage. So how do you transform and match the schema etc.,

Ans.

Design an ETL pipeline to handle irregularly timed JSON file uploads for data transformation and schema matching.

Use a cloud storage service (e.g., AWS S3) to store incoming JSON files.
Implement a file watcher or event-driven architecture (e.g., AWS Lambda) to trigger processing when new files arrive.
Utilize a data processing framework (e.g., Apache Spark or Apache Beam) to read and transform the JSON data.
Define a sch...

Answered by AI

Add your answer

Q3. Write a pyspark code to join two tables and explain broadcastjoin() & what it does?

Add your answer

Skills evaluated in this interview

Lead Data Engineer Interview Questions & Answers

Priyanshu Singh

posted on 17 Jun 2024

Interview experience

Average

Difficulty level

Moderate

Process Duration

Less than 2 weeks

Result

Selected

I applied via Approached by Company and was interviewed in May 2024. There was 1 interview round.

Round 1 - Technical

(6 Questions)

Q1. Architecture of spark

Ans.

Spark is a distributed computing framework that provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Spark is built around the concept of Resilient Distributed Datasets (RDDs) which are immutable distributed collections of objects.
It supports various programming languages like Java, Scala, Python, and R.
Spark provides high-level APIs like Spark SQL for structured data...

Answered by AI

Add your answer

Q2. Methods to optimizing spark jobs

Ans.

Optimizing Spark jobs involves tuning configurations, partitioning data, caching, and using efficient transformations.

Tune Spark configurations for memory, cores, and parallelism
Partition data to distribute workload evenly
Cache intermediate results to avoid recomputation
Use efficient transformations like map, filter, and reduce
Avoid shuffling data unnecessarily

Answered by AI

Add your answer

Q3. Write SQL to find the second highest sal of emp in each dep

Ans.

SQL query to find the second highest salary of employees in each department

Use a subquery to rank the salaries within each department
Filter the results to only include the second highest salary for each department
Join the result with the employee table to get additional information if needed

Answered by AI

Add your answer

Q4. Write SQL to find the users who purchased 3 consecutive month in a year

Ans.

SQL query to find users who purchased 3 consecutive months in a year

Use a self join on the table to compare purchase months for each user
Group by user and year, then filter for counts of 3 consecutive months
Example: SELECT user_id FROM purchases p1 JOIN purchases p2 ON p1.user_id = p2.user_id WHERE p1.month = p2.month - 1 AND p2.month = p1.month + 1 GROUP BY p1.user_id, YEAR(p1.purchase_date) HAVING COUNT(DISTINCT MONT...

Answered by AI

Add your answer

Q5. Working of kafka with spark streaming

Add your answer

Q6. Fibonacci series

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Work on SQL,Spark basic

Skills evaluated in this interview

Interview questions from similar companies

Software Engineer Interview Questions & Answers

Accenture

Anonymous

posted on 7 Jan 2020

Interview Questionnaire

6 Questions

Q1. What is fan trap? How to solve it? Give one example.

Add your answer

Q2. Two tables are joined. But joined type are not mentioned, then how much records will retrieve using select statement.

Ans.

The number of records retrieved depends on the default join type, which is usually INNER JOIN, returning matching records only.

If INNER JOIN is assumed, only records with matching keys in both tables are returned.
For example, if Table A has 5 records and Table B has 3 matching records, the result will be 3.
If LEFT JOIN is assumed, all records from the left table and matching records from the right table are returned.
Fo...

Answered by AI

View 1 more answer

Q3. How to use dynamic break in report?

Ans.

Dynamic break in report allows for automatic page breaks based on data.

Define a break field in the report design
Set the break field to trigger a page break when its value changes
Use dynamic break to automatically adjust page breaks based on data
Example: A report on sales by region can use dynamic break to create a new page for each region

Answered by AI

Add your answer

Q4. Use of foreach function in report.

Ans.

Foreach function is used to iterate through an array of strings in a report.

Foreach function is used to loop through each element in an array of strings.
It can be used to display each element in a report.
Example: foreach(string item in stringArray){ Console.WriteLine(item); }

Answered by AI

Add your answer

Q5. What is Aggregate awareness? When we use it? Syntax of it.

Ans.

Aggregate awareness is a feature in Business Objects that allows reports to be optimized for performance by using aggregate tables.

Aggregate awareness is used when creating reports in Business Objects
It allows the report to use aggregate tables instead of detailed tables for improved performance

Syntax: AGGR_AWARE(, ,

)

Example: AGGR_AWARE(Product, Sum(Sales), Sales_Fact_Table)

Answered by AI

Add your answer

Q6. @prompt function syntax and it's use.

Ans.

The @prompt function syntax is used to display a prompt message to the user and receive input.

The syntax is @prompt('message', 'default value')
The message is the prompt message to be displayed to the user
The default value is optional and will be displayed in the input field
The function returns the value entered by the user

Answered by AI

Add your answer

Skills evaluated in this interview

Software Engineer Interview Questions & Answers

TCS

Anonymous

posted on 25 Mar 2021

I applied via Campus Placement and was interviewed before Mar 2020. There were 5 interview rounds.

Interview Questionnaire

1 Question

Q1. I was placed through campus . Did not have to give the appitude / online exam as I was among top 10 students from the college . In the interview panel ,we had 3 people .one manager and two technical / staf...

View 1 more answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident and try to answer all. If something is out of your scope and you don't know ,politely tell them you don't know . Keep yourself engaged with the panel . In between talk to them , just do not give long pause and stares . It make things akward . Hope it helps

Software Engineer Interview Questions & Answers

TCS

D Behera

posted on 13 Apr 2021

I applied via Campus Placement and was interviewed in Oct 2020. There were 4 interview rounds.

Interview Questionnaire

3 Questions

Q1. C++ basic

Add your answer

Q2. Hh

Add your answer

Q3. Yy

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Hh

Senior Software Engineer Interview Questions & Answers

LTIMindtree

Anonymous

posted on 22 Apr 2022

I applied via Recruitment Consulltant and was interviewed before Apr 2021. There were 3 interview rounds.

Round 1 - Aptitude Test

Standard Aptitude questions

Round 2 - Coding Test

Based on strings and array

Round 3 - One-on-one

(1 Question)

Q1. Background questions and technical questions related to problem solved

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Mindtree is one of the best companies, i really miss working there.
It's more on your self development and your carrier.

Software Engineer Interview Questions & Answers

Accenture

Anonymous

posted on 27 Mar 2021

I applied via Company Website and was interviewed in Feb 2021. There was 1 interview round.

Interview Questionnaire

2 Questions

Q1. Coding

Add your answer

Q2. Java C

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Basics

Senior Software Engineer Interview Questions & Answers

Tech Mahindra

Suraj B R

posted on 26 Jan 2022

I applied via Naukri.com and was interviewed before Jan 2021. There was 1 interview round.

Interview Questionnaire

1 Question

Q1. They asked me to build a shopping cart web application using vanilla js and gave two days time , that was the only round they had!

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - I built the app according to wireframe and they evaluated it and selected me directly!

Software Engineer Interview Questions & Answers

Infosys

Anonymous

posted on 16 Apr 2021

Interview Questionnaire

1 Question

Q1. How r u

Add your answer

Software Engineer Interview Questions & Answers

IBM

Anonymous

posted on 10 Oct 2020

Interview Questionnaire

1 Question

Q1. Explain defect life cycle

Add your answer

Skills evaluated in this interview

Wipro Interview FAQs

How many rounds are there in Wipro Lead Data Engineer interview?

Wipro interview process usually has 1 rounds. The most common rounds in the Wipro interview process are Technical.

How to prepare for Wipro Lead Data Engineer interview?

Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Wipro. The most common topics and skills that interviewers at Wipro expect are Python, ETL, AWS, Adf and Agile.

What are the top questions asked in Wipro Lead Data Engineer interview?

Some of the top questions asked at the Wipro Lead Data Engineer interview -

Write SQL to find the users who purchased 3 consecutive month in a y...read more
Build an ETL Pipeline to read json files which are dropping at irregular times ...read more
Write SQL to find the second highest sal of emp in each ...read more

Tell us how to improve this page.

Wipro Interviews By Designations

Interview Questions for Popular Designations

4/5

based on 3 interview experiences

Difficulty level

Easy 50%

Moderate 50%

Duration

Less than 2 weeks 100%

TCS Lead Data Engineer Interview Questions

3.5

• 11.2k Interviews

Accenture Lead Data Engineer Interview Questions

3.7

• 8.7k Interviews

Infosys Lead Data Engineer Interview Questions

3.6

• 8k Interviews

Cognizant Lead Data Engineer Interview Questions

3.7

• 6k Interviews

Capgemini Lead Data Engineer Interview Questions

3.7

• 5.1k Interviews

Tech Mahindra Lead Data Engineer Interview Questions

3.5

• 4.2k Interviews

HCLTech Lead Data Engineer Interview Questions

3.5

• 4.2k Interviews

Genpact Lead Data Engineer Interview Questions

3.7

• 3.5k Interviews

LTIMindtree Lead Data Engineer Interview Questions

3.7

• 3.1k Interviews

IBM Lead Data Engineer Interview Questions

3.9

• 2.5k Interviews

View all

Wipro Lead Data Engineer Salary

based on 38 salaries

₹18 L/yr - ₹30 L/yr

23% less than the average Lead Data Engineer Salary in India

View more details

Wipro Salaries in India

Project Engineer 33.3k salaries	₹3.2 L/yr - ₹7.5 L/yr
Senior Software Engineer 23.5k salaries	₹6.7 L/yr - ₹19.5 L/yr
Senior Associate 22.8k salaries	₹1.8 L/yr - ₹5.5 L/yr
Technical Lead 20.5k salaries	₹16.6 L/yr - ₹30 L/yr
Senior Project Engineer 18.5k salaries	₹6.5 L/yr - ₹18.4 L/yr