Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by TCS Team. If you also belong to the team, you can get access from here

TCS Verified Tick

Compare button icon Compare button icon Compare

Filter interviews by

TCS Data Engineer Interview Questions and Answers

Updated 13 Jun 2025

78 Interview questions

A Data Engineer was asked 9mo ago
Q. What is Spark? Why it is so popular
Ans. 

Spark is a fast and general-purpose cluster computing system for big data processing.

  • Spark is popular for its speed and ease of use in processing large datasets.

  • It provides in-memory processing capabilities, making it faster than traditional disk-based processing systems.

  • Spark supports multiple programming languages like Java, Scala, Python, and R.

  • It offers a wide range of libraries for diverse tasks such as SQL, ...

A Data Engineer was asked
Q. 1.What is partition and bucketing. 2.diffrenece between union and union all. 3.spark architecture. 4.manage and external table in hive and diffirence. 5.sql and pyhton problem basic
Ans. 

Answers to interview questions for Data Engineer position.

  • 1. Partitioning is a way to divide a large dataset into smaller, more manageable parts based on a specific column or expression. Bucketing is a technique to further organize the data within each partition into smaller, equally-sized files based on a hash function.

  • 2. UNION combines the result sets of two or more SELECT statements, removing duplicate rows. UN...

Data Engineer Interview Questions Asked at Other Companies

asked in Sigmoid
Q1. Next Greater Element Problem Statement You are given an array arr ... read more
asked in LTIMindtree
Q2. If you are given cards numbered 1-1000 and 4 boxes, where card 1 ... read more
asked in Cisco
Q3. Optimal Strategy for a Coin Game You are playing a coin game with ... read more
asked in Sigmoid
Q4. Problem: Search In Rotated Sorted Array Given a sorted array that ... read more
asked in Sigmoid
Q5. K-th Element of Two Sorted Arrays You are provided with two sorte ... read more
A Data Engineer was asked
Q. 1)Diffrence between partion and bucketing in hve 2) diffrence between internal and external tables 3) explain hive architecture 4) difference between cache and parsit 5) what is RDD
Ans. 

1) Partitioning is dividing data into smaller parts based on a column, while bucketing is dividing data into equal-sized files based on a hash function. 2) Internal tables store data in a default location managed by Hive, while external tables store data in a user-defined location. 3) Hive architecture consists of a metastore, driver, compiler, optimizer, and execution engine. 4) Cache stores data in memory for fa...

A Data Engineer was asked
Q. Oracle, python, pyspark, have you worked on sql? What is you job role at previous comapny?
Ans. 

Yes, I have experience working with Oracle, Python, PySpark, and SQL in my previous roles as a Data Engineer.

  • Worked extensively with Oracle databases for data storage and retrieval

  • Utilized Python for data manipulation, analysis, and automation tasks

  • Implemented data processing and analytics using PySpark

  • Proficient in writing and optimizing SQL queries for data extraction and transformation

What people are saying about TCS

View All
a senior associate
2w
Tata's lost its touch? TCS ain't what it used to be :-(
Tata is not the same after Sir Ratan Tata! TCS used to really look after its employees, even when they were on the bench. Now, things have changed and it's disappointing.
FeedCard Image
Got a question about TCS?
Ask anonymously on communities.
A Data Engineer was asked
Q. What is view in SQL and dense and dense rank
Ans. 

View is a virtual table created from a SQL query. Dense rank assigns a unique rank to each row in a result set.

  • A view is a saved SQL query that can be used as a table

  • Dense rank assigns a unique rank to each row in a result set, with no gaps between the ranks

  • Dense rank is used to rank rows based on a specific column or set of columns

  • Example: SELECT * FROM my_view WHERE column_name = 'value'

  • Example: SELECT column_na...

A Data Engineer was asked 7mo ago
Q. Queue & Stack Algorithm
Ans. 

Queue & Stack Algorithm involves data structures for storing and retrieving data in a specific order.

  • Queue follows First In First Out (FIFO) principle, like a line at a grocery store.

  • Stack follows Last In First Out (LIFO) principle, like a stack of plates.

  • Examples: Queue - BFS algorithm in graph traversal. Stack - Undo feature in text editors.

A Data Engineer was asked 7mo ago
Q. Rotation of Array
Ans. 

Rotate an array of strings by a given number of positions.

  • Create a new array and copy elements from the original array based on the rotation index.

  • Handle cases where the rotation index is greater than the array length by using modulo operation.

  • Example: Original array ['a', 'b', 'c', 'd', 'e'], rotate by 2 positions -> ['c', 'd', 'e', 'a', 'b']

Are these interview questions helpful?
A Data Engineer was asked 9mo ago
Q. Corrupt Record Handling in Spark
Ans. 

Corrupt record handling in Spark involves identifying and handling data that does not conform to expected formats.

  • Use DataFrameReader option("badRecordsPath", "path/to/bad/records") to save corrupt records to a separate location for further analysis.

  • Use DataFrame.na.drop() or DataFrame.na.fill() to handle corrupt records by dropping or filling missing values.

  • Implement custom logic to identify and handle corrupt re...

A Data Engineer was asked 9mo ago
Q. SCD 1 vs SCD 2
Ans. 

SCD 1 overwrites old data with new data, while SCD 2 keeps track of historical changes.

  • SCD 1 updates existing records with new data, losing historical information.

  • SCD 2 creates new records for each change, preserving historical data.

  • SCD 1 is simpler and faster, but can lead to data loss.

  • SCD 2 is more complex and slower, but maintains a full history of changes.

A Data Engineer was asked 10mo ago
Q. Optimizations in pyspark
Ans. 

Optimizations in pyspark involve techniques to improve performance and efficiency of data processing.

  • Use partitioning to distribute data evenly across nodes for parallel processing

  • Utilize caching to store intermediate results in memory for faster access

  • Avoid unnecessary shuffling of data by using appropriate join strategies

  • Optimize the execution plan by analyzing and adjusting the stages of the job

  • Use broadcast va...

TCS Data Engineer Interview Experiences

97 interviews found

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 17 May 2025

Interview experience
1
Bad
Difficulty level
-
Process Duration
-
Result
-

I appeared for an interview in Apr 2025, where I was asked the following questions.

  • Q1. TCS Bhubaneswar Absolutely the Worst Walk-in Interview Experience Ever! I’ve never seen such a poorly organized and mismanaged interview process in my life. The entire setup was a disaster from the start. ...
  • Q2. The chaos, the waiting time, the heat, and the indifferent attitude of the panel made it one of the most frustrating and humiliating experiences ever. If you value your time, energy, and dignity — avoid th...

Interview Preparation Tips

Interview preparation tips for other job seekers - My honest advice: Don’t fall into this trap. Avoid this company’s walk-ins(Specially TCS Bhubaneswar)unless you want to be disrespected, misled, and left exhausted and disappointed. One of the Worst Walk-in Interview Experiences Ever – A Complete Disaster! I’ve attended many interviews in my career, but this was by far the worst and most mismanaged experience ever. The level of unprofessionalism and disorganization was beyond anything I could’ve imagined. They scheduled walk-in interviews for 15–17 different technologies on the same day without any proper planning. Around 700+ candidates showed up, and there was no seating arrangement, no air conditioning, not even fans. We were all left standing for hours under the harsh sun like cattle. No water, no shelter, no communication — it was pure chaos. Now, here's where it gets even more ridiculous: I went there specifically for a Snowflake position. The person who interviewed me claimed to be from the Snowflake panel — but as it turns out, there was no Snowflake requirement at all. Shockingly, two other candidates who were interviewed after me were told the same thing — that there’s no Snowflake opening, and if they wanted to proceed, they’d have to switch and interview for Azure Databricks instead. Is this fair? If you don’t have a requirement, and if you don’t have the right panel, then why call people in the first place? Why make hundreds of people wait from 9 AM to 5 PM in the heat with zero basic facilities and then casually tell them there’s no opening for the technology they applied for? And the panelists? Extremely rude and arrogant — they talked down to candidates as if we were wasting their time, when in fact they were wasting ours.

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 17 Oct 2024

Interview experience
3
Average
Difficulty level
Moderate
Process Duration
-
Result
Not Selected

I applied via Walk-in

Round 1 - Technical 

(2 Questions)

  • Q1. Difference between rank and dense_rank, Left vs Left anti join
  • Ans. 

    Rank assigns unique ranks to rows, while dense_rank handles ties by assigning the same rank to tied rows. Left join includes all rows from the left table and matching rows from the right table, while left anti join includes only rows from the left table that do not have a match in the right table.

    • Rank assigns unique ranks to rows based on the specified order, while dense_rank handles ties by assigning the same rank to ...

  • Answered by AI
  • Q2. Python list comprehension, SQL query
Round 2 - Behavioral 

(1 Question)

  • Q1. Project related questions

Interview Preparation Tips

Interview preparation tips for other job seekers - No response from HR, even after clearing technical and managerial rounds

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 20 Sep 2024

Interview experience
4
Good
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Recruitment Consulltant and was interviewed in Aug 2024. There were 2 interview rounds.

Round 1 - One-on-one 

(3 Questions)

  • Q1. Questions from pyspark - theoritical
  • Q2. Pyspark basic coding
  • Q3. SQL easy level coding question
Round 2 - One-on-one 

(1 Question)

  • Q1. 2 nd round was managerial. The interviewer asked about projects worked and assessed how I fit for the company

Interview Preparation Tips

Topics to prepare for TCS Data Engineer interview:
  • pyspark
  • databricks
  • SQL
  • projects

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 10 Oct 2024

Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

Focus of quantitative maths and aptitude a bit more

Round 2 - Technical 

(3 Questions)

  • Q1. Some basic questions on your comfortable programming knowledge
  • Q2. Some behaviour oriented questions
  • Q3. Some role related questions
Round 3 - HR 

(2 Questions)

  • Q1. Some behaviour oriented
  • Q2. Salary discussion

Data Engineer Interview Questions & Answers

user image Mohammed Suboor Ahmed

posted on 24 Nov 2024

Interview experience
5
Excellent
Difficulty level
Easy
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via LinkedIn and was interviewed in Oct 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. Reverse string in python list
  • Ans. 

    Reverse strings in a Python list

    • Use list comprehension to iterate through the list and reverse each string

    • Use the slice notation [::-1] to reverse each string

    • Example: strings = ['hello', 'world'], reversed_strings = [s[::-1] for s in strings]

  • Answered by AI
  • Q2. 2nd highest salary sql
  • Ans. 

    To find the 2nd highest salary in SQL, use the 'SELECT' statement with 'ORDER BY' and 'LIMIT' clauses.

    • Use the 'SELECT' statement to retrieve the salary column from the table.

    • Use the 'ORDER BY' clause to sort the salaries in descending order.

    • Use the 'LIMIT' clause to limit the result to the second row.

  • Answered by AI

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 11 Oct 2024

Interview experience
5
Excellent
Difficulty level
Moderate
Process Duration
Less than 2 weeks
Result
Not Selected

I appeared for an interview in Sep 2024.

Round 1 - One-on-one 

(3 Questions)

  • Q1. Spark architecture
  • Q2. Databricks clusters
  • Q3. Copy activity and all features
Interview experience
4
Good
Difficulty level
Moderate
Process Duration
-
Result
-

I applied via Approached by Company and was interviewed in Sep 2024. There was 1 interview round.

Round 1 - Technical 

(2 Questions)

  • Q1. SCD 1 vs SCD 2
  • Ans. 

    SCD 1 overwrites old data with new data, while SCD 2 keeps track of historical changes.

    • SCD 1 updates existing records with new data, losing historical information.

    • SCD 2 creates new records for each change, preserving historical data.

    • SCD 1 is simpler and faster, but can lead to data loss.

    • SCD 2 is more complex and slower, but maintains a full history of changes.

  • Answered by AI
  • Q2. Corrupt Record Handling in Spark
  • Ans. 

    Corrupt record handling in Spark involves identifying and handling data that does not conform to expected formats.

    • Use DataFrameReader option("badRecordsPath", "path/to/bad/records") to save corrupt records to a separate location for further analysis.

    • Use DataFrame.na.drop() or DataFrame.na.fill() to handle corrupt records by dropping or filling missing values.

    • Implement custom logic to identify and handle corrupt records...

  • Answered by AI

Interview Preparation Tips

Topics to prepare for TCS Data Engineer interview:
  • Python
  • PySpark
  • SQL
  • ETL

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Anonymous

posted on 30 Aug 2024

Interview experience
3
Average
Difficulty level
-
Process Duration
-
Result
-
Round 1 - One-on-one 

(3 Questions)

  • Q1. What is oops concept
  • Ans. 

    Object-oriented programming (OOP) is a programming paradigm based on the concept of objects, which can contain data in the form of fields and code in the form of procedures.

    • OOP focuses on creating objects that interact with each other to solve a problem

    • Key concepts include encapsulation, inheritance, polymorphism, and abstraction

    • Encapsulation involves bundling data and methods that operate on the data into a single uni...

  • Answered by AI
  • Q2. Explain data engineer life cycle and its tools
  • Ans. 

    Data engineer life cycle involves collecting, storing, processing, and analyzing data using various tools.

    • Data collection: Gathering data from various sources such as databases, APIs, and logs.

    • Data storage: Storing data in databases, data lakes, or data warehouses.

    • Data processing: Cleaning, transforming, and enriching data using tools like Apache Spark or Hadoop.

    • Data analysis: Analyzing data to extract insights and mak...

  • Answered by AI
  • Q3. What types of spark join strategies
  • Ans. 

    Spark join strategies include broadcast join, shuffle hash join, and shuffle sort merge join.

    • Broadcast join is used when one of the DataFrames is small enough to fit in memory on all nodes.

    • Shuffle hash join is used when joining two large DataFrames by partitioning and shuffling the data based on the join key.

    • Shuffle sort merge join is used when joining two large DataFrames by sorting and merging the data based on the j...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - prepared hadoop ecosystem related question ,sql and one programing language (python,java,sacala)

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Technical 

(2 Questions)

  • Q1. What is Spark? Why it is so popular
  • Ans. 

    Spark is a fast and general-purpose cluster computing system for big data processing.

    • Spark is popular for its speed and ease of use in processing large datasets.

    • It provides in-memory processing capabilities, making it faster than traditional disk-based processing systems.

    • Spark supports multiple programming languages like Java, Scala, Python, and R.

    • It offers a wide range of libraries for diverse tasks such as SQL, strea...

  • Answered by AI
  • Q2. What is Clustering? what is difference between pods and nodes?
  • Ans. 

    Clustering is the process of grouping similar data points together. Pods are groups of one or more containers, while nodes are individual machines in a cluster.

    • Clustering is a technique used in machine learning to group similar data points together based on certain features or characteristics.

    • Pods in a cluster are groups of one or more containers that share resources and are scheduled together on the same node.

    • Nodes ar...

  • Answered by AI

Skills evaluated in this interview

Data Engineer Interview Questions & Answers

user image Himanshu Pilkhwal

posted on 19 Dec 2024

Interview experience
2
Poor
Difficulty level
Moderate
Process Duration
-
Result
No response
Round 1 - Technical 

(1 Question)

  • Q1. Regarding Pipeline scheduling multiple questions

TCS Interview FAQs

How many rounds are there in TCS Data Engineer interview?
TCS interview process usually has 1-2 rounds. The most common rounds in the TCS interview process are Technical, HR and Coding Test.
How to prepare for TCS Data Engineer interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at TCS. The most common topics and skills that interviewers at TCS expect are Python, SQL, Spark, AWS and Big Data.
What are the top questions asked in TCS Data Engineer interview?

Some of the top questions asked at the TCS Data Engineer interview -

  1. what is an internal and external table in H...read more
  2. what is view in SQL and dense and dense r...read more
  3. How to deal with data quality iss...read more
How long is the TCS Data Engineer interview process?

The duration of TCS Data Engineer interview process can vary, but typically it takes about less than 2 weeks to complete.

Tell us how to improve this page.

Overall Interview Experience Rating

4/5

based on 101 interview experiences

Difficulty level

Easy 22%
Moderate 73%
Hard 5%

Duration

Less than 2 weeks 68%
2-4 weeks 18%
4-6 weeks 7%
6-8 weeks 5%
More than 8 weeks 2%
View more
TCS Data Engineer Salary
based on 6.3k salaries
₹3.7 L/yr - ₹10 L/yr
41% less than the average Data Engineer Salary in India
View more details

TCS Data Engineer Reviews and Ratings

based on 513 reviews

3.6/5

Rating in categories

3.5

Skill development

3.9

Work-life balance

2.5

Salary

4.6

Job security

3.5

Company culture

2.4

Promotions

3.3

Work satisfaction

Explore 513 Reviews and Ratings
Data Engineer

Chennai

7-12 Yrs

Not Disclosed

Sr. AWS Databricks Data Engineer

Kolkata,

Pune

+1

6-11 Yrs

Not Disclosed

AWS Databricks Data Engineer

Hyderabad / Secunderabad,

Bangalore / Bengaluru

+1

6-11 Yrs

Not Disclosed

Explore more jobs
System Engineer
1.1L salaries
unlock blur

₹1 L/yr - ₹9 L/yr

IT Analyst
65.6k salaries
unlock blur

₹7.7 L/yr - ₹12.7 L/yr

AST Consultant
53.5k salaries
unlock blur

₹12 L/yr - ₹20.6 L/yr

Assistant System Engineer
33.2k salaries
unlock blur

₹2.5 L/yr - ₹6.4 L/yr

Associate Consultant
32.9k salaries
unlock blur

₹16.2 L/yr - ₹28 L/yr

Explore more salaries
Compare TCS with

Amazon

4.0
Compare

Wipro

3.7
Compare

Infosys

3.6
Compare

Accenture

3.7
Compare
write
Share an Interview