Home
Communities
Companies
- Companies
  
  Discover best places to work
- Compare Companies
  
  Compare & find best workplace
- Add Office Photos
  
  Bring your workplace to life
- Add Company Benefits
  
  Highlight your company's perks
Reviews
- Company reviews
  
  Read reviews for 6L+ companies
- Write a review
  
  Rate your former or current company
Salaries
- Browse salaries
  
  Discover salaries for 6L+ companies
- Salary calculator
  
  Calculate your take home salary
- Are you paid fairly?
  
  Check your market value
- Share your salary
  
  Help other jobseekers
- Gratuity calculator
  
  Check your gratuity amount
- HRA calculator
  
  Check how much of your HRA is tax-free
- Salary hike calculator
  
  Check your salary hike
Interviews
- Company interviews
  
  Read interviews for 40K+ companies
- Share interview questions
  
  Contribute your interview questions
Jobs
Awards

VIEW WINNERS
- ABECA 2025
  
  VIEW WINNERS
  
  AmbitionBox Employee Choice Awards - 4th Edition
- ABECA 2024
  
  AmbitionBox Employee Choice Awards - 3rd Edition
- AmbitionBox Best Places to Work 2022
  
  2nd Edition
Participate in ABECA 2026

Add office photos

Engaged Employer

Cognizant

Compare

3.7

based on 54.4k Reviews

Video summary

Filter interviews by

Cognizant Associate Data Engineer Interview Questions and Answers

Updated 19 Oct 2021

11 Interview questions

An Associate Data Engineer was asked

Q. How do you delete duplicate rows in SQL?

Ans.

Deleting duplicate rows in SQL

Use the DISTINCT keyword in SELECT statement to retrieve unique rows
Use GROUP BY clause to group rows with same values and then use aggregate functions to select one row
Use the ROW_NUMBER() function to assign a unique number to each row and then delete the rows with duplicate numbers

An Associate Data Engineer was asked

Q. How do you find a process ID in Linux?

Ans.

To find process id in Linux, use the command 'ps -aux | grep '

Open the terminal
Type 'ps -aux' to list all running processes
Use 'grep ' to filter the process you are looking for
The process id (PID) will be listed in the second column

An Associate Data Engineer was asked

Q. In DataStage, how would you remove the header and trailer from a sequential data file?

Ans.

To remove header and trailer from a sequential data file in Datastage.

Use Sequential File stage in Datastage.
Set the 'Skip Rows' property to the number of header rows to be skipped.
Set the 'Trailer Rows' property to the number of trailer rows to be skipped.
Use a Transformer stage to remove any remaining header or trailer rows.
Use the 'Remove' function in the Transformer stage to remove the rows.

An Associate Data Engineer was asked

Q. How do you handle Out Of Memory (OOM) issues in Spark?

Ans.

Spark OutOfMemory (OOM) issues occur when the application exceeds memory limits, causing failures in processing data.

Increase executor memory: Use the configuration 'spark.executor.memory' to allocate more memory to executors.
Optimize data partitioning: Use 'repartition()' or 'coalesce()' to manage the number of partitions effectively.
Use broadcast variables: For large lookup tables, use 'sc.broadcast()' to reduce...

What people are saying about Cognizant

View All

a junior software engineer

Job offer in Malaysia - legit or scam?

Hey everyone, I received a job proposal from Mindgraph for a Junior Mainframe Developer position in Malaysia (onsite). Not sure if it's a real deal. They found my resume on Naukri and the offer includes: * Experience: 3+ years on cardlink, VSAM, CICS, JCL * Location: Malaysia (Accenture client in Kuala Lumpur) * Notice: 0-60 days * Benefits: One-way ticket, 1-week stay, medical insurance, visa. Has anyone heard of Mindgraph or had a similar experience? Note : This is a permanent position with Mindgragh and you need to work with our client Accenture - Malaysia (Kaula Lumpur) & we will provide one way Air Ticket from India - Malaysia, 1 Week Accommodation, Medical Insurance and will take care of the Visa process also. Any insights would be appreciated!

Got a question about Cognizant?

Ask anonymously on communities.

An Associate Data Engineer was asked

Q. How would you read data from a .log file and extract specific columns using regular expressions?

Ans.

Reading data from a .log file and extracting columns with a specific regex.

Use Python's built-in 're' module to define the regex pattern.
Open the .log file using Python's 'open' function.
Iterate through each line of the file and extract the desired columns using the regex pattern.
Store the extracted data in a data structure such as a list or dictionary.

An Associate Data Engineer was asked

Q. How would you find the count and profit from the data for the last 4 years?

Ans.

Calculate the count and profit from data over the last four years using SQL or data analysis tools.

Use SQL queries like 'SELECT COUNT(*)' to get the total count of records.
To calculate profit, use 'SUM(profit_column)' grouped by year.
Example SQL: 'SELECT YEAR(date_column), COUNT(*), SUM(profit_column) FROM sales WHERE date_column >= DATE_SUB(CURDATE(), INTERVAL 4 YEAR) GROUP BY YEAR(date_column);'
Consider filte...

An Associate Data Engineer was asked

Q. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY

Ans.

SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

SORT BY is used to sort the result set in ascending or descending order based on one or more columns.
ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.
CLUSTER BY is used to group...

Are these interview questions helpful?

An Associate Data Engineer was asked

Q. RDS VA DF VS DS

Ans.

RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

RDS stands for Relational Database Service, a managed database service by AWS.
VA stands for Virtual Assistant, a software program that can assist with tasks.
DF stands for Dataflow, a managed service by Google Cloud for data processing.
VS stands for Virtual Server, a server that runs on a virtual machine.
DS stands for Datastore, a NoSQL document d...

An Associate Data Engineer was asked

Q. SMALL FILE PROBLEM

Ans.

Small file problem refers to the issue of having a large number of small files in a storage system.

Small files can cause inefficiencies in storage and processing.
Solutions include consolidating small files into larger ones or using a different storage system.
Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

An Associate Data Engineer was asked

Q. SQL QUERIES WITH WINDOW FUNCTION

Ans.

SQL queries with window functions

Window functions perform calculations across a set of rows that are related to the current row
Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE
Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

Cognizant Associate Data Engineer Interview Experiences

4 interviews found

Associate Data Engineer Interview Questions & Answers

Anonymous

posted on 11 Aug 2021

Interview Questionnaire

5 Questions

Q1. How to find 3rd Highest salary in Sql.

View 3 more answers

Q2. How to delete duplicate rows in Sql

View 3 more answers

Q3. Datastge - How will you remove Header and trailer from Sequential data file

View 1 more answer

Q4. How would you kill any job in Datastage

Ans.

To kill a job in Datastage

Stop the job manually from the Director client
Terminate the job from the command line using the dsjob command
Kill the job process from the operating system level
Delete the job from the Datastage repository

Answered by AI

Add your answer

Q5. How to find process id in linux

View 2 more answers

Skills evaluated in this interview

Associate Data Engineer Interview Questions & Answers

Anonymous

posted on 10 Jul 2021

Interview Questionnaire

6 Questions

Q1. Basics of HIVE AND SPARK

Add your answer

Q2. SQL QUERIES WITH WINDOW FUNCTION

Ans.

SQL queries with window functions

Window functions perform calculations across a set of rows that are related to the current row
Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE
Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

Answered by AI

Add your answer

Q3. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY

Ans.

SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

SORT BY is used to sort the result set in ascending or descending order based on one or more columns.
ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.
CLUSTER BY is used to group data...

Answered by AI

Add your answer

Q4. SPARK OOM ISSUE

Ans.

Spark OutOfMemory (OOM) issues occur when the application exceeds memory limits, causing failures in processing data.

Increase executor memory: Use the configuration 'spark.executor.memory' to allocate more memory to executors.
Optimize data partitioning: Use 'repartition()' or 'coalesce()' to manage the number of partitions effectively.
Use broadcast variables: For large lookup tables, use 'sc.broadcast()' to reduce memo...

Answered by AI

Add your answer

Q5. SMALL FILE PROBLEM

Ans.

Small file problem refers to the issue of having a large number of small files in a storage system.

Small files can cause inefficiencies in storage and processing.
Solutions include consolidating small files into larger ones or using a different storage system.
Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

Answered by AI

Add your answer

Q6. RDS VA DF VS DS

Ans.

RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

RDS stands for Relational Database Service, a managed database service by AWS.
VA stands for Virtual Assistant, a software program that can assist with tasks.
DF stands for Dataflow, a managed service by Google Cloud for data processing.
VS stands for Virtual Server, a server that runs on a virtual machine.
DS stands for Datastore, a NoSQL document databa...

Answered by AI

Add your answer

Skills evaluated in this interview

Associate Data Engineer Interview Questions & Answers

akshaya rani

posted on 19 Oct 2021

I applied via Recruitment Consultant and was interviewed in Sep 2021. There were 3 interview rounds.

Interview Questionnaire

2 Questions

Q1. Azure activities

Add your answer

Q2. Project experience

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Basics of azure must be very clear also scenario based questions.

Associate Data Engineer Interview Questions & Answers

Anonymous

posted on 24 Jun 2021

I applied via Recruitment Consultant and was interviewed before Jun 2020. There were 4 interview rounds.

Interview Questionnaire

6 Questions

Q1. Reading Data from a .log file and finding out each column with a specific regex.

Ans.

Reading data from a .log file and extracting columns with a specific regex.

Use Python's built-in 're' module to define the regex pattern.
Open the .log file using Python's 'open' function.
Iterate through each line of the file and extract the desired columns using the regex pattern.
Store the extracted data in a data structure such as a list or dictionary.

Answered by AI

Add your answer

Q2. Asked to find count and profit from the data for last 4 years

Ans.

Calculate the count and profit from data over the last four years using SQL or data analysis tools.

Use SQL queries like 'SELECT COUNT(*)' to get the total count of records.
To calculate profit, use 'SUM(profit_column)' grouped by year.
Example SQL: 'SELECT YEAR(date_column), COUNT(*), SUM(profit_column) FROM sales WHERE date_column >= DATE_SUB(CURDATE(), INTERVAL 4 YEAR) GROUP BY YEAR(date_column);'
Consider filtering ...

Answered by AI

Add your answer

Q3. Optimizations I can use

Ans.

Optimizations for data engineering

Use indexing to speed up queries
Partition data to improve query performance
Use caching to reduce data retrieval time
Optimize data storage format for faster processing
Use parallel processing to speed up data processing
Optimize network bandwidth usage
Use compression to reduce storage and network usage

Answered by AI

Add your answer

Q4. Python JSON reading

Ans.

Answering how to read JSON in Python.

Use the json module to load and parse JSON data
Use the json.loads() method to load JSON data from a string
Use the json.load() method to load JSON data from a file
Access JSON data using keys or indexes
Use the json.dumps() method to convert Python objects to JSON strings

Answered by AI

Add your answer

Q5. Array, List - Python scenarios

Add your answer

Q6. Pyspark configs

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Very easy if you have prev work exp.

Skills evaluated in this interview

Interview questions from similar companies

Software Engineer Interview Questions & Answers

Infosys

Anonymous

posted on 5 Jun 2021

I applied via Company Website and was interviewed before Jun 2020. There were 7 interview rounds.

Interview Questionnaire

4 Questions

Q1. Project planning

Add your answer

Q2. Achievements

Add your answer

Q3. Coding questions

Add your answer

Q4. Many more other

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Overall was very nice and easy

Software Engineer Interview Questions & Answers

TCS

Anonymous

posted on 28 Jan 2022

I applied via Campus Placement and was interviewed before Jan 2021. There were 2 interview rounds.

Round 1 - Aptitude Test

Good

Round 2 - Technical

(1 Question)

Q1. Basic question from C++.Some questions from Data structure and computer architecture.

Add your answer

Interview Preparation Tips

Topics to prepare for TCS Software Engineer interview:

Interview preparation tips for other job seekers - Prepare well. Aptitude is not very easy so you have to prepare well.

Software Engineer Interview Questions & Answers

Capgemini

Anonymous

posted on 20 Jul 2021

I applied via Campus Placement and was interviewed before Jul 2020. There were 4 interview rounds.

Interview Questionnaire

1 Question

Q1. Related to final year project and basic programming.

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - I applied for pool campus drive before covid in feb 2019. The whole interview process have done in one day.

The first round was basic aptitude and essay writing the level of difficulty is average and this is also elimination round.

The second was pseudo coding round It has mcq based programing question like arrays, looping and some data structure question.

Depending on the score in coding round they split us into two groups infra and non infra. Infra for cloud and infrastructure related roles and non-infra for programming related role.

The third round was technical Hr. The level of the interview is average he asked basic questions in c and java like inheritance, Polymorphism, encapsulates etc.. and some final project related questions.

The last round was General Hr. He is very friendly and ask about some basic hr question. More importantly know something about the company before getting into this round.

I cleared the interview and got my LOI after 2 months.

Capgemini follows the pre joining course for fresher as ADAPT. It's very useful those are from non IT background.

Clearing ADAPT is not mandatory but you need to participate.

Due to pandemic I got the offer letter on Feb 2021 and now I have completed my training and looking for project.

Are these interview questions helpful?

Software Engineer Interview Questions & Answers

Infosys

Anonymous

posted on 23 Jul 2021

I applied via LinkedIn and was interviewed before Jul 2020. There were 4 interview rounds.

Interview Questionnaire

4 Questions

Q1. Which technologies your interested to work

Add your answer

Q2. Question related to Java coding

Add your answer

Q3. Question from C language

Add your answer

Q4. Question from AI & ML

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare on all the latest technologies, brush your regular skills

Software Engineer Interview Questions & Answers

Capgemini

Anonymous

posted on 23 Oct 2021

I applied via Company Website and was interviewed before Oct 2020. There were 3 interview rounds.

Interview Questionnaire

1 Question

Q1. Tell me about your experience

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident adn clear when you answer

Software Engineer Interview Questions & Answers

HCLTech

Anonymous

posted on 3 Jul 2022

I applied via Amcat and was interviewed before Jul 2021. There were 2 interview rounds.

Round 1 - Aptitude Test

Refer R S Agarwal book for apptitude

Round 2 - One-on-one

(1 Question)

Q1. Write a c program on fractional numbers

Ans.

A C program to perform arithmetic operations on fractional numbers.

Use float or double data type to store fractional numbers.
Use scanf() to take input from the user.
Perform arithmetic operations like addition, subtraction, multiplication, and division.
Use printf() to display the result.

Answered by AI

Add your answer

Interview Preparation Tips

Interview preparation tips for other job seekers - Be prepared refer R S Agarwal book for apptitude test

Skills evaluated in this interview

Cognizant Interview FAQs

What are the top questions asked in Cognizant Associate Data Engineer interview?

Some of the top questions asked at the Cognizant Associate Data Engineer interview -

Datastge - How will you remove Header and trailer from Sequential data f...read more
How to delete duplicate rows in ...read more
How to find process id in lin...read more

Tell us how to improve this page.

Cognizant Interviews By Designations

Interview Questions for Popular Designations

TCS Interview Questions

3.6

• 11k Interviews

Accenture Interview Questions

3.8

• 8.6k Interviews

Infosys Interview Questions

3.6

• 7.9k Interviews

Wipro Interview Questions

3.7

• 6k Interviews

Capgemini Interview Questions

3.7

• 5k Interviews

Tech Mahindra Interview Questions

3.5

• 4.1k Interviews

HCLTech Interview Questions

3.5

• 4.1k Interviews

Genpact Interview Questions

3.8

• 3.4k Interviews

IBM Interview Questions

4.0

• 2.4k Interviews

DXC Technology Interview Questions

3.7

• 834 Interviews

View all

Cognizant Associate Data Engineer Salary

based on 126 salaries

₹4.5 L/yr - ₹11.8 L/yr

13% less than the average Associate Data Engineer Salary in India

View more details

Cognizant Salaries in India

Associate 73.1k salaries	₹5.1 L/yr - ₹14.5 L/yr
Programmer Analyst 56.1k salaries	₹2 L/yr - ₹9.2 L/yr
Senior Associate 54.7k salaries	₹8.4 L/yr - ₹28.6 L/yr
Senior Processing Executive 29.7k salaries	₹1.4 L/yr - ₹9 L/yr
Technical Lead 18k salaries	₹6 L/yr - ₹25.5 L/yr