Upload Button Icon Add office photos
Engaged Employer

i

This company page is being actively managed by Cognizant Team. If you also belong to the team, you can get access from here

Cognizant Verified Tick

Compare button icon Compare button icon Compare

Filter interviews by

Cognizant Associate Data Engineer Interview Questions and Answers

Updated 19 Oct 2021

11 Interview questions

An Associate Data Engineer was asked
Q. How do you delete duplicate rows in SQL?
Ans. 

Deleting duplicate rows in SQL

  • Use the DISTINCT keyword in SELECT statement to retrieve unique rows

  • Use GROUP BY clause to group rows with same values and then use aggregate functions to select one row

  • Use the ROW_NUMBER() function to assign a unique number to each row and then delete the rows with duplicate numbers

An Associate Data Engineer was asked
Q. How do you find a process ID in Linux?
Ans. 

To find process id in Linux, use the command 'ps -aux | grep '

  • Open the terminal

  • Type 'ps -aux' to list all running processes

  • Use 'grep ' to filter the process you are looking for

  • The process id (PID) will be listed in the second column

Associate Data Engineer Interview Questions Asked at Other Companies

asked in Cognizant
Q1. In DataStage, how would you remove the header and trailer from a ... read more
asked in Cognizant
Q2. How do you delete duplicate rows in SQL?
asked in Capgemini
Q3. How would you recommend a customer migrate to the cloud?
asked in Cognizant
Q4. How do you find a process ID in Linux?
asked in Cognizant
Q5. How would you read data from a .log file and extract specific col ... read more
An Associate Data Engineer was asked
Q. In DataStage, how would you remove the header and trailer from a sequential data file?
Ans. 

To remove header and trailer from a sequential data file in Datastage.

  • Use Sequential File stage in Datastage.

  • Set the 'Skip Rows' property to the number of header rows to be skipped.

  • Set the 'Trailer Rows' property to the number of trailer rows to be skipped.

  • Use a Transformer stage to remove any remaining header or trailer rows.

  • Use the 'Remove' function in the Transformer stage to remove the rows.

An Associate Data Engineer was asked
Q. How do you handle Out Of Memory (OOM) issues in Spark?
Ans. 

Spark OutOfMemory (OOM) issues occur when the application exceeds memory limits, causing failures in processing data.

  • Increase executor memory: Use the configuration 'spark.executor.memory' to allocate more memory to executors.

  • Optimize data partitioning: Use 'repartition()' or 'coalesce()' to manage the number of partitions effectively.

  • Use broadcast variables: For large lookup tables, use 'sc.broadcast()' to reduce...

What people are saying about Cognizant

View All
a junior software engineer
6d
Job offer in Malaysia - legit or scam?
Hey everyone, I received a job proposal from Mindgraph for a Junior Mainframe Developer position in Malaysia (onsite). Not sure if it's a real deal. They found my resume on Naukri and the offer includes: * Experience: 3+ years on cardlink, VSAM, CICS, JCL * Location: Malaysia (Accenture client in Kuala Lumpur) * Notice: 0-60 days * Benefits: One-way ticket, 1-week stay, medical insurance, visa. Has anyone heard of Mindgraph or had a similar experience? Note : This is a permanent position with Mindgragh and you need to work with our client Accenture - Malaysia (Kaula Lumpur) & we will provide one way Air Ticket from India - Malaysia, 1 Week Accommodation, Medical Insurance and will take care of the Visa process also. Any insights would be appreciated!
Got a question about Cognizant?
Ask anonymously on communities.
An Associate Data Engineer was asked
Q. How would you read data from a .log file and extract specific columns using regular expressions?
Ans. 

Reading data from a .log file and extracting columns with a specific regex.

  • Use Python's built-in 're' module to define the regex pattern.

  • Open the .log file using Python's 'open' function.

  • Iterate through each line of the file and extract the desired columns using the regex pattern.

  • Store the extracted data in a data structure such as a list or dictionary.

An Associate Data Engineer was asked
Q. How would you find the count and profit from the data for the last 4 years?
Ans. 

Calculate the count and profit from data over the last four years using SQL or data analysis tools.

  • Use SQL queries like 'SELECT COUNT(*)' to get the total count of records.

  • To calculate profit, use 'SUM(profit_column)' grouped by year.

  • Example SQL: 'SELECT YEAR(date_column), COUNT(*), SUM(profit_column) FROM sales WHERE date_column >= DATE_SUB(CURDATE(), INTERVAL 4 YEAR) GROUP BY YEAR(date_column);'

  • Consider filte...

An Associate Data Engineer was asked
Q. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY
Ans. 

SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

  • SORT BY is used to sort the result set in ascending or descending order based on one or more columns.

  • ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.

  • CLUSTER BY is used to group...

Are these interview questions helpful?
An Associate Data Engineer was asked
Q. RDS VA DF VS DS
Ans. 

RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

  • RDS stands for Relational Database Service, a managed database service by AWS.

  • VA stands for Virtual Assistant, a software program that can assist with tasks.

  • DF stands for Dataflow, a managed service by Google Cloud for data processing.

  • VS stands for Virtual Server, a server that runs on a virtual machine.

  • DS stands for Datastore, a NoSQL document d...

An Associate Data Engineer was asked
Q. SMALL FILE PROBLEM
Ans. 

Small file problem refers to the issue of having a large number of small files in a storage system.

  • Small files can cause inefficiencies in storage and processing.

  • Solutions include consolidating small files into larger ones or using a different storage system.

  • Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

An Associate Data Engineer was asked
Q. SQL QUERIES WITH WINDOW FUNCTION
Ans. 

SQL queries with window functions

  • Window functions perform calculations across a set of rows that are related to the current row

  • Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE

  • Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

Cognizant Associate Data Engineer Interview Experiences

4 interviews found

Interview Questionnaire 

5 Questions

  • Q1. How to find 3rd Highest salary in Sql.
  • Q2. How to delete duplicate rows in Sql
  • Q3. Datastge - How will you remove Header and trailer from Sequential data file
  • Q4. How would you kill any job in Datastage
  • Ans. 

    To kill a job in Datastage

    • Stop the job manually from the Director client

    • Terminate the job from the command line using the dsjob command

    • Kill the job process from the operating system level

    • Delete the job from the Datastage repository

  • Answered by AI
  • Q5. How to find process id in linux

Skills evaluated in this interview

Interview Questionnaire 

6 Questions

  • Q1. Basics of HIVE AND SPARK
  • Q2. SQL QUERIES WITH WINDOW FUNCTION
  • Ans. 

    SQL queries with window functions

    • Window functions perform calculations across a set of rows that are related to the current row

    • Common window functions include ROW_NUMBER, RANK, DENSE_RANK, and NTILE

    • Window functions are used with the OVER() clause to define the window or subset of rows to perform the calculation on

  • Answered by AI
  • Q3. SORT BY ORDER BY CLUSTER BY DISTRIBUTE BY
  • Ans. 

    SORT BY, ORDER BY, CLUSTER BY, and DISTRIBUTE BY are SQL clauses used for data sorting and partitioning.

    • SORT BY is used to sort the result set in ascending or descending order based on one or more columns.

    • ORDER BY is used to sort the result set in ascending or descending order based on one or more columns. It is similar to SORT BY but can be used with other clauses like LIMIT and OFFSET.

    • CLUSTER BY is used to group data...

  • Answered by AI
  • Q4. SPARK OOM ISSUE
  • Ans. 

    Spark OutOfMemory (OOM) issues occur when the application exceeds memory limits, causing failures in processing data.

    • Increase executor memory: Use the configuration 'spark.executor.memory' to allocate more memory to executors.

    • Optimize data partitioning: Use 'repartition()' or 'coalesce()' to manage the number of partitions effectively.

    • Use broadcast variables: For large lookup tables, use 'sc.broadcast()' to reduce memo...

  • Answered by AI
  • Q5. SMALL FILE PROBLEM
  • Ans. 

    Small file problem refers to the issue of having a large number of small files in a storage system.

    • Small files can cause inefficiencies in storage and processing.

    • Solutions include consolidating small files into larger ones or using a different storage system.

    • Examples include Hadoop's SequenceFile format and Amazon S3's object size optimization.

  • Answered by AI
  • Q6. RDS VA DF VS DS
  • Ans. 

    RDS, VA, DF, VS, and DS are all acronyms related to data engineering.

    • RDS stands for Relational Database Service, a managed database service by AWS.

    • VA stands for Virtual Assistant, a software program that can assist with tasks.

    • DF stands for Dataflow, a managed service by Google Cloud for data processing.

    • VS stands for Virtual Server, a server that runs on a virtual machine.

    • DS stands for Datastore, a NoSQL document databa...

  • Answered by AI

Skills evaluated in this interview

I applied via Recruitment Consultant and was interviewed in Sep 2021. There were 3 interview rounds.

Interview Questionnaire 

2 Questions

  • Q1. Azure activities
  • Q2. Project experience

Interview Preparation Tips

Interview preparation tips for other job seekers - Basics of azure must be very clear also scenario based questions.

I applied via Recruitment Consultant and was interviewed before Jun 2020. There were 4 interview rounds.

Interview Questionnaire 

6 Questions

  • Q1. Reading Data from a .log file and finding out each column with a specific regex.
  • Ans. 

    Reading data from a .log file and extracting columns with a specific regex.

    • Use Python's built-in 're' module to define the regex pattern.

    • Open the .log file using Python's 'open' function.

    • Iterate through each line of the file and extract the desired columns using the regex pattern.

    • Store the extracted data in a data structure such as a list or dictionary.

  • Answered by AI
  • Q2. Asked to find count and profit from the data for last 4 years
  • Ans. 

    Calculate the count and profit from data over the last four years using SQL or data analysis tools.

    • Use SQL queries like 'SELECT COUNT(*)' to get the total count of records.

    • To calculate profit, use 'SUM(profit_column)' grouped by year.

    • Example SQL: 'SELECT YEAR(date_column), COUNT(*), SUM(profit_column) FROM sales WHERE date_column >= DATE_SUB(CURDATE(), INTERVAL 4 YEAR) GROUP BY YEAR(date_column);'

    • Consider filtering ...

  • Answered by AI
  • Q3. Optimizations I can use
  • Ans. 

    Optimizations for data engineering

    • Use indexing to speed up queries

    • Partition data to improve query performance

    • Use caching to reduce data retrieval time

    • Optimize data storage format for faster processing

    • Use parallel processing to speed up data processing

    • Optimize network bandwidth usage

    • Use compression to reduce storage and network usage

  • Answered by AI
  • Q4. Python JSON reading
  • Ans. 

    Answering how to read JSON in Python.

    • Use the json module to load and parse JSON data

    • Use the json.loads() method to load JSON data from a string

    • Use the json.load() method to load JSON data from a file

    • Access JSON data using keys or indexes

    • Use the json.dumps() method to convert Python objects to JSON strings

  • Answered by AI
  • Q5. Array, List - Python scenarios
  • Q6. Pyspark configs

Interview Preparation Tips

Interview preparation tips for other job seekers - Very easy if you have prev work exp.

Skills evaluated in this interview

Interview questions from similar companies

I applied via Company Website and was interviewed before Jun 2020. There were 7 interview rounds.

Interview Questionnaire 

4 Questions

  • Q1. Project planning
  • Q2. Achievements
  • Q3. Coding questions
  • Q4. Many more other

Interview Preparation Tips

Interview preparation tips for other job seekers - Overall was very nice and easy

I applied via Campus Placement and was interviewed before Jan 2021. There were 2 interview rounds.

Round 1 - Aptitude Test 

Good

Round 2 - Technical 

(1 Question)

  • Q1. Basic question from C++.Some questions from Data structure and computer architecture.

Interview Preparation Tips

Topics to prepare for TCS Software Engineer interview:
  • C++
Interview preparation tips for other job seekers - Prepare well. Aptitude is not very easy so you have to prepare well.

I applied via Campus Placement and was interviewed before Jul 2020. There were 4 interview rounds.

Interview Questionnaire 

1 Question

  • Q1. Related to final year project and basic programming.

Interview Preparation Tips

Interview preparation tips for other job seekers - I applied for pool campus drive before covid in feb 2019. The whole interview process have done in one day.

The first round was basic aptitude and essay writing the level of difficulty is average and this is also elimination round.

The second was pseudo coding round It has mcq based programing question like arrays, looping and some data structure question.

Depending on the score in coding round they split us into two groups infra and non infra. Infra for cloud and infrastructure related roles and non-infra for programming related role.

The third round was technical Hr. The level of the interview is average he asked basic questions in c and java like inheritance, Polymorphism, encapsulates etc.. and some final project related questions.

The last round was General Hr. He is very friendly and ask about some basic hr question. More importantly know something about the company before getting into this round.

I cleared the interview and got my LOI after 2 months.

Capgemini follows the pre joining course for fresher as ADAPT. It's very useful those are from non IT background.

Clearing ADAPT is not mandatory but you need to participate.

Due to pandemic I got the offer letter on Feb 2021 and now I have completed my training and looking for project.

Are these interview questions helpful?

I applied via LinkedIn and was interviewed before Jul 2020. There were 4 interview rounds.

Interview Questionnaire 

4 Questions

  • Q1. Which technologies your interested to work
  • Q2. Question related to Java coding
  • Q3. Question from C language
  • Q4. Question from AI & ML

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare on all the latest technologies, brush your regular skills

I applied via Company Website and was interviewed before Oct 2020. There were 3 interview rounds.

Interview Questionnaire 

1 Question

  • Q1. Tell me about your experience

Interview Preparation Tips

Interview preparation tips for other job seekers - Be confident adn clear when you answer

I applied via Amcat and was interviewed before Jul 2021. There were 2 interview rounds.

Round 1 - Aptitude Test 

Refer R S Agarwal book for apptitude

Round 2 - One-on-one 

(1 Question)

  • Q1. Write a c program on fractional numbers
  • Ans. 

    A C program to perform arithmetic operations on fractional numbers.

    • Use float or double data type to store fractional numbers.

    • Use scanf() to take input from the user.

    • Perform arithmetic operations like addition, subtraction, multiplication, and division.

    • Use printf() to display the result.

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Be prepared refer R S Agarwal book for apptitude test

Skills evaluated in this interview

Cognizant Interview FAQs

What are the top questions asked in Cognizant Associate Data Engineer interview?

Some of the top questions asked at the Cognizant Associate Data Engineer interview -

  1. Datastge - How will you remove Header and trailer from Sequential data f...read more
  2. How to delete duplicate rows in ...read more
  3. How to find process id in lin...read more

Tell us how to improve this page.

Interview Questions from Similar Companies

TCS Interview Questions
3.6
 • 11k Interviews
Accenture Interview Questions
3.8
 • 8.6k Interviews
Infosys Interview Questions
3.6
 • 7.9k Interviews
Wipro Interview Questions
3.7
 • 6k Interviews
Capgemini Interview Questions
3.7
 • 5k Interviews
Tech Mahindra Interview Questions
3.5
 • 4.1k Interviews
HCLTech Interview Questions
3.5
 • 4.1k Interviews
Genpact Interview Questions
3.8
 • 3.4k Interviews
IBM Interview Questions
4.0
 • 2.4k Interviews
DXC Technology Interview Questions
3.7
 • 834 Interviews
View all
Cognizant Associate Data Engineer Salary
based on 126 salaries
₹4.5 L/yr - ₹11.8 L/yr
13% less than the average Associate Data Engineer Salary in India
View more details

Cognizant Associate Data Engineer Reviews and Ratings

based on 11 reviews

3.7/5

Rating in categories

3.5

Skill development

3.5

Work-life balance

3.0

Salary

3.7

Job security

3.7

Company culture

3.0

Promotions

3.5

Work satisfaction

Explore 11 Reviews and Ratings
Associate
73.1k salaries
unlock blur

₹5.1 L/yr - ₹14.5 L/yr

Programmer Analyst
56.1k salaries
unlock blur

₹2 L/yr - ₹9.2 L/yr

Senior Associate
54.7k salaries
unlock blur

₹8.4 L/yr - ₹28.6 L/yr

Senior Processing Executive
29.7k salaries
unlock blur

₹1.4 L/yr - ₹9 L/yr

Technical Lead
18k salaries
unlock blur

₹6 L/yr - ₹25.5 L/yr

Explore more salaries
Compare Cognizant with

TCS

3.6
Compare

Infosys

3.6
Compare

Wipro

3.7
Compare

Accenture

3.8
Compare
write
Share an Interview