Upload Button Icon Add office photos

Citicorp

Compare button icon Compare button icon Compare

Filter interviews by

Citicorp Data Scientist Interview Questions and Answers

Updated 19 Apr 2024

7 Interview questions

A Data Scientist was asked
Q. What are variable reduction techniques?
Ans. 

Variable reducing techniques are methods used to identify and select the most relevant variables in a dataset.

  • Variable reducing techniques help in reducing the number of variables in a dataset.

  • These techniques aim to identify the most important variables that contribute significantly to the outcome.

  • Some common variable reducing techniques include feature selection, dimensionality reduction, and correlation analysi...

A Data Scientist was asked
Q. Which test is used in logistic regression to check the significance of the variable?
Ans. 

The Wald test is used in logistic regression to check the significance of the variable.

  • The Wald test calculates the ratio of the estimated coefficient to its standard error.

  • It follows a chi-square distribution with one degree of freedom.

  • A small p-value indicates that the variable is significant.

  • For example, in Python, the statsmodels library provides the Wald test in the summary of a logistic regression model.

Data Scientist Interview Questions Asked at Other Companies

Q1. for a data with 1000 samples and 700 dimensions, how would you fi ... read more
Q2. Special Sum of Array Problem Statement Given an array 'arr' conta ... read more
asked in Affine
Q3. You have a pandas dataframe with three columns filled with state ... read more
asked in Walmart
Q4. Describe the data you would analyze to solve cost and revenue opt ... read more
Q5. Clone a Linked List with Random Pointers Given a linked list wher ... read more
A Data Scientist was asked
Q. Explain the Gini coefficient.
Ans. 

Gini coefficient measures the inequality among values of a frequency distribution.

  • Gini coefficient ranges from 0 to 1, where 0 represents perfect equality and 1 represents perfect inequality.

  • It is commonly used to measure income inequality in a population.

  • A Gini coefficient of 0.4 or higher is considered to be a high level of inequality.

  • Gini coefficient can be calculated using the Lorenz curve, which plots the cum...

A Data Scientist was asked
Q. How do you check for multicollinearity in Logistic Regression?
Ans. 

Multicollinearity in logistic regression can be checked using correlation matrix and variance inflation factor (VIF).

  • Calculate the correlation matrix of the independent variables and check for high correlation coefficients.

  • Calculate the VIF for each independent variable and check for values greater than 5 or 10.

  • Consider removing one of the highly correlated variables or variables with high VIF to address multicoll...

What people are saying about Citicorp

View All
an assistant vice president
2w
EMIs: Are they expenses or actually investments?
Prove me wrong: EMIs aren't just expenses—they're investments! Home loan EMI = investment in security. Car loan EMI = investment in convenience. Education loan EMI = investment in your future. Business loan EMI = investment in your dreams. Healthcare Loan EMI = investment in extending life. Other EMIs? Investments in needs... or maybe stupidity. It's all about perspective. With discipline, EMIs can be a safe SIP alternative. But remember, every investment has risks. Diversify your EMIs!
Got a question about Citicorp?
Ask anonymously on communities.
A Data Scientist was asked
Q. Explain the logistic regression process.
Ans. 

Logistic regression is a statistical method used to analyze and model the relationship between a binary dependent variable and one or more independent variables.

  • It is a type of regression analysis used for predicting the outcome of a categorical dependent variable based on one or more predictor variables.

  • It uses a logistic function to model the probability of the dependent variable taking a particular value.

  • It is ...

What are the roles & responsibilities of a Data Scientist at Citicorp?

Solution Development

  • Design and implement analytics solutions
  • Enhance performance of existing solutions
  • Stay current with modeling community trends

Read full roles & responsibilities

A Data Scientist was asked
Q. What is R-squared, and how does R-squared differ from Adjusted R-squared?
Ans. 

R square is a statistical measure that represents the proportion of the variance in the dependent variable explained by the independent variables.

  • R square is a value between 0 and 1, where 0 indicates that the independent variables do not explain any of the variance in the dependent variable, and 1 indicates that they explain all of it.

  • It is used to evaluate the goodness of fit of a regression model.

  • Adjusted R squ...

Citicorp HR Interview Questions

204 questions and answers

Q. When can you join?
Q. Talk about yourself
Q. What is your reason for leaving your previous organization?
A Data Scientist was asked
Q. How to check outliers in a variable, what treatment should you use to remove such outliers
Ans. 

Outliers can be detected using statistical methods like box plots, z-score, and IQR. Treatment can be removal or transformation.

  • Use box plots to visualize outliers

  • Calculate z-score and remove data points with z-score greater than 3

  • Calculate IQR and remove data points outside 1.5*IQR

  • Transform data using log or square root to reduce the impact of outliers

Are these interview questions helpful?

Citicorp Data Scientist Interview Experiences

3 interviews found

Interview experience
4
Good
Difficulty level
-
Process Duration
-
Result
-

I appeared for an interview before Apr 2023.

Round 1 - Technical 

(1 Question)

  • Q1. Basic statistics
Round 2 - Technical 

(1 Question)

  • Q1. Project related

Interview Preparation Tips

Interview preparation tips for other job seekers - Donot join citi....no job security at all...I joined and was thrown in 3months due to their restructuring and budget issues.very bad management
Interview experience
3
Average
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
-

I applied via Company Website and was interviewed before Feb 2023. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. ML concepts , regression, regularization etc

I applied via Walk-in and was interviewed in Mar 2020. There was 1 interview round.

Interview Questionnaire 

10 Questions

  • Q1. What is R square and how R square is different from Adjusted R square
  • Ans. 

    R square is a statistical measure that represents the proportion of the variance in the dependent variable explained by the independent variables.

    • R square is a value between 0 and 1, where 0 indicates that the independent variables do not explain any of the variance in the dependent variable, and 1 indicates that they explain all of it.

    • It is used to evaluate the goodness of fit of a regression model.

    • Adjusted R square t...

  • Answered by AI
  • Q2. Explain what do u understand by the team WOE and IV. What's the importance. Advantages and disadvantages
  • Ans. 

    WOE (Weight of Evidence) and IV (Information Value) are metrics used for feature selection and assessing predictive power in models.

    • WOE transforms categorical variables into continuous variables, making them more suitable for modeling.

    • IV quantifies the predictive power of a feature by measuring the separation between the good and bad outcomes.

    • For example, if a feature has an IV of 0.3, it indicates strong predictive po...

  • Answered by AI
  • Q3. What are variable reducing techniques
  • Ans. 

    Variable reducing techniques are methods used to identify and select the most relevant variables in a dataset.

    • Variable reducing techniques help in reducing the number of variables in a dataset.

    • These techniques aim to identify the most important variables that contribute significantly to the outcome.

    • Some common variable reducing techniques include feature selection, dimensionality reduction, and correlation analysis.

    • Fea...

  • Answered by AI
  • Q4. Which test is used in logistic regression to check the significance of the variable
  • Ans. 

    The Wald test is used in logistic regression to check the significance of the variable.

    • The Wald test calculates the ratio of the estimated coefficient to its standard error.

    • It follows a chi-square distribution with one degree of freedom.

    • A small p-value indicates that the variable is significant.

    • For example, in Python, the statsmodels library provides the Wald test in the summary of a logistic regression model.

  • Answered by AI
  • Q5. How to check multicollinearity in Logistic regression
  • Ans. 

    Multicollinearity in logistic regression can be checked using correlation matrix and variance inflation factor (VIF).

    • Calculate the correlation matrix of the independent variables and check for high correlation coefficients.

    • Calculate the VIF for each independent variable and check for values greater than 5 or 10.

    • Consider removing one of the highly correlated variables or variables with high VIF to address multicollinear...

  • Answered by AI
  • Q6. Difference between bagging and boosting
  • Ans. 

    Bagging and boosting are ensemble methods used in machine learning to improve model performance.

    • Bagging involves training multiple models on different subsets of the training data and then combining their predictions through averaging or voting.

    • Boosting involves iteratively training models on the same dataset, with each subsequent model focusing on the samples that were misclassified by the previous model.

    • Bagging reduc...

  • Answered by AI
  • Q7. Explain the logistics regression process
  • Ans. 

    Logistic regression is a statistical method used to analyze and model the relationship between a binary dependent variable and one or more independent variables.

    • It is a type of regression analysis used for predicting the outcome of a categorical dependent variable based on one or more predictor variables.

    • It uses a logistic function to model the probability of the dependent variable taking a particular value.

    • It is commo...

  • Answered by AI
  • Q8. Explain Gini coefficient
  • Ans. 

    Gini coefficient measures the inequality among values of a frequency distribution.

    • Gini coefficient ranges from 0 to 1, where 0 represents perfect equality and 1 represents perfect inequality.

    • It is commonly used to measure income inequality in a population.

    • A Gini coefficient of 0.4 or higher is considered to be a high level of inequality.

    • Gini coefficient can be calculated using the Lorenz curve, which plots the cumulati...

  • Answered by AI
  • Q9. Difference between chair and cart
  • Ans. 

    A chair is a piece of furniture used for sitting, while a cart is a vehicle used for transporting goods.

    • A chair typically has a backrest and armrests, while a cart does not.

    • A chair is designed for one person to sit on, while a cart can carry multiple items or people.

    • A chair is usually stationary, while a cart is mobile and can be pushed or pulled.

    • A chair is commonly found in homes, offices, and public spaces, while a c...

  • Answered by AI
  • Q10. How to check outliers in a variable, what treatment should you use to remove such outliers
  • Ans. 

    Outliers can be detected using statistical methods like box plots, z-score, and IQR. Treatment can be removal or transformation.

    • Use box plots to visualize outliers

    • Calculate z-score and remove data points with z-score greater than 3

    • Calculate IQR and remove data points outside 1.5*IQR

    • Transform data using log or square root to reduce the impact of outliers

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Explain the concept properly, if not able to explain properly then take a pause and try again with some examples. Be confident.

Skills evaluated in this interview

Interview questions from similar companies

I applied via Recruitment Consulltant and was interviewed before Aug 2021. There was 1 interview round.

Round 1 - Technical 

(1 Question)

  • Q1. Difference between CNN and MLP
  • Ans. 

    CNN is used for image recognition while MLP is used for general classification tasks.

    • CNN uses convolutional layers to extract features from images while MLP uses fully connected layers.

    • CNN is better suited for tasks that require spatial understanding like object detection while MLP is better for tabular data.

    • CNN has fewer parameters than MLP due to weight sharing in convolutional layers.

    • CNN can handle input of varying ...

  • Answered by AI

Interview Preparation Tips

Interview preparation tips for other job seekers - Brush up basic statistics . Also prepare atleast 2 , 3 ML algorithms for the interview.

Skills evaluated in this interview

I applied via Approached by Company and was interviewed before Sep 2021. There were 3 interview rounds.

Round 1 - Resume Shortlist 
Pro Tip by AmbitionBox:
Keep your resume crisp and to the point. A recruiter looks at your resume for an average of 6 seconds, make sure to leave the best impression.
View all tips
Round 2 - Technical 

(1 Question)

  • Q1. Projects and Data Science concepts
Round 3 - Technical 

(1 Question)

  • Q1. Python and coding skills

Interview Preparation Tips

Interview preparation tips for other job seekers - Be through with concepts - ML, stats, NLP

Interview Questionnaire 

3 Questions

  • Q1. Mainly resume based. In detail from the project.
  • Q2. Softmax vs sigmoid
  • Ans. 

    Softmax and sigmoid are both activation functions used in neural networks.

    • Softmax is used for multi-class classification problems, while sigmoid is used for binary classification problems.

    • Softmax outputs a probability distribution over the classes, while sigmoid outputs a probability for a single class.

    • Softmax ensures that the sum of the probabilities of all classes is 1, while sigmoid does not.

    • Softmax is more sensitiv...

  • Answered by AI
  • Q3. Logistics regression (multiclass)

Interview Preparation Tips

Interview preparation tips for other job seekers - Prepare the projects mentioned in your resume very well

Skills evaluated in this interview

Interview experience
4
Good
Difficulty level
Moderate
Process Duration
2-4 weeks
Result
No response

I appeared for an interview in Jun 2025, where I was asked the following questions.

  • Q1. Describe Rag and the steps in brief detail?
  • Q2. Explain hinge loss in SVM
  • Ans. 

    Hinge loss is a loss function used in SVM to maximize the margin between classes while penalizing misclassifications.

    • Hinge loss is defined as max(0, 1 - y * f(x)), where y is the true label and f(x) is the predicted score.

    • It penalizes predictions that are on the wrong side of the margin, encouraging correct classifications with a margin.

    • For example, if y = 1 and f(x) = 0.5, hinge loss = max(0, 1 - 1 * 0.5) = 0.5.

    • If y =...

  • Answered by AI
Interview experience
5
Excellent
Difficulty level
-
Process Duration
-
Result
-
Round 1 - Aptitude Test 

Many Mcq,s.Similar to cat exam

Round 2 - Case Study 

Ml case study . Eg loan default prediction

Interview experience
4
Good
Difficulty level
Hard
Process Duration
Less than 2 weeks
Result
Selected Selected

I applied via Campus Placement and was interviewed before Jul 2023. There were 3 interview rounds.

Round 1 - Aptitude Test 

Medium General Aptitude questions and technical(Big Data, Python etc.)

Round 2 - Technical 

(1 Question)

  • Q1. ML Algorithms (SVM, Random forest, bagging boosting, ridge, etc)
Round 3 - Technical 

(1 Question)

  • Q1. Deep equations and understading of DL and ML Algorithms
  • Ans. 

    Understanding deep equations and algorithms in DL and ML is crucial for a data scientist.

    • Deep learning involves complex neural network architectures like CNNs and RNNs.

    • Machine learning algorithms include decision trees, SVM, k-means clustering, etc.

    • Understanding the math behind algorithms helps in optimizing model performance.

    • Equations like gradient descent, backpropagation, and loss functions are key concepts.

    • Practica...

  • Answered by AI

Skills evaluated in this interview

Interview experience
1
Bad
Difficulty level
-
Process Duration
-
Result
-
Round 1 - One-on-one 

(1 Question)

  • Q1. How to extract numbers pre decimal point from a long list of decimalnumbers with efficiency
  • Ans. 

    Use string manipulation to efficiently extract numbers before the decimal point from a list of decimal numbers.

    • Split each decimal number by the decimal point and extract the number before it

    • Use regular expressions to match and extract numbers before the decimal point

    • Iterate through the list and extract numbers using string manipulation functions

  • Answered by AI

Skills evaluated in this interview

Citicorp Interview FAQs

How many rounds are there in Citicorp Data Scientist interview?
Citicorp interview process usually has 1-2 rounds. The most common rounds in the Citicorp interview process are Technical.
How to prepare for Citicorp Data Scientist interview?
Go through your CV in detail and study all the technologies mentioned in your CV. Prepare at least two technologies or languages in depth if you are appearing for a technical interview at Citicorp. The most common topics and skills that interviewers at Citicorp expect are Data Science, Machine Learning, Natural Language Processing, Credit Risk and Data Analytics.
What are the top questions asked in Citicorp Data Scientist interview?

Some of the top questions asked at the Citicorp Data Scientist interview -

  1. Which test is used in logistic regression to check the significance of the vari...read more
  2. What is R square and how R square is different from Adjusted R squ...read more
  3. How to check outliers in a variable, what treatment should you use to remove su...read more

Tell us how to improve this page.

Overall Interview Experience Rating

3.5/5

based on 2 interview experiences

Difficulty level

Moderate 100%

Duration

2-4 weeks 100%
View more
Citicorp Data Scientist Salary
based on 200 salaries
₹20.9 L/yr - ₹35 L/yr
80% more than the average Data Scientist Salary in India
View more details

Citicorp Data Scientist Reviews and Ratings

based on 20 reviews

2.5/5

Rating in categories

2.3

Skill development

3.1

Work-life balance

3.9

Salary

3.2

Job security

2.2

Company culture

2.1

Promotions

2.3

Work satisfaction

Explore 20 Reviews and Ratings
Assistant Vice President
5.4k salaries
unlock blur

₹28.3 L/yr - ₹45 L/yr

Assistant Manager
3.4k salaries
unlock blur

₹9.9 L/yr - ₹18.5 L/yr

Officer
3.1k salaries
unlock blur

₹17.5 L/yr - ₹31.5 L/yr

Vice President
2.8k salaries
unlock blur

₹40 L/yr - ₹65 L/yr

Manager
2.3k salaries
unlock blur

₹17 L/yr - ₹30 L/yr

Explore more salaries
Compare Citicorp with

JPMorgan Chase & Co.

3.9
Compare

Wells Fargo

3.8
Compare

HSBC Group

3.9
Compare

UBS

3.9
Compare
write
Share an Interview