Filter interviews by
Top trending discussions
I applied via LinkedIn and was interviewed before Aug 2023. There were 2 interview rounds.
To show top 5 in pandas, use the nlargest() function.
Use the nlargest() function with the 'n' parameter set to 5 to get the top 5 values in a pandas DataFrame.
For example: df['column_name'].nlargest(5) will return the top 5 values in the specified column.
A scatter plot is a better representation for 3 numerical columns.
Use a scatter plot to show the relationship between the numerical columns.
Scatter plots are effective for visualizing correlations and patterns in data.
Each point on the plot represents a data point with values from all 3 columns.
I applied via Referral and was interviewed in Nov 2021. There was 1 interview round.
Coalesce is used to select the first non-null value from a set of columns. Repartition is used to shuffle data across nodes.
Coalesce reduces the number of partitions to the minimum required.
Repartition increases or decreases the number of partitions.
Coalesce is a narrow transformation while repartition is a wide transformation.
Coalesce is used to optimize data for queries while repartition is used to balance data acros...
Optimizing joins involves selecting appropriate join types, indexing tables, and minimizing data movement.
Choose the appropriate join type based on the size and structure of the tables being joined
Index the tables on the join columns to speed up the join process
Minimize data movement by selecting only the necessary columns and filtering rows before joining
Consider using denormalization or materialized views to precompu...
RDD is a low-level distributed data structure while DataFrame is a high-level structured data abstraction.
RDD is immutable and unstructured while DataFrame is structured and has a schema
DataFrames are optimized for SQL queries and can be cached in memory
RDDs are more flexible and can be used for complex data processing tasks
DataFrames are easier to use and provide a more concise syntax for data manipulation
RDDs are the...
I applied via Naukri.com and was interviewed in May 2024. There were 2 interview rounds.
Excel file was give, used sumifs, countifs , index match formulas
I applied via LinkedIn and was interviewed before Oct 2022. There were 4 interview rounds.
Basic dsa question in python and data engineering questions ,sql
Basic dsa question in python and data engineering questions
Easy leetcode data structure question
Code for calling a REST API and designing a rate limiter
Use a library like requests or axios to make the API call
Implement a token bucket algorithm for rate limiting
Set a maximum number of requests per second and queue excess requests
Consider using a distributed rate limiter for scalability
Design a database schema for Instagram.
Create tables for users, posts, comments, likes, and follows.
Use primary and foreign keys to establish relationships between tables.
Store user data such as username, email, and password securely.
Include fields for post content, location, and timestamp.
Track likes and comments on posts.
Implement a notification system for follows and likes.
Consider scalability and performance in the...
I applied via LinkedIn and was interviewed in Oct 2024. There was 1 interview round.
I applied via Campus Placement and was interviewed in Jul 2020. There were 5 interview rounds.
posted on 26 Jul 2022
I applied via Company Website and was interviewed in Jun 2022. There were 2 interview rounds.
HTML, HTML5 ,CSS , Bootstrap, javaScript
based on 1 interview experience
Business Analyst
15
salaries
| ₹6.5 L/yr - ₹11.4 L/yr |
Software Engineer
14
salaries
| ₹5.4 L/yr - ₹18 L/yr |
Software Developer
7
salaries
| ₹15.8 L/yr - ₹25 L/yr |
Technical Program Manager
7
salaries
| ₹10.6 L/yr - ₹19.8 L/yr |
Associate Product Manager
6
salaries
| ₹18 L/yr - ₹26 L/yr |
Omnicom Media Group
Z1 Tech
7Search PPC
Bikayi