Upload Button Icon Add office photos
filter salaries All Filters

28 Observe.AI Jobs

Observe.ai - Technical Lead - Infrastructure Engineering (8-12 yrs)

8-12 years

Observe.ai - Technical Lead - Infrastructure Engineering (8-12 yrs)

Observe.AI

posted 3+ weeks ago

Job Description


Our Infrastructure/DevOps team is a dynamic group of skilled engineers operating in a fast-paced Agile environment. We manage a robust multi-region infrastructure across the globe, leveraging AWS, Kubernetes, and Harness for efficient deployments and seamless application runtime management.



Collaboration is at our core, with daily stand-ups and bi-weekly sprints ensuring alignment and continuous progress. Innovation thrives here; team members are encouraged to experiment with new technologies and share ideas that drive impactful solutions. We foster growth through mentorship programs, regular skill development workshops, and ample career advancement opportunities.



Responsibilities :



- Manager Self-Hosting tools : Lead the transition from managed services to self-hosted Elasticsearch, Prometheus, and other critical infrastructure components to optimize performance and cost.


- Optimize AI Infrastructure : Work closely with ML engineers and data scientists to efficiently deploy and scale AI/ML models, ensuring high availability and low-latency inference.


- Infrastructure Scalability & Reliability : Design and implement scalable, fault-tolerant systems capable of handling large-scale AI workloads, distributed training, and high-throughput data pipelines.


- Technology Evaluation & Implementation : Continuously assess and introduce new technologies to enhance automation, reliability, and security in AI model deployment and training pipelines.



- CI/CD for AI Workflows : Enhance and automate ML model deployment pipelines using MLOps best

practices and tools like Kubeflow, MLflow, and Argo Workflows.



- Observability & Monitoring : Implement and enhance monitoring, logging, and alerting strategies using Prometheus, Grafana, ELK, OpenTelemetry, etc., tailored for AI workloads.


Requirements :



- 8+ years of experience in DevOps, SRE, or Cloud Infrastructure roles, preferably in AI or data-intensive environments.


- Strong expertise in Kubernetes (EKS, AKS preferred ) for deploying AI workloads and managing GPU & non CPU clusters.


- Experience with self-hosting services like Elasticsearch, Prometheus, Grafana, Kafka, etc.


- Hands-on expertise in Infrastructure as Code (Terraform, CloudFormation).


- Deep understanding of cloud platforms (AWS, Azure, GCP) and AI-focused services like AWS Sagemaker, Vertex AI, or Azure ML.


- Strong automation and scripting skills in Python, Bash, or Go.


- Experience in CI/CD tools (Jenkins, GitHub Actions, ArgoCD, etc. ) with a focus on AI model deployment.


- Strong leadership and mentorship skills to guide DevOps and ML teams.


- FinOps expertise for optimizing GPU and AI cloud compute costs.


- Familiarity with service meshes (Istio, Linkerd) and API gateways.


- Knowledge of compliance frameworks (SOC2 ISO 27001 etc. ) for AI data pipelines.




Functional Areas: Other

Read full job description

Prepare for Your Observe.AI Interview with Real Experiences!

View interviews
Office worker

What people at Observe.AI are saying

What Observe.AI employees are saying about work life

based on 9 employees
86%
100%
100%
100%
Flexible timing
Monday to Friday
No travel
Day Shift
View more insights

Observe.AI Benefits

Free Transport
Child care
Gymnasium
Cafeteria
Work From Home
Free Food +6 more
View more benefits

Compare Observe.AI with

Intellect Design Arena

3.9
Compare

Cohesity

3.9
Compare

Celebal Technologies

3.1
Compare

NoBrokerHOOD

3.1
Compare

Innovaccer

3.4
Compare

Vyapar

3.5
Compare

Nowfloats Technologies

3.2
Compare

ShopKirana

3.8
Compare

Tata nexarc

3.1
Compare

Classplus

3.4
Compare

Fleetx.io

3.7
Compare

KEKA TECHNOLOGIES

3.3
Compare

PagarBook

3.7
Compare

Signzy Technologies

3.0
Compare

Leena AI

3.0
Compare

Gameskraft

4.0
Compare

BlueBinaries Engineering and Solutions

3.0
Compare

yellow.ai

3.2
Compare

One Trust

2.9
Compare

Unicommerce Esolutions

2.9
Compare

Similar Jobs for you

Security Lead at Observe.AI

9-12 Yrs

₹ 24-30 LPA

Cloud Platform Engineer at Vipsa talent Solutions

5-8 Yrs

₹ 15-18 LPA

Engineering Manager at CareerNet Technologies

10-15 Yrs

₹ 30-40 LPA

Senior AWS Cloud Engineer at Exigo Tech

7-8 Yrs

₹ 25-30 LPA

Cloud Infrastructure Engineer at Invimatic Technologies

6-8 Yrs

₹ 18-24 LPA

Cloud Infrastructure Engineer at Egen

8-14 Yrs

₹ 20-40 LPA

Site Reliability Engineer Lead at Pocket FM

6-8 Yrs

₹ 20-25 LPA

Engineering Manager at SatSure Analytics India Pvt. Ltd.

7-8 Yrs

₹ 21-24 LPA

Architect at Devkraft Technologies

8-12 Yrs

₹ 25-40 LPA

DevOps Lead at Freight Tiger

12-14 Yrs

₹ 27-40 LPA

Observe.AI San Francisco Office Location

View all
San Francisco Office
Headquarter
595 Market Street, Suite #1130 San Francisco
94105

Observe.ai - Technical Lead - Infrastructure Engineering (8-12 yrs)

8-12 Yrs

AWS, Kubernetes, Azure DevOps +4 more

3+ weeks ago·via hirist.com

Senior Machine Learning Engineer - NLP

3-8 Yrs

Bangalore / Bengaluru

Customer Service, Python, Automation Testing +5 more

2 days ago·via naukri.com

Observe.AI - Senior Infrastructure Security Engineer (3-4 yrs)

3-4 Yrs

Cyber Security, IAM, Information Security +3 more

1 week ago·via hirist.com

Software Development Engineer II - Backend Technologies (3-5 yrs)

3-5 Yrs

Python, SQL, MQ +2 more

1 week ago·via hirist.com

Observe.AI - Infrastructure Security Leader (9-12 yrs)

9-12 Yrs

Cyber Security, CCNA, Information Security +7 more

1 week ago·via hirist.com

Senior Infrastructure Security Engineer

3-4 Yrs

Bangalore / Bengaluru

Software Configuration Management, Customer Service, Python +7 more

2 weeks ago·via naukri.com

Software Development Engineer II - Backend (Python)

3-5 Yrs

Bangalore / Bengaluru

Medical Coding, Customer Service, Python +6 more

2 weeks ago·via naukri.com

Accountant - Contractor

3-8 Yrs

Bangalore / Bengaluru

Data Entry, Medical Coding, Customer Service +7 more

2 weeks ago·via naukri.com

Speech Analyst - II

2-5 Yrs

Bangalore / Bengaluru

Computer Science, Data Analysis, Data Analytics +6 more

2 weeks ago·via naukri.com

Observe.AI - Software Development Engineer IV - Backend Architecture (8-10 yrs)

8-10 Yrs

Python, Postgresql, System Design +2 more

3+ weeks ago·via hirist.com
write
Share an Interview