75 SOFTPATH TECHNOLOGIES Jobs
6-9 years
Softpath Technologies - DevOps/Site Reliability Engineer - Kubernetes (6-9 yrs)
SOFTPATH TECHNOLOGIES
posted 2 weeks ago
Flexible timing
Key skills for the job
Role : Devops/Site Reliability Engineer
Location : Pune
Timings : Full Time (As per company timings)
Notice Period : (Immediate Joiner - Only)
Experience : 6-9 Years
Duration : 6 Months (Possible Extension)
Shift Timing : 11 : 30 AM 9 : 30 PM IST
About the Role :
We are looking for a highly skilled and experienced DevOps / Site Reliability Engineer to join on a contract basis. The ideal candidate will be hands-on with Kubernetes (preferably GKE), Infrastructure as Code (Terraform/Helm), and cloud-based deployment pipelines. This role demands deep system understanding, proactive monitoring, and infrastructure optimization skills.
Key Responsibilities :
- Design and implement resilient deployment strategies (Blue-Green, Canary, GitOps).
- Configure and maintain observability tools (logs, metrics, traces, alerts).
- Optimize backend service performance through code and infra reviews (Node.js, Django, Go, Java).
- Tune and troubleshoot GKE workloads, HPA configs, ingress setups, and node pools.
- Build and manage Terraform modules for infrastructure (VPC, CloudSQL, Pub/Sub, Secrets).
- Lead or participate in incident response and root cause analysis using logs, traces, and dashboards.
- Reduce configuration drift and standardize secrets, tagging, and infra consistency across environments.
- Collaborate with engineering teams to enhance CI/CD pipelines and rollout practices.
Required Skills & Experience :
- 510 years in DevOps, SRE, Platform, or Backend Infrastructure roles.
- Strong coding/scripting skills and ability to review production-grade backend code.
- Hands-on experience with Kubernetes in production, preferably on GKE.
- Proficient in Terraform, Helm, GitHub Actions, and GitOps tools (ArgoCD or Flux).
- Deep knowledge of Cloud architecture (IAM, VPCs, Workload Identity, CloudSQL, Secret Management).
- Systems thinking understands failure domains, cascading issues, timeout limits, and recovery strategies.
- Strong communication and documentation skills capable of driving improvements through PRs and design reviews.
Tech Stack & Tools :
- Cloud & Orchestration : GKE, Kubernetes
- IaC & CI/CD : Terraform, Helm, GitHub Actions, ArgoCD/Flux
- Monitoring & Alerting : Datadog, PagerDuty
- Databases & Networking : CloudSQL, Cloudflare
- Security & Access Control : Secret Management, IAM
Driving Results :
- A good single contributor and a good team player.
- Flexible attitude towards work, as per the needs.
- Proactively identify & communicate issues and risks.
Other Personal Characteristics :
- Dynamic, engaging, self-reliant developer
- Ability to deal with ambiguity
- Manage a collaborative and analytical approach
- Self-confident and humble
- Open to continuous learning
- Intelligent, rigorous thinker who can operate successfully amongst bright people
Functional Areas: Software/Testing/Networking
Read full job description6-9 Yrs
DevOps, Kubernetes, Datadog +3 more
2-4 Yrs
Customer Service, Customer Support, Voice Support
4-9 Yrs
.NET, ASP.NET, C# +1 more
4-10 Yrs
.NET, SQL, ASP.NET +3 more
6-8 Yrs
Oracle Apps DBA, Oracle ERP, Oracle Support
6-8 Yrs
Python, Postgresql, TypeScript
7-10 Yrs
Servicenow, ITSM, Servicenow ITSM
5-10 Yrs
UNIX, Shell Scripting, Payment Systems
3-5 Yrs
Data Engineering, Python, SQL +5 more
6-10 Yrs
ERP Systems, ERP Implementation, MS Dynamics CRM