The Impact You’ll Drive
We are looking for a proactive and detail-oriented Site Reliability Engineer (SRE) to ensure the reliability, availability, and performance of Vegapay's production systems. The ideal candidate will be responsible for monitoring applications and infrastructure, managing production alerts, overseeing critical end-of-day processing and driving incident resolution through effective triage and escalation.

The Hats You Will Wear
  • Monitor application health checks and ensure high system availability.
  • Monitor infrastructure health, including servers, databases, network components, and cloud resources.
  • Continuously track Grafana dashboards to identify performance bottlenecks, traffic spikes, and production anomalies.
  • Monitor and manage end-of-day (EOD) batch processing jobs, scheduled tasks, and cron executions.
  • Monitor end-of-day reporting jobs and ensure timely completion of core system and operational reports.
  • Review, acknowledge, and respond to production alerts across monitoring platforms.
  • Create, manage, and track incidents through closure, ensuring adherence to defined SLAs.
  • Perform initial triage, coordinate with engineering teams, and follow the escalation matrix for issue resolution.
  • Maintain incident logs, post-incident documentation, and operational runbooks.
  • Participate in on-call rotations and support critical production activities when required.
  • Identify opportunities for automation and operational efficiency improvements.

The Perfect Fit
  • 2+ years of experience in Site Reliability Engineering, Production Support, DevOps, or Infrastructure Operations.
  • Hands-on experience with monitoring and observability tools such as Grafana, Prometheus, CloudWatch, or similar platforms.
  • Familiarity with cloud platforms such as AWS, Azure, or GCP.
  • Experience monitoring scheduled jobs, batch processing systems, and cron-based workflows.
  • Knowledge of incident management processes and production support best practices.
  • Ability to work in a fast-paced, high-availability production environment

The Problem We’re Solving
Financial institutions today are held back by legacy systems that are slow, rigid, and expensive to scale. Launching or evolving credit, lending, and UPI products often takes months, requires heavy engineering effort, and limits the ability to create personalized customer experiences.

At the same time, customer expectations have changed - speed, flexibility, and tailored financial products are no longer optional. Banks and fintechs need infrastructure that allows them to innovate quickly, adapt continuously, and scale without friction.

This is where we come in.

At Vegapay, we are building modern, configurable fintech infrastructure that enables banks, NBFCs, and enterprises to design, launch, and manage credit and payment programs with ease. Our platform brings together flexibility, speed, and control - helping our partners unlock new growth opportunities and deliver personalized banking experiences at scale.