Site Reliability Engineer

Accenture
+2
9th Floor, Viettel Tower, 285 Cach Mang Thang 8, District 10, Ho Chi Minh
Hybrid
Posted 20 days ago

Top 3 reasons to join us

  • Hybrid working
  • Wonderful and Human focus Environment
  • Global Training Program (Cloud, OS,...)

Job description

  • Maintaining services once they are live by measuring and monitoring availability, latency and overall system health. 
  • Evolving systems to improve reliability and velocity. 
  • Practicing sustainable incident response, through delivery of incident response playbooks & automated recovery mechanisms. 
  • Running continuous Improvement activities that improve service availability, recovery, efficiency & performance. 
  • Familiar with ITIL framework to track and manage production incidents, Problems, Changes and service requests. 

Your skills and experience

Must have:

  • Linux system administration. 
  • Scripting and automation. 
  • Web server administration (e.g. Apache / NGIX / Jetty etc). 
  • Cloud technologies (AWS/Azure) 
  • SQL skills  
  • Critical thinker and a problem solver 
  • API & Microservices Integration 

 

​Nice to have:

  • Docker and related technologies (ECR, etc.). 
  • Production Support experience
  • Experienced production support resource who is well versed with incident, problem, change and request management. 
  • AWS Cloud based web frontend application; java based microservices/APIs support experience 
  • Working knowledge of Service Management tools such as ServiceNow, Remedy 
  • Monitoring & logging - preferably Cloud watch, Splunk and AppDynamics 
  • PostgreSQL Document (JSON) based Database Query and Update Skills 
  • Working knowledge of AWS Services Like ECS Cluster, ALB, ASG, ECS, EFS, S3, CloudWatch, Cloud Trail, Dynamo DB etc. 
  • Basic Working knowledge of Docker, Kubernetes or other Containers 
  • Working knowledge of Jenkins jobs for CI CD Pipelines 
  • General SSL certificates knowledge and java key stores 
  • Well versed with compliance activities for cloud-based applications such as Disaster Recovery, Application and Database Backup and and Restore, Capacity Management Plans  
  • Experienced in supporting projects for go-live implementation, deployments, operational readiness activities (handover, change management, documentation review, release management)  

Why you'll love working here

  • Hybrid working
  • Global Training 
  • Performance bonus
  • Full social insurance, unemployment insurance, health insurance
  • Personal incident and health insurance
  • Wonderful and Human focus Environment
  • Dynamic and young work force
  • Great working environment (relax corners, PS4, ping-pong table etc.)
  • Allowances (lunch, parking, mobi, transportation)
  • Travel opportunities (Inter sites mobility)
  • Team activities (CSR, Coffee hours, Friday Talking, YEP, …)

A global professional services company with leading capabilities in digital, cloud and security

Company type
IT Service and IT Consulting
Company industry
IT Services and IT Consulting
Company size
301-500 employees
Country
Vietnam
Working days
Monday - Friday
Overtime policy
No OT