03 feb
Luxoft
Xico
Site Reliability Engineer (SRE), Remote MexicoProject DescriptionDo you like to work with existing and new software product development teams?
This position is to instrument end-to-end observability and visibility for business-critical systems with log ingestion, metrics, and traces.
You will function as a site reliability engineer (SRE) that will collaborate with product teams, infrastructure SMEs, DevOps engineers, and the proactive monitoring team to provide unique dashboards of germane service level analytics for various product stakeholders.ResponsibilitiesWork closely with software product development teams (ITSO, Product Owner, SME)
to implement monitoring & observability instrumentation within their platforms.Drive adoption of best practices in monitoring, alerting, automation, and site reliability.Lead/contribute to engineering efforts from design to implementation focusing on instrumentation of logs, metrics, and traces.Drive use of automation in software instrumentation as well as in response to service degradation events.Identify and execute on opportunities to implement instrumentation in pre-production environments.Proactively pursue continuous improvement and expansion in observability coverage, service reliability best practices, incident management, and problem management.SkillsMust haveProduction support experience as developer for e-commerce platformStrong knowledge and experience in JavaSRE experienceScripting experience5+ years of experience with administrating Linux and at least 2 years in supporting production environments; Experience withdesigning large-scale distributed solutions accompanied with its capacity planning; Deep understanding of TCP/IP networking; Familiar with SLA, SLO, and SLI terms; Experience withmonitoring and alerting tools like Grafana, Datadog,
Prometheus etc; Strong knowledge of virtualization and containerization principles including orchestration tools; Familiar with CaC and IaC tools (Ansible, Salt, Terraform, Packer); Familiar with CI/CD tools (Jenkins, Azure DevOps); Experience withrelational and NoSQL DBMSA clear understanding of Agile and DevOps culture and what kind of problem they intended to solve; Strong written and verbal communication skills; Understanding of information security principles; Understanding of popular deployment strategies (Feature flags, Blue/Green, Canary, Dark launch, etc); Critical thinker and problem solverNice to haveExperience working with AzurePrevious experience of working in SRE teams#J-18808-Ljbffr
Muestra tus habilidades a la empresa, rellenar el formulario y deja un toque personal en la carta, ayudará el reclutador en la elección del candidato.