Senior System Engineer
The Senior System Engineer will work closely with Software R&D teams to design, deploy and operate components/services. The Senior System Engineer is responsible for setting standards for deployments at scale, infrastructure reliability and scalability, as well as continuously improving operations to ensure a consistently high-quality standard of work. You will be responsible for promoting and maintaining a high-performance culture and maintaining an inclusive and diverse workplace.
Administer, maintain, and automate systems to ensure reliability, resiliency, scalability and security.
Monitor and debug issues across the Mobile Service (applications, networks, databases), provide technical resolutions and root cause analysis for high severity incidents.
Work closely with Software R&D teams to design, deploy, and operate components/services that are automated, resilient, and scalable.
Continuously improve System Engineering operations (e.g. automate routine tasks, document new procedures etc.). Set standards for deployments at scale, infrastructure reliability and scalability.
Manage and grow existing team of System Engineering experts (specific part of the team focusing on one domain). Build and enrich an inclusive work environment comprised of people from diverse backgrounds.
Work closely with System Engineering Manager and dedicated recruiting staff to expand the team including interviewing candidates, participating in conferences/events, and on-boarding new employees.
Ad-hoc on-call support may be required (1 week per 2 months on average or less).Skills & Experience:
Bachelor's Degree in Computer Science
6+ years' experience as an Internet. distributed Systems Engineer or SRE managing a SaaS / PaaS environment
Deep experience managing Linux and/or databases
Experience in network technologies i.e. TCP/IP, Ethernet, UDP, DHCP, DNS, ARP, WAN routing.
Solid understanding of incident management, change management, and problem management.
Experience configuring and deploying VMs and Cloud Infrastructure i.e. AWS, Azure
Scripting language knowledge i.e. Bash or Python
Experience working with Ansible or similar automation tools.
Experience with Apache/Tomcat application maintenance.
At least 2+ year of team project/team coordination experience.
For more information on this role please contact Peter Raine at Reperio Human Capital (phone number removed)
Reperio Human Capital acts as an Employment Agency and an Employment Business