Software Site Reliability Engineer (SRE)
Job responsibilities:
1. System monitoring and alarm management:
- Implement and optimize the monitoring of key systems such as DMS, WMS, TIS, SAP, etc. to ensure smooth operation of the system at any point of time.
- Configure and manage the alarm mechanism to ensure that system anomalies can be detected and responded to in a timely manner.
2. Troubleshooting and Response:
- Responsible for rapid investigation and recovery of system failures to minimize the impact on business.
- Analyze and record the causes of failures, and continuously improve operation and maintenance processes and preventive measures.
3. Performance optimization and system tuning:
- Evaluate and optimize system performance, including database query optimization, application server tuning, resource utilization improvement, etc.
- Regularly conduct system health checks to identify and eliminate potential performance bottlenecks.
4. Automated Operation and Maintenance:
- Develop and maintain automated operation and maintenance tools and scripts to improve work efficiency and reduce human errors.
- Automate system deployment, expansion and management.
5. System Deployment and Versioning:
- Participate in and responsible for software version deployment, upgrade and rollback to ensure the stability and reliability of system version update.
- Maintain CI/CD pipeline and optimize code release process.
6. Log Management and Data Analysis:
- Implement and maintain a centralized log management system to collect, analyze and monitor system logs.
- Analyze system operation based on log data, identify potential problems and solve them.
7. Security and Compliance Management:
- Implement and maintain system security policies to ensure the system meets company and industry security standards and compliance requirements.
- Work with the security team to conduct regular security audits and vulnerability remediation.
8. Documentation and Knowledge Sharing:
- Write and update operation and maintenance documents, including system architecture diagrams, operation and maintenance manuals, and emergency plans.
- Share experiences and best practices within and across departments to enhance the overall technical capability of the team.
9. Resource Management and Capacity Planning:
- Monitor system resource utilization and perform capacity planning to ensure the system can accommodate future growth.
- Participate in IT budget formulation and execution to optimize resource allocation and control operating costs.
10. Cross-sector collaboration:
- Collaborate closely with development teams and business departments, participate in system design and review, and ensure the maintainability and scalability of system architecture.
- Provide operation and maintenance support and technical guidance to other technical teams and business users.
Qualification and requirements:
- Bachelor degree or above, with background in Computer Science, Information Technology or related field, fluent English and mandarin speaking, other European language is a plus
- At least 3 years of relevant working experience, with experience in operation and maintenance of DMS, WMS, TIS or SAP system is preferred.
- Proficient in Linux/Unix system, network configuration and optimization, database management (e.g. Oracle, MySQL, etc.).
- Experience with automated O&M tools (e.g. Ansible, Terraform, Jenkins, etc.).
- Familiarity with cloud computing platforms (e.g. AWS, Azure, GCP) and containerization technologies (e.g. Docker, Kubernetes).
- Excellent problem solving skills, able to respond and troubleshoot quickly in a high pressure environment.
- Good communication and coordination skills with teamwork spirit.
- ITIL certification or other relevant OPS certifications preferred.
We offer:
- Performance and experience-based competitive remuneration, pension plan.
- 25 holidays + option to purchase 5 extra holidays.
- Commuting allowance.
- Department & company-wide teambuilding events.
- An exciting opportunity to lead the European transition to Zero Emissions transportation and de-carbonization of the economy.
Our Purpose is to build a zero-emission future that reconnects humanity with nature and a World of clean air. We are looking for talent that connects with this mission and want to create positive impact by joining a diverse and dynamic team 🌏
- Department
- Aftersales PC
- Role
- Software Site Reliability Engineer (SRE)
- Locations
- Hoofddorp
Hoofddorp
About BYD Europe
As the first overseas subsidiary of BYD group, our main focus is to provide European customers with new energy vehicles, rechargeable batteries, solar panels, energy storage systems and other new energy products, as well as related after-sales services.
Software Site Reliability Engineer (SRE)
Loading application form