Job Requirements
Job benefits
-
Flexible work hours
Productivity curve is not something steady and consistent as it depends on each person's unique traits and preferences. At our company, as long as your team is in sync and your goal is hit, you can flexibly decide when you want to work.
-
Remote work options
Thanks to technology, we no longer have to be physically present at the office to be productive. Joining our company allows you to work anywhere without place-constraint.
-
Medical insurance
To ensure your health and wellbeing, you have various medical plans to choose from depending on your situation and unique needs. From partial up to full medical coverage, we got you covered.
This job post is managed by
Skills
Job description for Site Reliability Engineer at Brick - Financial Api
- Provide 2nd level support for customer incidents.
- Debug production reliability issues across services and levels of the stack and perform root cause analysis.
- Bridge between development and operations by applying a software engineering mindset to the system.
- Build self-service tools for user groups that rely on their services (e.g. Integrate alert mechanism to the system, create internal tools to automate or simplify workloads, build automatic provisioning of test environments, logs, and statistics visualization).
- Closely collaborate with developers to ensure the designed solution responds to non-functional requirements such as availability, performance, security, and maintainability (e.g. Log management for searching logs across multiple applications, define error standardization, etc).
- Prepare routine operation documentation.
- Closely collaborate with QA Engineer to perform site reliability testing for infrastructure and applications.
- Continuous improvement of software delivery pipeline for process efficiency.
- Planning and application of autoscaling for flexible scaling according to service growth and seasonality.
- Closely collaborate with DevOps, Engineers and Product team to provide technical assistance to improve system performance, capacity, reliability, and scalability.
- 2+ years of experience as (both software engineers & DevOps/infra).
- Experience working with the life-cycle of a help desk incident as 2nd level support.
- Experience with Linux OS (Red Hat and Debian Family, etc.) and software version control (Git) (Gitlab, Github, BitBucket).
- Have knowledge in distributed service architecture, such as load balancing, service discovery, distributed caching and distributed tracing.
- Experience analyzing, monitoring, and troubleshooting large-scale, high-traffic distributed systems.
- Have strong programming skills (Python, Java, Go) and experience with scripting languages (Shell script).
- Have knowledge about databases (PostgreSQL, Redis, NoSQL, etc).
- Have knowledge about messaging systems (RabbitMQ, Kafka).
- Strong knowledge about cloud technologies (AWS, Alicloud, GCP).
- Experience with monitoring tools like New Relic and ELK Stacks.
- Experience creating automation tools in Ansible or Jenkins.
- Experience with container technology and orchestration (Docker, Kubernetes).
- Fast learning ability, high initiative and a good team player.
- Detailed-oriented, cautious and prudent.
- Passion and high sense of responsibility for work.
- Experience with automation tools like Terraform, Packer.
- Experience with database optimization (PostgreSQL, Redis, NoSQL, etc).
- Experience with monitoring tools like Datadogs, Zabbix, Prometheus, etc
- Experience with load balancing tools like LVS, Nginx, Openresty or HAProxy
- Experience with container technology such as Docker, Kubernetes, Apache Mesos.
- Experience with High Availability system design.
Glints Safety Tips
Don't provide your bank or credit card details when applying for jobs. Legitimate employers and hiring managers do not require an application fee or expect you to pay for training.
Learn More