Deskripsi pekerjaan DevOps Engineer (Bandung ASAP) PT Karisma Zona Kreatifku (KAZOKKU)
DevOps Engineer
- Status: 12-Month Contract
- Location: Hybrid - Bandung (Siliwangi, Coblong)
- Start Date: ASAP
- Middle level
Detailed Requirements
- Hands-on experience administering Linux servers in production (RHEL / Rocky Linux / Ubuntu Server).
- Proficient with Linux storage management and troubleshooting.
- Experienced troubleshooting Linux boot issues: GRUB, dracut rescue, initramfs rebuild.
- Experienced reading and analyzing system logs (journalctl, dmesg, /var/log).
- Familiar with DR concepts: main site vs. DR site, failover flow, RPO/RTO.
- Experienced managing VMs: provisioning, snapshots, lifecycle, and troubleshooting.
- Hands-on with on-premise private cloud or hypervisor platforms or public clouds.
- Solid understanding of networking at the OS level: IP, routing, DNS, VLAN, firewall.
- Familiar with Kubernetes or Docker Swarm: deploying workloads, inspecting pods/services, basic troubleshooting.
- Familiar with CI/CD pipelines and able to follow and run existing pipeline workflows.
- Understands application request flow end-to-end (e.g. DNS → proxy → app → database).
- Have good communication skills and team awareness, positive and optimistic, and have a sense of responsibility.
- Excellent problem analysis and problem-solving skills, able and willing to seek challenges, acquire new knowledge.
Nice to Have:
- Direct experience with enterprise backup tools (Acronis, Veeam, etc.).
- Scripting or automation skills with Python or Ansible.
- Experience with monitoring stacks: Prometheus, Grafana, or Zabbix.
- Windows Server administration (user management, RDP/WinRM, event log analysis).
- Basic database operations: backup/restore for MySQL or PostgreSQL.
- Solid understanding of web application workflow and able to diagnose application problems.
- Experienced in application testing and integrity check.
Responsibilities:
- Maintain and monitor production Linux servers to ensure 24/7 uptime.
- Execute VM lifecycle management including provisioning and regular snapshots.
- Perform deep-level troubleshooting on Linux boot issues and storage failures.
- Collaborate with the development team to run and maintain CI/CD pipelines.
- Implement and test Disaster Recovery (DR) plans according to RPO/RTO targets.
- Manage containerized workloads using Kubernetes or Docker Swarm.
- Ensure network security and connectivity at the OS level across various environments.