Deskripsi pekerjaan Data Engineer PT Solusindo Digital Holistik
We are seeking a skilled Data Engineer with 2-5 years of experience to build and optimize data pipelines and work with cloud platforms, databases, and ETL tools.
Responsibilities:
- Write efficient SQL and Python code for data transformation and analysis.
- Design and implement ETL workflows using Apache Spark, Airflow, NiFi, AWS Glue, and similar tools.
- Work with Google Cloud Platform (BigQuery, Dataflow, Pub/Sub) and AWS (Redshift, S3, Glue, Lambda) for data storage and processing.
- Develop and manage data lakes (Apache Iceberg, Hadoop, Snowflake) and dimensional data models (star/snowflake schema, ER modeling).
- Build and optimize batch and streaming data pipelines (Spark, Flink, Beam).
- Handle performance optimization and schema evolution for data pipelines.
- Collaborate with cross-functional teams to integrate data into business applications.
Qualifications:
- 2-5 years of data engineering experience.
- Strong skills in SQL, Python, and ETL tools (Spark, Airflow, AWS Glue).
- Experience with cloud platforms (GCP, AWS) and data lakes (Iceberg, Hadoop, Snowflake).
- Knowledge of dimensional and ER modeling.
- Experience with relational and NoSQL databases (MySQL, PostgreSQL, MongoDB, DynamoDB).
- Familiarity with data pipeline performance optimization.
