Job Requirements
3 - 5 years of experience
This job post is managed by
Jun Ting Lim
Last active 2 years ago
Job description for Data Engineer - Cloudera at Infocepts
GovTech Singapore has awarded InfoCepts the provision of Data Science software and services for Government Ministries/Departments, statutory boards, organs of the state and participating entities.
We are hiring experienced data engineers to join us. We provide extensive training via bootcamps/on-the-job to prepare qualified candidates for an exciting career in Data Analytics in Government and Public Sector.
About InfoCepts
InfoCepts is a global leader of end-to-end data & analytics solutions with nearly 20 years of experience, also named as Gartner’s 2020 and 2021 customers’ choice for Data & Analytics providers. We continue to grow rapidly year over year, now employing more than 1,000 people in offices across the globe. As we have grown, we have stayed true to our mission—to always help our customers stay modern that help them make smart, data-driven decisions. Since 2004, we have deployed hundreds of high performance analytics applications over web and mobile platforms, built several advanced analytics models, processed petabytes of data using Big Data technologies and delivered several high impact business solutions. Driven by our vision of delivering great customer experiences, we are looking for professionals who are passionate about making the world a better place by leveraging the power of data.
What is a Data Engineer?
The role of a data engineer can be defined as someone who has the knowledge and skills to design and build systems for collecting, storing and analyzing data at scale.
Roles and Responsibilities:
- Monitor cluster connectivity and performance
- Manage and review the Hadoop Distributed File System, YARN configuration files and log files
- Embark on backup and recovery related tasks
- Handle resource and security management matters
- Drive test execution and such reporting to all relevant stakeholders while documenting test cases and test results
- Strictly adhere to define Defect Management Processes
- Investigate and resolve all related technical issues through conducting detailed analyses
- Provide root cause analyses for all technical issues as necessary
- Conduct detailed analyses on application change requests and provide estimation
- Render support in terms of software tool management coordination, e.g. patches and version evaluation with product vendors
Skills Required:
- Tertiary education in Software Engineering, Computer Science and/or related fields
- 3 to 5 years of ETL development experience
- Relevant experience with Cloudera platform – Cloudera DataFlow (Ambari)
- Knowledge of other data ingestion tools such as Azure DataBricks will be a plus
- Strong understanding of distributed querying, performance tuning, horizontal scaling and data partitioning concepts
- Knowledge of programming languages like SQL, Oracle, R, MATLAB, and Python
- Accuracy and attention to detail, with the ability to perform data analysis and report acute observations
- Adept at queries, writing reports, and making presentations
- Team-working skills
- Verbal and written communication skills