Data Engineer (Onsite)
Công ty Cổ Phần nhân lực quốc tế HRI • Hà Nội
Job Description
We are seeking a Data Engineer with strong experience building scalable data solutions on AWS and Databricks. This role will be responsible for developing high-quality data pipelines, optimizing our Lakehouse environment, and enabling data-driven products across the bank.
Experience in financial services or banking is a strong advantage.
Yêu cầu công việc
1. Data Pipelines & Processing
- Develop and maintain ETL/ELT pipelines on Databricks using PySpark, Spark SQL and Delta Lake.
- Build reliable, scalable data ingestion frameworks integrating with AWS services such as:
o S3, Glue, Lambda, Step Functions
o Kafka/MSK or Kinesis (real-time ingestion)
o RDS/Redshift or on-prem databases
- Automate workflows using Databricks Workflows, Airflow, or similar orchestrators.
2. Lakehouse Architecture
- Implement and optimize Delta Lake–based storage, including:
o Delta tables
o Schema evolution
o ACID transactions
o Time travel and performance tuning
- Support data modeling for analytics, dashboards, machine learning, and regulatory reporting.
3. Data Quality, Security & Governance
- Enforce data quality checks using: Delta expectations, unit tests, and validation frameworks.
- Implement metadata, lineage, and governance via: AWS Glue Catalog, Unity Catalog (preferred), or similar.
- Ensure compliance with banking standards: PII protection, access control, auditability.
4. Stakeholder Collaboration
- Partner with Data Analysts, Data Scientists, Business Units, and Risk/Compliance teams to deliver business-ready datasets.
- Translate business requirements into technical solutions that align with enterprise data strategy.
5. Operational Excellence
- Monitor pipeline performance and optimize cost and compute (e.g., Databricks cluster
- policies).
- Troubleshoot production issues, ensuring platform stability and SLAs.
- Apply DevOps practices, CI/CD pipelines (Azure DevOps, GitHub Actions, Bitbucket etc.).
Qualifications
Required
- Bachelor’s degree in Computer Science, Information Systems, Engineering, or relatedfield.
- 2–5+ years of experience in data engineering.
- Strong hands-on skills with:
o AWS: S3, Glue, Lambda, IAM, Step Functions, Kinesis/MSK
o Databricks: PySpark, Spark SQL, Delta Lake, Workflows
o Python & SQL
o Spark
- Solid understanding of data modeling (relational, dimensional, domain-driven).
- Experience working in large-scale, distributed data environments.
- Preferred
- Experience in banking or financial services (core banking, payments, lending, regulatory reporting).
- Familiarity with:
o Unity Catalog
o Streaming architectures
o Terraform or AWS CDK
o Data quality frameworks (Great Expectations, Deequ, Databricks expectations)
- Experience working in regulated environments with strict compliance and security controls.
Key Skills
- Strong analytical and problem-solving abilities.
- Ability to work in collaborative, cross-functional teams.
- Clear communication skills with both technical and non-technical stakeholders.
- Proactive mindset and willingness to improve existing systems.
Quyền lợi
-
Competitive salary commensurate with skills and contributions.
-
Quarterly and annual performance-based bonuses, project completion bonuses, and annual salary review and adjustment.
-
In addition to base salary, other income includes overtime pay, lunch allowance, and business trip allowances.
-
Social insurance and health insurance in accordance with State regulations.
-
Periodic health check-up once a year.
-
Annual leave and public holidays in accordance with State regulations, with full pay.
-
Company covers expenses for job-related training courses required each year.
-
Rich extracurricular activities: football club, outings and excursions, birthday celebrations, and trade union activities.
-
Working location: Quang Trung, Hanoi.