Responsibilities
- Design and implement scalable data architectures and systems capable of managing large datasets and supporting real-time analytics.
- Extract, transform, and load data from multiple sources such as databases, warehouses, and data lakes.
- Maintain data accuracy and consistency through validation, cleansing, and robust security and privacy practices.
- Work closely with data scientists, analysts, and business stakeholders to understand data needs and deliver reliable access to information.
- Build and maintain ETL workflows, data pipelines, and other integration tools to ensure seamless data flow.
- Conduct data modeling and database design, optimizing structures and queries for speed and scalability.
- Monitor and manage data infrastructure to ensure optimal performance and uptime.
- Stay current with the latest data engineering technologies and recommend improvements or new tools when beneficial.
- Contribute to team collaboration through meetings, code reviews, and shared development initiatives.
Requirements
- Bachelor’s degree in Computer Science, Electrical Engineering, or a related field (Master’s preferred).
- Deep understanding of data management and database systems, including SQL, NoSQL, and Spark.
- At least 2 years of experience working with data integration technologies such as ETL, data pipelines, and data warehousing.
- Strong programming skills in Python and Java.
- Excellent analytical and problem-solving abilities with a focus on resolving complex data challenges.
- Effective communication and teamwork skills.
- Proven ability to manage multiple priorities and adapt to changing business needs.
- Commitment to data integrity, confidentiality, and fostering an inclusive, safe work environment.