Position Overview: We are seeking a highly skilled Cloud Data Engineer to join our team on a full-time contract basis. This role is ideal for someone with expertise in cloud data solutions, data pipelines, and data architecture. You will be responsible for designing and developing scalable data pipelines, implementing data models, and managing cloud-based data environments. The role requires proficiency with AWS, GitLab CI/CD, Snowflake, dbt, SQL, Python, and data governance technologies.
Key Responsibilities:
- Design, develop, and maintain ETL/ELT pipelines for diverse data sources and large-scale data processing.
- Create and manage data models and architecture, ensuring data integrity and performance.
- Apply best practices in data warehousing, governance, and lifecycle management.
- Collaborate with data scientists, analysts, and stakeholders to deliver high-quality data products.
- Implement CI/CD pipelines using GitLab for automated testing and deployment.
- Optimize Snowflake databases for performance, scalability, and cost-efficiency.
- Develop and maintain SQL queries and Python scripts for data extraction, loading, and transformation.
- Manage data workflows using orchestrators like Airflow and AutoMateNow.
- Oversee data governance and observability using tools like Monte Carlo and Collibra.
- Monitor and troubleshoot data pipeline performance for efficiency and reliability.
- Document all pipelines, models, and processes thoroughly.
Qualifications:
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- Minimum 5 years of experience in data engineering or similar roles.
- Hands-on experience with AWS cloud services such as S3, Redshift, Lambda, Glue, and Data Pipeline.
- Expertise in Snowflake, SQL, Python, and Git.
- Proficiency in dbt and CI/CD pipeline development with GitLab.
- Experience with data orchestration tools like Airflow and AutoMateNow.
- Familiarity with data governance tools like Monte Carlo and Collibra.
Preferred Qualifications:
- AWS Certified Data Analytics or AWS Certified Solutions Architect.
- Experience with real-time data processing technologies (e.g., Kafka, Kinesis).
- Experience in regulated industries or manufacturing environments.
Location & Schedule: This is a hybrid role based in South San Francisco, requiring on-site presence several days per week.
If you’re a driven data engineer with a passion for cloud-based solutions and data optimization, we encourage you to apply!