Job Type: Contract
Contract Length: 3 months (with extension potential)
Pay Range: $60–$70/hour
Location: Remote (EST hours)
About the Opportunity:
Our client, a leader in data and analytics, is looking for a skilled Data Engineer to join their team for a 3-month engagement. This project involves building a canonical Master Data Management (MDM) identity layer on Snowflake, focused on mastering core entities—including Accounts, Contacts, Leads, Contracts, and Products—across multiple source systems. This is a high-impact, greenfield implementation role that requires a self-motivated professional who can hit the ground running, work from established architectural artifacts, and contribute to complex design decisions.
Key Responsibilities & Deliverables:
This role is focused on the successful implementation of a custom-build MDM solution. Your responsibilities will include:
- Schema Implementation: Design and implement the physical Snowflake schema from logical models, including DDL, RBAC, and PII column tagging.
- Pipeline Development: Build and maintain robust ingestion pipelines from source systems (Salesforce, D365) into the Snowflake environment.
- Identity Resolution: Implement crosswalk patterns as identity reference spines, coding deterministic and probabilistic entity matching logic, including population of confidence scores and survivorship status fields.
- Writeback & Stewardship: Build the Salesforce ID writeback mechanism from the canonical model back to source CRM records and implement stewardship queues with conflict-flagging logic.
- Quality Assurance: Actively contribute to testing, data quality validation, and ongoing pipeline monitoring.
We are looking for someone with a proven track record of successful data engineering engagements. The ideal candidate will have:
- 5+ years of experience in data engineering, with at least 2 years of hands-on experience using Snowflake (DDL, ELT, streams, tasks, RBAC).
- Demonstrable experience implementing or operating an MDM or entity resolution program.
- Solid SQL skills and proficiency in at least one pipeline language (DBT is required; Python is strongly preferred).
- Experience with multi-source data ingestion from CRM and ERP systems (Salesforce, D365, or equivalent).
- A strong understanding of entity resolution concepts, including deterministic matching, probabilistic matching, blocking strategies, and confidence scoring.
- Ability to read and implement solutions from logical data models and formal data dictionaries.





