Summary of Purpose
The Data Engineer is a vital member of the application services team responsible for designing, building, and maintaining the infrastructure that supports the collection, storage, and analysis of large sets of data. They ensure that data flows smoothly from source to destination so that it can be processed and analyzed by other stakeholders.
Essential Functions
- Design and create architecture to support organizational work and objectives as it relates to enterprise data.
- Design, implement, deploy, and manage Elasticsearch clusters in a multi-cluster environment, both on-premises and within the cloud
- Develop and implement integration solutions from different sources, including databases, data virtualization, APIs and external systems.
- Develop and maintain scalable data pipelines and build out new API integrations to support continuing increases in data volume and complexity
- Collaborate with other teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization
- Implement processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it
- Write complex queries for data analysis and ensure data integrity and quality
- Other duties as assigned
Knowledge, Skills, and Abilities
- Proficiency in SQL and experience with relational databases technologies.
- Proficiency in non-relational databases
- Knowledge of data modeling and data architecture
- Skills in programming languages such as Python, C#, or others
- Ability to work with cloud services, specifically AWS or Azure
- Strong analytical and problem-solving skills
- Excellent communication and collaboration abilities
- Familiarity with machine learning algorithms and data science techniques
- Understanding of ETL (Extract, Transform, Load) processes and data pipeline construction
- Commitment to data integrity and a keen eye for detail
- Data virtualization tools like Denodo
- Ability to design, build, and deploy data solutions that capture, explore, transform, and utilize data to support AI, ML, and BI
- Excellent business acumen and interpersonal skills; able to work across business lines to influence and effect change to achieve common goals
Education, Licenses, and Certifications
Required
High School Diploma or GED
Preferred
- Bachelor's degree in Computer Science, Mathematics, Data Science, or another relevant technical area
Experience
Required
- Six or more years of professional work experience, at least five of which in data management disciplines, including data integration, modeling, optimization and data quality, or other areas relevant to data engineering responsibilities and tasks.
- Three or more years of experience in data engineering or a related field
- Three or more years of experience in elastic indexes, SSIS, SSAS, et cetera
Additional Requirements
Work Context
- Prolonged periods of sitting at a desk and working on a computer
- Must be able to work under minimal supervision
- Must be able to sit, stand, walk, stoop, kneel, crouch, climb, and crawl
General Requirements
Behaviors and Assessments/Additional Requirements
- Employment is dependent upon successfully completing a pre-employment background check and drug and alcohol test
- This position may require obtaining unescorted access status
- This position requires direct or indirect access to certain export-controlled technology, for which INPO may be required to obtain an export license in accordance with applicable U.S. export control laws and regulations. If an export license is required, any offer of employment at INPO for this position is contingent upon receipt of the export license or authorization