Senior Data Quality Engineer

Job description

About the Team
Quality Engineering in Data Team plays a key role in validating the accuracy and reliability of the data gathered by the team from each part of the company and its integration into a Data Lake, and also analyzing data and verifying the management Dashboards, BI and KPI metrics. We are responsible for the end-to-end testing of the AI/ML Model developed to be ingested in the core product and varied Data Pipelines and APIs consumed by other team. We are also responsible to the Non-functional testing of the projects within data team.

About the Role

This is moderately complex, technical work developing, reviewing and correcting raw data & sampling methods to ensure reliability and validity of the data and testing Data warehouse integrity and Data team systems accuracy for Machine Learning model, Data Pipelines, Business Intelligence and Data Services.


Responsibilities
  • Data Quality Engineer will partner closely with the data creator, data stewards, data consumer to ensure the data is usable, accurate and compliant
  • Participate in mapping business data sources to appropriate repositories of data
  • Identify new critical data elements and metadata
  • Accountable for data discrepancies and mitigation, data availability and consistency
  • Design, Lead & manage the database implementation strategy and testing plans.
  • Leverage data quality best practices to design and maintain policies, methodologies, guidelines around data quality, data profiling & data cleansing
  • Maintain Documentation and Company wide Transparency on Data Quality

Requirements

Mandatory Qualifications
  • BS or MS in Computer Science/Engineering or equivalent relevant experience
  • 3+ years of experience in Backend testing and overall 6+ years
  • Knowledge of Backend / ETL testing is a must
  • Knowledge of data manipulation and transformation using queries. Preferably NoSQL
  • Experience in scripting using Python or Java is mandatory
  • Experience in Postman is mandatory
  • Familiar working with PyTest, GreatExpectations, dbtests is a plus
  • Familiar with Concept of Data Quality & Governance
  • Written communicator, able to document software behavior and submit bug reports
Preferred Qualifications
  • Experience in working with task management tools (e.g. Jira, Confluence)
  • Experience in working with test automation for backend and data load/volume test
  • Practical experience in using Big Data technologies
  • Familiarity with Concept of Data warehouse and OLAP