top of page

Data Quality

Project Information

July 2023

In this project, I implemented Extract, Transform, Load (ETL) processes using Alteryx to extract data from multiple sources, including Excel and CSV files. To ensure data integrity, I employed quality assurance practices, addressed missing data, and resolved gaps in the dataset.

​

A key component of the project involved creating an Alteryx workflow to pull and wrangle data from a database, applying data cleaning and normalization rules to enhance overall data quality.

​

For visualization and analysis, I developed a Tableau dashboard with Drill-Down capabilities and filters. This dashboard provided a comprehensive exploration of Data Quality (DQ) dimensions such as accuracy, validity, and completeness.

​

Additionally, I utilized Informatica Data Quality (IDQ) to profile data, identify anomalies, and establish data validation rules, contributing to a proactive approach in maintaining and enhancing data accuracy. This project not only demonstrated proficiency in Alteryx, Tableau, and IDQ but also showcased a commitment to elevating data quality standards and providing actionable insights through intuitive visualization.

​

Business Opportunities: 

 

  • Data Enrichment: Implement processes to enhance existing movie/IMDb data by filling gaps and rectifying inaccuracies.

​

  • Marketing and Outreach: Develop a comprehensive marketing strategy to raise awareness about the improved data quality, targeting both end-users and potential industry partners.

​

  • Quality Assurance: Establish robust quality control measures to identify and rectify discrepancies in the data, ensuring high accuracy and reliability.

​

Key Skills:

​​

  1. ETL Processes (Alteryx)

  2. Data Integrity Assurance

  3. Quality Assurance Practices

  4. Data Cleaning and Normalization

  5. Alteryx Workflow Design

  6. Tableau Dashboard Development

  7. Drill-Down Capabilities Implementation

  8. Data Quality (DQ) Dimensions Exploration (Accuracy, Validity, Completeness)

  9. Informatica Data Quality (IDQ) Profiling

  10. Anomaly Identification

  11. Data Validation Rule Creation

  12. Visualization and Analysis

  13. Database Data Extraction

  14. Data Wrangling

  • Whatsapp
  • GitHub
  • LinkedIn

I desire to explore and understand the world through the lens of data.

bottom of page