Forgot password?
|
|
|
|
We were unable to sign you in.
Please verify your user name and password and try again. If you do not have a TEC account, register now.


If you receive errors when attempting to view this white paper, please install the latest version of Adobe Reader.

" Melissa Data's Data Quality Suite operates like a data quality firewall ' instantly verifying, cleaning, and standardizing your contact data at point of entry, before it enters your database."
Source : Melissa Data

Resources Related to Six Steps to Manage Data Quality with SQL Server Integration Services:

Six Steps to Manage Data Quality with SQL Server Integration Services

Data Quality is also known as : Business Data Quality, Customer Data Quality, Data Quality, Data Quality Center, Data Quality Control, Data Quality Improvement, Data Quality Analysis, Data Quality Articles, Data Quality Assessment,
Data Quality Indicator, Data Quality Indicators, Data Quality Business Intelligence, Data Quality Initiatives, Data Quality Issues, Data Quality Management, Data Quality Measurement, Data Quality Measures, Data Quality Methodology, Data Quality Methods, Data Quality Metrics, Data Quality Model, Data Quality Objectives, Data Quality Plan, Data Quality Problems, Data Quality Process, Data Quality Products, Data Quality Program, Data Quality Project, Data Quality Report, Data Quality Reporting.

Six Steps to Managing Data Quality with

SQL Server Integration Services (SSIS) Introduction

A company's database is its most important asset. It is a collection of information on customers, suppliers, partners, employees, products, inventory, locations, and more. This data is the foundation on which your business operations and decisions are made; it is used in everything from booking sales, analyzing summary reports, managing inventory, generating invoices and forecasting. To be of greatest value, this data needs to be up-to-date, relevant, consistent and accurate — only then can it be managed effectively and aggressively to create strategic advantage.

Unfortunately, the problem of bad data is something all organizations have to contend with and protect against. Industry experts estimate that up to 60 percent or more of the average database is outdated, fl awed, or contains one or more errors. And, in the typical enterprise setting, customer and transactional data enters the database in varying formats, from various sources (call centers, web forms, customer service reps, etc.) with an unknown degree of accuracy. This can foul up sound decision-making and impair effective customer relationship management (CRM). And, poor source data quality that leads to CRM project failures is one of the leading obstacles for the successful implementation of Master Data Management (MDM) — where the aim is to create, maintain and deliver the most complete and consolidated view from disparate enterprise data.

The other major obstacle to creating a successful MDM application is the difficulty in integrating data from a variety of internal data sources, such as enterprise resource planning (ERP), business intelligence (BI) and legacy systems, as well as external data from partners, suppliers, and/or syndicators. Fortunately, there is a solution that can help organizations overcome the complex and expensive challenges associated with MDM — a solution that can handle a variety of data quality issues including data deduplication; while leveraging the integration capabilities inherent in Microsoft's SQL Server Integration Services (SSIS 2005/2008) to facilitate the assembly of data from one or more data sources. This solution is called Total Data Quality.

The Six Steps to Total Data Quality

The primary goal of an MDM or Data Quality solution is to assemble data from one or more data sources. However, the process of bringing data together usually results in a broad range of data quality issues that need to be addressed. For instance, incomplete or missing customer profile information may be uncovered, such as blank phone numbers or addresses. Or certain data may be incorrect, such as a record of a customer indicating he/she lives in the city of Wisconsin, in the state of Green Bay.

Setting in place a process to fi x these data quality issues is important for the success of MDM, and involves six key tasks: profiling, cleansing, parsing/standardization, matching, enrichment, and monitoring. The end result — a process that delivers clean, consistent data that can be distributed and confidently used across the enterprise, regardless of business application and system.

1. Profiling

As the first line of defense for your data integration solution, profiling data helps you examine whether your existing data sources meet the quality standards of your solution. Properly profiling your data saves execution time because you identify issues that require immediate attention from the start — and avoid the unnecessary processing of unacceptable data sources. Data profiling becomes even more critical when working with raw data sources that do not have referential integrity or quality controls.

There are several data profiling tasks: column statistics, value distribution and pattern distribution. These tasks analyze individual and multiple columns to determine relationships between columns and tables. The purpose of these data profiling tasks is to develop a clearer picture of the content of your data.

  • Column Statistics — This task identifies problems in your data, such as invalid dates. It reports average, minimum, maximum statistics for numeric columns
  • Value Distribution — Identifies all values in each selected column and reports normal and outlier values in a column
  • Pattern Distribution — Identifies invalid strings or irregular expressions in your data.
 

2. Cleansing

After a data set successfully meets profiling standards, it still requires data cleansing and de-duplication to ensure that all business rules are properly met. Successful data cleansing requires the use of flexible, efficient techniques capable of handling complex quality issues hidden in the depths of large data sets. Data cleansing corrects errors and standardizes information that can ultimately be leveraged for MDM applications.

3. Parsing and Standardization

This technique parses and restructures data into a common format to help build more consistent data. For instance, the process can standardize addresses to a desired format, or to USPS' specifications, which are needed to enable CASS Certified processing. This phase is designed to identify, correct and standardize patterns of data across various data sets including tables, columns and rows, etc.

4. Matching

Data matching consolidates data records into identifiable groups and links/merges related records within or across data sets. This process locates matches in any combination of over 35 different components — from common ones like address, city, state, ZIP', name, and phone — to other not-so-common elements like email address, company, gender and social security number. You can select from exact matching, Soundex, or Phonetics matching which recognizes phonemes like "ph" and "sh." Data matching also recognizes nicknames (Liz, Beth, Betty, Betsy, Elizabeth) and alternate spellings (Gene, Jean, Jeanne)

5. Enrichment

Data enrichment enhances the value of customer data by attaching additional pieces of data from other sources, including geocoding, demographic data, full-name parsing and genderizing, phone number verification, and email validation. The process provides a better understanding of your customer data because it reveals buyer behavior and loyalty potential.

  • Address Verification — Verify U.S. and Canadian addresses to highest level of accuracy — the physical delivery point using DPV' and LACSLink', which are now mandatory for CASS Certified processing and postal discounts.
  • Phone Validation — Fill in missing area codes, and update and correct area code/prefix. Also append lat/long, time zone, city, state, ZIP, and county.
  • Email Validation — Validate, correct and clean up email addresses using three levels of verification: Syntax; Local Database; and MXlookup. Check for general format syntax errors, domain name changes, improper email format for common domains (i.e. Hotmail, AOL, Yahoo) and validate the domain against a database of good and bad addresses, as well as verify the domain name exists through the MaileXchange (MX) Lookup, and parse email addresses into various components.
  • Name Parsing and Genderizing — Parse full names into components and determine the gender of the first name
  • Residential Business Delivery Indicator — Identify the delivery type as residential or business
  • Geocoding — Add latitude/longitude coordinates to the postal codes of an address
 

6. Monitoring

This real-time monitoring phase puts automated processes into place to detect when data exceeds pre-set limits. Data monitoring is designed to help organizations immediately recognize and correct issues before the quality of data declines. This approach also empowers businesses to enforce data governance and compliance measures.

Here is one scenario:
A customer of a hotel and casino makes a reservation to stay at the property using his full name, Johnathan Smith. So, as part of its customer loyalty-building initiative, the hotel's marketing department sends him an email with a free night's stay promotion, believing he is a new customer — unaware that the customer is already listed under the hotel's casino/gaming department as a VIP client — under a similar name John Smith.
The problem:
The hotel did not have a data quality process in place to standardize, clean and merge duplicate records to provide a complete view of the customer. As a result, the hotel was not able to leverage the true value of its data in delivering relevant marketing to a high value customer.

Supporting MDM

Along with setting up a Total Data Quality solution, you will need to deal with the other challenge of MDM — mainly, the deduplication of data from disparate sources with the integration provided by SSIS.

An MDM application that combines data from multiple data sources might hit a roadblock merging the data if there isn't a 'unique identifier' that is shared across the enterprise. This typically occurs when each data source system (i.e. an organization's sales division, customer service department, or call center) identifies a business entity differently.

There are three general categories or ways to organize your data so that it can ultimately be merged for MDM solutions — they are unique identifiers, attributes, and transactions.

Unique Identifiers — These identifiers define a business entity's master system of record. As you bring together data from various data sources, an organization must have a consistent mechanism to uniquely identify, match, and link customer information across different business functions. While data connectivity provides the mechanism to access master data from various source systems, it is the Total Data Quality process that ensures integration with a high level of data quality and consistency. Once an organization's data is cleansed, its unique identifiers can be shared among multiple sources. In essence, a business can develop a 'single customer view' — it can consolidate its data into a single customer view to provide data to its existing sources. This ensures accurate, consistent data across the enterprise.

Attributes — Once a unique identifier is determined for an entity, you can organize your data by adding attributes that provide meaningful business context, categorize the business entity into one or more groups, and provide more detail on the entity's relationship to other business entities. These attributes may be directly obtained from source systems.

While managing unique identifiers can help you cleanse duplicate records, you will likely need to cleanse your data attributes. In many situations, you will still need to perform data cleansing to manage conflicting attributes across different data sources.

Transactions — Creating a master business entity typically involves consolidating data from multiple source systems. Once you have identified a mechanism to bridge and cleanse the data, you can begin to categorize the entity based on the types of transactions or activities that the entity is involved in. When you work with transaction data, you will often need to collect and merge your data before building it into your MDM solution.

Building Support for Compliance and Data Governance

MDM applications help organizations manage compliance and data governance initiatives. Recent compliance regulations, such as Sarbanes-Oxley and HIPAA, have increased the need for organizations to establish and improve their data quality methodologies. Without a solid MDM program in place, it would be difficult to make sense of the data residing in multiple business systems. Having well-integrated and accurate data gives organizations a "central system of record" — allowing them to comply with government regulations as a result of gaining a better understanding of their customer information.

Conclusion

A business can't function on bad, faulty data. Without data that is reliable, accurate and updated, organizations can't confidently distribute their data across the enterprise — which could potentially lead to bad business decisions. Bad data also hinders the successful integration of data from a variety of data sources.

But developing a strategy to integrate data while improving its quality doesn't have to be costly or troublesome.

With a solid Total Data Quality methodology in place — which entails a comprehensive process of data profiling, cleansing, parsing and standardization, matching, enrichment, and monitoring — an organization can successfully facilitate an MDM application. Total Data Quality helps expand the meaning between data sets, consolidates information, and synchronizes business processes. It gives organizations a more complete view of customer information — unlocking the true value of its data, creating a competitive advantage and more opportunities for growth.

About Melissa Data Corp.

For more than 23 years Melissa Data has empowered direct marketers, developers and database professionals with tools to validate, standardize, de-dupe, geocode and enrich contact data for custom, Web and enterprise data applications. The company's flagship products include: Data Quality Suite, MatchUp', and MAILERS+4'. For more information and free trials, visit www.MelissaData.comor call 1-800-MELISSA.

About Total Data Quality Integration Toolkit (TDQ-IT)

TDQ-IT is a full-featured enterprise data integration platform that leverages SQL Server Integration Services (SSIS) to provide a flexible, affordable solution for total data quality and master data management (MDM) initiatives. For a free trial visit www.MelissaData.com/tdq

About Microsoft

Founded in 1975, Microsoft (Nasdaq "MSFT") is the worldwide leader in software, services and solutions that help people and businesses realize their full potential.

About Microsoft Integration Services and SQL Server

Microsoft Integration Services is a platform for building enterprise-level data integration and data transformations solutions. For SQL Server 2008 product information, visit Microsoft SQL Server 2008.

Searches related to Six Steps to Manage Data Quality with SQL Server Integration Services:
Business Data Quality | Customer Data Quality | Data Quality | Data Quality Analysis | Data Quality Articles | Data Quality Assessment | Data Quality Business Intelligence | Data Quality Center | Data Quality Control | Data Quality Improvement | Data Quality Management | Data Quality Measurement | Data Quality Measures | Data Quality Indicator | Data Quality Indicators | Data Quality Initiatives | Data Quality Issues |
Data Quality Methodology | Data Quality Methods | Data Quality Metrics | Data Quality Model | Data Quality Objectives | Data Quality Plan | Data Quality Problems | Data Quality Process | Data Quality Products | Data Quality Program | Data Quality Project | Data Quality Report | Data Quality Reporting | Data Quality Reports | Data Quality Research | Data Quality Review | Data Quality Service | Data Quality Services | Data Quality Software | Data Quality Solution | Data Quality Solutions | Data Quality Specialist | Data Quality Standards | Data Quality Statistics | Data Quality Strategy | Data Quality Survey | Data Quality System | Data Quality Technology | Data Quality Test | Data Quality Tool | Data Quality Tools | Ensure Data Quality | Enterprise Data Quality | Examining Data Quality | Good Data Quality | Highest Data Quality | Improving Data Quality | Measure Data Quality | Online Data Quality | SSIS Business Data Quality | SSIS Customer Data Quality | SSIS Data Quality | SSIS Data Quality Analysis | SSIS Data Quality Articles | SSIS Data Quality Assessment | SSIS Data Quality Business Intelligence | SSIS Data Quality Center | SSIS Data Quality Control | SSIS Data Quality Improvement | SSIS Data Quality Indicator | SSIS Data Quality Indicators | SSIS Data Quality Initiatives | SSIS Data Quality Issues | SSIS Data Quality Management | SSIS Data Quality Measurement | SSIS Data Quality Measures | SSIS Data Quality Methodology | SSIS Data Quality Methods | SSIS Data Quality Metrics | SSIS Data Quality Model | SSIS Data Quality Objectives | SSIS Data Quality Plan | SSIS Data Quality Problems | SSIS Data Quality Process | SSIS Data Quality Products | SSIS Data Quality Program | SSIS Data Quality Project | SSIS Data Quality Report | SSIS Data Quality Reporting | SSIS Data Quality Reports | SSIS Data Quality Research | SSIS Data Quality Review | SSIS Data Quality Service | SSIS Data Quality Services | SSIS Data Quality Software | SSIS Data Quality Solution | SSIS Data Quality Solutions | SSIS Data Quality Specialist | SSIS Data Quality Standards | SSIS Data Quality Statistics | SSIS Data Quality Strategy | SSIS Data Quality Survey | SSIS Data Quality System | SSIS Data Quality Technology | SSIS Data Quality Test | SSIS Data Quality Tool | SSIS Data Quality Tools | SSIS Ensure Data Quality | SSIS Enterprise Data Quality | SSIS Examining Data Quality | SSIS Good Data Quality | SSIS Highest Data Quality | SSIS Improving Data Quality | SSIS Measure Data Quality | SSIS Online Data Quality | Business Data Quality SSIS | Customer Data Quality SSIS | Data Quality SSIS | Data Quality Analysis SSIS | Data Quality Articles SSIS | Data Quality Assessment SSIS | Data Quality Business Intelligence SSIS | Data Quality Center SSIS | Data Quality Control SSIS | Data Quality Improvement SSIS | Data Quality Indicator SSIS | Data Quality Indicators SSIS | Data Quality Initiatives SSIS | Data Quality Issues SSIS | Data Quality Management SSIS | Data Quality Measurement SSIS | Data Quality Measures SSIS | Data Quality Methodology SSIS | Data Quality Methods SSIS | Data Quality Metrics SSIS | Data Quality Model SSIS | Data Quality Objectives SSIS | Data Quality Plan SSIS | Data Quality Problems SSIS | Data Quality Process SSIS | Data Quality Products SSIS | Data Quality Program SSIS | Data Quality Project SSIS | Data Quality Report SSIS | Data Quality Reporting SSIS | Data Quality Reports SSIS | Data Quality Research SSIS | Data Quality Review SSIS | Data Quality Service SSIS | Data Quality Services SSIS | Data Quality Software SSIS | Data Quality Solution SSIS | Data Quality Solutions SSIS | Data Quality Specialist SSIS | Data Quality Standards SSIS | Data Quality Statistics SSIS | Data Quality Strategy SSIS | Data Quality Survey SSIS | Data Quality System SSIS | Data Quality Technology SSIS | Data Quality Test SSIS | Data Quality Tool SSIS | Data Quality Tools SSIS | Ensure Data Quality SSIS | Enterprise Data Quality SSIS | Examining Data Quality SSIS | Good Data Quality SSIS | Highest Data Quality SSIS | Improving Data Quality SSIS | Measure Data Quality SSIS | Online Data Quality SSIS | MDM Business Data Quality | MDM Customer Data Quality | MDM Data Quality | MDM Data Quality Analysis | MDM Data Quality Articles | MDM Data Quality Assessment | MDM Data Quality Business Intelligence | MDM Data Quality Center | MDM Data Quality Control | MDM Data Quality Improvement | MDM Data Quality Indicator | MDM Data Quality Indicators | MDM Data Quality Initiatives | MDM Data Quality Issues | MDM Data Quality Management | MDM Data Quality Measurement | MDM Data Quality Measures | MDM Data Quality Methodology | MDM Data Quality Methods | MDM Data Quality Metrics | MDM Data Quality Model | MDM Data Quality Objectives | MDM Data Quality Plan | MDM Data Quality Problems | MDM Data Quality Process | MDM Data Quality Products | MDM Data Quality Program | MDM Data Quality Project | MDM Data Quality Report | MDM Data Quality Reporting | MDM Data Quality Reports | MDM Data Quality Research | MDM Data Quality Review | MDM Data Quality Service | MDM Data Quality Services | MDM Data Quality Software | MDM Data Quality Solution | MDM Data Quality Solutions | MDM Data Quality Specialist | MDM Data Quality Standards | MDM Data Quality Statistics | MDM Data Quality Strategy | MDM Data Quality Survey | MDM Data Quality System | MDM Data Quality Technology | MDM Data Quality Test | MDM Data Quality Tool | MDM Data Quality Tools | MDM Ensure Data Quality | MDM Enterprise Data Quality | MDM Examining Data Quality | MDM Good Data Quality | MDM Highest Data Quality | MDM Improving Data Quality | MDM Measure Data Quality | MDM Online Data Quality | Business Data Quality MDM | Customer Data Quality MDM | Data Quality MDM | Data Quality Analysis MDM | Data Quality Articles MDM | Data Quality Assessment MDM | Data Quality Business Intelligence MDM | Data Quality Center MDM | Data Quality Control MDM | Data Quality Improvement MDM | Data Quality Indicator MDM | Data Quality Indicators MDM | Data Quality Initiatives MDM | Data Quality Issues MDM | Data Quality Management MDM | Data Quality Measurement MDM | Data Quality Measures MDM | Data Quality Methodology MDM | Data Quality Methods MDM | Data Quality Metrics MDM | Data Quality Model MDM | Data Quality Objectives MDM | Data Quality Plan MDM | Data Quality Problems MDM | Data Quality Process MDM | Data Quality Products MDM | Data Quality Program MDM | Data Quality Project MDM | Data Quality Report MDM | Data Quality Reporting MDM | Data Quality Reports MDM | Data Quality Research MDM | Data Quality Review MDM | Data Quality Service MDM | Data Quality Services MDM | Data Quality Software MDM | Data Quality Solution MDM | Data Quality Solutions MDM | Data Quality Specialist MDM | Data Quality Standards MDM | Data Quality Statistics MDM | Data Quality Strategy MDM | Data Quality Survey MDM | Data Quality System MDM | Data Quality Technology MDM | Data Quality Test MDM | Data Quality Tool MDM | Data Quality Tools MDM | Ensure Data Quality MDM | Enterprise Data Quality MDM | Examining Data Quality MDM | Good Data Quality MDM | Highest Data Quality MDM | Improving Data Quality MDM | Measure Data Quality MDM | Online Data Quality MDM | BI Business Data Quality | BI Customer Data Quality | BI Data Quality | BI Data Quality Analysis | BI Data Quality Articles | BI Data Quality Assessment | BI Data Quality Business Intelligence | BI Data Quality Center | BI Data Quality Control | BI Data Quality Improvement | BI Data Quality Indicator | BI Data Quality Indicators | BI Data Quality Initiatives | BI Data Quality Issues | BI Data Quality Management | BI Data Quality Measurement | BI Data Quality Measures | BI Data Quality Methodology | BI Data Quality Methods | BI Data Quality Metrics | BI Data Quality Model | BI Data Quality Objectives | BI Data Quality Plan | BI Data Quality Problems | BI Data Quality Process | BI Data Quality Products | BI Data Quality Program | BI Data Quality Project | BI Data Quality Report | BI Data Quality Reporting | BI Data Quality Reports | BI Data Quality Research | BI Data Quality Review | BI Data Quality Service | BI Data Quality Services | BI Data Quality Software | BI Data Quality Solution | BI Data Quality Solutions | BI Data Quality Specialist | BI Data Quality Standards | BI Data Quality Statistics | BI Data Quality Strategy | BI Data Quality Survey | BI Data Quality System | BI Data Quality Technology | BI Data Quality Test | BI Data Quality Tool | BI Data Quality Tools | BI Ensure Data Quality | BI Enterprise Data Quality | BI Examining Data Quality | BI Good Data Quality | BI Highest Data Quality | BI Improving Data Quality | BI Measure Data Quality | BI Online Data Quality | Business Data Quality BI | Customer Data Quality BI | Data Quality BI | Data Quality Analysis BI | Data Quality Articles BI | Data Quality Assessment BI | Data Quality Business Intelligence BI | Data Quality Center BI | Data Quality Control BI | Data Quality Improvement BI | Data Quality Indicator BI | Data Quality Indicators BI | Data Quality Initiatives BI | Data Quality Issues BI | Data Quality Management BI | Data Quality Measurement BI | Data Quality Measures BI | Data Quality Methodology BI | Data Quality Methods BI | Data Quality Metrics BI | Data Quality Model BI | Data Quality Objectives BI | Data Quality Plan BI | Data Quality Problems BI | Data Quality Process BI | Data Quality Products BI | Data Quality Program BI | Data Quality Project BI | Data Quality Report BI | Data Quality Reporting BI | Data Quality Reports BI | Data Quality Research BI | Data Quality Review BI | Data Quality Service BI | Data Quality Services BI | Data Quality Software BI | Data Quality Solution BI | Data Quality Solutions BI | Data Quality Specialist BI | Data Quality Standards BI | Data Quality Statistics BI | Data Quality Strategy BI | Data Quality Survey BI | Data Quality System BI | Data Quality Technology BI | Data Quality Test BI | Data Quality Tool BI | Data Quality Tools BI | Ensure Data Quality BI | Enterprise Data Quality BI | Examining Data Quality BI | Good Data Quality BI | Highest Data Quality BI | Improving Data Quality BI | Measure Data Quality BI | Online Data Quality BI | SQL Business Data Quality | SQL Customer Data Quality | SQL Data Quality | SQL Data Quality Analysis | SQL Data Quality Articles | SQL Data Quality Assessment | SQL Data Quality Business Intelligence | SQL Data Quality Center | SQL Data Quality Control | SQL Data Quality Improvement | SQL Data Quality Indicator | SQL Data Quality Indicators | SQL Data Quality Initiatives | SQL Data Quality Issues | SQL Data Quality Management | SQL Data Quality Measurement | SQL Data Quality Measures | SQL Data Quality Methodology | SQL Data Quality Methods |
Home  |   Careers  |   Contact Us  |   Glossary  |   Special Offers  |   Software Features & Functions  |   Software Selection Shortcuts  |   Feedback  |   Terms of Use  |   Privacy Policy

©2012 Technology Evaluation Centers Inc. All rights reserved. Search powered by Google