Approach For MDM Data Governance
Approach For MDM Data Governance
Data
Platform
Platform is architecture (data model, integration model ..) and technology that supports storage, data integration, data quality, metadata
management, reporting, data profiling, workflows and data management etc.
MDM Complexities – to be dealt with and which we helped address..
• Multiple ownerships
Business / Functional • Clear understanding and demarcation of Change management and Data Governance initiatives
Complexity • MDM get diluted to address tactical priorities and loses larger view from Sr. Management
• Master Data Governance - managed in silos – separate LOBs having people doing redundant jobs
Technical & Architectural • Reference Data Governance – not an area of focus
• Managing multiple processes around master data & reference data across multiple regions for
Complexities different entities
• Data model design being independent of a MDM related program – entities & relationships identified
Governance / Data from a DW/legacy perspective
• Opting for a customized data model
Management Complexity • Not having proactive data governance
Wipro Mpower – Full lifecycle MDM methodology
MDM Dimensions
Business Data Technology Governance
• Business drivers &
• As-is data management
imperatives • Key master Data entities Master data landscape
processes
• Key business use cases • Data quality assessment assessment
• Data ownership
Assess • Enterprise initiatives
MDM • MDM Business capabilities • Master data definitions • MDM architecture Evaluate data governance
Strategy • Business sponsorship • Key attributes • Solution options organization models
Envision
Country specific
CRUD Rules
CRUD Rules
2. Define & assess Product Vendor strategy and POC for a complete Business Services for the CDO org
Enterprise architecture
implementation roadmap 3 roadmap for business
Metadata, Lineage, masking & leveraging IT
3. Helps CDO in Identifying & quantifying MDM business value to get security
Deliver ROI and TCO via
business buy-in for the program 4
value based themes
Source system Reporting
4. Defining technical solution implementation roadmap and calculating the strategy priorities Client-optimized MDM
5 technical architecture
TCO/ROI for the engagement Infrastructure Service advisory for
data strategy
5. Building a project cost element structures & cost model, technology 6
6. Help build POVs on opensource, data lake integration and Big Data 7
3
5
4
MDM and IT Governance
MDM Strategy and Business Case 1) Create governance organization and PMO CoE
1) High level requirements, source system analysis 2) Select IT admin, stewards/ global representatives
2) High level business case/ ROI (detailed extra) 3) Create new MDM, governance workflow
3) MDM governance recommendations applications
4) MDM road map and approach recommendations 4. Data profiling exploration & proactive DQ
1 scanning tools
MDM BEST PRACTICES
MDM Best Practices based on our MDM engagement experience
1. Build Business Case 2. Planning 3. Program and Data Governance
Analyze business problems that necessitate need Seek commitment time from business teams for Keep Data Governance as an integral component
for MDM implementation Requirements, Design Review, demos along with of the implementation
development, User Acceptance Establish roles and identify individuals
Define quantifiable returns
Focus on Continued Value Delivery – Source Enable change management along the
Poor data quality is often not noticed & the
Level Improvements, Smaller Iterations implementation path
impact of this is not known unless exposed
Build ‘POC’ and demo the ability of the system to Keep Business teams ‘engaged’ during the
Build consensus amongst business community ‘eclipse’ period
address One or Two use cases
about the data problems and the need to address
them Plan to familiarize the Core Business team on Engage with Business users and address criticism
MDM Terms and functionalities and concerns
Gain senior management commitment for
continued support for the implementation Avoid ‘Big Bang’ Approach Clearly define a business glossary for all MDM
entities and attributes
• Make sure the match requirements are well • Use 3rd party party tool for data enrichment • Plan adequate iterations for doing • Consider real-time data ingestion if
thrashed out as they will drive the model like Address Doctor for address enrichment, match rule tuning source provides data in real-time
and also the major effort required towards a and standardization
successful implementation • Leverage exact match column in fuzzy • Create business services for course
• Standardize all reference data elements like match rule for better performance. Limit grain MDM APIs by orchestrating
• Keep the database at a 3NF form and don’t country code, state code, phone numbers, trust configuration for important fine=grain services
normalize it completely as it limits flexibility, salutation columns only
performance and functionality • Profile source data and define
• Use Appropriate DQ tool’s cleansing • “BOT” can learn and take actions on appropriate data quality rules
• During design, it is useful to create a mix of capabilities & reference tables in MDM to such records by learning from actions
logical & physical model that contains key • Evolution of MDM architecture from
store “Standard” reference values taken by human data steward in past
attributes & related notes one style to another will require
• Implement robust validation rules in the • Consider data model design between changes to the data integration
• Articulate hierarchy and relationship application where data is generated like 2NF & 3NF; database hardware patterns
definition as they have direct impact on POS, Portal, etc. capacity, data volume must be
MDM data model and configuration • Define integration layer to handle
• Load only valid data in MDM. Data should be considered during design stage
unplanned exceptional scenarios /
corrected in source & re-loaded in MDM Data rejections
• Match requirements to drive Data model – A key • Establish the integration pattern such as service based • Setup VIP and Load Balancer in a clustered
value based model supports ‘OR’ conditions vs. Message Queue VS Batch or a combination of them environment for distribution of loads across
whereas column based model supports ‘AND’
conditions in matching
• Define reusable services that can be leveraged across nodes
the enterprise
• Avoid consolidating child tables unless there is a • Enable SSL to enable data security while data
specific business case
• Define a wrapper over MDM provided services and
in motion
expose wrapper to consuming applications
• MQ based integration can trigger a lot of messages • Consider using OOB; security model to
depending on type of events. This must be a enforce data authorization instead of
consideration when taking this approach customization