The ‘What’ and ‘Why’ of Metadata
Put simply, metadata is the DNA of your data, down to a granular level of audit trail and user history and defines the physical characteristics of each data element. It will give you information on data usage, data ownership and user access, and will also explain why the data was accessed, as well as where it came from. The story metadata tells is useful to all levels by users across all roles across an organization.
Metadata is generated on any change to your data, anywhere in your organisation’s enterprise data architecture, be it in data storage or data system. Metadata Management is like a data dictionary. And defines the fields and values, source, system, and is easily available as a web page to the business or IT.
The benefits of using metadata in business is limitless and will add value to your business. It will speed up your turnaround time with its organised visualisations and deliverables by providing you with a one-stop documentation on all the data in your system.
Three Categories of Data
Metadata can be split into three categories: Business, technical, or operational.
Technical metadata includes database system names, table and column names, data types and allowed values, and can also include primary and foreign key relations.
Eg: F_TRXN.INVC_DT where F_TRXN is table name and INVC_DT is the column name. Data type is DATE.
Operational metadata describes if the data is active or not, and if transformation or logic has been applied to it. Depending on the complexity of your Datawarehouse system, these three classifications can help you maintain the details of the system and can be your one-stop shop to trace anomalies and details.
Making Sense of Your Data
IBM Information Server The IBM Information Server suite stores metadata in a metadata repository tier as its own schema in a database and will maintain and manage your Technical and Operational Metadata in a Metadata Repository when integrated with IBM InfoSphere DataStage.
Information Server integrates all changes and updates into an Information Governance catalog. End users such as Analysts, Subject Matter Experts, architects, and administrators can then access this shared metadata repository via a suite of tools that flow through the InfoSphere Information Server suite. Changes made in one suite will automatically and instantly update and be visible throughout the entire suite.
To begin you need to catalog your data. To do this you will need an intelligent asset management tool that will provide you with a 360° view of your data and AI.