Forgot password?
|
|
|
|
We were unable to sign you in.
Please verify your user name and password and try again. If you do not have a TEC account, register now.
Read Comments Metadata Standards in the Marketplace -
Why Do I Care? (And Where Does Godzilla Fit In?)

M. Reed and D. Geller - May 16, 2000

Overview

Metadata ("data about data") is essential for data warehousing. In order to populate a database, extract data, or run a report, more is required than simply raw data. The tools involved must also "understand" the context, or meaning, of the data. This is one of the purposes of metadata. Consider 2.5. This could be an amount in dollars, yen, or euros. It could be a mail stop at a company. It could be the height in meters of a member of the Boston Celtics. Nothing about the number itself can tell you what it means. To interpret an item like this, you need metadata.

Metadata is a description of data that tells you how to interpret it. Sometimes the metadata description can be deduced from the structure or the column names in a database; sometimes it is found in the programs that use it, and sometimes it resides only in comments or text. Frequently it is a combination of these.

When dealing with a single database, the use of metadata is as natural as swimming is to a fish - and just as invisible as the water. However, when dealing with heterogeneous environments, things get much more opaque. As one TEC analyst put it, once you've begun building and maintaining a data warehouse, metadata problems begin to surface "like Godzilla lurching out of Tokyo harbor." Metadata becomes important in a data warehousing context because the value of a data warehouse derives from the possibility of having application suites that tie the source databases with the target data warehouse and produce the reports that give a data warehouse value.

If you write your own suite of programs or can find a single vendor solution that handles all of your needs, you might still be able to get away with ignoring the metadata. But in most cases you'll need to tie together applications from different vendors, possibly including some homegrown ones, to make your data warehouse work. It's at this point that you start paying attention to metadata and, more specifically, metadata standards.

A simplified definition of metadata: One type of metadata defines the meaning of data. It is made up of entities, attributes, and relationships. For example, a table is an entity. It has attributes such as a name. A column is also an entity. It also has attributes such as a datatype. A column is related to a specific table. This is an example of a relationship. Any database has hundreds or thousands of entities, each entity having many attributes and relationships. It is this information that is necessary for any useful manipulation of data.

This note gives a high-level overview of metadata from the viewpoint of warehouse tools and repositories, and the standards that have been promulgated to support its use and interchange.

Importance to the Customer

Why should an Information Technology manager be interested in standards for metadata? The data warehousing industry is rapidly building "suites" of applications that will allow for:

  1. The population of a data warehouse or data mart

  2. Intelligent reporting and analysis of that data.

In order to do this, the tools must be provided the metadata information about the source systems and the target data warehouse. Most suites on the market are incomplete, so it is common for customers to purchase different tools from different vendors. If each vendor tool has its own metadata store (often referred to as a "repository"), the customer has to supply metadata information individually to each tool, repetitively.

Current metadata repositories often use static metadata with batch file based exchange mechanisms. When tables change, each tool must have its metadata definitions refreshed. As the number of tools and the volatility of the data warehouse increase, this becomes extremely cumbersome.

Ideally, distributed metadata repositories could dynamically exchange information and keep each other synchronized. This would greatly reduce the customer's workload and ensure correct metadata across domains and life cycles. (Note: currently, even tools from the same vendor may not be able to share metadata).

Standards

In order for metadata to be useful, it must be represented in a format the data warehouse tool can understand. It must also be accessible to the tool. Over the years, vendors had many different methods of metadata representation and storage. Recently, they have formed standards bodies and reduced the number of standards to two. As Richard Soley, Chairman and CEO of the Object Management Group described it to TEC, "Better two standards than ten". It certainly could be argued that one would be even better than two, but that does not appear likely at this time.

OMG is the primary standards body in this scenario, and has presented a widely supported standard called CWMI (Common Warehouse Metadata Interchange).

The second standards body involved is the MetaData Coalition (MDC), to which Microsoft handed over it's standard, OIM (Open Information Model).

Much work has been done to bring the standards closer together, and an XMI (XML Metadata Interchange) "bridge" has been written to allow OIM-compliant products to interact with CWMI-compliant products. However, the standards are substantially different in their implementation. The major differential in the implementations is that OIM is based strictly on Microsoft standards and products, and CWMI is an open standard that will also work on UNIX, mainframes, and other systems.

Glossary of Terms

OMG: Object Management Group. An international organization founded in 1989 to endorse technologies as open standards for object-oriented applications. The consortium now includes over 800 members.

MDC: Meta Data Coalition. A consortium founded in 1995 with close to 50 vendors and end-users whose goal is to provide a tactical solution for metadata exchange.

CORBA: Common Object Request Broker Architecture. A standard from the OMG for communicating between distributed objects (objects are self-contained software modules). CORBA provides a way to execute programs (objects) written in different programming languages running on different platforms (i.e. UNIX, mainframe, Windows), no matter where they reside in the network. The CORBA standard competes to some degree with DCOM, although through COM-CORBA bridging products, both can be used in cooperation.

IIOP: Internet Inter-ORB Protocol. The CORBA messaging protocol used on a TCP/IP network. It allows programs (objects) to be run remotely in a network. IIOP links TCP/IP to CORBA's General Inter-ORB protocol (GIOP), which specifies how CORBA's Object Request Brokers (ORBs) communicate with each other. When a user accesses a Web page that uses a CORBA object, a small Java applet is downloaded into the web browser, which invokes the ORB to pass data to the object, execute the object and get the results back.

DCOM: Distributed Component Object Model. Microsoft's technology for distributed objects. DCOM defines the remote procedure call, which allows those objects to be run remotely over the network. DCOM only functions in a Microsoft Windows environment.

UML: Unified Modeling Language. An object-oriented design language from the OMG. Many design methodologies for describing object-oriented systems were developed in the late 1980s. UML "unifies" the popular methods into a single standard, including Grady Booch's work at Rational Software, James Rumbaugh's Object Modeling Technique and Ivar Jacobson's work on use cases. In only four years, UML has become the software industry's dominant modeling language. UML 1.3 was ratified at OMG's meeting in November 1999. Until June of 1999, Microsoft had their own "flavor" of UML, but they now adhere to the standard. MOF: Meta Object Facility. This OMG specification provides a set of CORBA interfaces that can be used to define and manipulate a set of interoperable metamodels. MOF is a key to integration of metamodels across domains.

XML: Extensible Markup Language. The World Wide Web Consortium's document format for the Web that is more flexible than the standard HTML (HyperText Markup Language) format. While HTML uses only predefined tags to describe elements within the page, XML allows tags to be defined by the developer of the page.

IDL: Interface Definition Language. A language used to describe the interface to a routine or function. For example, objects in the CORBA distributed object environment are defined by an IDL, which describes the services performed by the object and how the data is to be passed to it.

SGML: Standard Generalized Markup Language. An ISO standard for defining the format of a text document. An SGML document uses a separate Document Type Definition (DTD) file that defines the format codes, or tags, embedded within it.

DTD: Data Type Definitions. A language that describes the contents of an SGML document. The DTD is also used with XML, and the DTD definitions may be embedded within an XML document or in a separate file.

XMI: XML Metadata Interchange. A method for two MOF-compliant repositories to exchange information. When the OMG was questioned by TEC about Microsoft's position on support for XMI, Grady Booch, Chief Scientist of Rational Software and one of the developers of UML, stated that they "could not comment publicly". TEC received the same "no comment" response from every member of the OMG that we questioned.

CWMI: Common Warehouse Metadata Interchange. A universal data format for interchange of metadata between data warehouse and business intelligence products. Developed by the Object Management Group in conjunction with a consortium of over 700 vendors.

OIM: A competing standard to CWMI. Written by Microsoft and turned over to the MDC. Co-developed by over 20 companies. Based on standards such as SQL and COM, and used for metadata interchange.

BOTTOM LINE
Market Predictions

We believe that the variance between the MDC and the OMG standards will continue to shrink, due to market pressure from major customers who are growing tired of having to hand-craft integration strategies between business intelligence and data warehousing products (probability 80%). A single standard for "plug and play" metadata interchange will be a powerful market force in the future, especially for the vendor who implements it first.

Customers are becoming increasingly frustrated at the level of effort required to integrate different vendors "best of breed" tools. There will come a time very soon when they will refuse and vendors will be forced to work together, whether they get along with each other or not.

Vendor Recommendations

Ensure compliance with the standards promulgated by the Object Management Group. Much has been done to ensure interoperability between the two standards (i.e., the Microsoft XML Interchange Initiative), but it is not yet a perfect world.

Some vendors have created proprietary application programming interfaces to allow other vendors to interoperate (for example the Informatica MX2 API), but this limits customers to the use of tools whose vendors have developed the "plug-ins" to the Informatica product. Other vendors have taken a pure XML approach, which is a more open course of action.

User Recommendations

Customers must make their data warehousing vendors carefully articulate what standards have been used in the creation of their product, or if the product is completely proprietary. As the various tools are chosen, care must be taken to ensure that each additional tool interoperates with the others in the required manner.

In a heterogeneous environment, it becomes even more important, since the Microsoft solution is a Windows-only one unless bridging products are used. It is almost unheard of for any major company to have a completely homogeneous environment, since many legacy and mainframe systems are still in use (i.e., IBM MVS, DEC VMS, Adabas, MUMPS), therefore, metadata is extremely important to the customer and must be kept in mind at all times.

 


 
comments powered by Disqus


A Retail Sourcing Suite Built on Experience | One Vendor's Quest to Garner a Global Sourcing Ecosystem | Microsoft Dynamics AX 4.0 for Manufacturing Environments | Supplier Relationship Management: Benefits and Challenges | Software as a Service's Functional Catch-up | Software as a Service: Not without Caveats | Application Portfolio Management: Are You Getting the Most from your Enterprise Software? | Driving Factors in The Enterprise Applications Market | Understanding SOA, Web Services, BPM, and BPEL Part Two: BPEL and User Recommendations | Understanding SOA, Web Services, BPM, BPEL, and More Part One: SOA, Web Services, and BPM | Understand J2EE and .NET Environments Before You Choose | Outsourcing 101 - A Primer Part Three: Approaches and Recommendations | Financial Reporting, Planning, and Budgeting As Necessary Pieces of EPM Part Two: Challenges and User Recommendations | Financial Reporting, Planning, and Budgeting As Necessary Pieces of EPM Part One: Executive Summary | Has The BI Market Consolidation Been Crystal-Clearly Actuated? Part Three: Competition and User Recommendations. |
Has The BI Market Consolidation Been Crystal-Clearly Actuated? Part Two: Market Impact | Has The BI Market Consolidation Been Crystal-Clearly Actuated? | BI Market Consolidation Compared to ERP Market Consolidation | Analyse This | BPM Weaves Data And Processes Together For Real-time Revenues | SCE Leaders Partner To See Beyond Their Portfolio Part Three: Challenges and User Recommendations | SCE Leaders Partner To See Beyond Their Portfolios | The Art Of Distributed Development Of Multi-Lingual Three-Tier Internet Applications | The Case of A Boutique Vendor's Benefits of Focus - IRM Corporation | Why Systems Fail - The Dead-end of Dirty Data | Data Conversion in an ERP Environment | Continuous Data Quality Management: The Cornerstone of Zero-Latency Business Analytics Part 2: One Solution | Continuous Data Quality Management: The Cornerstone of Zero-Latency Business Analytics | What Makes Process Process? | Beware of Legacy Data - It Can Be Lethal | A Definition of Data Warehousing | The Next Big Thing or Integration-The Interaction Server Part 2: Possible Solutions | Shall Bifurcated Tack Reverse J.D. Edwards’ Bad Spell? | Enterprise Application Integration - Where Is It Now (And What Is It Now)? Part 2: Where Is It Now? | Enterprise Application Integration - Where Is It Now (And What Is It Now)? Part 1: What Is It Now? | The SOAP Opera Progresses - Helping XML to Rule the World | Can You Add New Life To an Old ERP System? | BEA Systems Announces WebLogic Integration | New Era of Networks Gets Blinded By the NEON | J.D. Edwards' QUEST To End Its String Of Pyrrhic Victories Part 2: The Implications | J.D. Edwards' QUEST To End Its String Of Pyrrhic Victories Part 1: The News | The Application Server War Escalates | EAI Vendor MITEM Integrates Legacy Systems With Siebel | Knosys Seeks Clarity With A Name Change | Computer Associates Jasmineii - When Is A Portal Not Just A Portal? | Hewlett Packard Makes Multiple Moves in Middleware | Where Is ERP Headed (Or Better, Where Should It Be Headed)? Part 2: Product Architecture and Web-Basing | EDI and XML Integration: Vitria Buys XMLSolutions | TIBCO Announces Results That Are 'Better Than Worse Than Expected' | Sagent Improves Its Image With SAS Partnership | Great Product: Too Bad The Architecture Doesn’t Fit | Seagate Software 'Crystallizes' Its New Name: Crystal Decisions | Informatica PowerCenter 5 Enables Enterprise Data Integration | EAI Market Consolidation Continues With Peregrine Acquisition of Extricity | IONA Purchases Netfish Technologies (And Much, Much More) | A New Era Dawns for Sybase | Evolutionary Technologies Does EAI (Always Did, We Just Didn’t Call It That) | Information Builders Did It iWay | GMAC Web-Enables Legacy Data With NEON Systems Shadow Direct | Business Objects Teams With TopTier For Analytics | Sun’s Java Won’t Be In Microsoft’s .NET – Complicate Your Integration? You .BET | Metagenix Reverse Engineers Data Into Information | Hummingbird Smells Nectar In The Corporate Portal Market | Mercator Continues to Suffer Turmoil - Can They Stay on the Map? | Tibco Takes a Pragmatic Approach to Multicasting | Talarian and NextSet Team for B2B Solutions | Informatica Powers Siebel’s New eBusiness Analytics | Implementation Acceleration Using Integration | BEA Systems Has A Broad Vision For E-Business Infrastructures | QueryObject Partners With Cognos | Knosys "in the Kno" With ProClarity 3.0 Analytical Platform | SPSS Has A New ShowCase | Did Sagent Technology Pull the Old 'Pump and Dump'? | Data Mining: The Brains Behind eCRM | Tempest Creates a Secure Teapot | Optum’s ConnectStream: First the Pieces Now the Glue | What’s Up with Computer Associates? | Now the Minnows are Eating the Minnows | Informix Decides to Start Analyzing Websites | EAI - The 'Crazy Glue' of Business Applications | SAS Institute Shoots for the Two-Stop-Shop with new Release of Warehouse Administrator | System Software Suppliers Slip Seriously | EAI Vendor Mercator Drops to a Lower Place on the Map | The Necessity of Data Warehousing | Syncsort Sigma Manages Database Aggregates | MicroStrategy 7 Hits the Street | CPortals Technologies Aims for the Middle | To BEA or Not to BEA: Is That the Question? | Informix Goes Vertical With Software Vendor ADRM | Evoke Software Releases Axio Data Integration Product | Vignette of an EAI Vendor (So to Speak) | Viador Teams With Business Objects | Applix Still Shows a Presence in the OLAP Market | Information Builders Announces New Release of WebFOCUS | webMethods Gets Active (Software That Is) | Sagent Technology Teams for Telco e-Business | EAI Vendor Active Software Activates Transactions | BMC Software Webs for the DBA | Business Objects Objects Again | Acta Gets Active | Parametric Technology Chills Out With Windchill Info*Engine V4 | Informix XML’s Its Metadata Transport Layer | EAI Vendor Extricity Teams with Moai to Automate E-Commerce Systems | Computer Associates Goes E-Business in a Big Way | IBM Moves into Enterprise Application Integration | Sybase Tag-Teams with Informatica | Mercator Software Extends EAI Solutions for Insurance with XML | EAI Vendor CrossWorlds Eases Middleware Customization | Brio Technology Expands Support for WML and XML | Ardent Software: Will Informix Merger Affect their Success? | MicroStrategy Hits a Big Speed Bump on the Information Superhighway | Aspen Follows Good Quarter With Internet Launch | Brio Technology Reports Record Second-QuarterEarnings | Sybase and MicroStrategy Team on Vertical Market Portal Applications | Informatica Conforms to Metadata Standard | Business Objects Outguns Brio Technology in Patent Dispute | Is There Finally a Metadata Exchange Standard on the Horizon? | Datawarehouse Vendors Moving Towards Application Suites | Microstrategy Moves Up with e-Business | Seagate Technology Refocuses its Software Business | The Market Rewards Ardent Software Initiatives | Hummingbird Announces Extraction and Portal Strategy for ERP | Sagent Technology Reports Strong Growth | Oracle8i Release 2 - Ready to Storm the Web | Sterling Software Sees the Light with Eureka:Intelligence | Brio Technology Enters the ETL Market | More Data is Going to the Cleaners | Informix to Acquire Ardent Software-Another Vendor's Attempt at End-to-End Data Warehousing | Informatica Heads for E-Business | Acta Technology Helps Add Business Intelligence Capabilities to Major ERP Vendors | Inprise/Borland Challenges Other Vendors to Open-Source Their Database Code | Informatica Goes Multinational With Support for Unicode | Bus-Tech Speeds up Mainframe DB2 Access | NEON Systems Moves Further into Enterprise Application Integration | Hummingbird Releases Genio 4.0 With Improved Support for Oracle, Business Objects, Cognos, and NCR | Business Objects Launches WebIntelligence Extranet | Analysis of Novell and EAI Vendor Talarian Alliance | Informix Holds Fire Sale on Linux Database | Resistance is Futile: Computer Associates Assimilates yet another Major Software Firm | systemfabrik Releases an EAI Product? | Saga Continues Roll Out of EAI Tools | NCR's Teradata Database Meets Windows 2000. A Match Made in Redmond? | BMC Software Gets Slapped with Class Action Lawsuit | Software Technologies Corporation (STC) Prepares to go Public | SAS/Warehouse 2.0 Goes Live | GE Comes to Lunch. Want to Guess Who the Appetizer Will Be? | Computer Associates Splashes Into the Data Warehousing Market with Platinum Technology Acquisition | Informatica Morphs into Enterprise Decision Support Vendor | Enterprise Application Integration - the Latest Trend in Getting Value from Data |


Use this index to search for white papers related to commonly used search terms A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Others 
Recent Searches
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z Others
A: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26
B: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
D: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
E: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
F: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27
G: 1 2 3 4 5 6 7
H: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
I: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
J: 1 2 3 4 5
K: 1 2 3 4
L: 1 2 3 4 5 6 7 8 9 10 11 12 13 14
M: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
N: 1 2 3 4 5 6 7 8
O: 1 2 3 4 5 6 7 8 9 10 11 12 13 14
P: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19
Q: 1 2
R: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17
T: 1 2 3 4 5 6 7 8 9 10 11 12 13
U: 1 2 3
V: 1 2 3 4
W: 1 2 3 4 5 6 7 8 9 10 11
X: 1
Y: 1
Z: 1
Others: 1 2 3


©2013 Technology Evaluation Centers Inc. All rights reserved. Search powered by Google