Abstract: 1 - Ontology and Database Design
Abstract: 1 - Ontology and Database Design
ABSTRACT
In this presentation we discuss the concept of semantic web and ontologies. Ontology
is an emerging concept, not only in the field of computer science but also medicine and
other areas. We will first talk about what is semantic web, what is ontology? Then we
will see an example of ontology and their design using an ontology building tool. Later
we will discuss its advantages and how it helps in designing a database design and also
how it helps in improving a database design. Ontology is a specification of a
Conceptualization and is the basic structure around which we can build a knowledgebase.
There are various tools available for developing ontologies and protg is one of the
most widely used ontology development editor which facilitates defining ontology
concepts (classes), properties, taxonomies(or concept hierarchy), and various restrictions,
as well as class instances and which supports several ontology representation languages,
including OWL.
2. SEMANTIC WEB
2.1 Introduction
The Semantic Web is a mesh of information linked up in such a way as to be
easily processable by machines, on a global scale. We can think of it as being an
efficient way of representing data on the World Wide Web, or as a globally linked
database. Semantic Web provides a common framework that allows data to be
shared and reused across application, enterprise, and reused across community
boundaries.
It is the new-generation Web that tries to represent information such that it can
be used by machines not just for display purposes, but for automation,
integration, and reuse across applications [2].It allows the representation and
exchange of information in a meaningful way [1].
The Semantic Web has been developing a layered architecture, was presented at
XML2000 by Tim Berners-Lee, director of the World Wide Web Consortium
(W3C) [1]:
Trust
Logic And Proof
Ontology
RDF Schema
RDF
XML Schema
XML
Unicode And URI
Fig 2.1.1: Semantic Web Layered Architecture
Unicode and URI: Unicode, the standard for computer character
representation, and URIs, the standard for identifying and locating resources
(such as pages on the Web), provide a baseline for representing characters used in
most of the languages in the world, and for identifying resources.
XML: XML and its related standards, such as Namespaces, and Schemas, form a
common means for structuring data on the Web but without communicating the
meaning of the data. These are well established within the Web already.
Resource Description Framework: RDF is the first layer of the Semantic Web
proper. RDF is a simple metadata representation framework, using URIs to
identify Web-based resources and a graph model for describing relationships
between resources. Several syntactic representations are available, including a
standard XML format.
RDF Schema: a simple type modeling language for describing classes of
resources and properties between them in the basic RDF model. It provides a
simple reasoning framework for inferring types of resources.
Ontologies: a richer language for providing more complex constraints on the
types of resources and their properties.
3 |Ontology and Database Design
The Semantic Web is best understood in comparison to the World Wide Web.
application consumption. The statements link together via constructs that can
form semantics, the meaning of the link. Thus, link semantics provide a defined
meaningful path rather than a user-interpreted one. The statements may also
contain logic that allows further interpretation and inference of the statements.
3. ONTOLOGY
3.1 Introduction
In simple terms, Ontology can be defined as a common vocabulary to share
information about a specific domain. This helps in understanding the concepts of
5 |Ontology and Database Design
the domain as well as helps the machine to interpret the definitions of concepts in
the domains and also the relations between them [8]. Ontology together with a set
of individual instances of classes constitutes a knowledge base [1]. Basic building
blocks of ontology design include:
classes or concepts
restrictions on slots
Ontology
sun
sun
sun
+
Intelligent
Services
Web of Information
(Current Web)
sun
sun
sun
Web of Knowledge
(Future Web)
Classes/Concepts
Relations/Properties/Slots/Attributes/Roles
Ontology
Structure
Instances/Individuals
Formal Axioms
the basis of their levels of generality, these three layers correspond to three
different types of ontologies, namely:
independent of what are the toppings). The programs use ontology for inductive
reasoning,
classification,
problem
solving
techniques,
facilitating
Buyer
Auction
System
Toys
Purchase goods
Fig 3.7.1: Auction Ontology (created using [5])
p
a ym
en
t
products
Seller
in
entering
valid
instances.
executing
queries,
etc.
4) Model exchange Protg supports models like XML, UML, RDF. The
models can be loaded and saved in any of these formats.
The figures below show how to define terms and relationships using Protg.
Figure 3 shows the terms in the computer science ontology and figure 4 shows
the relationships between these terms. Figure 4 shows the property editor for
defining the relationships and the constraints on the relationships and terms can
also be defined using Protg. The constraints on the terms can be seen in the
class editor. The blue tagged properties are the relationships between the terms.
Partial Ontology
The advantages of Protg that makes it a useful tool for ontology designers
as well as programmers are that for a designer this tool helps in finding mistakes
in an ontology which is similar to debugging a code. Designers can find
inconsistencies in the relationships using the tool. This tool is helpful for
programmers/developers
in
lot
of
ways
like
[3]
1) Developers can create a plug-in and test how the system works if an ontology
changes.
2) Plug-in provides an open testing framework. The code that is similar to the
JUnit test cases can be executed.
4. DATABASE DESIGN
4.1 Introduction
The process to produce a detailed data model of a database is known as
designing a database. The data model contains the entities, their attributes and
relationships with other entities [2]. The initial steps in designing a database
include determining the data stored in the database, determining the relationships
between the data elements [2].
A database designer finds difficulty in designing a database as a good design
depends on his understanding of the user requirements. The other problem is that
he might not have the required knowledge of the domain that is being modeled.
synonym, and related-to [10; 11]. As stated earlier ontology can capture the
integrity constraints needed for a database design.
Specifying the target domain and the initial domain knowledge: A target
domain needs to specified, for example an auction shown in section 3.7 is an
example of a domain.
Identifying terms: The terms in the application domain and their relationship
need to be identified and also domain relationships. For example, terms in
auction ontology are Person, Products, etc. A relationship in this ontology will
Analyzing the extracted features: When the extracted concepts are analyzed,
we are able to identify which terms belong together based on the frequency of
their appearances together. The relationships between the terms can be
generated by examining how close the terms appeared to each other.
The created ontology contains concepts and terms that usually involve binary
The steps for creating an E-R model using ontology are [5]:
Identifying the initial user terms The potential entities and transitive
the ontology. The best use of the term can be inferred from ontology by
The designer started by looking at the application requirements. The initial set of
terms and domain terms are then identified. Consistency checking is done in
order to find consistent terms and an E-R model is created. Fig.4.5.1 also shows
the inputs or information required during the process and what models are created
in the steps. The consistency check interacts with the designer during the process
as the designer may override inconsistencies.
development.
Study of various researches published in this area.
Comparison of Database Design with traditional methodology and with Ontology.
Identification of various tools for designing ontology.
Creating Auction Ontology using Protg 3.1.
Finding the valid information from the ontology.
6. CONCLUSION
A database design process begins with identifying the users requirements. The
requirements represent a users view of the application domain. From these requirements
23 |Ontology and Database Design
we identify a set of terms, definitions, relationships, and constraints for that application
domain. As stated in the paper a design is mainly two steps: creation and validation. We
can see that ontology helps in doing both the steps in an efficient manner and is also less
time consuming. We saw that with the help of ontology, containing knowledge about a
specific domain, we can create more complete and consistent database designs for that
application domain. It helps in both cases where we have a new model or an existing
model.
7. BIBLIOGRAPHY
[1] Noy and McGuinness ,Ontology Development 101: A Guide to Creating Your First
Ontology,
Stanford
University,
http://protege.stanford.edu/publications/ontology_development/ontology101-noymcguinness.html
[2] Database Design, http://en.wikipedia.org/wiki/Database_design
[3] Protg, http://protege.stanford.edu/
[4] Ontologies, AAAI, http://www.aaai.org/AITopics/html/ontol.html#cw
[5] Sugumaran and Storey, The Role of Domain Ontologies in Database
Design: An Ontology Management and Conceptual Modeling Environment, ACM
Transactions on Database Systems, Vol. 31, No. 3, September 2006, Pages 10641094.
[6] Sugumaran and Storey, An Ontology-Based Framework for Generating and
Improving Database Design, Springer-Verlag Berlin Heidelberg 2002
[7] Storey, V.C., Goldstein, R.C., Chiang, R.H.L., Dey, D., and Sundaresan, S.,
Database Design with Common Sense Business Reasoning and Learning, ACM
Transactions on Database Systems, Vol.22, No.4, December, 1997, pp.471-512.
[8] Chandrasekaran, Josephson, Benjamins. "What are Ontologies and Why do we need
them". IEEE Intelligent Systems, Jan/Feb 1999.
[9] Sowa, J.F., Ontology, metadata, and semiotics, Lecture Notes in AI #1867,
Springer-Verlag, Berlin, 2000, pp. 55-81.
[10] Mcguinness et al, Ontologies come of age, MIT Press, Cambridge, MA, 2002.
[11] Gomez-Perez, Tutorial on ontological engineering, Proceedings of the 16th
International Joint Conference on Artificial Intelligence, 1999, Sweden (July/Aug.).
[12] Time Berners-Lee, The Semantic Web Revisited, IEEE Intelligent Systems, 2006
[13] Teorey, Yang and Fry. A Logical Design Methodology for Relational Databases
Using the Extended Entity-Relationship Model, ACM Computing Surveys Vol 18 Issue
2, 1986, pp. 197-222.
[14] Sanjay Kumar Malik, Nupur Prakash, SAM Rizvi , Ontology Revisited: Design
and Development