Monday, April 4, 2011
08:30 AM - 04:45 PM
On a global scale, the Semantic Web promises a linked web of data. The same technology can be used on an enterprise scale to integrate data. But the realities of using Semantic Web standards for Enterprise Data Integration can be daunting, even to someone familiar with them. This tutorial explains how the simplicity of the RDF data model allows it to act as an interlingua between other linked data representations, providing a simple, unified paradigm for data integration. Information from any source format is translated into RDF in an information-preserving manner, without regard to specifics of its content. The resulting RDF is then merged with that of other data sources. Content-specific mappings are accomplished with SPARQL CONSTRUCT queries to identify patterns in the source data.
The tutorial will review the basics of RDF and SPARQL, emphasizing the features that make them particularly suitable for enterprise data integration. The bulk of the session will focus on data integration itself, including the following topics:
- Semantic Enterprise Data Integration lifecycle
- RDF basics, and why RDF is a good representation for integrating data
- The role of the URI in sharing data
- How RDF relates to familiar data representations: spreadsheets, XML, databases, email, etc.
- RDF and unstructured data
- SPARQL as a means for linking information
- Querying the integrated data
A Semantically-enabled Enterprise Database Management System marries the rich information discovery and data integration of semantic technologies with the mature, enterprise features of a commercial RDBMS. Applications in defense and intelligence, life sciences and clinical medicine, finance, publishing, and entertainment have RDF graphs running to tens of billions of triples, usually with associated relational, XML, text and/or spatial data. These applications require RDBMS-strength scalability, querying and security features with persistent beyond memory-scale inferencing. Learn how Oracle Database Semantic technologies has implemented scalable RDF storage, querying, and native inferencing that conforms with W3C standards and open source frameworks: RDF, RDFS, OWL 2, SPARQL, SKOS, Jena and Sesame, and SQL integration.
Dean Allemang, Chief Scientist at TopQuadrant Inc. is a frequent speaker at semantic technology conferences. He brings to this talk over 10 years experience working with customers on implementing solutions based on ontologies, with the last 5 years of his work focused on using the Semantic Web Standards. He developed the successful TopMIND Semantic Web training course, now in its fifth year with over 500 alumni. Along with his co-author Jim Hendler, Dean has just published Semantic Web for the Working Ontologist (Morgan-Kaufmann, 2008), a practitioner's guide to the Semantic Web.
Bill Beauregard is Senior Principal Product Manager for Oracle Database Semantic Technologies.