Skip to main content

OWL schema matching

Abstract

Schema matching is a fundamental issue to many database applications, such as query mediation and data warehousing. It becomes a challenge when different vocabularies are used to refer to the same real-world concepts. In this context, a convenient approach, sometimes called extensional, instance-based, or semantic, is to detect how the same real world objects are represented in different databases and to use the information thus obtained to match the schemas. Additionally, we argue that automatic approaches of schema matching should store provenance data about matchings. This paper describes an instance-based schema matching technique for an OWL dialect and proposes a data model for storing provenance data. The matching technique is based on similarity functions and is backed up by experimental results with real data downloaded from data sources found on the Web.

References

  1. 1.

    Bechhofer S, van Harmelen F, Hendler J, Horrocks I, McGuinness DL, Patel-Schneider PF, Stein LA (2004) OWL web ontology language reference. W3C recommendation. Last access on Dec 2008 at: http://www.w3.org/TR/owl-ref/

    Google Scholar 

  2. 2.

    Bilke A, Naumann F (2005) Schema matching using duplicates. In: Proceedings of the 21st international conference on data engineering, pp 69–80

    Google Scholar 

  3. 3.

    Brauner DF, Casanova MA, Milidiú RL (2006) Mediation as recommendation: an approach to the design of mediators for object catalogs. In: On the move to meaningful internet systems 2006: OTM 2006 workshops. Lecture notes in computer science, vol 4277. Springer, Berlin, pp 46–47

    Chapter  Google Scholar 

  4. 4.

    Brauner DF, Casanova MA, Milidiú RL (2007) Towards gazetteer integration through an instance-based thesauri mapping approach. In: Advances in geoinformatics; VIII Brazilian symposium on geoinformatics (GEOINFO), pp 235–245

    Google Scholar 

  5. 5.

    Brauner DF, Gazola A, Casanova MA (2008) Adaptive matching of database web services export schemas. In: Proceedings of the 10th international conference on enterprise information systems (ICEIS), pp 49–56

  6. 6.

    Brauner DF, Intrator C, Freitas JC, Casanova MA (2007) An instance-based approach for matching export schemas of geographical database Web services. In: Proceedings of the IX Brazilian symposium on geoinformatics (GEOINFO), pp 109–120

  7. 7.

    Casanova M, Breitman K, Brauner D, Marins A (2007) Database conceptual schema matching. Computer 40(10):102–104

    Article  Google Scholar 

  8. 8.

    Castano S, Ferrara A, Montanelli S, Racca G (2004) Semantic information interoperability in open networked systems. In: Proceedings of semantics for grid databases, first international IFIP conference (ICSNW). Lecture notes in computer science, vol 3226. Springer, Berlin, pp 215–230

    Google Scholar 

  9. 9.

    Chomicki J, Saake G (1998) Description logics for conceptual data modeling. In: Logics for databases and information systems. Springer, Berlin, chapter 8

    Chapter  Google Scholar 

  10. 10.

    Codd EF (1970) A relational model of data for large shared data banks. Commun ACM 13(6):377–387

    Article  Google Scholar 

  11. 11.

    Doan A, Domingos P, Halevy AY (2001) Reconciling schemas of disparate data sources: a machine-learning approach. In: Proceedings of the 2001 ACM SIGMOD international conference on management of data, vol 30, pp 509–520

  12. 12.

    Duchateau F, Bellahsène Z, Hunt E (2007) XBenchMatch: a benchmark for XML schema matching tools. In: Proceedings of the 33rd international conference on very large data bases, demo sessions: group 1, pp 1318–1321

  13. 13.

    Euzenat J, Shvaiko P (2007) Ontology matching. Springer, Berlin

    Google Scholar 

  14. 14.

    Gazola A (2008) A software infrastructure for catalog matching. Master’s thesis, Departamento de Informática, PUC-Rio

  15. 15.

    Gazola A, Brauner D, Casanova MA (2007) A mediator for heterogeneous gazetteers. In: Poster session of the 22nd Brazilian symposium on database

  16. 16.

    Horrocks I, Patel-Schneider PF, Boley H, Tabet S, Grosofand B, Dean M (2004) SWRL: A semantic web rule language combining OWL and RuleML. W3C member submission. Last access on Dec 2008 at: http://www.w3.org/Submission/SWRL/

  17. 17.

    Leme LAP, Brauner DF, Breitman KK, Casanova MA, Gazola A (2008) Matching object catalogues. J Innov Syst Softw Eng 4(4):315–328

    Article  Google Scholar 

  18. 18.

    Leme LAPP (2009) Conceptual schema matching based on similarity heuristics. DSc thesis (Advisor: Casanova MA), Department of Informatics, Pontifical Catholic University of Rio de Janeiro

  19. 19.

    Leme LAPP, Casanova MA, Breitman KK, Furtado AL (2008) Evaluation of similarity measures and heuristics for simple RDF schema matching. Monografias em Ciência da Computação MCC44/08, Department of Informatics, Pontifical Catholic University of Rio de Janeiro

  20. 20.

    Leme LAPP, Casanova MA, Breitman KK, Furtado AL (2009) Instance-based OWL schema matching. In: Proceedings of the 11th international conference on enterprise information systems. Lecture notes in business information processing, vol 24. Springer, Berlin, pp 14–26

    Google Scholar 

  21. 21.

    Madhavan J, Bernstein P, Doan A, Halevy A (2005) Corpus-based schema matching. In: Proceedings of the 21st international conference on data engineering, pp 57–68

  22. 22.

    OMG (2009) OMG unified modeling language, superstructure

  23. 23.

    Prud’hommeaux E, Seaborne A (2008) SPARQL query language for RDF. W3C recommendation. Last access on Dec 2008 at: http://www.w3.org/TR/rdf-sparql-query

  24. 24.

    Quine WV (1968) Ontological relativity. J Philos 65(7):185–212

    Article  Google Scholar 

  25. 25.

    Rahm E, Bernstein P (2001) A survey of approaches to automatic schema matching. VLDB J 10(4):334–350

    Article  Google Scholar 

  26. 26.

    Tversky A, Gati I (1978) Studies of similarity. Cogn Categ 1:79–98

    Google Scholar 

  27. 27.

    Wang J, Wen J, Lochovsky F, Ma W (2004) Instance-based schema matching for web databases by domain-specific query probing. In: Proceedings of the 13th international conference on very large data bases, pp 408–419

Download references

Author information

Affiliations

Authors

Corresponding author

Correspondence to Luiz André P. Paes Leme.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Leme, L.A.P.P., Casanova, M.A., Breitman, K.K. et al. OWL schema matching. J Braz Comput Soc 16, 21–34 (2010). https://doi.org/10.1007/s13173-010-0005-3

Download citation

Keywords

  • Schema matching
  • OWL
  • Similarity
  • Provenance