Skip to main content

An information retrieval application using ontologies


Searching for information in long videos can be a time-consuming experience. In this paper, we describe OnAIR, an ontology-aided information retrieval system applied to retrieve clips from video collections.

We used a video collection compiled from interviews with Ana Teixeira, a Brazilian artist. The interviews were made by Paula P. Braga, the domain expert. The interview is developed in the domain of contemporary art and the system uses a domain ontology to expand the queries with related terms. We tested the system with a battery of queries, and we veri.ed that the ontology contributes to the e.ciency improvement in terms of the relevance of retrieved documents. We designed the system to work in a domain-independent way, allowing us to move to other domains by just changing the underlying ontologies and video collections.


  1. [1]

    T. Andreasen, J. Nilsson, and H. Thomsen. Ontologybased querying. InProceedings of the Fourth International Conference on Flexible Query-Answering Systems, pages 15–26, Warsaw, Poland, Agosto 2000.

  2. [2]

    R. Baeza-Yates and B. Ribeiro-Neto.Modern Information Retrieval. Addison Wesley Longman, 1999.

  3. [3]

    W. Bailer, H. Mayer, H. Neuschmied, W. Haas, M. Lux, and W. Klieber. Content-based video retrieval and summarization using MPEG-7. InProceedings of the Internet Imaging V, pages 1–12, San Jose, CA, USA, Janeiro 2004.

  4. [4]

    B. Bassett and Histor Systems. Conversation with Jacques Lipchitz: A breakthrough in interactivity, 2001.

  5. [5]

    M. G. Brown, J. T. Foote, Gareth J. F. Jones, K. Sparck-Jones, and S. J. Young. Automatic content-based retrieval of broadcast news. InProceedings of the 3rd ACM Multimedia Conference, pages 35–43, San Francisco, USA, Novembro 1995.

  6. [6]

    R. Burke, K. Hammond, V. Kulyukin, S. Lytinen, N. Tomuro, and S. Schoenberg. Natural language processing in the faq finder system: Results and prospects. Technical report, AAAI Spring Symposium, 2002.

  7. [7]

    World Wide Web Consortium. RDF Resource De.nition Framework, 2004. http://www.w3. org/RDF/.

  8. [8]

    J. Gennari, M. Musen, R. Fergerson, W. Grosso, M. Crubézy, H. Eriksson, N. Noy, and S. Tu. The evolution of Protégé-2000: An environment for knowledgebased systems development.International Journal of Human-Computer Studies, 58(1):89–123, 2003.

    Article  Google Scholar 

  9. [9]

    N. Guarino, C. Masolo, and G. Vetere. Ontoseek: Content-based access to the web.IEEE Intelligent Systems, 14(3):70–80, Maio 1999.

    Article  Google Scholar 

  10. [10]

    L. Hirschman and R. Gaizauskas. Natural language question answering: the view from here.Natural Language Engineering, 7(4):275–300, 2001.

    Article  Google Scholar 

  11. [11]

    E. Hyvönen, A. Styrman, and S. Saarela. Ontologybased image retrieval. InTowards the semantic web and web services, Proceedings of XML Finland 2002 Conference, pages 15–27, Finland, 2002.

  12. [12]

    P. Jackson and I. Moulinier.Natural Language Processing for Online Applications: Text Retrieval, Extraction, and Categorization. John Benjamins Publishing Co, 2002.

  13. [13]

    L. Khan.Ontology-based Information Selection. PhD thesis, Department of Computer Science, University of Southern California, 2000.

  14. [14]

    G. Kline. High-tech sculptor has the answers.The News-Gazette Online, October 2001. Published in WWW in October 2001: story.cfm?Number=10249.

  15. [15]

    H. Knublauch, M. Musen, and A. Rector. Editing description logics ontologies with the Protégé OWL plugin. InInternational Workshop on Description Logics, Whistler, BC, Canada, 2004.

  16. [16]

    D. Lin. An information-theoretic definition of similarity. InProceedings of the 15th International Conference on Machine Learning, pages 296–304, San Francisco, USA, 1998. Morgan Kaufmann Publishers Inc., 1998.

  17. [17]

    M. Mauldin.Conceptual Information Retrieval: A case study in adaptive partial parsing. Kluwer Academic Publishers, 1991.

  18. [18]

    B. McBride. Jena: Implementing the rdf model and syntax specification. InProceedings of the Second International Workshop on the Semantic Web, Hong Kong, China, May 2001.

  19. [19]

    G. Miller. Wordnet: a lexical database for english.Commun. ACM, 38(11):39–41, 1995.

    Article  Google Scholar 

  20. [20]

    V. Orengo and C. Huyck. A stemming algorithm for the Portuguese language. InProceedings of the 8th International Symposium on String Processing and Information Retrieval(SPIRE) 2001, pages 186–193, 2001. An implementation of the algorithm in C is available at: PhDArea/rslp/RSLP.htm.

  21. [21]

    C. Paz-Trillo, R. Wassermann, and F. Kon. A patternbased tool for learning design patterns. Technical Report RT-MAC-2005-04, Instituto de Matemática e Estatística, Universidade de São Paulo, 2005. Available in:≈cpaz/rt-mac-2005-04.pdf.

  22. [22]

    J. Rabelo. Pergunte! uma interface em português para pergunta-resposta na web. Master’s thesis, Informatics Center, Federal University of Pernambuco, Brazil, 2004.

    Google Scholar 

  23. [23]

    G. Salton, A. Wong, and C. S. Yang. A vector space model for automatic indexing.Commun. ACM, 18(11):613–620, 1975.

    MATH  Article  Google Scholar 

  24. [24]

    M. Smith, C. Welthy, and D. McGuiness. OWL Web Ontology Language Guide. Technical report, World Wide Web Consortium, 2004. 2004/REC-owl-guide-20040210/.

  25. [25]

    R. Ueda. Ispell Dictionary for Brazilian Portuguese: br.ispell, 2002. Available at ~ueda/br.ispell/.

  26. [26]

    C. J. van Rijsbergen.Information Retrieval. Butterworths, 2nd edition, 1979.

  27. [27]

    M. Worring, A. Bagdanov, J. v. Gemert, J-M. Geusebroek, M. Hoang, A.Th. Schreiber, C.G.M. Snoek, J. Vendrig, J. Wielemaker, and A.W.M. Smeulders. Interactive indexing and retrieval of multimedia content. InProceedings of the 29th Conference on Current Trends in Theory and Practice of Informatics, volume 2540 ofLecture Notes in Computer Science, pages 135–148, Milovy, Czech Republic, 2002. Springer-Verlag,2002.

Download references

Author information



Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License ( ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Paz-Trillo, C., Wassermann, R. & Braga, P.P. An information retrieval application using ontologies. J Braz Comp Soc 11, 17–31 (2005).

Download citation

  • Issue Date:

  • DOI:


  • Ontologies
  • Information Retrieval
  • Video Retrieval
  • Query Expansion