ISWC 2010 Tutorial: Semantic Search. 7-11 November 20010. Shanghai, China
Miriam Fernandez, Knowledge Media Institute, Open University, UK
Thanh Tran Duc, Institute AIFB, KIT, DE
Peter Mika, Yahoo! Barcelona
Semantic search has attracted a large community of researchers, practitioners and industry people. As a result of its popularity, the term semantic search is used in different context with different meaning. Taking the variety of work published recently into account, semantic search is not one single type of application but rather, refers to a broad range of systems, which involve the use of semantics. In this tutorial, we aim to provide a comprehensive overview on the different types of semantic search systems, and discuss the differences in the techniques underlying them. Both the application of Semantic Web technologies to the IR problem and vice versa, the application of IR techniques to Semantic Web problem are covered by this tutorial. In particular, focus is given to four topics of semantic search which have attracted much interest recently. The first is one is Semantic- enabled Document Retrieval, i.e. the application of Semantic Web technologies to the IR problem. The second is Semantic Data Retrieval, which concerns with (the application of IR techniques to) the retrieval of semantic data. While the use of semantics is the essential theme in these two major components of the tutorial, Hybrid Search is a complementary part that illustrates the convergence of search paradigms. Last but not least, the user factor in search will be discussed. The aim of the tutorial is to shed some lights on semantic search and provoke ideas for future development.
In recent years we have witnessed tremendous interest and substantial economic exploitation of semantic search technologies, both in academia and bussiness.
Recent advances in the field of semantic search from the semantic technologies research area have resulted in tools and standards that allow search and ranking in large metadata stores (data retrieval).
In parallel to these developments, in the past years we have also seen the emergence of important results in semantically enhancing traditional keyword-based IR models to improve the processes of textual and multimedia information retrieval (document retrieval)
An intermediate approach exist that is focus on the integration of textual and structured semantic data (hybrid search)
These different semantic search research lines create a need for broad audiences to acquire a solid understanding of current semantic search technologies.
Semantic search can be seen as a retrieval paradigm that is centered on the use of semantics. When a system incorporates the semantics entailed by the query and (or) the resources into the matching process, it essentially performs semantic search. By this definition, there is no single type of semantic search but rather a wide range of systems, employing and exploiting semantic models of varying expressivity.
Several commercial initiatives have also showed up in the market aiming to exploit semantics in order to enhance traditional keyword-based search paradigms. Among those initiatives we can highlight the appearance of search engines like: Hakia, PowerSet, FreeBase, AskMeKnow, Digger, Bibtext, wolfram alpha, etc
In this tutorial, participants will learn about the different models embraced under the umbrella of semantic search, test several queries over public available semantic search systems and discuss the advantages and drawbacks of this new paradigm, as well as its future directions. We will explain the theoretical background and give hands-on, step-by-step instructions on different semantic search systems
The next section contain the slides presented in the tutorial. Parts 1 and 2 aim to be an introduction to semantic search. Parts 3 to 7 define the semantic search process. Part 8 focus on the evaluation initiatives. Part 9 focus on the results presentation. For any feedback, comment or error detection in the slides, please contact the authors. Thx! :)
Thanh Tran Duc Webpage:  "Thanh Tran is currently leading the AIFB semantic search special interest group. He has worked as consultant and software engineer for IBM and Capgemini. His research interests are centered around next generation search applications on the Web, where topics range from formal models for web resources (KR), storage and query processing concepts for RDF and text (DB & IR) and user interfaces for web search. His work is published in numerous top-level conference proceedings (SIGMOD, ICDE, WWW, CIKM, ISWC) and earned the second prize at the Billion Triple Challenge 2008. He runs the series of workshops on semantic search and is currently working as researcher and project leader for AIFB for the EU project X-Media, and the German projects iGreen and CollabKloud."
Miriam Fernandez Webpage:  "Dr Miriam Fernandez receives her BS, MSc (first student of promotion) and PhD (cum laude european mention) at Universidad Autonoma de Madrid, Spain. Her research interest are focused on Information Retrieval and Semanic Web, and more specifically on how semantic technologies can help to enhance current keyword-based retrieval models, including personalization and contextualization techniques. During her PhD period she performed several internships in academia (Knowledge Media Institute, UK, 6 months) and industry (Google Zurich, Switzerland, 6 months) where she continued her research in semantic search and search quality respectively. She has actively participate in several European projects (aceMedia, Mesh, XMedia) and produce publications in top-level conference proceedings(ECIR, ESWC, WWW, ICSC) and journals(TKDE, TCSVT). Miriam joined the Knowledge Media Institute (KMi) in April 2009, where she is working on semantic search models and evaluation methodologies.
Peter Mika Peter received his Ph.D. degree (cum laude) at the Vrije Universiteit, Amsterdam. His interdisciplinary work in the field of Social Networks and the Semantic Web earned a Best Paper Award at the International Semantic Web Conference in Galway, 2006 and a First Prize at the Semantic Web Challenge of 2005. He is also author of the book “Social Networks and the Semantic Web”, published in 2007 by Springer Verlag. Peter is also the youngest elected member of the board of the Journal of Web Semantics. From November, 2005 he was the co-chair of the Semantic Web Challenge, together with Michael Uschold (2005), Jennifer Golbeck (2006), Jim Hendler (2007) and Chris Bizer (2008).