{"id":2521,"date":"2014-04-30T11:00:34","date_gmt":"2014-04-30T10:00:34","guid":{"rendered":"https:\/\/irsg.bcs.org\/informer\/?p=2521"},"modified":"2014-04-30T11:00:34","modified_gmt":"2014-04-30T10:00:34","slug":"towards-search-standardisation","status":"publish","type":"post","link":"https:\/\/archive-irsg.bcs.org\/informer\/?p=2521","title":{"rendered":"Towards Search Standardisation"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignright size-medium wp-image-2555\" src=\"https:\/\/irsg.bcs.org\/informer\/wp-content\/uploads\/mumia-300x139.jpg\" alt=\"\" width=\"300\" height=\"139\" \/> The EU-funded COST Network IC1002 (<a href=\"http:\/\/www.mumia-network.eu\/\">http:\/\/www.mumia-network.eu\/<\/a> ) is a four year (2010-2014) networking programme which aims to promote collaboration between researchers and professionals working on Multilingual and Multifaceted Information Access (MUMIA), principally in Information Retrieval, Machine Translation and related topics.  More than 250 scientists and professionals from 28 COST countries and 4 non COST countries participated in the Action activities during its operation.\u00a0 One of the areas in which the network is active is the development of standards for search systems.  This networking activity was mainly motivated by previous work and discussions that were developed inside the network about integrating various Information Retrieval and Natural Language Processing (IR\/NLP) technologies.  During early 2013 two internal Working Group meetings and a Workshop (at ECIR in Moscow) were organised to discuss the problems of Integrating IR\/NLP tools for professional search systems.<\/p>\n<p><!--more--><\/p>\n<p>These meetings and discussions during the workshop revealed the need for standards and protocols and led to the organization of another Working Group meeting to specifically discuss this challenge. This initial meeting on this topic was held in Thessaloniki, Greece on the 19 November 2013.  The meeting had the following objectives:<\/p>\n<ul>\n<li>Bring together different stakeholders to discuss and prioritize the \tneeds and challenges for developing standards and protocols in the \tdomain of search technologies.<\/li>\n<li>Explore the best ways \tto launch a standardization initiative for creating standards and \tprotocols for integrating  IR\/NLP search tools and technologies<\/li>\n<\/ul>\n<p>It is expected that the development of suitable standards will make it easier for researchers and practitioners to exchange software and to adapt functionalities from one system to another when building applications. At that meeting two sorts of standards were identified: Component-based standards (focussing on API\u2019s and the like) using massive decomposition of search systems and Conceptual-based standards, focussing more on defining concepts and data structures to facilitate \u201cstandard- enabled\u201d exchange of information between search\/NLP tools and also on coordination architectures and overall capabilities. Some invited participants with previous experience of getting standards approved and adopted provided the active MUMIA members with a great deal of useful information about standards bodies, the way they operate, and how standardisation activities work in practice. Topics which were discussed included:<\/p>\n<ol>\n<li>Scope \tand objectives of a potential standard in this area.<\/li>\n<li>Are we seeking system or conceptual standardization? Functional \tdecomposition: which components \/ interfaces \/ APIs would offer \tthemselves for standardization?<\/li>\n<li>Which standards exist that we would need to consider?<\/li>\n<li>IR\/NLP technologies that should be first prioritized in view of \tdeveloping such technical standard and protocols.<\/li>\n<li>Domains in which standards could usefully be adopted (e.g. web \tsearch, patent, medical, bibliographic etc) and how they would \tbenefit from standards.<\/li>\n<li>Experiences about using other standards in search systems \tdevelopment (e.g. open search protocol).<\/li>\n<\/ol>\n<p>A follow up meeting was organised in Amsterdam (Netherlands) co-located with ECIR 2014 on 11 and 12 April 2014.<\/p>\n<figure id=\"attachment_2559\" aria-describedby=\"caption-attachment-2559\" style=\"width: 300px\" class=\"wp-caption alignright\"><img loading=\"lazy\" decoding=\"async\" class=\"size-medium wp-image-2559\" src=\"https:\/\/irsg.bcs.org\/informer\/wp-content\/uploads\/10003171_10152376736049533_3491448404290361273_n-300x225.jpg\" alt=\"\" width=\"300\" height=\"225\" \/><figcaption id=\"caption-attachment-2559\" class=\"wp-caption-text\">MUMIA Meeting at ECIR 2014 in Amsterdam<\/figcaption><\/figure>\n<p>In the Amsterdam meeting there were 35 participants from 20 countries, including David Fisher from the LEMUR project, Iadh Ounis and Craig Macdonald from Terrier project, and Peter Mika from Yahoo! Efforts had been made to involve participants from Bing!, Google, Yandex, ElasticSearch and the Lucene\/SOLR community, but unfortunately these were unsuccessful, although some expressed significant interest in the standards activity.<\/p>\n<p>The meeting\u2019s principle mode of operation was in the <a href=\"http:\/\/www.theworldcafe.com\/method.html\">World Caf\u00e9 model<\/a>. We began with some introductory talks, covering the conclusions of the previous meeting, the experience of the development of search systems, and an introduction to the World Caf\u00e9 process. We moved on to a brainstorming session involving the whole group in which we identified, first of all, a long list of candidate topics we could discuss, and then a weeded and merged list of five specific topics on which we would focus of the rest of the meeting. Those topics were:<\/p>\n<ol>\n<li>Content Representation &amp; Text \tProcessing<\/li>\n<li>Indexing<\/li>\n<li>Input-Output and Adaptability<\/li>\n<li>Retrieval<\/li>\n<li>Low Hanging Fruits<\/li>\n<\/ol>\n<p>(Please bear with us if these topic labels are little obscure \u2013 they were more meaningful in the context of the meeting!).<\/p>\n<p>Each of the topics was assigned to a discussion table, with a host who on this occasion also acted as a recorder. Iadh Ounis assisted by Craig McDonald, Mike Salampasis, Fernando Loizides, Parth Gupta, and David Fisher kindly volunteered to act as table hosts respectively for each topic from the five identified. We had &#8220;switching times&#8221; every 20 minutes, and so participants spent 20 minutes at each table discussing each of the topics, with the table hosts summarising the conclusions of previous rounds at that table. People were encouraged to move fairly randomly between tables rather than stick together as a group. In this way everyone was able to contribute in every topic, to spot the key issues\/inhibitors in each of the identified topic. The first day concluded with a brief summary of the common themes and conclusions which had emerged during discussion at each table.<\/p>\n<p>The table hosts wrote up a short report on their topic overnight, and the early part of the second day was spent finalising these reports with participants.  We plan to more extensively work on these short reports and create a more comprehensive paper reporting about the outcomes of the two Working Group meetings. The standards meeting concluded mid-morning with a plenary session on next steps and actions. During this plenary session the decision which was taken in Thessaloniki to produce a \u201cwhite paper\u201d on standards and protocols for search systems was reinforced. The white paper will:<\/p>\n<ol type=\"a\">\n<li>describe best \tpractices to be potentially adopted from stakeholders of search \tindustry and also academia wanting to increase re-usability and \tinteroperability of their tools, and<\/li>\n<li>Propose \trecommendations for a future formal standardization activity.<\/li>\n<\/ol>\n<p><a name=\"_GoBack\"><\/a> A working draft of the White Paper and a programme for its development will appear on the Mumia Web Site (http:\/\/www.mumia-network.eu\/)  by June 2014.<\/p>\n<p>Acknowledgement: This piece was co-authored by John Tait.<\/p>\n<div>\n<p>&nbsp;<\/p>\n<\/div>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The EU-funded COST Network IC1002 (http:\/\/www.mumia-network.eu\/ ) is a four year (2010-2014) networking programme which aims to promote collaboration between researchers and professionals working on Multilingual and Multifaceted Information Access (MUMIA), principally in Information Retrieval, Machine Translation and related topics. More than 250 scientists and professionals from 28 COST countries and 4 non COST countries&hellip; <a class=\"more-link\" href=\"https:\/\/archive-irsg.bcs.org\/informer\/?p=2521\">Continue reading <span class=\"screen-reader-text\">Towards Search Standardisation<\/span><\/a><\/p>\n","protected":false},"author":33,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[201,208],"tags":[],"class_list":["post-2521","post","type-post","status-publish","format-standard","hentry","category-feature-article","category-spring-2014","entry"],"_links":{"self":[{"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=\/wp\/v2\/posts\/2521","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=\/wp\/v2\/users\/33"}],"replies":[{"embeddable":true,"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2521"}],"version-history":[{"count":0,"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=\/wp\/v2\/posts\/2521\/revisions"}],"wp:attachment":[{"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2521"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2521"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/archive-irsg.bcs.org\/informer\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2521"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}