|
|
Grid@CLEFGrid@CLEF is an activity of the Cross-Language Evaluation Forum (CLEF), which is launching a new pilot track in the CLEF 2009 campaign. Information about the objectives, the task, the organization, and the subscription procedure follows; for more information and updates, please visit the Grid@CLEF Web site at: ObjectivesMultilingual information access (MLIA) is increasingly part of many complex systems, such as digital libraries, intranet and enterprise portals, Web search engines. The Cross-Language Evaluation Forum (CLEF) research community has been outstanding and very active in designing, developing, and testing MLIA methods and techniques, constantly improving the performances of such components. But is this enough? Do we really know how MLIA components (stop lists, stemmers, IR models, relevance feedback, translation techniques, etc.) behave with respect to languages? Do we have a deep comprehension of how these components interact together when the language changes? We are thus launching a cooperative effort where a series of large-scale and systematic grid experiments will allow us to improve our comprehension of MLIA systems and gain an exhaustive picture of their behaviour with respect to languages. In this way, we can exploit the valuable resources and experimental collections made available by CLEF over the years in order to gain more insights about the effectiveness of the various weighting schemes and retrieval techniques with respect to the languages and to disseminate this knowledge to the relevant application and developer communities. TaskThis first year task focuses on monolingual retrieval, i.e. querying topics against documents in the same language of the topics, in five European languages:
The selected languages will allow participants to test both romance and germanic languages, as well as languages with word compounding issues. Moreover, these languages have been extensively studied in the MLIA field and, therefore, it will be possible to compare and assess the outcomes of the first year experiments with respect to the existing literature. The reference scenario for Grid@CLEF 2009 concerns an IR system which consists of:
Instead of directly feeding the next component, as usually happens in a monolithic IR system, the Grid@CLEF task requires each component to input and output from/to XML files in a well-defined format. This choice allows the exchange of these XML files among participants and the creation of a whole experiment from the chaining of components that may belong to different IR systems. Therefore, the Grid@CLEF 2009 track has a twofold goal:
To facilitate the participation in this first year task, participants are required to participate in what we call the island mode, where all the components which constitute the IR system of the reference scenario are developed and run by the same participant. The participant is only requested to implement the XML messaging format for each of his own components and publish all the intermediate results of these components on the online XML messaging exchange system. Participanting in the Grid@CLEF 2009 pilot track is easy: you only need to join the island mode and produce as many experiments as possible. ScheduleThe tentative schedule for the Grid@CLEF 2009 track is as follows:
Track Coordinators
Advisory Committee
EnrolmentRegistration for CLEF 2009 and enrolment in the Grid@CLEF 2009 pilot track open 4 February. You can find more information on the main CLEF Web site at: http://www.clef-campaign.org/ under "CLEF 2009". |

