Forward OGC® Testbed 11 Catalogue Service and Discovery Engineering Report

3 ebRIM Electronic business Registry Information Model OWL Web Ontology Language RDF Resource Description Framework SPARQL Semantic Web Query Language SWEET Semantic Web for Earth and Environmental Terminology DC Dublin Core PURL Persistent Universal Resource Locator URI Universal Resource Identifier

4.2 UML notation

Most diagrams that appear in this standard are presented using the Unified Modeling Language UML static structure diagram, as described in Subclause 5.2 of [OGC 06- 121r3].

4.3 Used parts of other documents

This document uses significant parts of documents OGC CSW 2.0.2 OGC 07-006r1 and OGC CSW 3.0 12-168r4. To reduce the need to refer to that document, this document copies some of those parts with small modifications. The copied part is cited in the document. 5 OGC IP Engineering Report on Catalogue Service Discovery overview This OGC Engineering Report on Catalogue Service Discovery compares the differences in the information model and in particular, the support to metadata discovery, between the CSW 2.0.2 and CSW 3.0. A clear advancement of CSW 3.0 is the detailed discussion on implementation and interface definition to support federated distributed search. We also examined different techniques to semantically enable the search and discovery component of the CSW catalogue. Figure 1 demonstrates the research questions being answered through this ER. 4 Figure 1. Research question to answer in the catalogue service ER We first conducted a systematic review and comparison of the two catalogue service standards. The comparison is generally based on the support capacity for open, distributed, and federated searches. These three search types are important for future catalogue implementation because 1 open search is widely supported by general search engines. Thus, enabling open search makes the metadata within the catalogue easily discoverable and shared; and 2 the rapid explosion of the Web makes it possible to connect physically distributed people and more importantly distributed geospatial assets. Geospatial data is an information resource known to be widely distributed in today’s computing paradigms. To support this broad distribution, various catalogue applications have been established. Well known ones include European Union’s INSPIRE 1 catalogue, the US data.gov 2 project initiated by the US White House, and the international effort GEOSS Global Earth Observation System of Systems 3 . These Web-based catalog services are hosted by different organizations and the metadata contained in them is most always on different topics. To increase the visibility of distributed geospatial dataservice resources, there is a need for a federated distributed search mechanism that can coordinate the data discovery process from not only one but many distributed catalogue instances. Many organizations or applications often require access to distributed catalogues. Each set of these catalogues is used in a federated search. In another words one federated search requires the support from a subset of all distributed catalogs, and different 1 http:inspire-geoportal.ec.europa.eu 2 http:data.gov 3 http:www.geoportal.orgwebguestgeo_home_stp