Add new comment

Standards, Schema and Protocols

Two sets of standards are relevant here. The first are those that allow data represented within BioCorder to become part of the Semantic Web. The second are those that BioCorder uses to store and exchange data with non-Semantic Web applications (e.g. the GBIF data portal). Data standards for the former are essentially those outlined by the W3C (World Wide Web Consortium) for the Semantic Web. Specifically these are URI’s (Universal Resource Identifiers, there are many kinds of these, BioCorder largely uses LSID’s), HTTP (Hypertext Transfer Protocol, the protocol that links up the web), and RDF (Resource Description Framework, the format for providing machine readable information on the Semantic Web). Internally specimen data within BioCorder is being developed around the Darwin Core, a simple exchange format for specimens that is compatible with GBIF. Other internationally recognised standards will be adopted as the project develops. However, in most cases these standards lack the breath required by the diverse data concepts stored by BioCorder. Indeed, limitations of standards like these were in part the motivation for the development of the Semantic Web. For further details see the BioCorder FAQ on "Limitations of existing standards to exchange data?"

Assembled below are a non-exhaustive list of schema and communication protocols of relevance to taxonomy and systematics:

  • Darwin Core2 - a set of data element definitions designed to support the sharing and integration of primary biodiversity data.
  • Access to Biological Collections Data (ABCD) - a standard for the access and exchanging of data about specimens and observations (primary biodiversity data).
  • Distributed Generic Information Retrieval (DiGIR) - a protocol for single point access to distributed data sources.
  • Global Biodiversity Information Facility (GBIF) - an organization facilitating the digitisation and global dissemination of primary biodiversity data.
  • Taxonomic Databases Working Group (TDWG) - an international collaboration to promote the wider and more effective dissemination of information "about the World's heritage of biological organisms for the benefit of the world at large".
  • Committee on Data for Science and Technology (CODATA) - a committee of the International Council for Science (ICSU), working to improve the quality, reliability, management and accessibility of data to all fields of science and technology.
  • A Biological Collection Access Service for Europe (BioCASE) - a web-based information service providing researchers with unified access to biological collections in Europe.
  • BioMoby - an international research project involving biological data hosts, biological data service providers, and coders who are exploring methodologies for biological data representation, distribution, and discovery.
  • Reply


    *

    • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
    • Lines and paragraphs break automatically.