-
Notifications
You must be signed in to change notification settings - Fork 59
Web entity codebook
The web entity codebook is a configuration file which set for the corpus the way the users want to describe the web entities.
This is the reference which will be used through the system to handle metadata.
It will be first used by the user interface to dynamically built the metadata interface.
It will also be used by the memory structure as a specification about the metadata field of the web entity index
The codebook is a set of field.
Each field as this set of parameters :
- name String
- multiple boolean
- stock_type choice in ["string",]
- default_value choice in ["string",]
- interface_position int
- interface_mandatory boolean
- interface_vdex vocabIdentifier (see below)
- interface_input_type choice in ["tag","]
( name : language multiple : false stock_type : string default_value : french ), ( name : "plateform type" multiple : false stock_type : string default_value : "website" ),
VDEX is a good candidate for the vocabulary definition exchange : http://www.imsglobal.org/vdex/ http://en.wikipedia.org/wiki/IMS_VDEX
A crosswalk dictionnary is used to set the translation needed for normed webservices to pull data from the corpus. It will set the correspondance between :
- the system-built metadata and the user metadata from the codebook
- and a normed metadata format such as http://fr.wikipedia.org/wiki/Dublin_Core|Dublin-core
The codebook should be written in a strctured open format such as XML or JSON.
The memory structure will use the codebook to set/check the specification of the web entity metadata field
the crosswalk will be used by the metadata harvesting webservices to set the right format from the user codebook.