We need to store the license associated with each text as part of the metadata. This is essential for our end users and future researchers, both to be able to find the license, and to know how they can use the text in question. Presently we have two types of texts:
- public texts (governmental etc)
- texts licensed to SD for the corpus bank
Public texts can be used freely, also with regard to how much researchers are allowed to copy or cite from it, whereas the other texts are restricted according to the contracts we have.
The license type should also become an option in the web interface for uploading new texts, such that individuals giving texts to us have the option to choose between a completely free usage model, or our standard text licensing contract.
Corpus.dtd is updated according to the decision in the meeting 14.11.2005. The following entry was added:
<!ELEMENT availability (free|license)
<!ELEMENT free EMPTY>
<!ELEMENT license EMPTY>
type (standard|other) "standard"
The availability is free or restricted by a license. The license type is by default the standard license. In some special cases there may be some other type of license, the "other" is reserved for that for the time being. If there are some new license types, they should be added to the list.
I leave the bug open, since all the tools have to be updated as well.
The xsl- and cgi-tools are now updated to handle the license information. It is possible to choose the license type in the web upload form or specify it in the file-specific xsl-file.