Bug 272 - Missing original files in corpus db.
Summary: Missing original files in corpus db.
Alias: None
Product: Corpus
Classification: Unclassified
Component: Text corpus infrastructure (show other bugs)
Version: unspecified
Hardware: All All
: P2 - As soon as possible normal
Assignee: Børre Gaup
Depends on:
Reported: 2006-04-08 17:18 CEST by Saara Huhmarniemi
Modified: 2015-03-16 14:16 CET (History)
0 users

See Also:


Note You need to log in before you can comment on or make changes to this bug.
Description Saara Huhmarniemi 2006-04-08 17:18:06 CEST
There are files in the gtbound-hieararchy that don't have the counterpart in the orig. E.g
orig/sme/facta contains together 70 pdf, html and doc files but corresponding gtbound directory 119 xml-files. The laws and news directories both contain about 10 files without original. The xml-files date to Dec 27 and Nov 24.

The original files are perhaps removed or renamed, but the generated files were forgotten?
Comment 1 Børre Gaup 2006-05-09 11:33:25 CEST
They are renamed files, and should be removed.
Comment 2 Saara Huhmarniemi 2006-05-13 10:43:19 CEST
They are now removed.