What do the big players offer: Microsoft, IBM, Oracle, ...
Metadata and classification
clustering and classification
clustering versus classification
classification in categories versus taxonomies
editing and managing categories/ taxonomies
rule based decision trees
k nearest neighbour
support vector machine
Stratify Classification Server
Case Study: Belga News Agency
News is a Belga's raw material and final product: its role is to classify,
synthesize, package, distribute and document both text and multimedia sources.
Customers want a fast or even realtime service, tailored to their needs -
a business case for automation is quickly made.
Belga has a search solutions mixture composed of Oracle
Intermedia, BRS, a third party
(customized) MySQL search
and Autonomy. Stefaan Melis will explain
the search methods Belga uses for its distinct content sources, and discuss
the combined search solution that was implemented in a portal for news editors.
how search and navigation was designed with different kinds of users in mind (professionals versus citizens,
people looking for a specific document versus people looking for information using vague search terms...)
metadata is used by the team of editorialists (adding several tens of documents per day)
the task of adding metadata is is made lighter by automatic summarization and attribution of keywords
important the thesaurus and continuing thesaurus maintenance is
the automatic classification, indexing and search tools were selected and implemented
Roundup of this seminar, Conclusions & Summary, Final
Questions and Answers