Lexicography for IBM Developing Norwegian Linguistic Resources in the 1980s

被引:0
|
作者
Engh, Jan [1 ]
机构
[1] Oslo Univ Lib, Oslo, Norway
来源
关键词
Lexicography; Norwegian language; natural language processing; IBM; application software;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In 1984, IBM and the University of Oslo set up a joint project, probably the first project of its kind in Norway. Its aim was to develop Norwegian language resources for IBM application software for PCs, midrange computers, and mainframes. The primary objective: to create a "base dictionary" module that would drive language sensitive functions. The technology was based on simple character sequence recognition; its great asset being high compaction and rapid access to correct data. The module was to be built on documented linguistic forms. The dictionary should cover the general part of the vocabulary, and a broad coverage module was created for Norwegian Bokmal. Later, one module for Nynorsk was developed as well. At that stage, however, the project had become a regular IBM project. In the following years, other linguistic functions were added and eventually, the result served as the foundation for a grammar and for machine translation. The project was terminated because of the corporate financial crisis of the late 1980s. Later, the dictionaries were transferred to the University of Oslo. They are now an integral part of the basic infrastructure for Norwegian academic computational linguistics.
引用
收藏
页码:258 / 270
页数:13
相关论文
共 50 条