iVia Notes on Working with Natural Language and WordNet to Provide Word Sense Disambiguation and to Help Map to Ontologies

There are several compendiums/nets/corpora of languages out there. WordNet is one for the English language. Homonyms, synonyms, antonyms. These can be used to provide background concepts/word sense disambiguation to yield more accurate classification. That.s the promise. E.g., jaguar the animal and Jaguar the car (very different clumpings of words associated with them in the background). These language nets can also be used to increase accuracy in mappings from natural language phrases to controlled languages. See Bhattacharyya, Ramakrishnan and Mihalcea among others below. Ankle deep linguistics./p>

WordNet Worth Pursuing???:
According to Soumen, general comments:
> Our interests are in its usage to increase accuracy in classification
> and in helping to better map natural language terms to controlled
> language terms. I'm looking at it now with an eye towards working on
> it in the next grant cycle, perhaps.

As far as "standard" benchmarks (Reuters, 20 newsgroups) go, the effect of
adding WN info has been mild or negative. I had a student (in CMU now as
it turns out) who got negligible gains using WN synsets as features as
against the word itself, and got accuracy losses upon expanding features a
bit in WN. Agrees with broad concerns that WN "connects up too much" and
you need to pick and choose carefully. We did have feature selection in
place, even so, could not make it really work. Perhaps if the
classification problem is more challenging, WN assistance could be more
valuable. Also, mapping to controlled language could be a more promising
application. The best use of WN I have found is in open-domain question
answering ("general knowledge") and WSD, esp. the is-a hierarchies. But
WN is not very complete for technical jargon wrt most specialized fields.

WordNet
http://www.cogsci.princeton.edu/~wn/
WordNet® is an online lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets.

WordNet Bibliography
http://engr.smu.edu/~rada/wnb/

 
WordNets of the World
http://www.globalwordnet.org/gwa/wordnet_table.htm
 


Projects:

Senseval
http://www.senseval.org/
There are now many computer programs for automatically determining the sense of a word in context (Word Sense Disambiguation or WSD). The purpose of Senseval is to evaluate the strengths and weaknesses of such programs with respect to different words, different varieties of language, and different languages.

Suggested Upper Merged Ontology: IEEE
http://virtual.cvut.cz/kifb/en/
http://suo.ieee.org/
The SUO WG is developing a Standard that will specify an upper ontology to support computer applications such as data interoperability, information search and retrieval, automated inferencing, and natural language processing.
 
MEANING
Developing Multilingual Web-scale Language Technologies
http://www.lsi.upc.es/~nlp/projectes/meaning.html
MEANING will be concerned with automatically collecting and analysing language data from the WWW on a large scale, and building more comprehensive multilingual lexical knowledge bases to support improved word sense disambiguation (WSD).
 
Mimida
http://www.gittens.nl/SemanticNetworks.html
a WordNet-based mechanically-generated multilingual semantic network for more than 20 languages based on dictionaries found on the Web.
 
 
WordNet Software and/or interesting projects:

WordNet::Similarity
http://sourceforge.net/projects/wn-similarity

SenseRelate
http://senserelate.sourceforge.net/

WordNet
http://www-inf.enst.fr/~milc/wordnet.us.html

HyperDic: WordNet
http://www.hyperdic.net/

WordNet Component Object Model
http://wncom.sourceforge.net/
This package provides a access to the WordNet database though a self contained component that offers both a general-purpose interface that can be used by scripting languages such as Visual Basic or JScript and an advanced interface to be used in C or C++.

WordNet to SQL
http://www.highlyillogical.org/wn2sql/
http://wordnet2sql.infocity.cjb.net/

Rada Mihalcea WordNet related software
Open Mind Word Expert OMWE1.0 sense tagged data ; TWA sense tagged data set Word Alignment Resources ; SemCor 1.6, SemCor 1.7, SemCor 1.7.1, SemCor 2.0 ; Mappings among various WordNet versions ; Evaluation code for text filtering
http://www.cs.unt.edu/~rada/downloads.html

Senseval
http://www.senseval.org/
There are now many computer programs for automatically determining the sense of a word in context (Word Sense Disambiguation or WSD). The purpose of Senseval is to evaluate the strengths and weaknesses of such programs with respect to different words, different varieties of language, and different languages.

OpenCyc
http://www.opencyc.org/
http://sourceforge.net/project/showfiles.php?group_id=27274 (LGPL)
OpenCyc is the open source version of the Cyc technology, the world's largest and most complete general knowledge base and commonsense reasoning engine.
(see Tom O. Hara articles)
 
 
 
Papers:

Text Representation with WordNet Synsets using Soft Sense Disambiguation
Ganesh Ramakrishnan and Pushpak Bhattacharyya
Ingenierie des Systemes d'Information Journal (ISI-NIS Journal),
http://www.cse.iitb.ac.in/~pb/papers/isi-nis.pdf
.Text information processing depends critically on the proper representation of texts. A common and naive way of representing a text is as a bag of its component words. This representation suffers primarily from two drawbacks, viz., polysemy and synonymy which arise because of the ambiguity of the words and the lack of information about the relations between the words. This paper presents a model for representing a text in terms of the synsets in the WordNet- the lexical knowledge base of English words along with the semantic relations. These synsets stand for concepts which correspond to the words of the text. In particular, a soft sense disambiguation approach has been proposed. The text representation so obtained is found to convey the key ideas that the texts deal with. WordNet relations with other words in the sentence are exploited to disambiguate the senses. This scheme has been evaluated using a goodness measure based the information content of the representation of the text. As an actual application, the problem of text classification has been taken up, and the results are encouraging.. .. The conclusion is that WordNet does helps relate the words in a document and in the emergence of meaning through mutual reinforcement of related words. This method of ranking synsets for a text can find use in many other applications like clustering, question answering and summarization, some of which are ongoing. Future work consists in assigning weights top the edges of the semantic graph and incorporating verbs.

Soft Word Sense Disambiguation,
Ganesh Ramakrishnan, Prithviraj B.P., A. Deepa, Pushpak Bhattacharyya, and Soumen Chakrabarti,
International Conference on Global Wordnet (GWC 04), Brno, Czeck Republic, January, 2004
http://www.cse.iitb.ac.in/~pb/papers/soft-wsd.pdf
Word sense disambiguation is a core problem in many tasks related to language processing. In this paper, we introduce the notion of soft word sense disambiguation which states that given a word, the sense disambiguation system should not commit to a particular sense, but rather, to a set of senses which are not necessarily orthogonal or mutually exclusive. The senses of a word are expressed by its WordNet synsets, arranged according to their relevance. The relevance of these senses are probabilistically determined through a Bayesian Belief Network. The main contribution of
the work is a completely probabilistic framework for word-sense disambiguation with a semi-supervised learning technique utilising WordNet. WordNet can be customized to a domain using corpora from that domain. This idea applied to question answering has been evaluated on TREC data and the results are promising.

Automatic Lexicon Generation through Wordnet
Nitin Verma and Pushpak Bhattacharyya,
International Conference on Global Wordnet (GWC 04), Brno, Czeck Republic, January, 2004
http://www.fi.muni.cz/gwc2004/proc/81.pdf
A lexicon is the heart of any language processing system. Accurate words with grammatical and semantic attributes are essential or highly desirable for any application- be it machine translation, information extraction, various forms of tagging or text mining. However, good quality lexicons are difficult to construct requiring enormous amount of time and manpower. In this paper, we present a method for automatically generating multilingual Universal Word (UW) dictionaries (for English, Hindi and Marathi) from an input document- making use of English, Hindi and Marathi WordNets. The dictionary entries are in the form of Universal Words (UWs) which are language words (primarily English) concatenated with disambiguation information. The entries are associated with syntactic and semantic properties- most of which too are generated automatically. In addition to the WordNet, the system uses a word sense disambiguator, an inferencer and the knowledge base (KB) of the Universal Networking Language which is a recently proposed interlingua. The lexicon so constructed is sufficiently accurate and reduces the manual labor substantially

Pushpak Bhattacharyya
Personal Page: Work in Information Retrieval and Extraction . Word Sense Disambiguation/WordNet
http://www.cse.iitb.ac.in/~pb/pubs-tmir.html

Word Sense Disambiguation and Text Similarity Measurement Using WordNet ,
P. Bhattacharyya and Narayan Unny,
chapter in Real World Semanic Web Applications, IOS Press, Amsterdam, 2002. Vipual Kashyap and Leon Shklar (ed), ISBN: 1 58603 306 9
http://www.cse.iitb.ac.in/~pb/papers/ios_pushpak.ps

Text Categorization and Information Retrieval Using WordNet Senses
http://www.fi.muni.cz/gwc2004/proc/110.pdf
.In this paper we study the influence of semantics in the Text Categorization (TC) and Information Retrieval (IR) tasks. The K Nearest Neighbours (K-NN) method was used to perform the text categorization. The experimental results were obtained taking into account for a relevant term of a document its corresponding WordNet synset. For the IR task, three techniques were investigated: the direct use of a weighted matrix, the Singular Value Decomposition (SVD) technique in the Latent Semantic
Indexing (LSI) model, and the bisecting spherical k-means clustering technique. The experimental results we obtained taking into account the semantics of the documents, allowed for an improvement of the performance for the text categorization whereas they were not so promising for the IR task..... .With regard to the study of how the semantic 30-KNN performed, it can be remarked that when documents are indexed with WordNet synsets, the performance slightly improved. Therefore, the use of words which refer to the same concept is a research direction we plan to investigate further. As future work, it would be
interesting to carry out some experiments using other data sets (e.g. the TREC document collection). In these experiments, the two vector representations should be also combined, in order to take into account with different weights, terms and WordNet synsets at the same time. With regard to the poor performance we obtained for the IR task, it could be due to mainly three reasons. First, the queries of 15 words were pretty long (normal queries are 1.5 words on average) and such long queries implicitly have a disambiguation effect. We should expect better effect of using WordNet for the normal 1 or 2 queries. Second, the semantic lemmatisation related synonyms when they are in the same morphologic group: it should be combined with standard morphological lemmatisation because they could complement
each other. Moreover, also other relations could be exploited in the semantic lemmatisation, possibly including the contextual information of the glosses of all the hyponyms. Last, but not least, indexing by WordNet synsets can be very helpful for text retrieval tasks only if the error rate is below 30% and, unfortunately, the state-of-the-art of WSD techniques perform with error rates ranging from 30% to 60% which cannot guarantee better results than standard word indexing

Using Measures of Semantic Relatedness for Word Sense Disambiguation
Patwardhan, Banerjee and Pedersen, Appears in the Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics, February 17-21, 2003, Mexico City
http://www.d.umn.edu/~tpederse/Pubs/cicling2003-3.pdf
.This paper generalizes the Adapted Lesk Algorithm of Banerjee and Pedersen (2002) to a method of word sense disambiguation based on semantic relatedness. This is possible since Lesk's original Algorithm (1986) is based on gloss overlaps which can be viewed as a measure of semantic relatedness. We evaluate a variety of measures of semantic relatedness when applied to word sense disambiguation by carrying out
experiments using the English lexical sample data of Senseval-2. We find that the gloss overlaps of Adapted Lesk and the semantic distance measure of Jiang and Conrath (1997) result in the highest accuracy..

Text Classification Using WordNet Hypernyms
Sam Scott and Sam Matwin, 1998
http://acl.ldc.upenn.edu/W/W98/W98-0706.pdf
This paper describes a method of incorporating WordNet knowledge into text representation that can
lead to significant reductions in error rates on certain types of text classification tasks. The method uses the
lexical and semantic knowledge embodied in WordNet to move from a bag-of.words representation to a representation based on hypernym density. The appropriate value for the height of generalization parameter h depends on the characteristics of each classification task. A side benefit of the hypernym density representation is that the classification rules induced are often simpler and more comprehensible than rules induced using the bag-of-words. Our experience indicates that the hypernym density
representation can work well for texts that use an extended or unusual vocabulary, or are written by
multiple authors employing different terminologies. It is not likely to work well for text that is guaranteed
to be written concisely and efficiently, such as the text in Reuters-21578. In particular, hypernym
density is more likely to perform well on classification tasks involving narrowly defined and/or
semantically distant classes (such as SONGI and USENETI). In the case of classes that are broadly
defined and/or semantically related (such as SONG2 and USENET2) hypernym density does not always
outperform bag-of-words.

Word Sense Disambiguation Using Semantic Graphs
Narayan Unnikrishnan and P. Bhattacharyya
International Conference on Global WordNet (GWC 02), Mysore, India, January, 2002

Exploring automatic word sense disambiguation with decision lists and the Web
Agirre, Eneko and David Martinez.
In: Proceedings of the Semantic Annotation And Intelligent Annotation workshop organized by COLING, Luxembourg, 2000.
http://arxiv.org/abs/cs.CL/0010024
The most effective paradigm for word sense disambiguation, supervised learning, seems to be stuck because of the knowledge acquisition bottleneck. In this paper we take an in-depth study of the performance of decision lists on two publicly available corpora and an additional corpus automatically acquired from the Web, using the fine-grained highly polysemous senses in WordNet. Decision lists are shown a versatile state-of-the-art technique. The experiments reveal, among other facts, that SemCor can be an acceptable (0.7 precision for polysemous words) starting point for an all-words system. The results on the DSO corpus show that for some highly polysemous words 0.7 precision seems to be the current state-of-the-art limit. On the other hand, independently constructed hand-tagged corpora are not mutually useful, and a corpus automatically acquired from the Web is shown to fail.

WordNet enrichment with classification systems
Montoyo, Andrés, Manuel Palomar and German Rigau.
In: Proceedings of the NAACL 2001 Workshop on WordNet and Other Lexical Resources, Pittsburgh, June 2001. http://www.seas.smu.edu/~rada/mwnw/papers/WNW-NAACL-202.pdf
This paper presents a new method to enrich semantically WordNet with categories from general domain classification systems. The method is performed in two consecutive steps. First, a lexical knowledge word sense disambiguation process. Second, a set of rules to select the main concepts as representatives for each category. The method has been applied to label automatically WordNet synsets with Subject Codes from a standard news agencies classification system. Experimental results show than the proposed method achieves more than 95% accuracy selecting the main concepts for each category.


Enriching WordNet concepts with topic signatures.
Agirre, Eneko, Olatz Ansa, David Martinez, and Eduard Hovy.
In: Proceedings of the NAACL 2001 Workshop on WordNet and Other Lexical Resources, Pittsburgh, June 2001.
http://www.seas.smu.edu/~rada/mwnw/papers/WNW-NAACL-228.pdf
http://arxiv.org/ftp/cs/papers/0109/0109031.pdf
.This paper explores the possibility of enriching the content of existing ontologies. The overall goal is to overcome the lack of topical links among concepts in WordNet. Each concept is to be associated to a topic signature, i.e., a set of related words with associated weights. The signatures can be automatically constructed from the WWW or from sense-tagged corpora. Both approaches are compared and evaluated on a word sense disambiguation task. The results show that it is possible to construct clean signatures from the WWW using some filtering techniques..

Manual and automatic semantic annotation with WordNet.
Fellbaum, Christiane, Martha Palmer, Hoa Trang Dang, Lauren Delfs, and Susanne Wolf.
In: Proceedings of the NAACL 2001 Workshop on WordNet and Other Lexical Resources, Pittsburgh, June 2001.

Automatic WordNet mapping using Word Sense Disambiguation
Lee, Changki and Geunbae Lee and Jungyun Seo. In: Proceedings of the Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC 2000), Hong Kong, 2000.
http://acl.ldc.upenn.edu/W/W00/W00-1318.pdf


Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation,
Rada Mihalcea
in Proceedings of the 19th International Conference on Computational Linguistics COLING 2002, Taiwan, August 2002.
http://www.cs.unt.edu/~rada/papers/coling.2002.ps
Instance based learning with automatic feature selection is a new approach in the WSD field. The algorithm was implemented in a system that achieves excellent performance on the data released during the senseval-2 English lexical task. The feature selection process is completely automated and it practically creates a classifier tailored to the behaviour of each specific word.

Highly Accurate Bootstrapping Algorithm for Word Sense Disambiguation
Mihalcea, Rada and Dan I. Moldovan. In: International Journal on Artificial Intelligence Tools, vol.10, no.1-2, [pg.5-21], 2001.
http://www.worldscinet.com/ijait/10/1001n02/S0218213001000398.html
http://www.cs.unt.edu/~rada/papers/ijait01.ps.gz
In this paper, we present a bootstrapping algorithm for Word Sense Disambiguation which succeeds in disambiguating a subset of the words in the input text with very high precision. It uses WordNet and a semantic tagged corpus, for the purpose of identifying the correct sense of the words in a given text. The bootstrapping process initializes a set of ambiguous words with all the nouns and verbs in the text. It then applies various disambiguation procedures and builds a set of disambiguated words: new words are sense tagged based on their relation to the already disambiguated words, and then added to the set. This process allows us to identify, in the original text, a set of words which can be disambiguated with high precision; 55% of the verbs and nouns are disambiguated with an accuracy of 92%.

Bootstrapping Large Sense Tagged Corpora,
Rada Mihalcea,
in Proceedings of the 3rd International Conference on Languages Resources and Evaluations LREC 2002, Las Palmas, Spain, May 2002.
http://www.cs.unt.edu/~rada/papers/lrec.2002.ps
The performance of Word Sense Disambiguation systems largely depends on the availability of sense tagged corpora. Since the semantic annotations are usually done by humans, the size of such corpora is
limited to a handful of tagged texts. This paper proposes a generation algorithm that may be used to automatically create large sense tagged corpora. The approach is evaluated through comparative sense
disambiguation experiments performed on data provided during the SENSEVAL-2 English all words and English lexical sample tasks.

``An Automatic Method for Generating Sense Tagged Corpora''
Mihalcea, Rada and Dan I. Moldovan.
In: Proceedings of AAAI '99, [pg.461-466], Orlando, FL, July 1999.
http://www.cs.unt.edu/~rada/papers/aaai99.ps.gz
The unavailability of very large corpora with semantically disambiguated words is a major limitation
in text processing research. For example, statistical methods for word sense disambiguation of free text are known to achieve high accuracy results when large corpora are available to develop context rules, to train and test them. This paper presents a novel approach to automatically generate arbitrarily large corpora for word senses. The method is based on (1) the information provided in WordNet, used to formulate
queries consisting of synonyms or definitions of word senses, and (2) the information gathered
from Internet using existing search engines. The method was tested on 120 word senses and a precision
of 91% was observed.

``Word Semantics for Information Retrieval: moving one step closer to the Semantic Web''
Mihalcea, Rada and Dan I. Moldovan.
In: International Conference on Tools in Artificial Intelligence, November, 2001.

``AutoASC - A System for Automatic Acquisition of Sense tagged Corpora ''
Mihalcea, Rada and Dan I. Moldovan.
In: International Journal of Pattern Recognition and Artificial Intelligence, [pg.3-17], vol.14, no.1, 2000.

``A Semi-Complete Disambiguation Algorithm for Open Text''
Mihalcea, Rada and Dan I. Moldovan.
In: Proceedings of AAAI 2000, [pg.1085], Austin, TX, July 2000.

``An Iterative Approach to Word Sense Disambiguation''
Mihalcea, Rada and Dan I. Moldovan.
In: Proceedings of Flairs 2000, [pg.219-223] Orlando, FL, May 2000. http://www.seas.smu.edu/~rada/papers/flairs00.ps.gz

``Constructing Bayesian networks from WordNet for word-sense disambiguation: representational and processing issues.''
Wiebe, Janyce, Tom O'Hara and Rebecca Bruce.
In: Proceedings of the COLING/ACL Workshop on Usage of WordNet in Natural Language Processing Systems, Montreal, 1998. http://www.CS.NMSU.Edu/~wiebe/pubs/papers/siglex98.ps (also see OpenCyc)
This paper describes a probabilistic model that is formed from the integration of an analytical and empirical component. The analytical component is a Bayesian network derived from WordNet, and the empirical component is composed of compatible probabilistic models formulated from tagged training data. The components are integrated in a formal, uniform framework based on the semantics of causal dependence. The paper explores various representational issues that must be addressed.

"Classifying functional relations in Factotum via WordNet hypernym associations",
O'Hara, Tom, and Janyce Wiebe (2003),
in Proc. Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing-2003). (also see OpenCyc)
This paper describes how to automatically classify the functional relations from the Factotum knowledge base via a statistical machine learning algorithm. This incorporates a method for inferring prepositional relation indicators from corpus data. It also uses lexical collocations (i.e., word associations) and class-based collocations based on the WordNet hypernym relations (i.e., is-subset-of). The result shows substantial improvement over a baseline approach.

"A Comparison of Word- and Sense-based Text Categorization Using Several Classification Algorithms".
Ath. Kehagias, V. Petridis, V.G. Kaburlasos, and P. Fragkou.
J. of Int. Inf. Sys., vol.21, pp.227-247, 2003.
http://users.auth.gr/~kehagiat/kehPub/journal/2001WordsSenses.ps

``Induction of Classification from Lexicon Expansion: Assigning Domain Tags to WordNet Entries''
Chang, Echa and Chu-Ren Huang and Sue-Jin Ker and Chang-Hua Yang
In: Proceedings of the Coling 2002 Workshop ''SemaNet'02: Building and Using Semantic Networks'', Taipei, August 2002. http://www.cs.ust.hk/~hltc/semanet02/pdf/chang.pdf

Word-Sense Disambiguation using Statistical Models of {Roget}'s Categories Trained on Large Corpora,
David Yarowsky booktitle = "Proceedings of {COLING}-92", month = "July", address = "Nantes, France", pages = "454--460", year = "1992", url = "citeseer.ist.psu.edu/yarowsky92wordsense.html" }
Perhaps one of the simplest and oldest paper. Very highly cited. Yarowsky proposes a solution to the problem of WSD using a thesaurus in a supervised learning environment. Word associations are recorded and for an unseen text, the senses of words are detected from the learnt associations.
Keywords (my own): Naive Bayes, thesaurus based disambiguation

Word sense disambiguation using conceptual density,
E. Agirre and G. Rigau",. 1996. In Proceedings of COLING'96, pages 16--22, Copenhagen, Danmark.",
year = "1996", url = "citeseer.ist.psu.edu/agirre96word.html" }
Agirre uses a measure based on the proximity of the text words in WordNet (conceptual density) to disambiguate the words. Perhaps one of the first papers that proposed a model for exploiting the proximity of word senses in WordNet. However, this paper talks only about proximity based on hypernym relations.
Keywords : conceptual density, proximity in WordNet.


Word sense disambiguation using decomposable models
Rebecca Bruce and Janyce Wiebe, booktitle = "Proceedings of the {ACL}-94, 32nd Annual Meeting of the Association for Computational Linguistics", address = "Las Cruces, US", pages = "139--145",
year = "1994", url = "citeseer.ist.psu.edu/article/bruce94wordsense.html" }
Bruce and Weibe introduce the use of decomposable models (maximum-entropy, and logistic regression are some decomposable models. Log-linear models are decomposable) for incorporating a host of discrete and continuous valued features for sense disambiguation which is treated as a classification task.
The features used were : part of speech tags one and two places to the left and right of the ambiguous word, part of speech tag of the word and a collocation variable for each sense of the word (a binary feature)
The paper has a lots of maths but not much elaboration on WSD and experiments are far from satisfactory.
In my opinion - a bit hyped up !


Two Languages Are More Informative Than One,
Dagan, I., Itai, A., and Schwall, U., text = "Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL-91), pp. 130-137", year = "1991", url = "http://acl.ldc.upenn.edu/P/P91/P91-1017.pdf" }
A good representative paper illustrating the use of bi-lingual corpus for word-sense-disambiguation. Central idea is that word in a language can be disambiguated by looking up its translation in another language. Scale of experiment was again very very small. It hinges on disambiguating words in one language usin statistical data about lexical (or surface) relations in another language.
But this paper exposes a chicken-egg problem ! One major requirement of WSD is for machine translation. And this paper requires machine translated documents for effective WSD !!



Word sense distinguishability and inter-coder agreement,
R. Bruce and J. Wiebe, (1998). Proceedings of the 3rd Conference on Empirical Methods in Natural
Language Processing (EMNLP-98). Association for Computational Linguistics SIGDAT, Granada, Spain, June 1998.", year = "1998", url = "citeseer.ist.psu.edu/bruce98wordsense.html" }
This paper tries and analyses the distinguishability between word senses within and across dictionaries (category distinguishability). It also analyses and presents tests for observer differences in sense tagging (of a highly ambiguous word - ``interest'').
Again scale of experiment is small



WASP-Bench: an MT Lexicographer's Workstation Supporting State-of-the-art
Lexical Disambiguation,
Adam Kilgariff and David Tugwell, url = "citeseer.ist.psu.edu/kilgariff01waspbench.html" }
This paper describes more of a work-bench for sense disambiguation than an algorithm. Mutual information between a word-sense and the words in context, is used as a measure of disambiguation. So this paper does not provide a novel algorithm as such. But it describes a tool for lexical disambiguation that was built on this idea. Moreover, the tool supports human-assisted sense disambiguation.