VerbNet

My primary research interest is in the representation of semantic information and its use in natural language processing applications - computational lexical semantics. The difficulty of achieving adequate hand-crafted semantic representations has limited the field of natural language processing to applications that can be contained within well-defined subdomains. The only escape from this limitation will be through the use of automated or semi-automated methods of lexical acquisition, which presuppose a link between a distributional analysis of language and a well-founded theory of semantic representation.

I began with a thorough study of the usefulness of Lexical Conceptual Structures, [Jackendoff72,90] as a basis for computational lexical semantics, [Palmer81, Palmer83, Palmer90a]. The outcome of this study was in the Pundit/Kernel text processing system where the semantic representations proved extremely effective for reference resolution, temporal analysis and recovery of implicit information, [Palmer et al 85, Dahl et al 86]. This system was internationally recognized as providing path-breaking in-depth coverage of semantics and pragmatics, [Palmer et al 93]. However, porting the system to new domains revealed the limitations of the approach; primarily the fragility of the parser and the major time commitment required to create separate hand-crafted lexical entries for every slight sense variation of a particular lexical item, [Palmer 90b].

I am now investigating verb classifications such as Levin's verb classes, [Levin93], and WordNet, [Miller90, Miller91]. I believe that sets of semantic components can be associated with lexical items, in particular primary senses of verbs, that will account for most of their syntactic behavior. This can be implemented as sets of features, which provides a more flexible representation than the rule-based Lexical Conceptual Structures, allowing for more robust processing and best partial matches, [Palmer and Wu95]. In addition, this approach should be more amenable to empirical methods, since a distributional analysis of syntactic frames should provide critical information regarding a verb's semantic classification, not just for English but for all languages, [Dang & Palmer99]. These semantic classifications, although potentially quite diverse, should share key cross-linguistic semantic components as suggested by Talmy [Talmy90] and Jackendoff.

This research, supported by NSF grant 9800658, specifically addresses questions of word sense distinctions with respect to verbs, and how regular extensions of meaning can be achieved through the adjunction of particular syntactic phrases. My students and I are developing VerbNet, based on a bilingual Korean/English lexicon, as well as a bilingual Portuguese/English lexicon, that make explicit the semantic components, argument structure, and sets of syntactic frames associated with individual lexical items. Many of these semantic components are cross-linguistic, [Palmer et al 96, Palmer et al 98a]. The lexical items in each language form natural groupings based on the presence or absence of semantic components and the ability to occur or not occur within particular syntactic frames. These bilingual lexicons are being implemented as Feature-based Lexicalized Tree-Adjoining Grammars, [Bleam et al 98], [Xia et al 98], but they are intended to be independent of particular syntactic frameworks and should map readily onto many widely used formalisms, including CCG, HPSG, LFG, and GB. The English entries are mapped directly onto English WordNet senses.

Levin classes, although a valuable starting point for VerbNet, do not currently provide information that is complete enough or precise enough to inform lexical entries or to serve as a clustering Gold Standard. Both Levin classes and WordNet have limitations that impede their utility as general classification schemes. We have developed a refinement of Levin classes, intersective Levin classes, which are more fine-grained and which exhibit more coherent sets of syntactic frames and associated semantic components [Palmer et al 97]. Certain syntactic frames indicate the adjunction of prepositional phrases or adverbs that provide a regular extension of meaning to the core sense of many verbs. For example, we associate the {\it directed motion} feature with path prepositional phrases for manner of motion verbs (and other classes, such as sound emission). We have preliminary indications that the membership of our intersective sets is more compatible with WordNet classifications than the broader Levin classes, allowing us to attribute the semantic components and associated sets of syntactic frames to specific WordNet senses as well, thus enriching the WordNet representation, and providing explicit criteria for word sense disambiguation. We are also finding interesting class correspondences between English and Portuguese, [Dang et al 98].

Related research interests include logic programming, artificial intelligence, multi-lingual information extraction and retrieval, and machine translation [Palmer et al 98b, Palmer, Rambow & Nasr98].

VerbNet also forms the basis of Parameterized Action Representations which are used for natural language interaction with virtual humans, work that is done at the Human Simulation and Modeling Center.

References

N. Badler, R. Bindiginavale, J. Bourne, M. Palmer, J. Shi, W. Schuler, A Parameterized Action Representation for Virtual Human Agents, Workshop on Embodied Conversational Characters, WECC98, , Lake Tahoe, CA,Oct 12-15, 1998.
Tonia Bleam, Martha Palmer, K. Vijay-Shanker, Motion Verbs and Semantic Features in TAG, TAG+ 4 Workshop, , Philadelphia, PA, Aug 1 - 3, 1998.
A. Bundy, L. Byrd, G. Luger, C. Mellish, M. Palmer. Solving Mechanics Problems Using Meta-level Inference. ed. Donald Michie, Edinburgh University Press, 1979, p. 50-65.
D. A. Cruse. Lexical Semantics. Cambridge University Press, Cambridge, 1986.
Hoa~Trang Dang, Karin Kipper, Martha Palmer, Joseph Rosenzweig, Investigating regular sense extensions based on intersective Levin classes. Coling-ACL98 , Montreal CA, August 11-17, 1998.
Bonnie Jean Dorr. Machine Translation: A View from the Lexicon. MIT Press, Cambridge, Mass., 1993
David Dowty. Thematic proto-roles and argument selection. Language, CXV11, pp. 547-619, 1991.
Dania Egedi, Hyun S. Park, Martha Palmer, and Aravind K. Joshi. Korean to English Translation Using Synchronous TAGs. AMTA-94, Columbia, Maryland, 1994.
Dania Egedi and Martha Palmer. Constraining Lexical Selection Across Languages Using TAGs. TAG+3 Workshop, Paris-7, Paris, France, 1994.
Eduard Hovy. Ontologies for MT. In Japan-US Workshop on Machine Aided Translation, US Department of Commerce, Washington, D.C., 1993.
Ray Jackendoff. Semantic Interpretation in Generative Grammar. MIT Press, Cambridge, Mass., 1972.
Ray Jackendoff. Semantic Structures. MIT Press, Cambridge, Mass., 1990.
Aravind K. Joshi and Martha Palmer. Experimenting with lexical semantics and synchronous TAGs. In Japan-US Workshop on Machine Aided Translation, US Department of Commerce, Washington, D.C., 1993.
Beth Levin. English Verb Classes and Verb Alternations: A Preliminary Investigation. University of Chicago Press, 1993.
Miller, G. and Beckwith, R. and Fellbaum, C. and Gross, D., and Miller, K. Five Papers on WordNet, Cognitive Science Laboratory, Princeton University, No. 43, 1990.
Sergei Nirenburg et al. Machine Translation: A Knowledge-based Approach. Morgan Kaufmann, San Mateo, CA, 1992.
Martha Palmer. Semantic Processing for Limited Domains. ACL Book Series, Cambridge University Press, 1990.
Martha Palmer. Customizing verb definitions for specific semantic domains. Machine Translation Journal 5, Kluwer Academic Publishers, pp. 5-30, 1990.
Martha Palmer, Rebecca Passonneau, Carl Weir, and Tim Finin. The KERNEL text understanding system, Special issue of the AI Journal on Text Understanding, October, 1993.
Martha Palmer and A. Polguere. A lexical and conceptual analysis of Break. In Lexical Computational Semantics. Cambridge University Press, 1994.
Martha Palmer, Owen Rambow, and Alexis Nasr. Rapid Prototyping of Domain-Specific Machine Translation Systems AMTA-98 , Langhorne, PA, Oct 28-31, 1998..
Martha Palmer, Joseph Rosenzweig and William Schuler. Capturing Motion Verb Generalizations with Synchronous TAGs, In Predicative Forms in NLP, ed by Patrick St. Dizier, Kluwer Press, December, 1998.
Martha Palmer and Zhibiao Wu. Verb Semantics for English-Chinese Translation, Machine Translation Journal , 1995.
Fei Xia, Martha Palmer, K. Vijay-Shanker, Joseph Rosenzweig, Consistent Grammar Development Using Partial-Tree Descriptions for Lexicalized Tree-Adjoining Grammar. TAG+ 4 Workshop , Philadelphia, PA, Aug 1 - 3, 1998.
Martha Palmer. Consistent Criteria for Sense Distinctions, submitted to Computers and the Humanities, special issue on SENSEVAL , 1999.

Visiting Associate Professor, Department of Computer and Information Science

mailto:mpalmer@cis.upenn.edu

Back to the Martha's homepage