A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions

Abstract : We present the main findings and preliminary results of an ongoing project aimed at developing a system for collocation extraction based on contextual morpho-syntactic properties. We explored two hybrid extraction methods: the first method applies language-indepedent statistical techniques followed by a linguistic filtering, while the second approach, available only for German, is based on a set of lexico-syntactic patterns to extract collocation candidates. To define extraction and filtering patterns, we studied a specific collocation category, the Verb-Noun constructions, using a model inspired by the systemic functional grammar, proposing three level analysis: lexical, functional and semantic criteria. From tagged and lemmatized corpus, we identify some contextual morpho-syntactic properties helping to filter the output of the statistical methods and to extract some potential interesting VN constructions (complex predicates vs complex predicator). The extracted candidates are validated and classified manually.
Document type :
Conference papers
Complete list of metadatas

https://hal-univ-diderot.archives-ouvertes.fr/hal-01220400
Contributor : Christopher Gledhill <>
Submitted on : Monday, October 26, 2015 - 11:49:46 AM
Last modification on : Thursday, November 14, 2019 - 10:18:02 AM

Identifiers

  • HAL Id : hal-01220400, version 1

Citation

Amalia Todirascu, Dan Tufis, Ulrich Heid, Christopher Gledhill, Dan Stefânescu, et al.. A Hybrid Approach to Extracting and Classifying Verb+Noun Constructions. The 6th edition of the Language Resources and Evaluation Conference (LREC 2008), May 2008, Marrakech, Morocco. ⟨hal-01220400⟩

Share

Metrics

Record views

121