Extracting collocations in context: the case of verb-noun constructions in Romanian

Abstract : Verb-Noun (VN) constructions involve a ‘generic’ V plus a ‘specific’ N which is either (i) a structural qualifierof the Predicator as in make haste, or (ii) a syntactic Complement as in to make a suggestion. In each case the Nexpresses the semantic Range of the VN construction (Banks 2000, Gledhill 2007). However, despite muchresearch on ‘support verb’ or ‘light verb’ constructions, there is no one morpho-syntactic feature which allows usto distinguish these constructions from any other sequence of V plus N, at least in English. From the point of view of corpus linguistics, this lack of formal marking makes it hard to extract them on a semi-automatic basis.As part of an on-going lexicographic project, we have examined various computational models in order toextract VN constructions from multilingual corpora. One of our main findings is that statistical methods aloneare not sufficient; the collocations that are thrown up in the data extend from a few ‘relevant’ VN constructions,to rather too many ‘semi-relevant’ VN co-occurrences and ‘irrelevant’ noise.
Document type :
Conference papers
Complete list of metadatas

https://hal-univ-diderot.archives-ouvertes.fr/hal-01220405
Contributor : Christopher Gledhill <>
Submitted on : Monday, October 26, 2015 - 11:58:02 AM
Last modification on : Friday, January 4, 2019 - 5:33:30 PM

Identifiers

  • HAL Id : hal-01220405, version 1

Collections

Citation

Amalia Todirascu, Christopher Gledhill, Dan Stefânescu. Extracting collocations in context: the case of verb-noun constructions in Romanian. RANLP, Sep 2007, Borovets, Bulgaria. ⟨hal-01220405⟩

Share

Metrics

Record views

75