Diversity in logarithmic opinion pools

Smith, Andrew D.M.; Osborne, Miles

doi:10.1075/li.30.1.04smi

Article published In:

Named Entities: Recognition, classification and use
Edited by Satoshi Sekine and Elisabete Ranchhod
[Lingvisticæ Investigationes 30:1] 2007
► pp. 27–47

Diversity in logarithmic opinion pools

Andrew D.M. Smith | University of Edinburgh

Miles Osborne

Conditional random fields are state-of-the-art models for sequencing tasks such as named entity recognition. However, being globally conditioned, they have a tendency to overfit to a greater extent than other sequencing models. We introduce an approach to combat this overfitting called a logarithmic opinion pool (LOP). A LOP consists of a weighted combination of constituent models. We present the theory behind LOPs, and show that effective LOPs require constituent models that are diverse from one another. We examine different ways to introduce such diversity, including an approach that involves training the constituent models together, interactively. Our results show that, as expected from the underlying theory, explicitly optimising for constituent model diversity can improve performance over standard approaches to regularisation.

Published online: 10 August 2007

https://doi.org/10.1075/li.30.1.04smi

Cited by

Cited by 2 other publications

Coelho, Flávio Codeço, Claudia T. Codeço & Rustom Antia

2009. Dynamic Modeling of Vaccinating Behavior as a Function of Individual Beliefs. PLoS Computational Biology 5:7 ► pp. e1000425 ff.

Ray, Evan L., Jeffer E. Sasaki, Patty S. Freedson & John Staudenmayer

2018. Physical Activity Classification with Dynamic Discriminative Methods. Biometrics 74:4 ► pp. 1502 ff.

This list is based on CrossRef data as of 3 april 2024. Please note that it may not be complete. Sources presented here have been supplied by the respective publishers. Any errors therein should be reported to them.