Catalog Search
 
Advanced Search

My shopping cart cart icon
Your cart is empty

My wish list wishlist icon
Your wish list is empty



Last update:
9 February 2010

© John Benjamins
Home

Natural Language Processing for Online Applications

Text retrieval, extraction and categorization

Second revised edition

Peter Jackson and Isabelle Moulinier
Thomson Corporation

2007. x, 232 pp.
Publishing status: Available

HardboundIn stock
978 90 272 4992 0 / EUR 105.00 / USD 158.00
Add to shopping cart

PaperbackIn stock
978 90 272 4993 7 / EUR 33.00 / USD 49.95

Add to shopping cart

e-BookAvailable from e-book platforms
978 90 272 9244 5 / EUR 105.00 / USD 158.00
Ordering information

Add to wish list

This text covers the technologies of document retrieval, information extraction, and text categorization in a way which highlights commonalities in terms of both general principles and practical concerns. It assumes some mathematical background on the part of the reader, but the chapters typically begin with a non-mathematical account of the key issues. Current research topics are covered only to the extent that they are informing current applications; detailed coverage of longer term research and more theoretical treatments should be sought elsewhere. There are many pointers at the ends of the chapters that the reader can follow to explore the literature. However, the book does maintain a strong emphasis on evaluation in every chapter both in terms of methodology and the results of controlled experimentation.


Table of contents

Preface to the 2nd edition
ix
Chapter 1. Natural language processing
1
1.1 What is NLP?
2
1.2 NLP and linguistics
5
1.3 Linguistic tools
11
1.4 Plan of the book
20
Chapter 2. Document retrieval
23
2.1 Information retrieval
24
2.2 Indexing technology
25
2.3 Query processing
27
2.4 Evaluating search engines
45
2.5 Attempts to enhance search performance
52
2.6 The future ofWeb searching
59
Chapter 3. Information extraction
69
3.1 The message understanding conferences
70
3.2 Regular expressions
73
3.3 Finite automata in FASTUS
75
3.4 Context-free grammars
92
3.5 Limitations of current technology and future research
104
3.6 Summary of information extraction
110
Chapter 4. Text categorization
113
4.1 Overview of categorization tasks
115
4.2 Handcrafted rule based methods
120
4.3 Inductive learning for text classification
122
4.4 Nearest neighbor algorithms
144
4.5 Combining classifiers
147
4.6 Evaluation of text categorization systems
154
Chapter 5. Text mining
163
5.1 What is text mining?
164
5.2 Resolving reference and coreference
168
5.3 Automatic summarization
183
5.4 Testing of automatic summarization programs
204
5.5 Prospects for text mining and NLP
210
References
215
Index
227