I’ve watched quite a few Google tech talks in the last few months. There are lots available. Here are some of the ones I found interesting (If you have recommendations for good talks that I haven’t listed below please leave a comment). read more »
Last week I was got an email to my Gmail account about a meeting that I was going to attend. I added it to my calendar and noticed that Gmail extracted some of the information from the email and used it to fill out fields in the new calendar entry. This was pretty exciting as it is a real-life example of how information extraction can be useful. It got me wondering how they are doing the extraction and how Automated IE could be used to improve integration between different applications. read more »
Finn, A. (2006). A Multi-Level Boundary Classification Approach to Information Extraction. Phd thesis (University College Dublin). pdf
Abstract
Information Extraction (IE) is the process of identifying a set of pre-defined relevant items in text documents. We investigate the application of Machine Learning classification techniques to the problem of Information Extraction. In particular we use Support Vector Machines and several different feature-sets to build a set of classifiers for Information Extraction (IE). We show that this approach is competitive with current state-of-the-art Information Extraction algorithms based on specialized learning algorithms. read more »
ELIE is a tool for adaptive information extraction from text. It also provides a number of other text processing tools e.g. POS tagging, chunking, gazetteer, stemming. read more »
Finn, A. & Kushmerick, N. (2004). Multi-level Boundary Classification for Information Extraction. In Proc. European Conference on Machine Learning
(Pisa). pdf read more »
Finn, A. & Kushmerick, N. (2004). Information Extraction by Convergent Boundary Classification. AAAI-04 Workshop on Adaptive Text Extraction and Mining (San Jose). pdf read more »
Finn, A. & Kushmerick, N. (2003). Active learning selection strategies for information extraction. ECML-03 Workshop on Adaptive Text Extraction and Mining (Croatia). pdf read more »
Finn, A. & Kushmerick, N. (2003). Active learning strategies for information extraction. Poster submission(rejected) to International Joint Conference on Artificial Intelligence (Acapulco). pdf read more »