All Things Email

About | Contact

Extracting social networks and contact information from email and the Web

by Aron Culotta, Ron Bekkerman, Andrew McCallum

Conference on Email and Anti-Spam, 2004-07-30
Language: English

Note: Published at CEAS 2004.

External links

Full text: PDF

Information about this paper

Abstract

We present an end-to-end system that extracts a user s social network and its members contact information given the user s email inbox. The system identifies unique people in email, finds theirWeb presence, and automatically fills the fields of a contact address book using conditional random fields a type of probabilistic model well-suited for such information extraction tasks. By recursively calling itself on new people discovered on the Web, the system builds a social network with multiple degrees of separation from the user. Additionally, a set of expertise-describing keywords are extracted and associated with each person. We outline the collection of statistical and learning components that enable this system, and present experimental results on the real email of two users; we also present results with a simple method of learning transfer, and discuss the capabilities of the system for addressbook population, expert-finding, and social network analysis.

Creative Commons. Some Rights Reserved.
Copyright © 2004 Jochen Topf
Unless otherwise noted the contents on this site are licensed under the
Creative Commons Attribution-ShareAlike License.