INSPIRING FUTURES

A comparison of techniques for name matching.

Peng, Taoxin, Li, Lin and Kennedy, Jessie (2012) A comparison of techniques for name matching. International Journal on Computing (In press), 2 (1). ISSN 2010 2283

Full text not available from this repository. (Request a copy)

Abstract/Description

Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of businesses to maintain high quality of data in their information applications, such as data integration, text and web mining, information retrieval, search engine, etc. In such applications, matching names is one of the popular tasks. There are a number of name matching techniques available. Unfortunately, there is no existing name matching technique that performs the best in all situations. Therefore, a problem that every researcher or a practitioner has to face is how to select an appropriate technique for a given dataset. This paper analyses and evaluates a set of popular name matching techniques on several carefully designed different datasets. The experimental comparison confirms the statement that there is no clear best technique. Some suggestions have been presented, which can be used as guidance for researchers and practitioners to select an appropriate name matching technique in a given dataset.

Item Type: Article
Print ISSN: 2010 2283
Additional Information: This paper challenges a problem that every researcher or a practitioner has to face: how to select an appropriate name matching technique for a given dataset. It analyses and evaluates a set of popular name matching techniques on a number of carefully designed different datasets. A comprehensive experimental comparison confirms the statement that there is no clear best technique. Therefore, the selection of an appropriate name matching technique should depend on the nature of a dataset. Several suggestions have been presented, which can be used as guidance for such a selection. The work also introduces a number of further investigations.
Uncontrolled Keywords: Name matching; dataset;
University Divisions/Research Centres: Edinburgh Napier University, Institute for Informatics and Digital Innovation
Dewey Decimal Subjects: 000 Computer science, information & general works > 000 Computer science, knowledge & systems > 004 Data processing & computer science
Library of Congress Subjects: Q Science > QA Mathematics > QA76 Computer software
Item ID: 5116
Depositing User: Computing Research
Date Deposited: 12 Apr 2012 15:43
Last Modified: 22 Nov 2012 15:10
URI: http://researchrepository.napier.ac.uk/id/eprint/5116

Actions (login required)

View Item

Edinburgh Napier University is a registered Scottish charity. Registration number SC018373