Peng, Taoxin, Li, Lin and Kennedy, Jessie (2012) A comparison of techniques for name matching. International Journal on Computing (In press), 2 (1). ISSN 2010 2283
Full text not available from this repository. (Request a copy)Abstract/Description
Information explosion is a problem for everyone nowadays. It is a great challenge to all kinds of businesses to maintain high quality of data in their information applications, such as data integration, text and web mining, information retrieval, search engine, etc. In such applications, matching names is one of the popular tasks. There are a number of name matching techniques available. Unfortunately, there is no existing name matching technique that performs the best in all situations. Therefore, a problem that every researcher or a practitioner has to face is how to select an appropriate technique for a given dataset. This paper analyses and evaluates a set of popular name matching techniques on several carefully designed different datasets. The experimental comparison confirms the statement that there is no clear best technique. Some suggestions have been presented, which can be used as guidance for researchers and practitioners to select an appropriate name matching technique in a given dataset.
| Item Type: | Article |
|---|---|
| Print ISSN: | 2010 2283 |
| Additional Information: | This paper challenges a problem that every researcher or a practitioner has to face: how to select an appropriate name matching technique for a given dataset. It analyses and evaluates a set of popular name matching techniques on a number of carefully designed different datasets. A comprehensive experimental comparison confirms the statement that there is no clear best technique. Therefore, the selection of an appropriate name matching technique should depend on the nature of a dataset. Several suggestions have been presented, which can be used as guidance for such a selection. The work also introduces a number of further investigations. |
| Uncontrolled Keywords: | Name matching; dataset; |
| University Divisions/Research Centres: | Edinburgh Napier University, Institute for Informatics and Digital Innovation |
| Dewey Decimal Subjects: | 000 Computer science, information & general works > 000 Computer science, knowledge & systems > 004 Data processing & computer science |
| Library of Congress Subjects: | Q Science > QA Mathematics > QA76 Computer software |
| Item ID: | 5116 |
| Depositing User: | Computing Research |
| Date Deposited: | 12 Apr 2012 15:43 |
| Last Modified: | 22 Nov 2012 15:10 |
| URI: | http://researchrepository.napier.ac.uk/id/eprint/5116 |
Actions (login required)
| View Item |

Tools
Tools