banner



How To Clean Domain_6 Data

  • Home >
  • Proceedings >
  • Proceedings of the 2005 SIAM International Conference on Data Mining (SDM) >
  • 10.1137/1.9781611972757.24

Manage this Paper

Add to my favorites

Download Citations

Track Citations

Notify Me!

E-mail Alerts

RSS Feeds

Proceedings of the 2005 SIAM International Conference on Data Mining (SDM)


< Previous Chapter

Next Chapter >

Table of Contents

  • Abstract
  • PDF

Exploiting relationships for domain-independent data cleaning*†

Dmitri V. Kalashnikov, Sharad Mehrotra and Zhaoqi Chen

This Paper Appears in

Cover Image

Title Information

Published: 2005

ISBN: 978-0-89871-593-4

eISBN: 978-1-61197-275-7

Book Code: PR119

Pages: 12

*RelDC project (http://www.ics.uci.edu/∼dvk/RelDC)

†This work was supported in part by NSF grants 0331707, 0331690, and IRI-9703120.

Abstract

In this paper we address the problem of reference disambiguation. Specifically, we consider a situation where entities in the database are referred to using descriptions (e.g., a set of instantiated attributes). The objective of reference disambiguation is to identify the unique entity to which each description corresponds. The key difference between the approach we propose (called RelDC) and the traditional techniques is that RelDC analyzes not only object features but also inter-object relationships to improve the disambiguation quality. Our extensive experiments over two real datasets and also over synthetic datasets show that analysis of relationships significantly improves quality of the result.

Permalink: https://doi.org/10.1137/1.9781611972757.24

How To Clean Domain_6 Data

Source: https://epubs.siam.org/doi/10.1137/1.9781611972757.24

Posted by: perrymerhade80.blogspot.com

0 Response to "How To Clean Domain_6 Data"

Post a Comment

Iklan Atas Artikel

Iklan Tengah Artikel 1

Iklan Tengah Artikel 2

Iklan Bawah Artikel