[HTML][HTML] ERBlox: Combining matching dependencies with machine learning for entity resolution
Entity resolution (ER), an important and common data cleaning problem, is about detecting
data duplicate representations for the same external entities, and merging them into single
representations. Relatively recently, declarative rules called matching dependencies (MDs)
have been proposed for specifying similarity conditions under which attribute values in
database records are merged. In this work we show the process and the benefits of
integrating four components of ER:(a) Building a classifier for duplicate/non-duplicate record …
data duplicate representations for the same external entities, and merging them into single
representations. Relatively recently, declarative rules called matching dependencies (MDs)
have been proposed for specifying similarity conditions under which attribute values in
database records are merged. In this work we show the process and the benefits of
integrating four components of ER:(a) Building a classifier for duplicate/non-duplicate record …
[HTML][HTML] ERBlox: Combining Matching Dependencies with Machine Learning for Entity Resolution
L Bertossi - arXiv (Cornell University), 2015 - academia.edu
Entity resolution (ER), an important and common data cleaning problem, is about detecting
data duplicate representations for the same external entities, and merging them into single
representations. Relatively recently, declarative rules called matching dependencies (MDs)
have been proposed for specifying similarity conditions under which attribute values in
database records are merged. In this work we show the process and the benefits of
integrating three components of ER:(a) Classifiers for duplicate/non-duplicate record pairs …
data duplicate representations for the same external entities, and merging them into single
representations. Relatively recently, declarative rules called matching dependencies (MDs)
have been proposed for specifying similarity conditions under which attribute values in
database records are merged. In this work we show the process and the benefits of
integrating three components of ER:(a) Classifiers for duplicate/non-duplicate record pairs …
Showing the best results for this search. See all results