Show simple item record

dc.contributor.authorWaal, Ton de
dc.date.accessioned2007-11-12T19:35:08Z
dc.date.available2007-11-12T19:35:08Z
dc.date.issued2005
dc.identifier.citationWaal, Ton de. "Automatic error localisation for categorical, continuous and integer data". SORT, 2005, Vol. 29, núm. 1
dc.identifier.issn1696-2281
dc.identifier.urihttp://hdl.handle.net/2099/3757
dc.description.abstractData collected by statistical offices generally contain errors, which have to be corrected before reliable data can be published. This correction process is referred to as statistical data editing. At statistical offices, certain rules, so-called edits, are often used during the editing process to determine whether a record is consistent or not. Inconsistent records are considered to contain errors, while consistent records are considered error-free. In this article we focus on automatic error localisation based on the Fellegi-Holt paradigm, which says that the data should be made to satisfy all edits by changing the fewest possible number of fields. Adoption of this paradigm leads to a mathematical optimisation problem. We propose an algorithm for solving this optimisation problem for a mix of categorical, continuous and integer-valued data. We also propose a heuristic procedure based on the exact algorithm. For five realistic data sets involving only integer-valued variables we evaluate the performance of this heuristic procedure.
dc.format.extent57-100
dc.language.isoeng
dc.publisherInstitut d'Estadística de Catalunya
dc.relation.ispartofSORT. 2005, Vol. 29, Núm. 1 [January-June]
dc.rightsAttribution-NonCommercial-NoDerivs 2.5 Spain
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/2.5/es/
dc.subject.otherMathematical logic
dc.subject.otherStatistics
dc.subject.otherArtificial intelligence
dc.subject.otherMathematical programming
dc.titleAutomatic error localisation for categorical, continuous and integer data
dc.typeArticle
dc.subject.lemacLògica matemàtica
dc.subject.lemacEstadística
dc.subject.lemacIntel·ligència artificial
dc.subject.lemacProgramació (Matemàtica)
dc.description.peerreviewedPeer Reviewed
dc.subject.amsClassificació AMS::03 Mathematical logic and foundations::03B General logic
dc.subject.amsClassificació AMS::62 Statistics
dc.subject.amsClassificació AMS::68 Computer science::68T Artificial intelligence
dc.subject.amsClassificació AMS::90 Operations research, mathematical programming::90C Mathematical programming
dc.rights.accessOpen Access
local.personalitzacitaciotrue


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record