C-sanitized: a privacy model for document redaction and sanitization

David Sanchez; Montserrat Batet

by David Sanchez, Montserrat Batet

Released as a article .

2014

Abstract

Within the current context of Information Societies, large amounts of information are daily exchanged and/or released. The sensitive nature of much of this information causes a serious privacy threat when documents are uncontrollably made available to untrusted third parties. In such cases, appropriate data protection measures should be undertaken by the responsible organization, especially under the umbrella of current legislations on data privacy. To do so, human experts are usually requested to redact or sanitize document contents. To relieve this burdensome task, this paper presents a privacy model for document redaction/sanitization, which offers several advantages over other models available in the literature. Based on the well-established foundations of data semantics and the information theory, our model provides a framework to develop and implement automated and inherently semantic redaction/sanitization tools. Moreover, contrary to ad-hoc redaction methods, our proposal provides a priori privacy guarantees which can be intuitively defined according to current legislations on data privacy. Empirical tests performed within the context of several use cases illustrate the applicability of our model and its ability to mimic the reasoning of human sanitizers.
In text/plain format

Archived Files and Locations

application/pdf 717.3 kB
file_365xtigtqbaofjgtgiopnobu7u arxiv.org (repository)
web.archive.org (webarchive)

Read Archived PDF

Preserved and Accessible

Type article
Stage

submitted

Date 2014-06-17
Version v1
Language en ^?

arXiv 1406.4285v1

Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)

Cite This

BibTeX
CSL-JSON
MLA
Harvard

Lookup Links

Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar

Catalog Record
Revision: cf7ce185-2fdb-402d-bb6d-ab5b2467dddb
API URL: JSON

Edit Metadata View History

C-sanitized: a privacy model for document redaction and sanitization release_3j4oqidekjbgxbcikrctnxspb4

Abstract

Archived Files and Locations

C-sanitized: a privacy model for document redaction and sanitization `release_3j4oqidekjbgxbcikrctnxspb4`