A copy of this work was available on the public web and has been preserved in the Wayback Machine. The capture dates from 2017; you can also visit the original URL.
The file type is application/pdf
.
Document Sanitization: Measuring Search Engine Information Loss and Risk of Disclosure for the Wikileaks cables
[chapter]
2012
Lecture Notes in Computer Science
In this paper we evaluate the effect of a document sanitization process on a set of information retrieval metrics, in order to measure information loss and risk of disclosure. As an example document set, we use a subset of the Wikileaks Cables, made up of documents relating to five key news items which were revealed by the cables. In order to sanitize the documents we have developed a semi-automatic anonymization process following the guidelines of Executive Order 13526 (2009) of the US
doi:10.1007/978-3-642-33627-0_24
fatcat:ecreun3syjgavcfbnrijo6taba