Towards Optimal Statistical Watermarking

Huang, Baihe; Zhu, Hanlin; Zhu, Banghua; Ramchandran, Kannan; Jordan, Michael I.; Lee, Jason D.; Jiao, Jiantao

Computer Science > Machine Learning

arXiv:2312.07930 (cs)

[Submitted on 13 Dec 2023 (v1), last revised 6 Feb 2024 (this version, v3)]

Title:Towards Optimal Statistical Watermarking

Authors:Baihe Huang, Hanlin Zhu, Banghua Zhu, Kannan Ramchandran, Michael I. Jordan, Jason D. Lee, Jiantao Jiao

View PDF HTML (experimental)

Abstract:We study statistical watermarking by formulating it as a hypothesis testing problem, a general framework which subsumes all previous statistical watermarking methods. Key to our formulation is a coupling of the output tokens and the rejection region, realized by pseudo-random generators in practice, that allows non-trivial trade-offs between the Type I error and Type II error. We characterize the Uniformly Most Powerful (UMP) watermark in the general hypothesis testing setting and the minimax Type II error in the model-agnostic setting. In the common scenario where the output is a sequence of $n$ tokens, we establish nearly matching upper and lower bounds on the number of i.i.d. tokens required to guarantee small Type I and Type II errors. Our rate of $\Theta(h^{-1} \log (1/h))$ with respect to the average entropy per token $h$ highlights potentials for improvement from the rate of $h^{-2}$ in the previous works. Moreover, we formulate the robust watermarking problem where the user is allowed to perform a class of perturbations on the generated texts, and characterize the optimal Type II error of robust UMP tests via a linear programming problem. To the best of our knowledge, this is the first systematic statistical treatment on the watermarking problem with near-optimal rates in the i.i.d. setting, which might be of interest for future works.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL); Cryptography and Security (cs.CR); Information Theory (cs.IT); Machine Learning (stat.ML)
Cite as:	arXiv:2312.07930 [cs.LG]
	(or arXiv:2312.07930v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2312.07930

Submission history

From: Baihe Huang [view email]
[v1] Wed, 13 Dec 2023 06:57:00 UTC (132 KB)
[v2] Sun, 21 Jan 2024 05:22:22 UTC (143 KB)
[v3] Tue, 6 Feb 2024 21:01:28 UTC (147 KB)

Computer Science > Machine Learning

Title:Towards Optimal Statistical Watermarking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Towards Optimal Statistical Watermarking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators