Nov. 15, 2022, 2:16 a.m. | Jumon Nozaki, Yugo Murawaki

cs.CL updates on arXiv.org arxiv.org

Previous studies on neural linguistic steganography, except Ueoka et al.
(2021), overlook the fact that the sender must detokenize cover texts to avoid
arousing the eavesdropper's suspicion. In this paper, we demonstrate that
segmentation ambiguity indeed causes occasional decoding failures at the
receiver's side. With the near-ubiquity of subwords, this problem now affects
any language. We propose simple tricks to overcome this problem, which are even
applicable to languages without explicit word boundaries.

arxiv segmentation steganography

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US