denexter added a comment.
There's nothing in the spec that I've found, there's a more detailed explanation in the word parser https://cgit.kde.org/calligra.git/tree/filters/words/msword-odf/wv2/src/parser9x.cpp#n513 but it still doesn't cite any sources. Removing the entire string is excessive and may be a problem with some documents, but removing that without addressing the decoding issue gives you a string with junk or missing characters whereas addressing the decoding gives the full correct string. REPOSITORY R8 Calligra REVISION DETAIL https://phabricator.kde.org/D25034 To: denexter, pvuorela Cc: Calligra-Devel-list, davidllewellynjones, dcaliste, cochise, vandenoever