Non-latin metadata rendered broken
Given the following bug.html
:
<html>
<head>
<meta http-equiv="Content-Type" content="text/html; encoding=utf-8">
<meta name="author" content="Дмитрий Козлюк (Dmitry Kozlyuk)">
<title>Текст заголовка (Title text)</title>
</head>
<body>
</body>
</html>
Command pagedjs-cli bug.html -o bug.pdf
produces PDF with broken Cyrillic in metadata, both XMP and PDF metadata:
Problematic PDF attached. I guess strings must be converted to UTF-16 in PDF metadata.