Bug#976884: UDD: encoding issue in bug titles
Hi,
> â\u0080\u0098class
I had a very similar issue in Lintian's database a few weeks ago. It
was caused by uploading (properly UTF-8 encoded) JSON to Postgres. The
Perl driver DBD::Pg encoded the data again and picked the 7-bit clean
escape sequences according to RFC4627, which is what I believe you are
seeing. They are further described here:
https://metacpan.org/pod/JSON::PP#ascii
My solution was to disable the automatic decoding layer in DBD::Pg via
'pg_enable_utf8 => 0'. It mirrors how I handle encoding elsewhere
(i.e. explicitly and without PerlIO, Bug#972878) and works great, but
did not enjoy much support on IRC in either the Perl or the Postgres
communities..
Kind regards
Felix Lechner
Reply to: