[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

Bug#976884: UDD: encoding issue in bug titles



Hi,

> â\u0080\u0098class

I had a very similar issue in Lintian's database a few weeks ago. It
was caused by uploading (properly UTF-8 encoded) JSON to Postgres. The
Perl driver DBD::Pg encoded the data again and picked the 7-bit clean
escape sequences according to RFC4627, which is what I believe you are
seeing. They are further described here:

    https://metacpan.org/pod/JSON::PP#ascii

My solution was to disable the automatic decoding layer in DBD::Pg via
'pg_enable_utf8 => 0'. It mirrors how I handle encoding elsewhere
(i.e. explicitly and without PerlIO, Bug#972878) and works great, but
did not enjoy much support on IRC in either the Perl or the Postgres
communities..

Kind regards
Felix Lechner


Reply to: