--- Begin Message ---
- To: Debian Bug Tracking System <submit@bugs.debian.org>
- Subject: tex4ht: you-on gets extra space
- From: Junichi Uekawa <dancer@netfort.gr.jp>
- Date: Wed, 26 Sep 2007 08:35:04 +0900
- Message-id: <87ejgmjp3r.dancerj%dancer@netfort.gr.jp>
Package: tex4ht
Version: 20070904-1
Severity: normal
Japanese text usually does not use space inside sentences (space is
not a delimiter). tex4ht seems to insert spaces after 'you-on'
(small-letters: "ゃゅょっャュョッ")
The original tex formatting is as follows:
「ところで,Debian Weekly News (DWN)は読んでいますか?Debian 界隈でおきていることについて書いているDebian Weekly News. 毎回読んでいるといろいろと分かって来ますが,一人で読んでいても,解説が少ないので,意味がわからないところもあるかも知れません.みんなでDWNを読んでみましょう.」
But it is rendered on HTML as:
「ところで, Debian Weekly News (DWN) は読んでいますか? Debian 界隈でおきていることについて書いて いる Debian Weekly News. 毎回読んでいるといろいろと分かっ て来ますが, 一人で読んでいても, 解説が少ないので, 意味がわからないところもあるかも知れません. みんなで DWN を読んでみま しょ う.」
A HTML snippet looks like (there is extra space in "</span> <span" after "っ")
class="dmjhira-10x-x-101">か</span><span
class="dmjhira-10x-x-101">っ</span> <span
class="dmjhira-10x-x-101">て</span><span
I'm using modified htlatex command to generate, called htplatex, as follows to use ptex-bin to process Japanese.
#!/bin/sh
#htplatex debianmeetingresume200708.tex jp,2,sections+
#apt-get install dvi2ps-fontdata-a2n dvi2dvi dvipng
if command -v xhtex > /dev/null 2>&1 ; then
true
else
export PATH=/usr/share/tex4ht:$PATH
fi
platex $5 '\makeatletter\def\HCode{\futurelet\HCode\HChar}\def\HChar{\ifx"\HCode\def\HCode"##1"{\Link##1}\expandafter\HCode\else\expandafter\Link\fi}\def\Link#1.a.b.c.{\g@addto@macro\@documentclasshook{\RequirePackage[#1,html]{tex4ht}}\let\HCode\documentstyle\def\documentstyle{\let\documentstyle\HCode\expandafter\def\csname tex4ht\endcsname{#1,html}\def\HCode####1{\documentstyle[tex4ht,}\@ifnextchar[{\HCode}{\documentstyle[tex4ht]}}}\makeatother\HCode '$2'.a.b.c.\input ' $1
platex $5 '\makeatletter\def\HCode{\futurelet\HCode\HChar}\def\HChar{\ifx"\HCode\def\HCode"##1"{\Link##1}\expandafter\HCode\else\expandafter\Link\fi}\def\Link#1.a.b.c.{\g@addto@macro\@documentclasshook{\RequirePackage[#1,html]{tex4ht}}\let\HCode\documentstyle\def\documentstyle{\let\documentstyle\HCode\expandafter\def\csname tex4ht\endcsname{#1,html}\def\HCode####1{\documentstyle[tex4ht,}\@ifnextchar[{\HCode}{\documentstyle[tex4ht]}}}\makeatother\HCode '$2'.a.b.c.\input ' $1
platex $5 '\makeatletter\def\HCode{\futurelet\HCode\HChar}\def\HChar{\ifx"\HCode\def\HCode"##1"{\Link##1}\expandafter\HCode\else\expandafter\Link\fi}\def\Link#1.a.b.c.{\g@addto@macro\@documentclasshook{\RequirePackage[#1,html]{tex4ht}}\let\HCode\documentstyle\def\documentstyle{\let\documentstyle\HCode\expandafter\def\csname tex4ht\endcsname{#1,html}\def\HCode####1{\documentstyle[tex4ht,}\@ifnextchar[{\HCode}{\documentstyle[tex4ht]}}}\makeatother\HCode '$2'.a.b.c.\input ' $1
dvi2dvi -F a2n ${1/.tex/.dvi} > ${1/.tex/.dvi.tmp}
mv ${1/.tex/.dvi.tmp} ${1/.tex/.dvi}
tex4ht -f/$1 -itex4ht/ht-fonts/mozilla/nttsub/$3
t4ht -f/$1 $4 -dhtml/
-- System Information:
Debian Release: lenny/sid
APT prefers unstable
APT policy: (500, 'unstable')
Architecture: amd64 (x86_64)
Kernel: Linux 2.6.23-rc5dancer (PREEMPT)
Locale: LANG=ja_JP.EUC-JP, LC_CTYPE=ja_JP.EUC-JP (charmap=EUC-JP) (ignored: LC_ALL set to ja_JP.eucJP)
Shell: /bin/sh linked to /bin/bash
Versions of packages tex4ht depends on:
ii libc6 2.6.1-2 GNU C Library: Shared libraries
ii libkpathsea4 2007-14 TeX Live: path search library for
ii tex4ht-common 20070904-1 LaTeX and TeX for Hypertext (HTML)
ii texlive-base-bin 2007-14 TeX Live: Essential binaries
tex4ht recommends no packages.
-- no debconf information
--- End Message ---