Htmlrecode

Htmlrecode is a HTML file character set converter.
Download

Htmlrecode Ranking & Summary

Advertisement

  • Rating:
  • License:
  • GPL
  • Price:
  • FREE
  • Publisher Name:
  • Joel Yliluoma
  • Publisher web site:
  • http://bisqwit.iki.fi/source/htmlrecode.html

Htmlrecode Tags


Htmlrecode Description

Htmlrecode is a HTML file character set converter. Htmlrecode is a HTML file character set converter.The Htmlrecode project recodes the HTML file using a new character set, while losing no characters at all. You can recode shift_jis to euc-jp, utf8 to latin1, iso-8859-15 to GB18030, iso-2022-jp to koi8-r etc if you wish, and none of the characters on the page will become unreadable (unless you specify -l switch, which disables making &#nnnn; escapes).Standard-correct HTML is a good thing. One of the goals in the development of this program is that it never makes the HTML more broken than it previously was. It should even make it better than it was.Usage: htmlrecode ]Reads stdin, writes stdout.Options: -I, --inset setname Assumed input character set (default: iso-8859-1) -O, --outset setname Wanted output character set (default: iso-8859-1) -V, --version Displays version information. -e, --usehex Use hexadecimal escapes. -g, --signature Prefix the file with an unicode signature. -h, --help This help. -l, --lossy Disable lossless conversion. -q, --quiet Be less verbose. -s, --strict Turn off support for slightly broken HTML. -v, --verbose Be less quiet. -x, --xmlmode XML mode: all tag param values quoted.Pipe in the html file and pipe the output to result file.Installation:$ make$ su# make installExample:This page template is locally stored in iso-8859-1, but is automatically converted to utf-8 to make the final version.Here are some latin letters: Here are some CJK (chinese/japanese/korean ideograms): 日本Here are some html escapes: >" Source code of the above:Here are some latin letters: Here are some CJK (chinese/japanese/korean ideograms): 日本Here are some html escapes: >"äöê What your browser is getting, is not 日 etc but the actual utf-8 characters.


Htmlrecode Related Software