charset

I had a website running fine on Ensim 3.12 which I later moved to Ensim 3.7
on RHE.

The website uses MySQL, Tomcat (JSP) and Apache 2.0, Tomcat and MYSQL are
default installations of Ensim 3.7. The website is updated frequently with
content from MS Word.

When the website has been moved to Ensim 3.7 some of the charcters from
Microsft Word do not display properly e.g. apostrophe, quotation Marks,
bullets etc display as "?"

I changed the charset of Apache 2.0 from UTF-8 to ISO-8559-1 as was used in
Ensim 3.12. Mysql has default charset latin1 (ISO-8559-1) as was on Ensim
3.12. Still the problem was not solved.

Since I do not know JSP I created a PHP Script to replace the few characters
I knew in existing database to HTML equivalent and those characters were
fixed.

$sometext=ereg_replace('‘', '‘', $sometext);
$sometext=ereg_replace('’', '’', $sometext);
$sometext=ereg_replace('“', '“', $sometext);
$sometext=ereg_replace('”', '”', $sometext);
$sometext=ereg_replace('·', '·', $sometext);
$sometext=ereg_replace('’', '’', $sometext);

But there were many more characters which I was not aware of and they are
still showing as "?" and also the new data entered also shows those
characters as "?"
As you can see above, I pasted the actual characters in ereg_replace but
they have been replaced with some unknown characters.

These characters actually show up as "?" in JSP and if I view them from PHP
they show as you can see above.

=======================================

 

 

 

 

Top