• My post are written in German, so letter like ? ? ü ? are used. When I download my posts, the letters are transformed, e.g. ü = ü

    Any chance to set a translation table once, so no need for further manual repetetive work?

    Regards, Hans

Viewing 2 replies - 1 through 2 (of 2 total)
  • Plugin Author Rick Hellewell

    (@rhellewellgmailcom)

    Thanks for the question. Language translations are not my strong point, so am not sure were to go. The plugin doesn’t really do any translation of the post data, but it may be that I just need to set the character code in the generated HTML file specifically.

    But, my research indicates that the DB_CHARSET and DB_COLLATE settings in wp-config.php are what WP uses to define the character set used in your database.

    Can you tell me what those values are in your site? That info might be helpful as I figure out what to do.

    Thanks…Rick…

    Thread Starter 4tegs

    (@4tegs)

    Hi Rick,

    thanks a lot for your immediate responsse.
    Here is what’s within wp-config.php

    define(‘DB_CHARSET’, ‘utf8’);
    /** The Database Collate type. Don’t change this if in doubt. */
    define(‘DB_COLLATE’, ”);

    I checked an example post with PHPMyAdmin. Here is a a copy & paste:

    post_title: Brückentag in Herzegowina: Mostar
    post_content:
    Adi’s Hostel in Mostar: 43°20′47.22″N 17°48′42.01″E
    Kloster Rama: 43°48′15.34″N 17°31′50.87″E

    With the plugin this translates into exported HTML code:

    <br><h1 class=”blog2html_h1″>Brückentag in Herzegowina: Mostar</h1><h2 class=”blog2html_h2″>5 Jun 2018</h2><p class=”blog2html_p”>Adi’s Hostel in Mostar: 43°20′47.22″N 17°48′42.01″E</p>

    so something in the export must do the translation from pure text into into &**** for ??ü and so on.
    As an example, when you look at the 43° , what happens is, that by any reason an additional character is added, here a &Acirc (the is no such character at all), ending up having now two charcters instead of one ( &Acirc and °) and in most cases (” or ‘ ) its translated wrong in total.

    Finally, even if I think this is only minor information: The HTML file itself is coded in UTF8 (checked it with Notepad++).

    I hope I could deliver what you expect.

    Best regards, Hans

Viewing 2 replies - 1 through 2 (of 2 total)
  • The topic ‘International Codepages?’ is closed to new replies.