• Hi

    I’m building an website for teaching Irish, which has á é í ó ú in it’s alphabet (you probably know where this is going).

    I have my database set to UTF8_general_ci and noticed when creating post, the post cuts off after an accented letter. I have a remote create post function written and thought I would add a character replacement function converting the á etc to ascii.

    This seemed to work for the remote post; then I noticed why… it was storing the character as ascii instead of á.

    Users trying to create posts through the dashboard are having the same problem. So, it seems UFT8 doesn’t like the accents, although this seems to suggest otherwise. Does this mean I have to change collation, following these steps ? If so is there an easier way (e.g. a plugin?)

    thanks

Viewing 10 replies - 1 through 10 (of 10 total)
  • Moderator James Huff

    (@macmanx)

    As long as you have access to phpMyAdmin (most hosting providers offer this in their control panel) and a plain text editor, there is a much easier way to do it.

    I did a quick write-up on converting from latin1 to utf8 last month. Just swap the terms as necessary and you should be fine. The whole process is easier and certainly a lot safer than messing around with the MySQL command line.

    Thread Starter donal_og

    (@donal_og)

    Hi James

    I’ve tried this and still no joy. The only difference for me trying it was that I was using Textpad on a PC therefore step 5 (Select all of the text and select “Unicode (UTF-8)” from the Text/Reload Text With Encoding menu.) was skipped as Textpad doesn’t have this option. I assume this is only a text formatting option anyway?

    The problem still exists for creating new posts in WordPress and comments (posted with Buddypress). For example fáda tést í ó abú appeared as f?da t?st ? ? ab? and is being saved in the DB like that or sometimes the accent will not be displayed and the rest of the text will be omitted completely.

    Thanks

    Moderator James Huff

    (@macmanx)

    Do you have latin1 set for the DB_CHARSET in the wp-config.php file?

    https://codex.www.ads-software.com/Editing_wp-config.php#Database_character_set

    Thread Starter donal_og

    (@donal_og)

    Hi James, that seems to have worked for the WP blog and Buddypress usernames thanks! The only problem remaining is the twitter feeds I have displayed at the top of the page. I checked the charset for the page in header.php and this is set to charset=<?php bloginfo(‘charset’); which I assume is going to pick up the ‘latin1’ set in the reading section. Aithníonn is displayed ok in twitter but it is Aithn?-onn on the page and when I view source on the page. Any thoughts on that?

    Moderator James Huff

    (@macmanx)

    What are you using to bring in the Twitter feeds?

    Thread Starter donal_og

    (@donal_og)

    I’ve just realised that the home page is fine whereas the other pages aren’t. Surely they all use the same header.php?

    Thread Starter donal_og

    (@donal_og)

    sorry when I say home page, it’s the initial login page. When the user logs in the problem then occurs, but again they should both be using header.php

    Moderator James Huff

    (@macmanx)

    I’d assume so. You’ve done all the right steps. Are you using any caching plugins? Have you checked from any other browsers on any other computers?

    Thread Starter donal_og

    (@donal_og)

    I’ve noticed the for comments (i.e. posts) its on the general home page it’s working fine, but for the comments for groups, eg whats new in group 1, these are not rendering the accented letters.

    This also applies for the twitter linkage, for which I am using Twitter for WordPress 1.9.7

    Thread Starter donal_og

    (@donal_og)

    Hi

    Just wondering if anyone knows when BP 1.3 is out? I’ve had this bug for sometime now, similar to this one for Portuguese. https://buddypress.org/community/groups/how-to-and-troubleshooting/forum/topic/i-have-a-charset-problem/

    I’ve did all the manual DB conversions etc and the end result seems to be an AJAX Groups Buddypress bug: if I type an accented Irish charecter á é í ó or ú into anywhere in the groups area, it comes up strange eg ??r instead of ár (within an AJAX content area with sub headings of HOME MEMBERS etc) whereas on my homepage (non AJAX) it seems fine. The intial thread was here https://buddypress.org/community/groups/how-to-and-troubleshooting/forum/topic/from-latin1-to-uft8-issues/

    If 1.3 fixes this and is out soon it would solve a major problem for me. I have been delaying properly releasing this site due to this bug and was considering migrating to another social system as I heard Buddypress was no longer supported but if that’s not true and 1.3 fixes this I’d be happy to hold on a while.

    regards
    Dónal

Viewing 10 replies - 1 through 10 (of 10 total)
  • The topic ‘UTF8 to Latin’ is closed to new replies.