• Resolved kosis

    (@kosis)


    Hi,

    I installed Relevanssi and indexed my site (went fairly quickly, no timeouts, was also able to add some stopwords, which is a nice feature), but then found it had not taken over my WP search function; I’m unsure why. I uninstalled and will return to that issue later depending in part on the answer to this question:

    I’m curious how html entities are indexed. Many languages use them, but they render differently, of course, on the front end than in the html markup (which is the point). Visitors, however, will always search “Müller,” for example, instead of “Müller.”

    I notice that Relevanssi does index, for example, “ü” as “uuml”; but will it find “Müller” as a search term if it has been rendered “Müller” in the html markup? I use quite a few html entities.

    Thanks.

    https://www.ads-software.com/plugins/relevanssi/

Viewing 3 replies - 1 through 3 (of 3 total)
  • Thread Starter kosis

    (@kosis)

    Sorry, forgot to post the entity so it would not render:
    Should read:
    “Visitors, however, will always search ‘Müller,’ for example, instead of ‘Müller’ using the html entity for u with an Umlaut.”
    And:
    “but will it find ‘Müller’ as a search term if it has been rendered ‘Müller’ using the html entity for u with an Umlaut in the html markup?”

    Plugin Author Mikko Saari

    (@msaari)

    Relevanssi is UTF-8 compatible, so if your site says ü, Relevanssi will index it as ü. These days there’s very little reason to use entities instead of actual letters, especially for common cases like ?, ? or ü.

    Thread Starter kosis

    (@kosis)

    Thanks for the swift response. Seems I’ll have to replace some entities, but your explanation helps. Thanks again.

Viewing 3 replies - 1 through 3 (of 3 total)
  • The topic ‘How html entities are indexed’ is closed to new replies.