• Resolved redraphdk85

    (@redraphdk85)


    Hi,

    I’m having quite a few issues on a multilingual website. I use WPML and WooCommerce to handle products and translations. But when I look at crawl-reports from Google, I saw that it crawl some URL’s, that have never existed. Example:

    site/en/product/english-single-product-name/another-german-single-product-name

    There are always approx. 15-20 of these in the crawl-reports and/or sitemap. They have never existed on the site, but Google crawls them anyway, and regards them as errors on my part. I can’t for the life of me figure out, why this is happening.
    I’ve had an ongoing support-case with WPML on this matter, since the translations are managed there, but they keep insisting, that this problem is somehow caused by Yoast. They suggest disabling the sitemap altogether, and switch it back on. Is that a valid suggestion, and what are the consequences of doing so?

    thanks for your assistance.

    BR,
    Alex

Viewing 4 replies - 1 through 4 (of 4 total)
  • Plugin Support Md Mazedul Islam Khan

    (@mazedulislamkhan)

    If you look at the sitemap_index.xml, do you find all these non-existed URLs in the sitemap? If yes, please share the relevant sitemap URL here with us so that we can take a look at it.

    If not, it isn’t something specific to Yoast SEO but somehow Google crawlers find these URLs on your site and crawl them and when crawlers see these URLs are broken, you see the error in Google Search Console.

    Thread Starter redraphdk85

    (@redraphdk85)

    I find some of them in the sitemap, but not all. It’s like there is two types of error, that continuously occur:

    Type 1:
    Here is the URL for the product sitemap in question:
    https://heta.dk/product-sitemap.xml

    In the sitemap, the issues starts to show aprrox. 65 lines above the bottom, from the line ‘…/produkt/plaat/’ and downwards. This is a Swedish product name, and if you visit the link, you’ll also be redirected to ‘…sv/produkt/plaat’. This issue continues throughout the rest of the sitemap all the way to the bottom: The link has no Swedish language code in it, but still a swedish product name, and if you visit it, you are redirected to the right place.
    What I don’t understand, is how these get into the sitemap in the first place, because the URL’s are faulty and have never existed.

    Type: 2:
    In the crawl-reports, I always get about 10-15 links, that look something like this:

    site/en/product/english-product-name/seemingly-random-product-name-from-another-language

    The english product name is the same every time, but for some reason, a random product name from any another language is added, making for an obvious 404. Again, Google is crawling a URL, that have never existed on the site, and I can’t figure out why.

    I’ve had WPML checking this out, and they ended up suggestion to deactivate the sitemap altogether. Any ideas for a solution?

    BR,
    Alex

    Plugin Support Md Mazedul Islam Khan

    (@mazedulislamkhan)

    Issue 1: Yoast SEO only includes URLs that exist on a site. However, if you think that Yoast SEO is including non-exists URL in the product-sitemap.xml, we’d like you to please perform a conflict check first. Can you try and gather as much information for us as possible? Please perform the following:
    1. Check for conflicts.
    2. Check for JavaScript errors with your console.
    If you find any JavaScript errors related to Yoast SEO or if there is a conflict with a plugin or a theme, you can create a new GitHub issue for our developers. Please report the issue to a third party developer as well.
    If you didn’t find any conflicts or errors, we think the issue is specific to your site. We’d need to investigate further but are unable to do so on these forums. You can purchase Yoast SEO Premium and receive our Premium email support and we can help you further.

    Issue 2: If the non-exists URLs include in the XML sitemap that is submitted to Google, so it is expected to see them in the crawl errors. So, we’d like you to first please perform the above action before moving forward with this issue.

    Plugin Support Jerlyn

    (@jerparx)

    Closed due to inactivity.

Viewing 4 replies - 1 through 4 (of 4 total)
  • The topic ‘Google crawls URL’s that never existed’ is closed to new replies.