• I’m dealing with huge amount of 404s, because of google crawler, which try to access pages, that have never existed. I’m 100% sure, because they are in English, while my whole blog is in Polish. Look at these adress:
    eye_glasses
    health_insurance_obama_clinton
    holy_harmony_gratis_listen
    garage_door_opener_hbw0777
    xanax_is_harmless
    wallcovering
    exploited_college_girls_evangeline_in_the_free_online_encyclope
    how_to_program_lift-master_gate_opener_973lm

    and so on.
    I use Yoast WordPress SEO for generating xml site page, and there are none, even similar, pages.
    However, I’ve also noticed, that few of tables in my database have overhead of 4 mb, which can’t be fixed (4mb is in fact more than those tables have generally). Can it be any form of hack/malicious code in there? And if so, how to check and fix it?

    edit:
    Interesting, I’ve noticed, that these errors appeares only in www. version of my site. I have two: one www and one non-www, ’cause I wanted to use Google Speed service, and they demanded using www adress, so I had to verify www version in my webmaster tools separatly. In non-www version, there are no errors. Interesting, isn’t it?

Viewing 15 replies - 1 through 15 (of 17 total)
  • Please link to your site.

    Thread Starter Kamil Chmielewski

    (@kamster94)

    Interesting, I’ve noticed, that these errors appeares only in www. version of my site.

    Google does not return any results for https://www.kamster.pl

    I’m 100% sure, because they are in English, while my whole blog is in Polish

    I do not find anything in English as described by you. Possible that they may only occur occasionally, not all the time. If you really find them, it can be adware. Because adware do not show up in most of the scanning tools for infection/malware, I could not find anything after using such tools too.

    Non-existent URLs, as you describe them can be created by Google bots and they are inconsequential.

    Thread Starter Kamil Chmielewski

    (@kamster94)

    Now I’ve noticed, that number of tables with incurable overhead is bigger. Now it’s about 20 mb (5 tables, each with exactly 4 mb of overhead), while my whole db is about 8 mb. These tables are:
    options
    postmeta
    relevanssi
    statpress
    wfFileMods
    Is there any connection between that, and 404s?

    Is there any connection between that, and 404s?

    Yes, and NO, because some of the old URLs stored in your database may be returning those 404s, though theoretically they should not. Have you tried to optimize/ repair your database?

    Thread Starter Kamil Chmielewski

    (@kamster94)

    Yes, ofc, using many plugins, but none of them worked in this case (well, they are working, sice ‘normal’ overhead is deleted, but 4 mb in each of mentioned tables still remains)

    Restricting revisions can help a lot.
    https://codex.www.ads-software.com/Revision_Management
    Optimizing database also helps. You may use a plugin if you like.
    https://www.ads-software.com/extend/plugins/search.php?q=optimize+database
    Also review: https://codex.www.ads-software.com/WordPress_Optimization

    Backup your site first so that if anything goes wrong you can always restore.

    @kamster94

    Your ‘PressWork Theme’ looks suspicious enough:

    See the post on the bottom: Looks like presswork.me has been hacked

    Thread Starter Kamil Chmielewski

    (@kamster94)

    Developers of PressWork have given up, abandoning this theme. Look at https://twitter.com/PressWorkWP

    edit:
    Are you suggesting, that deleting presswork.me adress from everywhere in theme files could help? I don’t see a way, that this theme could cause this problem, because I’ve been using it for very long time, even back when my domain was kamster.tk. And there were no such problems.

    In other words, your English spam words came from your allegedly hacked Press Work theme written in English, right?

    Thread Starter Kamil Chmielewski

    (@kamster94)

    Read my edit please. I possibly can delete every occurence of presswork.me, but how could it help?

    I was working on a site having two Google Webmaster Tools profiles, with www prefix and without.

    There were tons of URLs Not Found errors, caused by editors typos. After cleaning the mess, I marked all errors in Google Webmaster Tools as fixed (you can see that option there)

    In a few days Google dropped all errors.

    If you remove your theme, soon enough you would get your Webmaster Tool record cleaned.

    Thread Starter Kamil Chmielewski

    (@kamster94)

    So you claim, it’s theme’s fault? I can’t see why? But ok, since presswork.me is not leading anywhere now, I’ve cut it from every single place. Or is there any code, that’s causing all this mess? But I’ve never dealt such errors, and I’ve been using this theme since beginnig of this year.

    Interesting, I’ve noticed, that these errors appeares only in www. version of my site.

    Sure, there is a backdoor which was used after you set www prefix profile.

    Not being a professional, neither you nor me would be able to find that backdoor.

    As long as we are saying that the theme is shitty, you need to get rid of it.

    Thread Starter Kamil Chmielewski

    (@kamster94)

    Well, if you say so. I like this theme. Do you know any similar? Or, at least, such professionally looking free theme? Anyway, the thread is to be closed.

Viewing 15 replies - 1 through 15 (of 17 total)
  • The topic ‘Google Crawler errors, 404s’ is closed to new replies.