No valid sitemap parsed for crawler
-
Hey, we got some really strange issue with the sitemap crawler on one of our sites. We do use the same config on several sites on the same server and only this page is having issues. The sitemap was working, then its gone. Sometimes it loads again, but then its gone again.
we have already added the test script for the var_dump and its generating the following outcome:
array(6) { ["headers"]=> object(WpOrg\Requests\Utility\CaseInsensitiveDictionary)#36393 (1) { ["data":protected]=> array(24) { ["x-powered-by"]=> string(9) "PHP/8.3.3" ["x-dns-prefetch-control"]=> string(2) "on" ["x-robots-tag"]=> string(7) "noindex" ["content-type"]=> string(23) "text/xml; charset=UTF-8" ["pragma"]=> string(6) "public" ["cache-control"]=> string(46) "no-cache, no-store, must-revalidate, max-age=0" ["expires"]=> string(1) "0" ["x-litespeed-cache-control"]=> string(8) "no-cache" ["vary"]=> string(26) "Accept-Encoding,User-Agent" ["strict-transport-security"]=> string(35) "max-age=15768000; includeSubDomains" ["referrer-policy"]=> string(31) "strict-origin-when-cross-origin" ["x-frame-options"]=> string(10) "sameorigin" ["x-xss-protection"]=> string(13) "1; mode=block" ["x-content-type-options"]=> string(7) "nosniff" ["expect-ct"]=> string(22) "enforce, max-age=21600" ["content-security-policy"]=> string(25) "upgrade-insecure-requests" ["permissions-policy"]=> string(113) "geolocation=(), midi=(),sync-xhr=(),accelerometer=(), gyroscope=(), magnetometer=(), camera=(), fullscreen=(self)" ["x-litespeed-tag"]=> string(4) "b79_" ["content-length"]=> string(3) "408" ["content-encoding"]=> string(2) "br" ["date"]=> string(29) "Sat, 24 Feb 2024 16:35:23 GMT" ["server"]=> string(9) "LiteSpeed" ["x-qc-pop"]=> string(28) "lej613-hyb-worker.quic.cloud" ["alt-svc"]=> string(167) "h3=":443"; ma=2592000, h3-29=":443"; ma=2592000, h3-Q050=":443"; ma=2592000, h3-Q046=":443"; ma=2592000, h3-Q043=":443"; ma=2592000, quic=":443"; ma=2592000; v="43,46"" } } ["body"]=> string(2246) " https://nookyyy.com/page-sitemap.xml 2024-02-21T16:04:38+00:00 https://nookyyy.com/review-sitemap1.xml 2024-02-23T21:53:41+00:00 https://nookyyy.com/review-sitemap2.xml 2024-02-23T20:40:38+00:00 https://nookyyy.com/review-sitemap3.xml 2024-02-23T13:04:35+00:00 https://nookyyy.com/review-sitemap4.xml 2024-02-23T13:03:43+00:00 https://nookyyy.com/review-sitemap5.xml 2024-02-23T12:53:40+00:00 https://nookyyy.com/review-sitemap6.xml 2024-02-23T12:50:49+00:00 https://nookyyy.com/review-sitemap7.xml 2024-02-23T12:37:02+00:00 https://nookyyy.com/review-sitemap8.xml 2024-02-23T12:35:39+00:00 https://nookyyy.com/review-sitemap9.xml 2024-02-23T12:31:10+00:00 https://nookyyy.com/review-sitemap10.xml 2024-02-23T12:29:58+00:00 https://nookyyy.com/review-sitemap11.xml 2024-02-23T12:20:30+00:00 https://nookyyy.com/review-sitemap12.xml 2024-02-23T12:18:48+00:00 https://nookyyy.com/review-sitemap13.xml 2024-02-23T12:16:40+00:00 https://nookyyy.com/review-sitemap14.xml 2024-02-23T12:14:45+00:00 https://nookyyy.com/custom-sitemap.xml 2024-02-24T18:35:16+00:00 " ["response"]=> array(2) { ["code"]=> int(200) ["message"]=> string(2) "OK" } ["cookies"]=> array(0) { } ["filename"]=> NULL ["http_response"]=> object(WP_HTTP_Requests_Response)#36392 (5) { ["data"]=> NULL ["headers"]=> NULL ["status"]=> NULL ["response":protected]=> object(WpOrg\Requests\Response)#36394 (10) { ["body"]=> string(2246) " https://nookyyy.com/page-sitemap.xml 2024-02-21T16:04:38+00:00 https://nookyyy.com/review-sitemap1.xml 2024-02-23T21:53:41+00:00 https://nookyyy.com/review-sitemap2.xml 2024-02-23T20:40:38+00:00 https://nookyyy.com/review-sitemap3.xml 2024-02-23T13:04:35+00:00 https://nookyyy.com/review-sitemap4.xml 2024-02-23T13:03:43+00:00 https://nookyyy.com/review-sitemap5.xml 2024-02-23T12:53:40+00:00 https://nookyyy.com/review-sitemap6.xml 2024-02-23T12:50:49+00:00 https://nookyyy.com/review-sitemap7.xml 2024-02-23T12:37:02+00:00 https://nookyyy.com/review-sitemap8.xml 2024-02-23T12:35:39+00:00 https://nookyyy.com/review-sitemap9.xml 2024-02-23T12:31:10+00:00 https://nookyyy.com/review-sitemap10.xml 2024-02-23T12:29:58+00:00 https://nookyyy.com/review-sitemap11.xml 2024-02-23T12:20:30+00:00 https://nookyyy.com/review-sitemap12.xml 2024-02-23T12:18:48+00:00 https://nookyyy.com/review-sitemap13.xml 2024-02-23T12:16:40+00:00 https://nookyyy.com/review-sitemap14.xml 2024-02-23T12:14:45+00:00 https://nookyyy.com/custom-sitemap.xml 2024-02-24T18:35:16+00:00 " ["raw"]=> string(3332) "HTTP/1.1 200 OK Connection: close x-powered-by: PHP/8.3.3 x-dns-prefetch-control: on x-robots-tag: noindex content-type: text/xml; charset=UTF-8 pragma: public cache-control: no-cache, no-store, must-revalidate, max-age=0 expires: 0 x-litespeed-cache-control: no-cache vary: Accept-Encoding,User-Agent strict-transport-security: max-age=15768000; includeSubDomains referrer-policy: strict-origin-when-cross-origin x-frame-options: sameorigin x-xss-protection: 1; mode=block x-content-type-options: nosniff expect-ct: enforce, max-age=21600 content-security-policy: upgrade-insecure-requests permissions-policy: geolocation=(), midi=(),sync-xhr=(),accelerometer=(), gyroscope=(), magnetometer=(), camera=(), fullscreen=(self) x-litespeed-tag: b79_ content-length: 408 content-encoding: br date: Sat, 24 Feb 2024 16:35:23 GMT server: LiteSpeed x-qc-pop: lej613-hyb-worker.quic.cloud alt-svc: h3=":443"; ma=2592000, h3-29=":443"; ma=2592000, h3-Q050=":443"; ma=2592000, h3-Q046=":443"; ma=2592000, h3-Q043=":443"; ma=2592000, quic=":443"; ma=2592000; v="43,46" https://nookyyy.com/page-sitemap.xml 2024-02-21T16:04:38+00:00 https://nookyyy.com/review-sitemap1.xml 2024-02-23T21:53:41+00:00 https://nookyyy.com/review-sitemap2.xml 2024-02-23T20:40:38+00:00 https://nookyyy.com/review-sitemap3.xml 2024-02-23T13:04:35+00:00 https://nookyyy.com/review-sitemap4.xml 2024-02-23T13:03:43+00:00 https://nookyyy.com/review-sitemap5.xml 2024-02-23T12:53:40+00:00 https://nookyyy.com/review-sitemap6.xml 2024-02-23T12:50:49+00:00 https://nookyyy.com/review-sitemap7.xml 2024-02-23T12:37:02+00:00 https://nookyyy.com/review-sitemap8.xml 2024-02-23T12:35:39+00:00 https://nookyyy.com/review-sitemap9.xml 2024-02-23T12:31:10+00:00 https://nookyyy.com/review-sitemap10.xml 2024-02-23T12:29:58+00:00 https://nookyyy.com/review-sitemap11.xml 2024-02-23T12:20:30+00:00 https://nookyyy.com/review-sitemap12.xml 2024-02-23T12:18:48+00:00 https://nookyyy.com/review-sitemap13.xml 2024-02-23T12:16:40+00:00 https://nookyyy.com/review-sitemap14.xml 2024-02-23T12:14:45+00:00 https://nookyyy.com/custom-sitemap.xml 2024-02-24T18:35:16+00:00 " ["headers"]=> object(WpOrg\Requests\Response\Headers)#36395 (1) { ["data":protected]=> array(24) { ["x-powered-by"]=> array(1) { [0]=> string(9) "PHP/8.3.3" } ["x-dns-prefetch-control"]=> array(1) { [0]=> string(2) "on" } ["x-robots-tag"]=> array(1) { [0]=> string(7) "noindex" } ["content-type"]=> array(1) { [0]=> string(23) "text/xml; charset=UTF-8" } ["pragma"]=> array(1) { [0]=> string(6) "public" } ["cache-control"]=> array(1) { [0]=> string(46) "no-cache, no-store, must-revalidate, max-age=0" } ["expires"]=> array(1) { [0]=> string(1) "0" } ["x-litespeed-cache-control"]=> array(1) { [0]=> string(8) "no-cache" } ["vary"]=> array(1) { [0]=> string(26) "Accept-Encoding,User-Agent" } ["strict-transport-security"]=> array(1) { [0]=> string(35) "max-age=15768000; includeSubDomains" } ["referrer-policy"]=> array(1) { [0]=> string(31) "strict-origin-when-cross-origin" } ["x-frame-options"]=> array(1) { [0]=> string(10) "sameorigin" } ["x-xss-protection"]=> array(1) { [0]=> string(13) "1; mode=block" } ["x-content-type-options"]=> array(1) { [0]=> string(7) "nosniff" } ["expect-ct"]=> array(1) { [0]=> string(22) "enforce, max-age=21600" } ["content-security-policy"]=> array(1) { [0]=> string(25) "upgrade-insecure-requests" } ["permissions-policy"]=> array(1) { [0]=> string(113) "geolocation=(), midi=(),sync-xhr=(),accelerometer=(), gyroscope=(), magnetometer=(), camera=(), fullscreen=(self)" } ["x-litespeed-tag"]=> array(1) { [0]=> string(4) "b79_" } ["content-length"]=> array(1) { [0]=> string(3) "408" } ["content-encoding"]=> array(1) { [0]=> string(2) "br" } ["date"]=> array(1) { [0]=> string(29) "Sat, 24 Feb 2024 16:35:23 GMT" } ["server"]=> array(1) { [0]=> string(9) "LiteSpeed" } ["x-qc-pop"]=> array(1) { [0]=> string(28) "lej613-hyb-worker.quic.cloud" } ["alt-svc"]=> array(1) { [0]=> string(167) "h3=":443"; ma=2592000, h3-29=":443"; ma=2592000, h3-Q050=":443"; ma=2592000, h3-Q046=":443"; ma=2592000, h3-Q043=":443"; ma=2592000, quic=":443"; ma=2592000; v="43,46"" } } } ["status_code"]=> int(200) ["protocol_version"]=> float(1.1) ["success"]=> bool(true) ["redirects"]=> int(0) ["url"]=> string(37) "https://nookyyy.com/sitemap_index.xml" ["history"]=> array(0) { } ["cookies"]=> object(WpOrg\Requests\Cookie\Jar)#36391 (1) { ["cookies":protected]=> array(0) { } } } ["filename":protected]=> NULL } }
The debug log shows the following:
02/24/24 18:42:37.598 [87.149.249.7:37550 1 QA8] ????? ? failed to parse custom sitemap: Failed to parse xml https://nookyyy.com/page-sitemap.xml 02/24/24 18:42:37.598 [87.149.249.7:37550 1 QA8] ????? Truncate sitemap 02/24/24 18:42:37.604 [87.149.249.7:37550 1 QA8] ????? Generate sitemap 02/24/24 18:42:37.638 [87.149.249.7:37550 1 QA8] ?? 301 from https://nookyyy.com/wp-admin/admin.php 02/24/24 18:42:37.638 [87.149.249.7:37550 1 QA8] ?? 301 to https://nookyyy.com/wp-admin/admin.php?page=litespeed-crawler 02/24/24 18:42:37.638 [87.149.249.7:37550 1 QA8] ?? Compare [from] https [to] https 02/24/24 18:42:37.638 [87.149.249.7:37550 1 QA8] ?? Compare [from] nookyyy.com [to] nookyyy.com 02/24/24 18:42:37.638 [87.149.249.7:37550 1 QA8] ?? Compare [from] /wp-admin/admin.php [to] /wp-admin/admin.php 02/24/24 18:42:37.638 [87.149.249.7:37550 1 QA8] ?? Compare [from] page=litespeed-crawler&LSCWP_CTRL=crawler&LSCWP_NONCE=6bdd7444a8&litespeed_type=refresh_map [to] page=litespeed-crawler 02/24/24 18:42:37.638 [87.149.249.7:37550 1 QA8] ?? 301 different redirection 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] ?? [Tag] Add --- HTTP.302 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Core] CHK html bypass: miss footer const 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Ctrl] not cacheable before ctrl finalize 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Router] get_role: administrator 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Vary] role in vary_group [group] 99 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Vary] show_admin_bar_front: true 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Vary] admin bar : true 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Vary] no custimzed vary 02/24/24 18:42:37.639 [87.149.249.7:37550 1 QA8] [Ctrl] ? forced no cache [reason] DONOTCACHEPAGE const 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] ?? X-LiteSpeed-Cache-Control: no-cache 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] [Vary] uid: 2 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] [Vary] role in vary_group [group] 99 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] [Vary] show_admin_bar_front: true 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] [Vary] admin bar : true 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] [Core] CHK html bypass: miss footer const 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] [Core] run hook litespeed_buffer_finalize 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] Response headers --- array ( 0 => 'X-Powered-By: PHP/8.3.3', 1 => 'X-LiteSpeed-Tag: b79_tag_priv,public:b79_HTTP.302', 2 => 'X-Redirect-By: WordPress', 3 => 'Location: https://nookyyy.com/wp-admin/admin.php?page=litespeed-crawler', 4 => 'X-LiteSpeed-Cache-Control: no-cache', ) 02/24/24 18:42:37.640 [87.149.249.7:37550 1 QA8] End response --------------------------------------------------Duration: 1,689.18 ms------------------------------
Any idea y it throws this error – the sitemap looks good to me?
Sidenote: if we disable the option “include images in sitemap” on rank math – it seems to work – so i guess it has something to do with this. But as i mentioned in the beginning – no issues on other sites with the same settings (images enabled)
regards
The page I need help with: [log in to see the link]
- The topic ‘No valid sitemap parsed for crawler’ is closed to new replies.