• Resolved FireSamurai

    (@firesamurai)


    Hi,

    I’ve installed your sitemap plugin, but from what I can tell, it doesn’t index content, just page titles.

    Maybe I’m off on how sitemaps should work, but shouldn’t an xml feed also index what’s in the page/post?

    For example, if you go to
    https://www.graniteschools.org/legal/policies, you’ll see a page full of PDF files, but if I go to to the sitemap, even when I click down through the sitemap’s links, it only ever gives me page titles.

    Usually when I see a sitemap, I see a large XML file full of xml code, but with this I’m only seeing links to pages. Am I misunderstanding something?

    I’m particularly interested in files because I’m using Google Custom Search and would like the google bots to also pick up the files for the search results.

    Looking at the Reading > Settings, it appears that robots.txt does allow for the uploads folder to be crawled.

    Any insights, instructions are appreciated. Much thanks!

    https://www.ads-software.com/plugins/xml-sitemap-feed/

Viewing 2 replies - 1 through 2 (of 2 total)
  • it doesn’t index content, just page titles.

    Not even the titles, just the location. Plus additional info like post lastmod date/time and priority. Read more about what’s in a typical XML Sitemap on https://www.sitemaps.org/protocol.html

    I’m particularly interested in files because I’m using Google Custom Search and would like the google bots to also pick up the files for the search results.

    If you linked to these PDF files in your post/page/site content, then Google will eventually find them. But if you want to help Google find them faster, you can add the file URLs in the “Include custom URLs” field on your Settings > Reading admin page.

    Looking at the Reading > Settings, it appears that robots.txt does allow for the uploads folder to be crawled.

    These rules are not used in your case since you are running this site on a subdirectory. A robots.txt file can only be in the root of any domain so your WP installation in the subdir simply does not create one… Add or change any rules that you want applied to the subdir to the robots.txt (static file?) in the root of your https://www.graniteschools.org/ installation.

    Thread Starter FireSamurai

    (@firesamurai)

    Ahhh! Makes sense. Thank you.

Viewing 2 replies - 1 through 2 (of 2 total)
  • The topic ‘Does this plugin index PDF files and other file types?’ is closed to new replies.