geschiedenisweb.nl
robots.txt

Robots Exclusion Standard data for geschiedenisweb.nl

Resource Scan

Scan Details

Site Domain geschiedenisweb.nl
Base Domain geschiedenisweb.nl
Scan Status Ok
Last Scan2024-10-06T11:48:02+00:00
Next Scan 2024-10-13T11:48:02+00:00

Last Scan

Scanned2024-10-06T11:48:02+00:00
URL https://geschiedenisweb.nl/robots.txt
Domain IPs 104.21.90.112, 172.67.200.97, 2606:4700:3031::ac43:c861, 2606:4700:3035::6815:5a70
Response IP 104.21.90.112
Found Yes
Hash 93c45eef4c86d1e24c8162ee92602b3505c69bccfe959a577d332632d1cbf5fe
SimHash 7beed82228b9

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /*css?*
Allow /*js?*
Disallow /?s=
Disallow /search/
Disallow /wp-login.php

aspiegelbot
blexbot
barkrowler
dotbot
mj12bot
mauibot
nimbostratus-bot
petalbot
semrushbot
seznambot
sogou
serpstatbot
trendiction
textbulkerbot

Rule Path
Disallow /wp-admin/

Other Records

Field Value
crawl-delay 180

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://geschiedenisweb.nl/sitemap-news.xml
sitemap https://geschiedenisweb.nl/sitemap_index.xml

Comments

  • XML Sitemap & Google News version 5.4.9 - https://status301.net/wordpress-plugins/xml-sitemap-feed/
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK

Warnings

  • 1 invalid line.