wasserwacht-diessen.de
robots.txt

Robots Exclusion Standard data for wasserwacht-diessen.de

Resource Scan

Scan Details

Site Domain wasserwacht-diessen.de
Base Domain wasserwacht-diessen.de
Scan Status Ok
Last Scan2024-10-06T11:35:16+00:00
Next Scan 2024-10-13T11:35:16+00:00

Last Scan

Scanned2024-10-06T11:35:16+00:00
URL https://wasserwacht-diessen.de/robots.txt
Domain IPs 104.21.50.107, 172.67.204.241, 2606:4700:3031::6815:326b, 2606:4700:3035::ac43:ccf1
Response IP 172.67.204.241
Found Yes
Hash 3e05fe4b0ccb2d0b7d2c32c6e6a366c2127cf332852dd50290a2cffdfa707d23
SimHash 7927dc8228bb

Groups

*

Rule Path
Disallow

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Allow /*css?*
Allow /*js?*
Disallow /?s=
Disallow /search/
Disallow /wp-login.php

aspiegelbot
blexbot
barkrowler
dotbot
mj12bot
mauibot
nimbostratus-bot
petalbot
semrushbot
seznambot
sogou
serpstatbot
trendiction
textbulkerbot

Rule Path
Disallow /wp-admin/

Other Records

Field Value
crawl-delay 180

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://wasserwacht-diessen.de/sitemap-news.xml
sitemap https://wasserwacht-diessen.de/sitemap_index.xml

Comments

  • XML Sitemap & Google News version 5.4.9 - https://status301.net/wordpress-plugins/xml-sitemap-feed/
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK

Warnings

  • 1 invalid line.