bildungsserver.net
robots.txt

Robots Exclusion Standard data for bildungsserver.net

Resource Scan

Scan Details

Site Domain bildungsserver.net
Base Domain bildungsserver.net
Scan Status Ok
Last Scan2024-09-16T17:15:08+00:00
Next Scan 2024-10-16T17:15:08+00:00

Last Scan

Scanned2024-09-16T17:15:08+00:00
URL https://bildungsserver.net/robots.txt
Domain IPs 188.40.3.214
Response IP 188.40.3.214
Found Yes
Hash a16109918a006157dcb32c05dcbd5ca319ad6ce209bf9a35e66a1f435eb411d4
SimHash 1b6d5d6ba3a3

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-includes/
Allow /wp-includes/js/
Allow /wp-includes/images/
Disallow /trackback/
Disallow /wp-login.php
Disallow /wp-register.php

mj12bot/v1.4.8
mj12bot
geedoproductsearch
dotbot/1.2
dotbot

Rule Path
Disallow /

facebookexternalhit/1.1

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 120

Other Records

Field Value
sitemap https://bildungsserver.net/sitemap_index.xml

Comments

  • This virtual robots.txt file was created by the Virtual Robots.txt WordPress plugin: https://www.wordpress.org/plugins/pc-robotstxt/
  • Diese Webcrawler schließe ich aus:
  • Diese Webcrawler beschränke ich zeitlich:
  • User-agent: *
  • Crawl-delay: 60