imsueden.de
robots.txt

Robots Exclusion Standard data for imsueden.de

Resource Scan

Scan Details

Site Domain imsueden.de
Base Domain imsueden.de
Scan Status Ok
Last Scan2024-10-03T18:55:14+00:00
Next Scan 2024-10-10T18:55:14+00:00

Last Scan

Scanned2024-10-03T18:55:14+00:00
URL https://imsueden.de/robots.txt
Domain IPs 2a01:4f8:c011:284::1, 49.12.17.76
Response IP 49.12.17.76
Found Yes
Hash ac41eb869e2df2992e509f5bbe573a0b6b0cfda5391a9ae27d002fe2d288e928
SimHash b2320d18afe5

Groups

*

Rule Path
Disallow /wp-admin/*
Disallow /wp-includes/*
Disallow /*?

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.imsueden.de/sitemap.xml

Comments

  • Legal notice: www.imsueden.de and the respective subdomains expressly reserves the right to use its content for commercial text and data mining (ยง 44b UrhG).
  • The use of robots or other automated means to access www.imsueden.de or any of the respective subdomains or collect or mine data without the express permission of www.imsueden.de or any of the respective subdomains is strictly prohibited.
  • If you would like to apply for permission to crawl www.imsueden.de or any of the respective subdomains, collect or use data, please contact info@imsueden.de.