selflib.me
robots.txt

Robots Exclusion Standard data for selflib.me

Resource Scan

Scan Details

Site Domain selflib.me
Base Domain selflib.me
Scan Status Ok
Last Scan2024-11-11T18:46:36+00:00
Next Scan 2024-11-18T18:46:36+00:00

Last Scan

Scanned2024-11-11T18:46:36+00:00
URL https://selflib.me/robots.txt
Domain IPs 82.118.242.218
Response IP 82.118.242.218
Found Yes
Hash cee54373f6cbe0f92c6101f60bcdda1ba97723d6bb69dba8a88e3da7dff476f9
SimHash 2f069e60a333

Groups

mediapartners-google

Rule Path
Disallow

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /internal/*

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://selflib.me/sitemap_index.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.
  • `request-rate` is not a known field.