usagi.one
robots.txt

Robots Exclusion Standard data for usagi.one

Resource Scan

Scan Details

Site Domain usagi.one
Base Domain usagi.one
Scan Status Ok
Last Scan2024-09-15T22:43:27+00:00
Next Scan 2024-09-22T22:43:27+00:00

Last Scan

Scanned2024-09-15T22:43:27+00:00
URL https://usagi.one/robots.txt
Redirect https://web.usagi.one/robots.txt
Redirect Domain web.usagi.one
Redirect Base usagi.one
Domain IPs 104.21.86.13, 172.67.213.190, 2606:4700:3030::ac43:d5be, 2606:4700:3031::6815:560d
Redirect IPs 82.118.242.218
Response IP 82.118.242.218
Found Yes
Hash 9cbecb4779d124c0743d89348a3512748c50aa6db8cc9841ef7344ae12381c2b
SimHash 2b0c9e66c333

Groups

mediapartners-google

Rule Path
Disallow

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /internal/*

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://web.usagi.one/sitemap_index.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.
  • `request-rate` is not a known field.