therillkawan.com
robots.txt

Robots Exclusion Standard data for therillkawan.com

Resource Scan

Scan Details

Site Domain therillkawan.com
Base Domain therillkawan.com
Scan Status Ok
Last Scan2025-12-14T23:09:01+00:00
Next Scan 2026-01-13T23:09:01+00:00

Last Scan

Scanned2025-12-14T23:09:01+00:00
URL https://therillkawan.com/robots.txt
Domain IPs 104.21.64.132, 172.67.151.10, 2606:4700:3031::6815:4084, 2606:4700:3032::ac43:970a
Response IP 172.67.151.10
Found Yes
Hash 202ebfb294c9983a90414df7cab0d96eb8c9a6322683b24cac96c305a6698c64
SimHash 6b408885c5b1

Groups

googlebot
slurp
bingbot

Rule Path
Allow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://therillkawan.com/data-sitemap.xml

Warnings

  • `host` is not a known field.