derestaurantspace.com
robots.txt

Robots Exclusion Standard data for derestaurantspace.com

Resource Scan

Scan Details

Site Domain derestaurantspace.com
Base Domain derestaurantspace.com
Scan Status Ok
Last Scan2026-01-25T23:00:49+00:00
Next Scan 2026-02-24T23:00:49+00:00

Last Scan

Scanned2026-01-25T23:00:49+00:00
URL https://derestaurantspace.com/robots.txt
Domain IPs 103.133.1.1
Response IP 103.133.1.1
Found Yes
Hash 9546c85eaaae2359a3901211360d82e81d6b747b86973d82eaf8822882d18811
SimHash 7d6480f5e361

Groups

*

Rule Path
Disallow

gptbot

Rule Path
Disallow */episode/*
Disallow */tag/*

Comments

  • Limit OpenAI's GPTBot crawler to main content only