theseagull.net
robots.txt

Robots Exclusion Standard data for theseagull.net

Resource Scan

Scan Details

Site Domain theseagull.net
Base Domain theseagull.net
Scan Status Ok
Last Scan2025-10-22T20:57:34+00:00
Next Scan 2025-11-21T20:57:34+00:00

Last Scan

Scanned2025-10-22T20:57:34+00:00
URL https://theseagull.net/robots.txt
Redirect https://www.theseagull.net/robots.txt
Redirect Domain www.theseagull.net
Redirect Base theseagull.net
Domain IPs 199.34.228.159
Redirect IPs 199.34.228.159
Response IP 199.34.228.159
Found Yes
Hash d67656d41692bd0db4897ee5788089c432c9fe9eb4f0e568a24ce75db6486649
SimHash 4b54dc562f93

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/

Other Records

Field Value
sitemap https://www.theseagull.net/sitemap.xml