regex.com
robots.txt

Robots Exclusion Standard data for regex.com

Resource Scan

Scan Details

Site Domain regex.com
Base Domain regex.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2025-09-07T06:53:30+00:00
Next Scan 2025-12-06T06:53:30+00:00

Last Successful Scan

Scanned2025-01-18T05:04:17+00:00
URL http://www.regex.com/robots.txt
Domain IPs 172.217.194.121, 2404:6800:4003:c05::79
Response IP 142.251.12.121
Found Yes
Hash d890915d040b6a6c4e6e6ea7e848fb4ffb26e615265a6d47bed6d07d9559744d
SimHash 69051140c131

Groups

*

Rule Path
Disallow /feeds
Allow /_/rsrc/
Allow /_/atari/*
Disallow /_/

Other Records

Field Value
sitemap http://www.regex.com:80/system/feeds/sitemap