uaruhr.org
robots.txt

Robots Exclusion Standard data for uaruhr.org

Resource Scan

Scan Details

Site Domain uaruhr.org
Base Domain uaruhr.org
Scan Status Ok
Last Scan2026-02-02T00:15:15+00:00
Next Scan 2026-03-04T00:15:15+00:00

Last Scan

Scanned2026-02-02T00:15:15+00:00
URL https://uaruhr.org/robots.txt
Domain IPs 104.21.46.79, 172.67.136.107, 2606:4700:3033::6815:2e4f, 2606:4700:3037::ac43:886b
Response IP 172.67.136.107
Found Yes
Hash 86fb7802c2295c958cb69ca2fd8490617f175c32ec787d1ba1e62cf5d6f1d605
SimHash 4a147b70ee37

Groups

*

Rule Path
Disallow /search
Disallow /admin
Disallow /search?*
Disallow /search?search=
Disallow /*.pdf$
Disallow /?
Disallow /*?
Disallow /*?page=
Disallow /cgi-bin*
Allow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://uaruhr.org/sitemap.xml