hrc.org
robots.txt

Robots Exclusion Standard data for hrc.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hrc.org
Base Domain	hrc.org
Scan Status	Ok
Last Scan	2026-02-24T20:45:01+00:00
Next Scan	2026-03-26T20:45:01+00:00

Last Scan

Scanned	2026-02-24T20:45:01+00:00
URL	https://hrc.org/robots.txt
Redirect	https://www.hrc.org/robots.txt
Redirect Domain	www.hrc.org
Redirect Base	hrc.org
Domain IPs	44.242.58.44, 54.203.123.123, 54.71.0.213
Redirect IPs	44.242.58.44, 54.203.123.123, 54.71.0.213
Response IP	54.71.0.213
Found	Yes
Hash	9e58ed28761ee8002575ec85cfb5f428fe6b7a474099f55af4956be69447db29
SimHash	615499224f36

Groups

*

Rule	Path
Disallow	/cpresources/
Disallow	/vendor/
Disallow	/.env
Disallow	/cache/

Rule

Path

Disallow

/cpresources/

Disallow

/vendor/

Disallow

/.env

Disallow

/cache/

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

perplexitybot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.hrc.org/sitemaps-2-sitemap.xml
sitemap	https://www.hrc.org/es/sitemaps-2-sitemap.xml

Field

Value

sitemap

https://www.hrc.org/sitemaps-2-sitemap.xml

sitemap

https://www.hrc.org/es/sitemaps-2-sitemap.xml

Back to top

Comments

robots.txt for https://www.hrc.org/
live - don't allow web crawlers to index cpresources/ or vendor/
Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
Disallow Perplexity bot, as there's no benefit to allowing it to index your site

Back to top

hrc.orgrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

gptbot

google-extended

perplexitybot

Other Records

Comments

hrc.org
robots.txt