pridesource.com
robots.txt

Robots Exclusion Standard data for pridesource.com

Resource Scan

Scan Details

Site Domain pridesource.com
Base Domain pridesource.com
Scan Status Ok
Last Scan2024-10-05T07:31:11+00:00
Next Scan 2024-10-12T07:31:11+00:00

Last Scan

Scanned2024-10-05T07:31:11+00:00
URL https://pridesource.com/robots.txt
Domain IPs 104.21.28.25, 172.67.170.50, 2606:4700:3033::ac43:aa32, 2606:4700:3035::6815:1c19
Response IP 172.67.170.50
Found Yes
Hash 9cf5b5cf49a13c362ccccfd2adc078a724bc71bba630d1c19289990e981a9921
SimHash c1601c723f93

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /admin/login

Other Records

Field Value
sitemap https://pridesource.com/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://pridesource.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/