uspceu.com
robots.txt

Robots Exclusion Standard data for uspceu.com

Resource Scan

Scan Details

Site Domain uspceu.com
Base Domain uspceu.com
Scan Status Ok
Last Scan2025-11-08T17:25:28+00:00
Next Scan 2025-12-08T17:25:28+00:00

Last Scan

Scanned2025-11-08T17:25:28+00:00
URL https://uspceu.com/robots.txt
Redirect https://www.uspceu.com/robots.txt
Redirect Domain www.uspceu.com
Redirect Base uspceu.com
Domain IPs 104.18.22.6, 104.18.23.6, 2606:4700::6812:1606, 2606:4700::6812:1706
Redirect IPs 104.18.22.6, 104.18.23.6, 2606:4700::6812:1606, 2606:4700::6812:1706
Response IP 104.18.22.6
Found Yes
Hash 1f7f4f6472a2508bf2e3e7e6b4099c7b6dbcc4114d9c01442e565636987a578d
SimHash 790d1943a7c6

Groups

*

Rule Path Comment
Disallow /*/ctl/ Googlebot permits *
Disallow /admin/ -
Disallow /App_Browsers/ -
Disallow /App_Code/ -
Disallow /App_Data/ -
Disallow /App_GlobalResources/ -
Disallow /bin/ -
Disallow /Components/ -
Disallow /Config/ -
Disallow /contest/ -
Disallow /controls/ -
Disallow /Documentation/ -
Disallow /HttpModules/ -
Disallow /Install/ -
Disallow /Providers/ -
Disallow /Activity-Feed/userId/ Do not index user profiles

Other Records

Field Value
sitemap https://www.uspceu.com/sitemap.aspx

Comments

  • Begin robots.txt file
  • /-----------------------------------------------\
  • | In single portal/domain situations, uncomment the sitmap line and enter domain name
  • \-----------------------------------------------/
  • End of robots.txt file