pgecorp.com
robots.txt

Robots Exclusion Standard data for pgecorp.com

Resource Scan

Scan Details

Site Domain pgecorp.com
Base Domain pgecorp.com
Scan Status Ok
Last Scan2024-10-19T21:48:04+00:00
Next Scan 2024-11-18T21:48:04+00:00

Last Scan

Scanned2024-10-19T21:48:04+00:00
URL https://www.pgecorp.com/robots.txt
Domain IPs 23.215.7.12, 23.215.7.17, 2600:1413:b000:1b::17d7:70c, 2600:1413:b000:1b::17d7:711
Response IP 23.215.7.17
Found Yes
Hash 8fb3341722d9bb187d02fbea711da04eb486e51424debb0f93e62677454987a7
SimHash 291ccae582d3

Groups

*

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /corp_responsibility/reports/2003/
Disallow /corp_responsibility/reports/2004/
Disallow /corp_responsibility/reports/2005/
Disallow /corp_responsibility/reports/2006/

googlebot-image

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.pgecorp.com/sitemap.xml