guidepia.com
robots.txt
Robots Exclusion Standard data for guidepia.com
Resource Scan
Scan Details
Site Domain | guidepia.com |
Base Domain | guidepia.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2025-07-21T14:11:31+00:00 |
Next Scan | 2025-10-19T14:11:31+00:00 |
Last Successful Scan
Scanned | 2025-03-24T02:27:27+00:00 |
URL | https://guidepia.com/robots.txt |
Redirect | https://www.guidepia.com/robots.txt |
Redirect Domain | www.guidepia.com |
Redirect Base | guidepia.com |
Domain IPs | 61.100.12.89 |
Redirect IPs | 61.100.12.89 |
Response IP | 61.100.12.89 |
Found | Yes |
Hash | b631e68dd30d1143d7205676d67f41bcdcfbe9b2f89a1b972b5f07c7b0b271f1 |
SimHash | c074738445f1 |
Groups
twitterbot
facebookexternalhit
kakaostory-og-reader
googlebot-news
googlebot
mediapartners-google
daum
daumoa
naverbot
yeti
zumbot
applebot
bingbot
Rule | Path |
---|---|
Allow | /$ |
Allow | /index.php |
Allow | /xml/ |
Allow | /posting/ |
Allow | /category/ |
Allow | /data/ |
Allow | /css/ |
Allow | /js/ |
Allow | /banner/ |
Allow | /ftp/ |
Disallow | / |
*
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.guidepia.com/xml/index.php?act=xml |