guidepia.com
robots.txt

Robots Exclusion Standard data for guidepia.com

Resource Scan

Scan Details

Site Domain guidepia.com
Base Domain guidepia.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-07-21T14:11:31+00:00
Next Scan 2025-10-19T14:11:31+00:00

Last Successful Scan

Scanned2025-03-24T02:27:27+00:00
URL https://guidepia.com/robots.txt
Redirect https://www.guidepia.com/robots.txt
Redirect Domain www.guidepia.com
Redirect Base guidepia.com
Domain IPs 61.100.12.89
Redirect IPs 61.100.12.89
Response IP 61.100.12.89
Found Yes
Hash b631e68dd30d1143d7205676d67f41bcdcfbe9b2f89a1b972b5f07c7b0b271f1
SimHash c074738445f1

Groups

daumoa-image

Rule Path
Allow /data*/

twitterbot
facebookexternalhit
kakaostory-og-reader
googlebot-news
googlebot
mediapartners-google
daum
daumoa
naverbot
yeti
zumbot
applebot
bingbot

Rule Path
Allow /$
Allow /index.php
Allow /xml/
Allow /posting/
Allow /category/
Allow /data/
Allow /css/
Allow /js/
Allow /banner/
Allow /ftp/
Disallow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.guidepia.com/xml/index.php?act=xml