manpeace.org
robots.txt

Robots Exclusion Standard data for manpeace.org

Resource Scan

Scan Details

Site Domain manpeace.org
Base Domain manpeace.org
Scan Status Ok
Last Scan2024-11-09T00:43:13+00:00
Next Scan 2024-11-16T00:43:13+00:00

Last Scan

Scanned2024-11-09T00:43:13+00:00
URL https://manpeace.org/robots.txt
Domain IPs 112.171.184.147
Response IP 112.171.184.147
Found Yes
Hash c373c045f8cae0b06dd8bbd221de06ab7ee43afb04bea0a177cff35384d25fd7
SimHash 8406cc10cef5

Groups

googlebot-image

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

gooblebot

Rule Path
Allow /

yeti

Rule Path
Allow /

daum

Rule Path
Allow /

grapeshot

Rule Path
Allow /

*

Rule Path
Allow /
Allow /ads.txt

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 3600

*

Rule Path
Allow /
Disallow /adm/
Disallow /data/
Disallow /extend/
Disallow /install/
Disallow /js/
Disallow /lib/
Disallow /plugin/
Disallow /skin/
Disallow /theme/
Disallow /bbs/login.php
Disallow /bbs/board.php?bo_table=event

Other Records

Field Value
sitemap http://manpeace.org/sitemap.xml