maangchi.com
robots.txt

Robots Exclusion Standard data for maangchi.com

Resource Scan

Scan Details

Site Domain maangchi.com
Base Domain maangchi.com
Scan Status Ok
Last Scan2024-11-16T00:41:34+00:00
Next Scan 2024-11-23T00:41:34+00:00

Last Scan

Scanned2024-11-16T00:41:34+00:00
URL https://maangchi.com/robots.txt
Domain IPs 162.159.134.42, 162.159.135.42, 2606:4700:7::a29f:862a, 2606:4700:7::a29f:872a
Response IP 162.159.135.42
Found Yes
Hash ae7aa4bb473705f30d41d664f63f4e41b35dcc9caa84330caf52475e63fce253
SimHash 1c7ac880a912

Groups

*

Rule Path
Disallow /post-comments
Disallow /post-comments/

gptbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /