sudiachi.com
robots.txt

Robots Exclusion Standard data for sudiachi.com

Resource Scan

Scan Details

Site Domain sudiachi.com
Base Domain sudiachi.com
Scan Status Ok
Last Scan2025-04-08T19:40:58+00:00
Next Scan 2025-04-15T19:40:58+00:00

Last Scan

Scanned2025-04-08T19:40:58+00:00
URL https://sudiachi.com/robots.txt
Domain IPs 104.21.39.65, 172.67.143.143, 2606:4700:3033::ac43:8f8f, 2606:4700:3037::6815:2741
Response IP 172.67.143.143
Found Yes
Hash 9c7c47105b35137341af767a1ea63a0a0a730c80f45077c2934bb5c551f13fd8
SimHash 712d8140f5c9

Groups

daumoa

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

getintent

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

jamesbot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

facebookexternalhit

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /update/
Disallow /cdn-cgi/
Disallow /search
Disallow /search/auto
Disallow /en/search
Disallow /en/search/auto
Allow /

*

Rule Path
Disallow /update/
Disallow /cdn-cgi/
Disallow /search
Disallow /search/auto
Disallow /en/search
Disallow /en/search/auto
Allow /